Open
Milestone
Nov 1, 2025鈥揇ec 31, 2025
setup storage API and re-harvest legacy data sources as RAW
Using the new set of harvesting libraries, the data storage API we want to re-harvest RAW data from all legacy data sources and store them witin our system.
For this, the data storage API should be set up and configured correctly to be able to consume the RAW data and provide it back for further processing and provide log entries.
The following endpoint times were harvested in the prototype and should be included as follows
| type of endpoint/source | what to do |
|---|---|
| sitemap harvesting | as is |
| OAI PMH | as is |
| GitLab instances | exclude until solution for codemeta.py is |
| indigo (KIT & Hifis) | either this or atomfeed |
| atomfeed (KIT & HIFIS) | either this or indigo |
| DataCIte | as Schema.org base on ROR IDs of centers |
Loading
Loading
Loading
Loading