Skip to content

Add Datacite types and properties to HDO

Problem

I would like to complete the mapping of the datacite schema to schema.org, however I cannot find appropriate types and properties in other ontologies. I was wondering if the HDO would be a suitable place the missing entities.

Backround-Info

The datacite commons API provides metadata in json (compliant with the datacite schema) and in json-ld (schema.org). However the mapping from datacite to schema.org ist incomplete. That results in the loss of information when data is requestet in json-ld.

The GBV (Gemeinsamer Bibliotheksverbund) terminology service provided a namespace to datacite entities and defined them via skos. Therefore it possible to add datacite-resource-types via @type or schema:additionalType to any (incomplete) schema.org mapping. However the GBV defined all datacite resources as skos:Concept or skos:ConceptScheme even datacite relation types, which makes it impossible to model relationships between nodes with datacite context.

The most important missing property is datacite:Collects / datacite:IsCollectedBy. Which relates a dataset to an instrument. Missing datacite resource types (such as datacite:Instrument) are annoying but not as problematic as missing relation types since there is a default resource type which is schema:CreativeWork.

Mapped relation types:

  • datacite:IsIdenticalTo => schema:sameAs
  • datacite:IsPartOf => schema:isPartOf
  • datacite:HasPart => schema:hasPart
  • datacite:IsPreviousVersionOf => schema:predecessor_of
  • datacite:IsNewVersionOf => schema:successor_of
  • datacite:References => schema:citation
  • datacite:HasTranslation => schema:workTranslation
  • datacite:IsTranslationOf => schema:translationOfWork

Missing realation types:

  • datacite:IsCitedBy / datacite:Cites
  • datacite:IsSupplementTo / datacite:IsSupplementedBy
  • datacite:IsContinuedBy / datacite:Continues
  • datacite:IsPublishedIn / datacite:IsReferencedBy
  • datacite:References
  • datacite:IsDocumentedBy / datacite:Documents
  • datacite:IsCompiledBy / datacite:Compiles
  • datacite:IsVariantFormOf
  • datacite:IsOriginalFormOf
  • datacite:HasMetadata / datacite:IsMetadataFor
  • datacite:Reviews / datacite:IsReviewedBy
  • datacite:IsDerivedFrom / datacite:IsSourceOf
  • datacite:Describes / datacite:IsDescribedBy
  • datacite:HasVersion / datacite:IsVersionOf
  • datacite:Requires / datacite:IsRequiredBy
  • datacite:Obsoletes / datacite:IsObsoletedBy
  • datacite:Collects / datacite:IsCollectedBy

Missing data types

  • datacite:InteractiveResource => nil
  • datacite:Instrument
  • datacite:Model => nil
  • datacite:OutputManagementPlan => nil
  • datacite:PhysicalObject => nil
  • datacite:Preprint => nil
  • datacite:Standard => nil
  • datacite:StudyRegistration
  • datacite:Workflow => nil

Mapped data types

  • datacite:Audiovisual => schema:MediaObject
  • datacite:Book => schema:Book
  • datacite:BookChapter => schema:Chapter
  • datacite:Collection => schema:Collection
  • datacite:ComputationalNotebook => schema:SoftwareSourceCode
  • datacite:ConferencePaper => schema:Article
  • datacite:ConferenceProceeding => schema:Periodical
  • datacite:DataPaper => schema:Article
  • datacite:Dataset => schema:Dataset
  • datacite:Dissertation => schema:Thesis
  • datacite:Event => schema:Event
  • datacite:Image => schema:ImageObject
  • datacite:Journal => schema:Periodical
  • datacite:JournalArticle => schema:ScholarlyArticle
  • datacite:PeerReview => Review
  • datacite:Report => schema:Report
  • datacite:Service => schema:Service
  • datacite:Software => schema:SoftwareSourceCode
  • datacite:Sound => schema:AudioObject
  • datacite:Text => schema:ScholaryArticle
  • datacite:Other => schema:CreativeWork

not part of DataCite schema, but used internally

  • datacite:Periodical => schema:Periodical
  • datacite:DataCatalog => schema:DataCatalog
  • datacite:Award => schema:Grant
  • datacite:Project" => schema:Project

Materials

This is what I found so far in my research:

An ontology that might work:

  • DISCO "A vocabulary for publishing metadata about data sets (research and survey data) into the Web of Linked Data"

Other ontologies mentioning "Instruments"

Bolognese Mapping tool

Datacite Resources

GBV Resources (namespace for datacite skos)