Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Repository metadata consistency with RDA recommendations #247

Open
mbjones opened this issue Oct 19, 2023 · 0 comments
Open

Repository metadata consistency with RDA recommendations #247

mbjones opened this issue Oct 19, 2023 · 0 comments

Comments

@mbjones
Copy link
Collaborator

mbjones commented Oct 19, 2023

The RDA Data Repository Attributes working group is proposing repository metadata attributes to be used in describing repositories (see their Draft Version 5), and these overlap with the ESIP science-on-schema.org guidelines for repository metadata. I did a little crosswalk to see how consistent they are, and I worked up a document for the Arctic Data Center that illustrates usage.

Here's a comparison table of how the ESIP and RDA field lists correspond.

number label schema_mapping esip_recommended rda_recommended
1 Repository Name schema:name TRUE TRUE
2 URL schema:url TRUE TRUE
3 Country schema:addressCountry TRUE
4 Language schema:inLanguage TRUE
5 Institution schema:parentOrganization TRUE TRUE
6 Contact schema:contactPoint TRUE TRUE
7 Description schema:description TRUE TRUE
8 Research Area schema:category TRUE TRUE
9 Persistent Identifiers ex:persistentIdentifiers TRUE
10 Machine Interoperability ex:machineInteroperability TRUE
11 Metadata ex:metadata TRUE
12 Curation ex:curation TRUE
13 Terms of Deposit schema:termsOfService TRUE
14 Terms of Access ex:termsOfAccess TRUE
15 Dataset Use License schema:license TRUE
16 Certification schema:credentialCategory TRUE
17 Preservation Policy ex:preservationPolicy TRUE
18 Repository Legal Name schema:legalName TRUE
19 Repository Alternate Name schema:alternateName
20 Repository Logo URL schema:logo TRUE
21 Repository Linked Identifier @id TRUE
22 Repository Persistent Identifier schema:identifier TRUE
23 Repository Equivalent Identifier schema:sameAs TRUE
24 Repository Service Provider schema:provider TRUE
25 Founding Date schema:foundingDate TRUE
26 Funder schema:funder TRUE
27 Repository Catalog schema:hasOfferCatalog TRUE
28 Repository Address schema:address TRUE

And here's a schema.org document for the Arctic Data Center with a union of both sets of fields.

Expand for arctic-data-center.json metadata
{
  "@context": [
    "https://schema.org/",
    {
        "ex": "https://example.org/"
    }
  ],
  "@type": [
    "Service",
    "Organization",
    "ResearchProject"
  ],
  "@id": "https://arcticdata.io",
  "name": "Arctic Data Center",
  "legalName": "Arctic Data Center",
  "alternateName": "ADC",
  "logo": "https://arcticdata.io/wp-content/themes/aurora/library/images/logo_.png",
  "url": "https://arcticdata.io",
  "description": "The Arctic Data Center is the primary data and software repository for the Arctic section of NSF Polar Programs.",
  "identifier": [
    {
      "@type": "PropertyValue",
      "name": "ROR:055hrh286",
      "propertyID": "https://registry.identifiers.org/registry/ror",
      "value": "ror:055hrh286",
      "url": "https://ror.org/055hrh286"
    },
    {
      "@type": "PropertyValue",
      "name": "Re3data DOI: 10.17616/R37P98",
      "propertyID": "https://registry.identifiers.org/registry/doi",
      "value": "doi:10.17616/R37P98",
      "url": "https://doi.org/10.17616/R37P98"
    },
    {
      "@type": "PropertyValue",
      "name": "wikidata:Q77285095",
      "propertyID": "https://registry.identifiers.org/registry/wikidata",
      "value": "wikidata:Q77285095",
      "url": "https://www.wikidata.org/wiki/Q77285095"
    },
    {
      "@type": "PropertyValue",
      "name": "grid:grid.507882.0",
      "propertyID": "https://registry.identifiers.org/registry/grid",
      "value": "grid:grid.507882.0",
      "url": "https://www.grid.ac/institutes/grid.507882.0"
    }
  ],
  "sameAs": [
    "https://ror.org/055hrh286",
    "https://www.grid.ac/institutes/grid.507882.0",
    "https://www.wikidata.org/wiki/Q77285095",
    "https://www.re3data.org/repository/r3d100011973",
    "http://doi.org/10.17616/R37P98",
    "urn:node:ARCTIC"
  ],
  "category": [
    "Arctic Research"
  ],
  "provider": {
    "@id": "https://arcticdata.io"
  },
  "contactPoint": {
    "@type": "ContactPoint",
    "name": "Support",
    "email": "[email protected]",
    "url": "https://arcticdata.io/support/",
    "contactType": "customer support"
  },
  "foundingDate": "2016-02-01",
  "funder": {
    "@type": "Organization",
    "@id": "https://doi.org/10.13039/100000087",
    "legalName": "Office of Polar Programs",
    "alternateName": "OPP",
    "url": "https://www.nsf.gov/div/index.jsp?div=OPP",
    "identifier": {
      "@type": "PropertyValue",
      "propertyID": "https://registry.identifiers.org/registry/doi",
      "value": "doi:10.13039/100000087",
      "url": "https://doi.org/10.13039/100000087"
    },
    "parentOrganization": {
      "@type": "Organization",
      "@id": "https://doi.org/10.13039/100000085",
      "legalName": "Directorate for Geosciences",
      "alternateName": "NSF-GEO",
      "url": "http://www.nsf.gov",
      "identifier": {
        "@type": "PropertyValue",
        "propertyID": "https://registry.identifiers.org/registry/doi",
        "value": "10.13039/100000085",
        "url": "https://doi.org/10.13039/100000085"
      },
      "parentOrganization": {
        "@type": "Organization",
        "@id": "https://doi.org/10.13039/100000001",
        "legalName": "National Science Foundation",
        "alternateName": "NSF",
        "url": "http://www.nsf.gov",
        "identifier": {
          "@type": "PropertyValue",
          "propertyID": "https://registry.identifiers.org/registry/doi",
          "value": "10.13039/100000001",
          "url": "https://doi.org/10.13039/100000001"
        }
      }
    }
  },
  "hasOfferCatalog": {
    "@type": "OfferCatalog",
    "name": "Arctic Data Center Data Catalog",
    "itemListElement": [
      {
        "@type": "DataCatalog",
        "@id": "https://arcticdata.io/catalog/data",
        "name": "Arctic Data Center Catalog",
        "audience": {
          "@type": "Audience",
          "audienceType": "public",
          "name": "General Public"
        }
      }
    ]
  },
  "address": {
    "@type": "PostalAddress",
    "streetAddress": "1021 Anacapa Street",
    "addressLocality": "Santa Barbara",
    "addressRegion": "CA",
    "postalCode": "93101",
    "addressCountry": "US"
  },
  "parentOrganization": {
    "@type": "Organization",
    "@id": "https://ror.org/0146z4r19",
    "legalName": "National Center for Ecological Analysis and Synthesis",
    "alternateName": "NCEAS",
    "url": "http://nceas.ucsb.edu",
    "identifier": {
      "@type": "PropertyValue",
      "propertyID": "https://registry.identifiers.org/registry/ror",
      "value": "ror:0146z4r19",
      "url": "https://ror.org/0146z4r19"
    },
    "parentOrganization": {
      "@type": "Organization",
      "@id": "https://ror.org/02t274463",
      "legalName": "University of California, Santa Barbara",
      "alternateName": "UCSB",
      "url": "http://ucsb.edu",
      "identifier": {
        "@type": "PropertyValue",
        "propertyID": "https://registry.identifiers.org/registry/ror",
        "value": "ror:02t274463",
        "url": "https://ror.org/02t274463"
      }
    }
  },
  "inLanguage": "en-US",
  "addressCountry": "US",
  "license": [
    "http://spdx.org/licenses/CC0-1.0",
    "https://spdx.org/licenses/CC-BY-4.0"
  ],
  "credentialCategory": "CoreTrustSeal",
  "termsOfService": [
    "http://spdx.org/licenses/CC0-1.0",
    "https://spdx.org/licenses/CC-BY-4.0"
  ],
  "ex:persistentIdentifiers": [
    "https://registry.identifiers.org/registry/doi",
    "https://registry.identifiers.org/registry/orcid",
    "https://registry.identifiers.org/registry/ror",
    "https://registry.identifiers.org/registry/rrid",
    "https://registry.identifiers.org/registry/d1id",
    "https://registry.identifiers.org/registry/ark"
  ],
  "ex:machineInteroperability": [
    "DataONE", "OAI-PMH", "DataCite", "REST", "SPARQL"
  ],
  "ex:metadata": [
    "EML", "ISO-19115", "DDI", "Dublin Core", "FGDC CSDGM", "METS", "DataCite", "OAI-ORE", "other"
  ],
  "ex:curation": "https://arcticdata.io/submit/",
  "ex:preservationPolicy": "https://arcticdata.io/preservation/",
  "ex:termsOfAccess": [
    "http://spdx.org/licenses/CC0-1.0",
    "https://spdx.org/licenses/CC-BY-4.0"
  ]
}

Note that I didn't have good mappings for the following terms, which I placed in the example namespace (ex) for now:

  • ex:persistentIdentifiers
  • ex:machineInteroperability
  • ex:metadata
  • ex:curation
  • ex:preservationPolicy
  • ex:termsOfAccess

Thoughts on whether we should make any changes to be consistent?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant