Documents

Documents#

About#

Documents: These include datasets, reports or other documents

Creative works (documents)#

Documents will include maps, reports, guidance and other creative works. Due to this OIH will focus on a generic example of schema.org/CreativeWork and then provide examples for more focused creative work examples.

{
  "@context": {
    "@vocab": "https://schema.org/"
  },
  "@type": "CreativeWork",
  "@id": "https://example.org/id/XYZ",
  "name": "Name or title of the document",
  "description": "Description of the creative work to aid in searching",
  "url": "https://www.sample-data-repository.org/creativework/report.pdf",
  "contributor": {
    "@type": "Organization",
    "@id": "http://www.foo.org/orgID",
    "legalName": "Some Institute"
  },
  "author": {
    "@id": "https://www.sample-data-repository.org/person/51317",
    "@type": "Person",
    "name": "Dr Uta Passow",
    "givenName": "Uta",
    "familyName": "Passow",
    "url": "https://www.sample-data-repository.org/person/51317"
  },
  "identifier": {
    "@id": "https://doi.org/10.5066/F7VX0DMQ",
    "@type": "PropertyValue",
    "propertyID": "https://registry.identifiers.org/registry/doi",
    "value": "doi:10.5066/F7VX0DMQ",
    "url": "https://doi.org/10.5066/F7VX0DMQ"
  },
  "keywords": {
    "@type": "DefinedTerm",
    "inDefinedTermSet": {
      "@type": "DefinedTermSet",
      "name": "Name of the set",
      "description": "Description of the set",
      "url": "url for the set"
    },
    "termCode": "A code that identifies this DefinedTerm within a DefinedTermSet"
  },
  "provider": {
    "@id": "https://www.repositoryB.org",
    "@type": "Organization",
    "legalName": "Sample Data Repository Office",
    "name": "SDRO",
    "sameAs": "http://www.re3data.org/repository/r3dxxxxxxxxx",
    "url": "https://www.sample-data-repository.org"
  },
  "license": "http://spdx.org/licenses/CC0-1.0",
  "publisher": {
    "@id": "https://www.publishingrus.org",
    "@type": "Organization",
    "legalName": "Some Institute"
  }
}

../../_images/6b144c024f1bda29aaf0fe5eae85ac075457f5a29f4377e98ae796b8f1df82e9.svg

Details: Indentifier#

For each profile there are a few key elements we need to know about. One key element is what the authoritative reference or canonical identifier is for a resource.

{
    "@context": {
        "@vocab": "https://schema.org/"
    },
    "@id": "https://example.org/id/XYZ",
    "@type": "CreativeWork",
    "identifier": {
        "@id": "https://doi.org/10.5066/F7VX0DMQ",
        "@type": "PropertyValue",
        "propertyID": "https://registry.identifiers.org/registry/doi",
        "url": "https://doi.org/10.5066/F7VX0DMQ",
        "value": "doi:10.5066/F7VX0DMQ"
    }
}

../../_images/147685ac70ea2618b862728bb037b0bb07d2f80f0ffd6001204d2b3de1f65aac.svg

Publisher and provider#

Our JSON-LD documents are graphs that can use framing to subset. In this case we can look closer at the provider and publisher properties, which are both of type Organization.

{
    "@context": {
        "@vocab": "https://schema.org/"
    },
    "@id": "https://example.org/id/XYZ",
    "@type": "CreativeWork",
    "provider": {
        "@id": "https://www.repositoryB.org",
        "@type": "Organization",
        "legalName": "Sample Data Repository Office",
        "name": "SDRO",
        "sameAs": "http://www.re3data.org/repository/r3dxxxxxxxxx",
        "url": "https://www.sample-data-repository.org"
    },
    "publisher": {
        "@id": "https://www.publishingrus.org",
        "@type": "Organization",
        "legalName": "Some Institute"
    }
}

../../_images/fa18120d1e6a00e772b4ac49905de0d48a618ec2ae5cb8623dd95e62e6f8811d.svg

Author type Person#

Our JSON-LD documents are graphs that can use framing to subset. In this case we can look closer at the author property which points to a type Person.

{
    "@context": {
        "@vocab": "https://schema.org/"
    },
    "@id": "https://example.org/id/XYZ",
    "@type": "CreativeWork",
    "author": {
        "@id": "https://www.sample-data-repository.org/person/51317",
        "@type": "Person",
        "familyName": "Passow",
        "givenName": "Uta",
        "name": "Dr Uta Passow",
        "url": "https://www.sample-data-repository.org/person/51317"
    }
}

../../_images/cb7f8c3e45a04561a385e30a5f1db15903ac6bb86113bf5b5cfba079198ad247.svg

License#

{
    "@context": {
        "@vocab": "https://schema.org/"
    },
    "@id": "https://example.org/id/XYZ",
    "@type": "CreativeWork",
    "license": "http://spdx.org/licenses/CC0-1.0"
}

../../_images/61d6a0e085d06595eb7f46ce11f831dd3a16ab967dce10be106f4c802c2769e5.svg

License as URL#

{
  "@context": "https://schema.org/",
  "license": "https://creativecommons.org/licenses/by/4.0/"
}

License as CreativeWork#

{
  "@context": "https://schema.org/",
  "license": {
    "@type": "CreativeWork",
    "name": "Creative Commons Attribution 4.0",
    "url": "https://creativecommons.org/licenses/by/4.0/"
  }
}

License as SPDX URL#

Use a simple URL
SPDX creates URLs for many licenses including those that don’t have URLs
From a source that harvesters can rely on (e.g. use URL to lookup more information about the license)

{
  "@context": "https://schema.org/",
  "license": "https://spdx.org/licenses/CC-BY-4.0"
}

OR, include both the SPDX and the Creative Commons URLs in an array:

{
  "@context": "https://schema.org/",
  "license": ["https://spdx.org/licenses/CC-BY-4.0", "https://creativecommons.org/licenses/by/4.0/"]
}

References#

For dataset we can use SOS Dataset
OBPS group is using JericoS3 API (ref: https://www.jerico-ri.eu/)
- Traditional knowledge points here
- sounds like they use dspace
For other document these are likely going to be some schema:CretiveWork with there being many subtypes we can explore. See also here Adam Leadbetter’s work at Ocean best practices
- This is a great start and perhaps helps to highlight why SHACL shapes are useful
- https://irishmarineinstitute.github.io/erddap-lint/
- earthcubearchitecture-project418/p419dcatservices *EMODnet (Coner Delaney)
- ERDAP also
- Are we talking links from schema.org that link to OGC and ERDAP services
- Are these methods?
- Sounds like may link to external metadata for interop they have developed in the community
NOAA connected as well
- Interested in OGC assets
- ERDAP data platform