-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Link to ODIS via JSON-LD/schema.org #115
Comments
Pinging this issue - May is upon us |
@pbuttigieg let's plan in the week of May 22 to discuss. Can you send @gsautter some example input you would like to have so we can study and have an informed discussion? |
@myrmoteras seems the project schedules didn't align, but ODIS is now sustained under UNESCO, so we have more flexibility We've created a getting started guide that may be enough for your developers to create the initial link |
@pbuttigieg @gsautter can we meet sometimes tomorrow ater 4pm or Friday afternoon after 3pm and discuss next steps? |
Hi Donat,
@pbuttigieg <https://github.com/pbuttigieg> @gsautter
<https://github.com/gsautter> can we meet sometimes tomorrow ater 4pm or
Friday afternoon after 3pm and discuss next steps?
sure, just tell me when, and whether to use Zoom or Skype ... preferring Skype
personally, since the char is far better suited for sharing links and files,
etc., specifically keeping them available and accessible after the meeting ends.
Best,
Guido
|
Hi Donat, The questions regarding duplication is increasing. The answer is not that it doesn’t matter and that tools can de-douplicate, but that our GBIF record is different that it has a link to a treatment, and publications. With other words, this shows how the data is being used. At the same time there is a development to create a digital specimen that includes all the links to the various representation of a the physical or original observation. In a sense, let me handle these sorts of requests, that is forward them to me and I will take care of it. However, what's specifically strange about this one is that the specimen with the georeference is not the one the one the treatment is about ... the latter specimen is only mentioned in passing, of sorts. How should we mark something like this, in general? See http://tb.plazi.org/GgServer/html/03F0943BFFCF4D070BF8FDEBFA9C1675 All the best, From: Guido Sautter [email protected] EXTERNAL SENDER Hi Tammy, https://www.gbif.org/dataset/49a11228-6c4d-478f-b958-52610eaab951 Which is one of my papers. The entry includes only one georeferenced specimen – which is not correct. The paper details the locations of all the samples examined and provides a full station list and map. The georeferenced provided is for a mention of a specimen not covered in our paper! How do we correct this? We can remove this materials citation if you want, but, treatment subject or not, it is a georeferenced specimen that making available as data is surely worthwhile. I would also like to ask about duplication that may be occurring through these uploads. I am currently working to prepare datasets for OBIS/GBIF of specimens held in the Discovery Collections – what will happen if I upload the data on these specimens to OBIS? We will be creating duplicates. I’m sure you will have come across this for other museums, sharing specimen data to GBIF. Also, I would not worry about avoiding duplicates all too much, considering that there are catalogs like WoRMS and ITIS that already have overlapping occurrence data and still are both individual datasets in GBIF. And ultimately, if the detail data of any two occurrence records match up exactly, consumer applications can eliminate the duplicates. Better to have occurrence data available to the public, if at the risk of duplication, than to have it locked in publications altogether. Kind regards, |
discussion from 20240709 Notes |
property trimming for Plazi blueprints plazi/treatmentBank#115
Here's a an initial template - based on this PR iodepo/odis-in#25 - for Treatments as Datasets, based on one of your examples provided. Note that the schema: Note I left some additional properties in there just in case those are useful for other treatments. You can of course delete these if not relevant. I think this is enough to begin sharing TreatmentBank's content via ODIS and related systems. If you have any questions please ping me in iodepo/odis-in#25. |
If you wish to be more literal about taxonomic concepts as claims, you can use this pattern to associate the Claim that a taxon is present with Treatments wherein the claim appears. If each treatment's JSON-LD has its own "appearance": [
{
"@id": "https://treatment.plazi.org/json-ld/url-to-a-json-ld-representation-of-some-treatments-metadata/"
},
....
] if the JSON-LD files for Treatments don't have their own URLs/IRIs, then you could include metadata about them verbatim: "appearance": [
{
"@type": "Dataset",
"@id": "URL: Optional. A URL that resolves to *this* JSON-LD document, NOT the URL of the Dataset that this JSON-LD document describes. To link to the Dataset itself, please use 'url' and/or 'identifier')",
"name": "Maera gujaratensis, Thacker & Myers & Trivedi, 2024",
"description": "A dataset representing a TreamentBank Treatment record of Maera gujaratensis, see https://plazi.org/treatmentbank/what-treatment/",
"url": "https://treatment.plazi.org/id/423BD146-4079-EE78-44C5-FE28FE592E10",
"identifier": "423BD146-4079-EE78-44C5-FE28FE592E10"
},
{
"@type": "Dataset",
"@id": "URL: Optional. A URL that resolves to *this* JSON-LD document, NOT the URL of the Dataset that this JSON-LD document describes. To link to the Dataset itself, please use 'url' and/or 'identifier')",
"name": "Maera gujaratensis, Smith & Jones & Li, 2024",
"description": "A dummy dataset representing another TreamentBank Treatment record of Maera gujaratensis, see https://plazi.org/treatmentbank/what-treatment/",
"url": "https://treatment.plazi.org/id/MADE-UP-423BD146-4079-EE78-44C5-FE28FE592E10",
"identifier": "MADE-UP-423BD146-4079-EE78-44C5-FE28FE592E10"
}
....
] |
This is great for modeling treatments as a whole, thanks. Question remains how to specifically embed the materials citations within this framework, though, and yet more specifically any given coordinates, especially since my understanding was for the main goal to be sharing occurrence data.
I see ... good to have an example for a figure citation. Not sure Also, how to include the DOI of a cited publication? And which |
Working on the template further, I'm getting the impression the basis for the example was the HTML page proper ... in case it helps, there is the underlying XML document, which holds a good few more details than we can conveniently show in the HTML: https://tb.plazi.org/GgServer/xml/423BD1464079EE7844C5FE28FE592E10 (the HTML is produced from this XML via XSLT). |
Looking at an example of a materialsCitation: <materialsCitation id="7AFA6A0D4079EE7D451DFD8DFE982B22" ID-GBIF-Occurrence="4903570308" collectionCode="LFSC, R" latitude="22.1985" longLatPrecision="7" longitude="72.1082" pageId="11" pageNumber="574" specimenCount="1" typeStatus="holotype">
<typeStatus id="1529DEF24079EE7D451DFD8DFE1E2B46" box="[335,439,527,553]" pageId="11" pageNumber="574" type="holotype">Holotype</typeStatus>
male,
<quantity id="0D6ACDB54079EE7D4654FD8DFDE32B45" box="[518,586,527,554]" metricMagnitude="-3" metricUnit="m" metricValue="8.0" pageId="11" pageNumber="574" unit="mm" value="8.0">8 mm</quantity>
, (
<collectionCode id="AC83F8954079EE7D460DFD8DFD0C2B46" box="[607,677,527,553]" pageId="11" pageNumber="574">LFSC</collectionCode>
.ZRC-209) Gopnath (
<geoCoordinate id="AFA606974079EE7D47C5FD8DFBB92B45" box="[919,1040,527,554]" degrees="22.1985" direction="north" orientation="latitude" pageId="11" pageNumber="574" precision="5" value="22.1985">22.1985 N</geoCoordinate>
<geoCoordinate id="AFA606974079EE7D404BFD8DFB392B46" box="[1049,1168,527,553]" degrees="72.1082" direction="east" orientation="longitude" pageId="11" pageNumber="574" precision="5" value="72.1082">72.1082 E</geoCoordinate>
), 25 March, 2023, coll. D.
<collectionCode id="AC83F8954079EE7D44E0FDB1FF612B22" box="[178,200,563,589]" country="Chile" name="Departamento de Geologia, Universidad de Chile" pageId="11" pageNumber="574">R</collectionCode>
. Thacker
</materialsCitation>
It looks like you can either treat this as a Dataset about a physical sample (as a value of isBasedOn) or you can try to link the material thing to the Treatment itself. Perhaps we need a call for this, as it's a little metaphysically fraught.
A table is certainly a CreativeWork. For figures, you could used the ImageObject subtype. Yes, for tables (which don't have a good subtype in schema.org), you can use the CreativeWork type and use the same properties (or any other property from CreativeWork).
You can use the identifier property for DOIs (some examples already provided) at the right level. Yes, you can use CreativeWork for uncertain cases. Document or DigitalDocument may be useful too. |
Frankly, I'm not philosophical at all as to what exactly we model these things as, just want a representation that will allow data consumers to make the most of what we send them ... and you know the consumers way better than I do, so what would you recommend? Ultimately, such an occurrence states the presence of a physical object (specimen) identified to be an instance of some taxon, in a certain place at a certain time, and then there is additional information, e.g. who made the observation, and where the specimen was taken (if it's collected, which rhinos and elephants and the like usually are not these days) or where the record of the observation (e.g. a picture from a camera trap) can be found ... it really takes various forms, and we could get very philosophical about it, but that sure isn't my goal ... |
Following on from discussions at BICIKL and subsequent techical calls, we're exploring how to expose ocean-relevant content (including coastal zones) to the IOC-UNESCO Ocean Data and Information System (ODIS)
Once the initial material has been reviewed by the Traitbank team, we can have another technical call to go through a few examples and create some reference JSON-LD documents. Following that, and the creation of a sitemap registered in ODISCat, we can then begin testing the harvest and dissemination to the ODIS Federation.
This issue will be cross-linked to a counterpart on the odis-arch tracker
The text was updated successfully, but these errors were encountered: