-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FAIR Protocol Buffer? #17
Comments
Apologies, but it is not clear to me what "FAIR protocol buffer" is supposed to mean in the context of the |
Hi Mike - perhaps my question is misplaced, it relates to the meta-data requirement on the bdbag in order to enable FAIRness; e.g., provenance, unique identifier, keywords, licensing, that sort of thing. Thoughts? |
The (BD)Bag specification describes a container: it is silent on many of the issues raised in the FAIR principles, like data licenses and vocabularies. However, the metadata directory provides a natural place to address those issues. We can, for example, include Research Object (RO) metadata: see https://github.com/fair-research/bdbag/blob/master/profiles/bdbag-ro-profile.json. (See https://n2t.net/minid:b9dt2t for an example of a BDBag that includes simple RO metadata.) As Carl Kesselman noted in a recent email exchange, one could address the licensing issue, for example, by:
If such conventions are defined, we can integrate them into the BDBag tools. |
Myself and a student have been reviewing various community FAIR efforts, mapping these to requirements for a simple metadata model. We considered those ambitious, rigorous efforts such as DATS and HCLS, and decided to start with a more rudimentary, well-scoped set of requirements that are computable, but also decoupled from implementation. For example, we took into account the convention you describe for licensing, and we also take into account versioning for objects, APIs, and even ID's (consider, for example, AAC53040 is the accession ID for the p53 protein sequence object, and the most recent version is AAC53040.1). |
I agree that more needs to be done to expand the FAIR metadata needed. Many of those requirements are covered by the underlying specs, for instance Research Object Bundle manifests lists basic provenance per resource. BDBags support RO manifest using the bdbag_ro.py module. I will admit license was not listed there, we can in theory use the "aggregates": [
{ "uri": "../data/file.txt",
"dct:license": {
"uri": "http://www.apache.org/licenses/LICENSE-2.0",
"name": "Apache License, Version 2.0"
}
}
] But this should probably feed upstream to include in a general Research Object profile of FAIR metadata attributes. There is also schema.org/license as used by for instance BioSchemas Dataset. |
I see this repo is under 'fair-research' - has anybody started on defining a FAIR protocol buffer?
The text was updated successfully, but these errors were encountered: