Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create DOI upon creation of studyset/have button to automatically generate DOI if desired #319

Open
jdkent opened this issue Jun 9, 2022 · 0 comments

Comments

@jdkent
Copy link
Member

jdkent commented Jun 9, 2022

I think the idea, at least as the DB models are currently written, is that nimads_data is just a static JSON file containing a NIMADS Dataset object. That's basically just a list of Study objects, yes. So in your example above, there wouldn't be an analysis field at the top level (analyses would be nested within studies). Re: your other questions:

should the data under nimads_data be in JSON-LD form?

Let's go with just JSON for now. We can't guarantee that every piece of data will have a unique address, and given all the other downsides of JSON-LD we've talked about, it seems reasonable to just drop that entirely.

If we are not using JSON-LD yet, would looking at a study with api/studies/?nested=true be a good approximation for now?

Yep. And then a dataset would just be a list of studies, plus the other fields (description, doi, etc.).

Are the doi, pmid and other identifying keys are assuming the dataset exists in some published form?

The implication is that there's a digital object published somewhere that contains a description of the dataset. I think of this as "this dataset is associated with this DOI", and not "this dataset is published at this DOI". It's roughly the same relationship as between NeuroVault collections and associated DOIs.

would it be appropriate to interpret a dataset as a subset of the entries in /api/studies?

Not quite, because in principle, a user could push a dataset to Neurostore that hadn't been constructed using our tools, and possibly contains data that comes from other sources. This is why data in the dataset are stored as flat JSON, and not in a relational form. It's probably true that most of the time, the listed studies will be a subset of what's available from /api/studies, but we can't make that assumption. [EDIT: oh, and also, datasets will often be filtered; i.e., even when the studies are a subset of the API, they will typically contain only some of the data for each study (e.g., only images and not coordinates).]

Originally posted by @tyarkoni in #28 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

1 participant