Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Define high level observing, sampling, and collecting processes #92

Open
ramonawalls opened this issue Oct 9, 2018 · 14 comments
Open
Assignees

Comments

@ramonawalls
Copy link
Collaborator

These terms will go into OBI (see obi-ontology/obi#969), but will be used extensively in BCO, so I would like to reach some consensus among this group before proposing the definitions to OBI.

The proposed (and I think mostly accepted) OBI hierarchy will be:

assay
-specimen collecting process (input material entity, output material entity)
--material sampling process (outputs a physical specimen that is representative of larger population)
-observing process (input material entity, output data)
--observing process based on sampling (input material entity, output data that is intended to be representative of a larger population)
--other kinds of observing processes

We will then import these classes into BCO and make subclasses specific for biodiversity, ecology, evolution, etc. (i.e., non-biomedicine).

Assay and specimen collecting process have been discussed extensively, and their definitions are stable and useful to many researchers, so I don't want to change those.

The terms that need clearer definitions are:
-material sampling process
-observing process
-observing process based on sampling

I suggest that we use STATO statistical sampling process (http://purl.obolibrary.org/obo/STATO_0000502) -- a planned process which aims at assembling a population of observation units (samples) in as an unbiaised manner as possible in order to obtain or infer information about the actual population these samples have been drawn -- to help define material sampling process and observing process based on observation.

I will post strawman definitions for discussion in the comments.

@ramonawalls ramonawalls self-assigned this Oct 9, 2018
@robgur
Copy link
Collaborator

robgur commented Oct 10, 2018 via email

@dr-shorthair
Copy link

@ramonawalls some suggested changes
-specimen collecting process (input material entity, output material entity)
--material sampling process (outputs a physical specimen that is representative of larger population or entity)
-observing process (input material entity , output data)
--observing process based on sampling (input material entity, output data that is intended to be representative of a larger population)
--other kinds of observing processes

@robgur
I was thinking about biased and unbiased sampling earlier. Biased sampling is used commonly in geochemistry - e.g. crushing and then taking all the dense, or magnetic grains. So I initially bristled at the definition of sampling that @ramonawalls quoted from STATO which says that sampling should be unbiased. But I think its OK - its just that the population that is being characterized is the heavy/magnetic part of the rock formation (in the geology case) so while the sub-sampling of the initial specimen is biased, it is intended to be an unbiased representation of something else. Does this apply to the cases that you have in mind?

@ramonawalls
Copy link
Collaborator Author

Will try to schedule a call for next week. Working on definitions now.

@ramonawalls
Copy link
Collaborator Author

Everyone who is interested, but at least @dr-shorthair @robgur @pbuttigieg @tucotuco please fill out the doodle poll at https://doodle.com/poll/6zibkfq6nww2spqn ASAP

@ramonawalls
Copy link
Collaborator Author

Everyone who is interested, but at least @dr-shorthair @robgur @pbuttigieg @tucotuco please fill out the doodle poll at https://doodle.com/poll/6zibkfq6nww2spqn ASAP

I did not realize it was going to do three hour block, but please supply your general availability, then we can narrow down.

@ramonawalls
Copy link
Collaborator Author

@dr-shorthair @robgur @pbuttigieg @tucotuco
Sorry to make you all do this twice, but please fill out the Doodle poll again, now with times that (more or less) work for all time zones.

@robgur
Copy link
Collaborator

robgur commented Oct 11, 2018 via email

@dr-shorthair
Copy link

dr-shorthair commented Oct 11, 2018 via email

@robgur
Copy link
Collaborator

robgur commented Oct 11, 2018 via email

@dr-shorthair
Copy link

If Guru wants to join us from Brisbane it is essentially impossible
https://www.timeanddate.com/worldclock/meetingtime.html?iso=20181016&p1=152&p2=37&p3=156&p4=197&p5=224&p6=47

@ramonawalls
Copy link
Collaborator Author

ramonawalls commented Oct 12, 2018 via email

@ramonawalls
Copy link
Collaborator Author

@dr-shorthair
Copy link

dr-shorthair commented Mar 31, 2020

I see that specimens are back in the mix #94 (comment).
So can I re-open the samples and specimens discussion?

AFAIK the collections (museums) community defines a Specimen as a material-entity that is explicitly curated.
And to science and stats practitioners a Sample is an (usually continuant) entity that is designed to be representative of a larger entity, which might be a population, universe (and usually continuant).

Samples are not always specimens - statistical samples in social science are not curated, for example. And samples are not necessarily material entities.

Specimens are not always samples - though I suspect most are, because why would you curate it if it was not representative of a larger truth? (Specimens in a fine-art museum or gallery are representative of 'things of beauty' or some related concept.) The key to its sample-ness is that we can explain that there is larger entity, related through an isSampleOf relation.

FWIW - a type-specimen is a sample because is it is representative of a taxon (?).

@ramonawalls
Copy link
Collaborator Author

For sure, Simon. My work plan is to first update the workflow for Darwin Core imports and do a release with them as modules, then dive fully into specimens, sampling, and observations. I won't make any permanent changes without your input! Will probably schedule a call about it in a few weeks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants