-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ab/request stac metadata #712
base: master
Are you sure you want to change the base?
Conversation
I'm ok asking users to help with collecting the STAC metadata, but think we need to make it more "optional". Or at least indicate it can be filled in later in consultation with the Data Team. I'm concerned too many questions in the request will be a barrier to submission. Also do we trust users to come up with a good collection ID? or is that something the Data Team should control. |
|
||
This collection will be published as a Spatio-Temporal Asset Catalog (STAC) Collection. You can read the complete STAC collection spec here: (https://github.com/radiantearth/stac-spec/blob/master/collection-spec/collection-spec.md). | ||
|
||
If you need help with any of the fields below, please let us know. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please fill in any fields you know. If you don't know the answer it can be updated later in consultation with the Data Team.
|
||
If you need help with any of the fields below, please let us know. | ||
|
||
**id:** Identifier for the Collection that is unique across the provider. This is typically an abbreviated and hyphenated or camel-cased version of the dataset name, For example `gedi_l2a_v002` for "GEDI L2A Elevation and Height Metrics Data Global Footprint Level V002". |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure we should leave this up to users. So maybe it should be clear this is a suggestion.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agree w/ Alex here. As the ID needs to be unique, we have to manage it within the catalog and unless we have a feedback loop of which IDs are valid, it will be difficult to have a scientist set the ID.
Is it unique for a "provider" ? Most recent provider (as Provider is a list)? Or is it unique within the catalog?
|
||
_We will reuse the Dataset Description from the first section if not otherwise indicated._ | ||
|
||
**spatial extent:** A bounding box for the potential spatial extents covered by the collection. Read more in the [spatial-extent-object section of the spec](https://github.com/radiantearth/stac-spec/blob/master/collection-spec/collection-spec.md#spatial-extent-object). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since Spatial Extent and Temporal Extent are the most important can we move these to to the top of list?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also the "additional information" section at the bottom mentions spatial and temporal info again
|
||
_Please provide the temporal extent as an interval. If the dataset has a single date time or is currently ongoing, a single date is appropriate._ | ||
|
||
**links:** A list of references to other documents. There must be a at least one link and we highly recommend and may require in the future link to documentation which includes details about how to access and open the data. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How is this different from
URL or DOI to dataset description in the top section?
|
||
_We can create an id from the Dataset Name from the first section if not otherwise indicated._ | ||
|
||
**title:** A short descriptive one-line title for the collection. Technically, this is not requried by the STAC Spec but it is used by the STAC browser. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Title/Description - why don't we just use what's above like the statement and skip asking again?
@@ -29,6 +29,36 @@ assignees: freitagb, wildintellect | |||
**Intended science use case** | |||
*Please describe how you intend to use the data, or the expected relevance to MAAP users.* | |||
|
|||
## Spatio-Temporal Asset Catalog (STAC) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does this need to be a separate section? Why does the user need to know that it's specifically for STAC?
There are a lot of questions here that could be helped by some usability testing and some observation. Maybe we should plan some soon. At least from two angles off the top of my head: If it's easy to update/edit metadata maybe there's a quick workflow to just get data in, and then ppl can go back later to add in more extensive metadata. |
It's not super easy to get external data in at this time without Data Team help. Metadata is a bit of hassle to update. But I think this is premature. The likely scenarios:
in 2 and 3 Data Team will then likely request additional information to ensure good STAC records. @abarciauskas-bgse are we going ahead with these changes? should someone else pick up this task? I'd like to get this PR solved so we can convert to the new Github Forms before the June 12 UWG meeting. |
|
okay I need to go re-review the form. ignore my #1 above :D |
In terms of my #2... a thumbnail of some sort might also be helpful, but certainly not required. It looks like this would be in the Collection |
I think we should request a bit more information to help us complete the STAC metadata. At a minimum, I think we should request spatial and temporal extents, since it's not always trivial to determine what this is from the files themselves.