Replies: 2 comments
-
Here is a working PR: #102 As noted - we will want to coordinate with ASF and make sure the schema in the tests matches any updated ingest schema. |
Beta Was this translation helpful? Give feedback.
0 replies
-
So, from what I am gleaning from David/Piyush, it will be best to:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Goal
Update the GUNW metadata to:
Current Approach
For the GUNWs, they are uniquely identified by input reference and secondary scenes. We download entire corpus due to this data being uniquely specific to
echo10
using Andrew Johnston's queries (see "Checking CMR" here).Worth investigating for current GUNWs
Finding ways to query unique products that we want to generate is highly valuable because we can then multithread these requests and quickly de-duplicate. In an ideal situation, we would just request GUNWs that have specific reference/secondary scenes. However currently this is not possible.
We do generate a 4 digit hash at the end of a GUNW id, i.e.
S1-GUNW-D-R-144-tops-20230125_20221220-140020-00121W_00036N-PP-28c2-v2_0_6
has hash28c2
. This is generated using the string form the sorted reference and secondary scenes as indicated here. We would then assume "exact" GUNW match and not containment of reference and secondary scenes. This is ok.(From Andrew) The CMR Search API supports searching against keywords using wildcards, see the "pattern" keyword under Parameter Options. Performance might not be great for these searches, but you can try it out and see if it meets your needs. For example, to find all GUNWs with "28c2" in their name:$(10 + 26)^4 = 1.7e6$ possibilities, there are 14 hits for different GUNWs. This will require some more filtering, i.e. they occur across unique tracks and that is easily accessible from the IDs or CMR. But this is work. An example to get a unique hit would be to use track and date in addition to hash as in
https://cmr.earthdata.nasa.gov/search/granules.umm_json?short_name=SENTINEL-1_INTERFEROGRAMS&granule_ur=*28c2*&options[granule_ur][pattern]=true
. But even though there arehttps://cmr.earthdata.nasa.gov/search/granules.umm_json?short_name=SENTINEL-1_INTERFEROGRAMS&granule_ur=*065*20170220*28c2*&options[granule_ur][pattern]=true
.Frame IDs
For the frames, the unique identification is:
This will not be compatible for de-duplicating old products since 2 GUNWs could potentially share 2 of the same SLCs and span slightly different areas because are using the fixed frame's bounding box to direct ISCE2 processing.
It is also clear we need to get Secondary Date into CMR correctly as we are only using start/stop times from the reference date as indicated here.
Will also have to add frame ID
Questions / Comments
Beta Was this translation helpful? Give feedback.
All reactions