Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sporadic MA manifest IT failure due to sequencing_protocol #6697

Open
hannes-ucsc opened this issue Nov 10, 2024 · 7 comments
Open

Sporadic MA manifest IT failure due to sequencing_protocol #6697

hannes-ucsc opened this issue Nov 10, 2024 · 7 comments
Assignees
Labels
+ [priority] High bug [type] A defect preventing use of the system as specified data [subject] Data or metadata [use of this label is uncommon] orange [process] Done by the Azul team spike:5 [process] Spike estimate of five points

Comments

@hannes-ucsc
Copy link
Member

https://gitlab.dev.singlecell.gi.ucsc.edu/ucsc/azul/-/jobs/103847

======================================================================
FAIL: test_indexing (integration_test.IndexingIntegrationTest.test_indexing) [None] (format=<ManifestFormat.verbatim_jsonl: 'verbatim.jsonl'>, catalog='dcp3-it')
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/builds/ucsc/azul/test/integration_test.py", line 351, in subTest
    yield
  File "/builds/ucsc/azul/test/integration_test.py", line 1627, in _test_managed_access_manifest
    self.assertSetEqual(private_only, authorized - unauthorized)
AssertionError: Items in the first set but not the second:
<frozendict {'value': <frozendict {'instrument_manufacturer_model': <frozendict {'text': 'Illumina NovaSeq 6000'}>, 'method': <frozendict {'text': 'high throughput sequencing'}>, 'paired_end': False, 'protocol_core': <frozendict {'protocol_id': '76a4bca7-715e-fcb4-78e6-61acadf18708'}>, 'provenance': <frozendict {'document_id': '76a4bca7-715e-fcb4-78e6-61acadf18708', 'submission_date': '2021-06-24T16:21:49.702568Z'}>, 'schema_type': 'protocol', 'schema_version': '10.1.0', 'describedBy': 'https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol'}>, 'type': 'sequencing_protocol'}>
======================================================================
FAIL: test_indexing (integration_test.IndexingIntegrationTest.test_indexing) [None] (format=<ManifestFormat.verbatim_pfb: 'verbatim.pfb'>, catalog='dcp3-it')
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/builds/ucsc/azul/test/integration_test.py", line 351, in subTest
    yield
  File "/builds/ucsc/azul/test/integration_test.py", line 1627, in _test_managed_access_manifest
    self.assertSetEqual(private_only, authorized - unauthorized)
AssertionError: Items in the first set but not the second:
'76a4bca7-715e-fcb4-78e6-61acadf18708'
----------------------------------------------------------------------
Ran 18 tests in 371.012s
FAILED (failures=2, skipped=3)
make: *** [Makefile:247: integration_test] Error 1
Cleaning up project directory and file based variables 00:01
ERROR: Job failed: exit code 1

In both subtests it's a particular sequencing_protocol entity that's in private_only but not in authorized - unauthorized.

@hannes-ucsc hannes-ucsc added the orange [process] Done by the Azul team label Nov 10, 2024
@hannes-ucsc hannes-ucsc changed the title Sporadic MA manifest IT failure due to protocol Sporadic MA manifest IT failure due to sequencing_protocol Nov 10, 2024
@hannes-ucsc hannes-ucsc added the spike:5 [process] Spike estimate of five points label Nov 12, 2024
@hannes-ucsc hannes-ucsc added the + [priority] High label Nov 12, 2024
@hannes-ucsc
Copy link
Member Author

I would also like to request these changes to be made to the IT:

Index: test/integration_test.py
IDEA additional info:
Subsystem: com.intellij.openapi.diff.impl.patch.CharsetEP
<+>UTF-8
===================================================================
diff --git a/test/integration_test.py b/test/integration_test.py
--- a/test/integration_test.py	(revision dfb41a228ef60b0f3d8ceb968dc94808edfa542b)
+++ b/test/integration_test.py	(date 1731220150195)
@@ -1532,7 +1532,7 @@
                 for bundle in hit['bundles']
             }
 
-        managed_access_bundles = set.union(*(
+        ma_bundles = set.union(*(
             bundle_uuids(file)
             for file in files
             if len(file['sources']) == 1
@@ -1542,8 +1542,8 @@
         files_url = furl(url=endpoint, path='index/files', args=params)
         response = self._get_url_json(GET, files_url)
         public_bundle = self.random.choice(sorted(bundle_uuids(one(response['hits']))))
-        self.assertNotIn(public_bundle, managed_access_bundles)
-        all_bundles = {public_bundle, *managed_access_bundles}
+        self.assertNotIn(public_bundle, ma_bundles)
+        all_bundles = {public_bundle, *ma_bundles}
 
         filters = {
             special_fields.bundle_uuid: {
@@ -1611,26 +1611,27 @@
                 'format': format.value,
                 'filters': json.dumps({special_fields.bundle_uuid: {'is': list(bundles)}})
             })
-            content = BytesIO(self._get_url_content(PUT, manifest_url))
-            return {
+            readers = {
                 ManifestFormat.verbatim_jsonl: read_verbatim_jsonl_manifest,
                 ManifestFormat.verbatim_pfb: read_verbatim_pfb_manifest
-            }[format](content)
+            }
+            content = BytesIO(self._get_url_content(PUT, manifest_url))
+            return readers[format](content)
 
         for format in ManifestFormat.verbatim_jsonl, ManifestFormat.verbatim_pfb:
             if format in metadata_plugin.manifest_formats:
                 with self.subTest(format=format):
-                    unauthorized = get_verbatim_manifest(format, all_bundles)
+                    public = get_verbatim_manifest(format, all_bundles)
                     with self._service_account_credentials:
-                        authorized = get_verbatim_manifest(format, all_bundles)
-                        private_only = get_verbatim_manifest(format, managed_access_bundles)
-                    self.assertSetEqual(private_only, authorized - unauthorized)
+                        all = get_verbatim_manifest(format, all_bundles)
+                        ma_only = get_verbatim_manifest(format, ma_bundles)
+                    self.assertSetEqual(ma_only, all - public)
 
         if ManifestFormat.curl in metadata_plugin.manifest_formats:
             # Create a single-file curl manifest and verify that the OAuth2
             # token is present on the command line
-            managed_access_file_id = one(self.random.choice(files)['files'])['uuid']
-            filters = {'fileId': {'is': [managed_access_file_id]}}
+            ma_file_id = one(self.random.choice(files)['files'])['uuid']
+            filters = {'fileId': {'is': [ma_file_id]}}
             manifest_url.set(args=dict(catalog=catalog,
                                        filters=json.dumps(filters),
                                        format='curl'))

@nadove-ucsc
Copy link
Contributor

nadove-ucsc commented Nov 13, 2024

This protocol (76a4bca7-715e-fcb4-78e6-61acadf18708) occurs in 26 bundles in the IT catalog. 10 of these bundles are from the public source 1b3866aa-d2a3-42cf-888d-7fb57a77c5aa ("Singlecelltranscriptionalandchromatinaccessibility"), while the other 16 are from source 2c4ec571-f489-4277-b215-da8dedd86803 ("Asingle-cellatlasofthehealthybreasttissuesrevealsc").

GET azul_v2_nadove_dcp3-it_bundles/_search?size=100
{
  "query": {"bool": {"filter": [
    {
      "terms": {
        "contents.sequencing_protocols.document_id.keyword": [
          "76a4bca7-715e-fcb4-78e6-61acadf18708"
        ]
      }
    }
  ]}},
  "_source": "source.id"
}
$ cat hits.json | jq .hits.hits[]._source.source.id | sort | uniq -c
     10 "1b3866aa-d2a3-42cf-888d-7fb57a77c5aa"
     16 "2c4ec571-f489-4277-b215-da8dedd86803"

Thus, the protocol being present in all three manifests (private source only, public source only, and both) is the correct and expected behavior. The problem lies in the test condition, which seems to assume that entities cannot occur in more than one source. Apparently, they can, albeit rarely.

How we adjust the test condition depends on the pervasiveness of entities that break this assumption. If only protocols are affected, then we can simply filter protocol entities out of the manifests before comparing the set difference. If, for example, donors or samples can also occur in multiple sources, then we may want to consider a more substantial redesign for this test.

@hannes-ucsc
Copy link
Member Author

hannes-ucsc commented Nov 13, 2024

From the DCP/2 specs

Disjunctivity: No metadata entity may be included in more than one of the snapshots that make up the data release. Expressed in terms of the TDR schema this means that for every row index i in any table T of any snapshot S1 in the release, there is no row index j in table T of any other snapshot S2 in the release for which S1.T[i].T_id = S2.T[j].T_id.

I think what we are observing is disallowed in the spec. I think the fix is to remove the snapshot for one of the projects. In order to decide which one, we should investigate a bit more, to understand how this happened, and if prod has the same issue.

@hannes-ucsc hannes-ucsc added bug [type] A defect preventing use of the system as specified data [subject] Data or metadata [use of this label is uncommon] labels Nov 13, 2024
@dsotirho-ucsc
Copy link
Contributor

Spike to retrieve the snapshot table rows for that sequencing protocol entity and to look for equivalent snapshots in prod using the project title and shortname.

@hannes-ucsc
Copy link
Member Author

I got this, too, running IT locally.

======================================================================
FAIL: test_indexing (integration_test.IndexingIntegrationTest.test_indexing) [None] (format=<ManifestFormat.verbatim_jsonl: 'verbatim.jsonl'>, catalog='dcp3-it')
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Users/hannes/workspace/hca/azul.hannes.local/test/integration_test.py", line 351, in subTest
    yield
  File "/Users/hannes/workspace/hca/azul.hannes.local/test/integration_test.py", line 1627, in _test_managed_access_manifest
    self.assertSetEqual(private_only, authorized - unauthorized)
AssertionError: Items in the first set but not the second:
<frozendict {'value': <frozendict {'instrument_manufacturer_model': <frozendict {'text': 'Illumina NovaSeq 6000'}>, 'method': <frozendict {'text': 'high throughput sequencing'}>, 'paired_end': False, 'protocol_core': <frozendict {'protocol_id': '76a4bca7-715e-fcb4-78e6-61acadf18708'}>, 'provenance': <frozendict {'document_id': '76a4bca7-715e-fcb4-78e6-61acadf18708', 'submission_date': '2021-06-24T16:21:49.702568Z'}>, 'schema_type': 'protocol', 'schema_version': '10.1.0', 'describedBy': 'https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol'}>, 'type': 'sequencing_protocol'}>

======================================================================
FAIL: test_indexing (integration_test.IndexingIntegrationTest.test_indexing) [None] (format=<ManifestFormat.verbatim_pfb: 'verbatim.pfb'>, catalog='dcp3-it')
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Users/hannes/workspace/hca/azul.hannes.local/test/integration_test.py", line 351, in subTest
    yield
  File "/Users/hannes/workspace/hca/azul.hannes.local/test/integration_test.py", line 1627, in _test_managed_access_manifest
    self.assertSetEqual(private_only, authorized - unauthorized)
AssertionError: Items in the first set but not the second:
'76a4bca7-715e-fcb4-78e6-61acadf18708'

@dsotirho-ucsc
Copy link
Contributor

Same type of failure for a library_preparation_protocol

GitLab.dev IT:
https://gitlab.dev.singlecell.gi.ucsc.edu/ucsc/azul/-/jobs/104160

======================================================================
FAIL: test_indexing (integration_test.IndexingIntegrationTest.test_indexing) [None] (format=<ManifestFormat.verbatim_jsonl: 'verbatim.jsonl'>, catalog='dcp3-it')
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/builds/ucsc/azul/test/integration_test.py", line 351, in subTest
    yield
  File "/builds/ucsc/azul/test/integration_test.py", line 1627, in _test_managed_access_manifest
    self.assertSetEqual(private_only, authorized - unauthorized)
AssertionError: Items in the first set but not the second:
<frozendict {'value': <frozendict {'cdna_library_amplification_method': <frozendict {'text': 'PCR'}>, 'end_bias': '3 prime end bias', 'input_nucleic_acid_molecule': <frozendict {'text': 'RNA'}>, 'library_construction_method': <frozendict {'text': "10x 3' v2"}>, 'library_preamplification_method': <frozendict {'text': 'RT-PCR'}>, 'nucleic_acid_source': 'single cell', 'primer': 'poly-dT', 'protocol_core': <frozendict {'protocol_id': "10x-3'-v2"}>, 'provenance': <frozendict {'document_id': 'af78f3c0-356b-478f-95ba-d6a14db8769b', 'submission_date': '2021-06-24T16:21:49.702568Z'}>, 'strand': 'first', 'schema_type': 'protocol', 'schema_version': '6.2.0', 'describedBy': 'https://schema.humancellatlas.org/type/protocol/sequencing/6.2.0/library_preparation_protocol'}>, 'type': 'library_preparation_protocol'}>
======================================================================
FAIL: test_indexing (integration_test.IndexingIntegrationTest.test_indexing) [None] (format=<ManifestFormat.verbatim_pfb: 'verbatim.pfb'>, catalog='dcp3-it')
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/builds/ucsc/azul/test/integration_test.py", line 351, in subTest
    yield
  File "/builds/ucsc/azul/test/integration_test.py", line 1627, in _test_managed_access_manifest
    self.assertSetEqual(private_only, authorized - unauthorized)
AssertionError: Items in the first set but not the second:
'af78f3c0-356b-478f-95ba-d6a14db8769b'
----------------------------------------------------------------------

@dsotirho-ucsc
Copy link
Contributor

Sequencing protocol 76a4bca7 was found in 2 dev snapshots and 26 prod snapshots

dev

datarepo-dev-3041c2cf.hca_dev_2af52a1365cb4973b51339be38f2df3f__20210830_20210903.sequencing_protocol
Row(('1e430267-2f48-4943-9f68-38bcab0f8762', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2021, 6, 24, 16, 21, 49, 702568, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2021-06-24T16:21:49.702568Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-dev-d4b988d6.hca_dev_a004b1501c364af69bbd070c06dbc17d__20210830_20210903.sequencing_protocol
Row(('e2961d02-c29e-422b-b69e-7ab10a1ca19b', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2021, 6, 24, 16, 21, 49, 702568, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2021-06-24T16:21:49.702568Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

prod

datarepo-cc9e8ac9.hca_prod_815c5ef50fb14eb798821d160362468e__20240806_dcp2_20240807_dcp41.sequencing_protocol
Row(('d5b8eef8-97ed-47da-827a-837331a9a14d', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2024, 7, 29, 17, 50, 26, 986835, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2024-07-29T17:50:26.986835Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-2a9467bc.hca_prod_29ed827bc5394f4cbb6bce8f9173dfb7__20221208_dcp2_20221208_dcp23.sequencing_protocol
Row(('59159ffb-872d-48f3-a308-46988ec2a946', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2022, 12, 1, 18, 41, 11, 696205, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2022-12-01T18:41:11.696205Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-a0ffb40f.hca_prod_2af52a1365cb4973b51339be38f2df3f__20220111_dcp2_20231102_dcp33.sequencing_protocol
Row(('5aa6e7e4-48c1-436c-a2e6-f6f176675571', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 11, 16, 0, 12, 53792, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-11T16:00:12.053792Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-69bbc337.hca_prod_9c20a245f2c043ae82c92232ec6b594f__20220212_dcp2_20231102_dcp33.sequencing_protocol
Row(('d043bc5f-4554-4489-abdb-091a5de81498', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 10, 20, 29, 15, 511029, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-10T20:29:15.511029Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-11a44864.hca_prod_8a666b76daaf4b1f9414e4807a1d1e8b__20220630_dcp2_20231102_dcp33.sequencing_protocol
Row(('b846ad07-eb34-4011-9cee-31f5189e5ec6', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 11, 2, 51, 47, 885350, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-11T02:51:47.885350Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-d5d4057a.hca_prod_1c5eaabf075b4b7aa9e607792c2034b3__20231101_dcp2_20231102_dcp33.sequencing_protocol
Row(('aabbcec9-f6bf-4c3a-8f90-703383e4c3e6', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 10, 20, 58, 7, 436018, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-10T20:58:07.436018Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-ddcd2940.hca_prod_a004b1501c364af69bbd070c06dbc17d__20220118_dcp2_20220121_dcp12.sequencing_protocol
Row(('1990495c-780f-484b-990b-2bd079cb15f9', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2021, 11, 10, 20, 23, 18, 142391, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2021-11-10T20:23:18.142391Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-674de9c8.hca_prod_75dbbce90cde489c88a793e8f92914a3__20220117_dcp2_20220121_dcp12.sequencing_protocol
Row(('a0cef7d3-b41b-4568-8594-6a82b8105e7e', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2021, 11, 10, 20, 23, 18, 142391, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2021-11-10T20:23:18.142391Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-32cb91ae.hca_prod_4bcc16b57a4745bbb9c0be9d5336df2d__20240327_dcp2_20240807_dcp41.sequencing_protocol
Row(('0a06476e-6fec-401f-9814-6f998106ef57', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2024, 7, 26, 19, 3, 20, 179570, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2024-07-26T19:03:20.179570Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-e9df1043.hca_prod_248c5dc36b754fb4ad8acc771968483f__20240806_dcp2_20241007_dcp43.sequencing_protocol
Row(('73bd17bf-2cc3-4ded-b03c-ab68e5adf7b2', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2024, 9, 21, 0, 11, 59, 607234, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2024-09-21T00:11:59.607234Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-0b11f34c.hca_prod_94023a08611d4f22a8c990956e091b2e__20220118_dcp2_20231102_dcp33.sequencing_protocol
Row(('52ac6242-81fe-44b0-9b40-4db1da79c281', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 11, 0, 47, 16, 346677, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-11T00:47:16.346677Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-ca76b840.hca_prod_21ea8ddb525f4f1fa82031f0360399a2__20220111_dcp2_20231102_dcp33.sequencing_protocol
Row(('6e5b347b-6438-427b-8ed0-7b2a4e4d3c50', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 11, 0, 27, 5, 859297, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-11T00:27:05.859297Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-cad8e8e7.hca_prod_581de139461f4875b40856453a9082c7__20231212_dcp2_20240328_dcp37.sequencing_protocol
Row(('9ac75183-8fc0-4202-baad-c7d11d8d0bb1', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2024, 3, 20, 3, 26, 30, 348181, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2024-03-20T03:26:30.348181Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-a71cbef5.hca_prod_8ab8726d81b94bd2acc24d50bee786b4__20220118_dcp2_20220121_dcp12.sequencing_protocol
Row(('7b0b967a-1e44-4350-acc7-0a3f48623a05', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2021, 11, 10, 20, 23, 18, 142391, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2021-11-10T20:23:18.142391Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-83783d1c.hca_prod_5eafb94b02d8423e81b83673da319ca0__20220117_dcp2_20220120_dcp12.sequencing_protocol
Row(('9f354e12-be43-4c78-ae1e-a7ab661ed572', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2021, 11, 10, 20, 23, 18, 142391, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2021-11-10T20:23:18.142391Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-c2886bdd.hca_prod_838d46603d624b08b32ddc5cbd93919d__20240531_dcp2_20240807_dcp41.sequencing_protocol
Row(('a003f3cd-ac58-4a0d-a848-69c9dc39366b', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2024, 7, 26, 16, 31, 31, 670437, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2024-07-26T16:31:31.670437Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-46bbfa8a.hca_prod_16e9915978bc44aab47955a5e903bf50__20221101_dcp2_20231102_dcp33.sequencing_protocol
Row(('2a9c1b62-af9c-4e66-99c4-0c1c7155664d', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 10, 20, 46, 12, 135805, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-10T20:46:12.135805Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-935adc8a.hca_prod_9fc0064b84ce40a5a768e6eb3d364ee0__20220630_dcp2_20220630_dcp18.sequencing_protocol
Row(('ee0cebe3-568b-47c0-bff3-cd6f122808cb', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2022, 6, 22, 13, 58, 50, 880576, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2022-06-22T13:58:50.880576Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-c85d293d.hca_prod_f598aee0d269403690e9d6d5b1c84429__20240903_dcp2_20240904_dcp42.sequencing_protocol
Row(('6ad0434f-683a-4af3-9e62-d2d0d157eca6', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2024, 8, 26, 16, 43, 22, 222833, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2024-08-26T16:43:22.222833Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-50b00aaf.hca_prod_2079bb2e676e4bbf8c68f9c6459edcbb__20240327_dcp2_20240807_dcp41.sequencing_protocol
Row(('79ff40ec-a337-4e7b-8874-8c6e328c0e59', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2024, 7, 25, 21, 54, 21, 143789, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2024-07-25T21:54:21.143789Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-b64e953d.hca_prod_1ffa222328a64133a5a4badd00faf4bc__20231101_dcp2_20231102_dcp33.sequencing_protocol
Row(('815c90cc-5d83-4c97-811c-7ae7136ec407', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 23, 17, 17, 21, 205471, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-23T17:17:21.205471Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-526f3da5.hca_prod_3d49e5e5976f44cbb6b9079016c31c56__20230614_dcp2_20231102_dcp33.sequencing_protocol
Row(('af1d80ed-225b-467a-a6ae-48c0a0ad8b4c', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 11, 0, 22, 41, 458599, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-11T00:22:41.458599Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-0e1a9ef4.hca_prod_f0f89c1474604bab9d4222228a91f185__20220119_dcp2_20230616_dcp29.sequencing_protocol
Row(('19d6c86f-5450-42d0-938b-fb74eb30d35d', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 6, 8, 0, 18, 51, 60081, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-06-08T00:18:51.060081Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-3ed34ae5.hca_prod_cea413af79b34f118b48383fe9a65fbe__20230614_dcp2_20230616_dcp29.sequencing_protocol
Row(('68001975-48be-4c48-a4ac-fab84ee95657', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 6, 5, 21, 48, 11, 898757, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-06-05T21:48:11.898757Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-3ad8245f.hca_prod_da74b50760ee4dd1bd02807bb051a337__20230614_dcp2_20231102_dcp33.sequencing_protocol
Row(('506e7331-d80e-4619-9056-165719f4ff4d', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 10, 23, 28, 1, 456894, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-10T23:28:01.456894Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

datarepo-2e06a188.hca_prod_73011a86475548ac9f70a28903b4ad77__20230616_dcp2_20231102_dcp33.sequencing_protocol
Row(('07aaa25e-f86c-488b-8755-926b31cfa793', '76a4bca7-715e-fcb4-78e6-61acadf18708', datetime.datetime(2023, 10, 10, 23, 39, 58, 759941, tzinfo=datetime.timezone.utc), '{"instrument_manufacturer_model":{"text":"Illumina NovaSeq 6000"},"method":{"text":"high throughput sequencing"},"paired_end":false,"protocol_core":{"protocol_id":"76a4bca7-715e-fcb4-78e6-61acadf18708"},"provenance":{"document_id":"76a4bca7-715e-fcb4-78e6-61acadf18708","submission_date":"2023-10-10T23:39:58.759941Z"},"schema_type":"protocol","schema_version":"10.1.0","describedBy":"https://schema.humancellatlas.org/type/protocol/sequencing/10.1.0/sequencing_protocol"}'), {'datarepo_row_id': 0, 'sequencing_protocol_id': 1, 'version': 2, 'content': 3})

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
+ [priority] High bug [type] A defect preventing use of the system as specified data [subject] Data or metadata [use of this label is uncommon] orange [process] Done by the Azul team spike:5 [process] Spike estimate of five points
Projects
None yet
Development

No branches or pull requests

3 participants