You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| The Poseidon Community Archive | PCA |[GitHub](https://github.com/poseidon-framework/community-archive)|[Explorer](https://server.poseidon-adna.org/explorer/community-archive)|
10
+
| The Poseidon Minotaur Archive| PMA |[GitHub](https://github.com/poseidon-framework/minotaur-archive)|[Explorer](http://server.poseidon-adna.org:3000/explorer/minotaur-archive)|
11
+
| The Poseidon AADR Archive | PAA |[GitHub](https://github.com/poseidon-framework/aadr-archive)|[Explorer](https://server.poseidon-adna.org/explorer/aadr-archive)|
12
12
13
13
We opted to maintain these archives with [Git](https://git-scm.com) on [GitHub](https://github.com/poseidon-framework) to ensure clean version management on the file level. The context data in a Poseidon package is text data that can be stored and maintained easily with line-wise version control. Unfortunately that does not hold true for the large genotype data files. They are not suitable to be handled in Git directly. Instead we rely on GitHub's [large file storage system](https://docs.github.com/en/repositories/working-with-files/managing-large-files/about-large-files-on-github), which comes with a number of technical drawbacks both for developers and users of the Poseidon archives.
14
14
15
15
We therefore generally do **not** recommend to use the GitHub repositories for end-user-level access to the data. You can explore and download the packages through
16
16
17
-
-a simple [archive viewer on this webpage](archive_explorer)
17
+
-an [archive viewer](https://server.poseidon-adna.org) provided by our webserver
18
18
- our [Web API](web_api)
19
19
- the [`fetch`](trident#fetch-command) and [`list --remote`](trident#list-command) commands of trident, which use the API internally
20
20
21
21
We do recommend that you work through GitHub, though, if you want to report an issue or suggest changes to the data (as described [here](archive_submission_guide.md)).
The Poseidon Community Archive is the oldest and arguably the most important public archive. It stores publication-wise packages, so one package for the genotype data released with one paper.
26
28
27
29
**The PCA focusses on author submissions**, so Poseidon packages prepared by the authors of the packaged publication, containing exactly the genotype data used for the analysis in the paper. This ensures a high degree of computational reproducibility. Author submissions are also ideal for the context data in the .janno file, because the respective domain-experts are generally most knowledgable on the spatiotemporal origin of their samples.
@@ -30,10 +32,14 @@ For historical reasons the PCA does not only contain author submissions, though.
The Poseidon Minotaur Archive mirrors the PCA in that it features publication-wise packages, sometimes even the very same as the PCA. To distinguish them clearly, package titles and sample-wise Poseidon_IDs in the PMA carry the suffix `_MNT` (for "MiNoTaur").
34
38
35
39
**Packages in the PMA include consistently reprocessed genotype data**, run through the [Minotaur workflow](minotaur). Consitent bioinformatic processing across publications is an important quality for meta-analysis. So while the context information in the .janno file between PCA and PMA overlaps, the PMA packages are adjusted for data reuse beyond paper-wise reproducibility.
The Poseidon AADR Archive is the conceptionally most simple archive. It features **"poseidonized" versions of releases of the [Allen Ancient DNA Resource (AADR)](https://reich.hms.harvard.edu/allen-ancient-dna-resource-aadr-downloadable-genotypes-present-day-and-ancient-dna-data)**. The code and decisions for the cleaning and packaging process are documented [here](https://github.com/poseidon-framework/aadr2poseidon). Due to file size limitations of GitHub's LFS system each AADR dataset is split into smaller sub-packages by data type and geographic origin.
0 commit comments