Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove deduplication on strain #33

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Commits on Aug 9, 2023

  1. Skip deduplication of sequences

    Sequences are written by accession, which should already be unique.
    Metadata is deduplicated on strain, which may not be unique.
    victorlin committed Aug 9, 2023
    Configuration menu
    Copy the full SHA
    0f4b7a4 View commit details
    Browse the repository at this point in the history
  2. Remove deduplication

    On the phylogenetic workflow side, the metadata is processed with
    accession as the ID column, which should be unique. De-duplicating on
    strain (which is what was previously done) would only result in a loss
    of data.
    victorlin committed Aug 9, 2023
    Configuration menu
    Copy the full SHA
    c376a30 View commit details
    Browse the repository at this point in the history