-
Notifications
You must be signed in to change notification settings - Fork 28
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #202 from nextstrain/add-measles-dataset
Add measles dataset
- Loading branch information
Showing
18 changed files
with
72,209 additions
and
29,713 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
## Unreleased | ||
|
||
Initial release. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
# Measles dataset | ||
|
||
| Key | Value | | ||
| ----------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------- | | ||
| name | Measles N450 (WHO-2012) | ||
| authors | [Nextstrain](https://nextstrain.org) | | ||
| reference | NC_001498.1 | | ||
| workflow | https://github.com/nextstrain/measles/tree/main/nextclade | | ||
| path | `nextstrain/measles/N450/WHO-2012` | | ||
|
||
|
||
## Scope of this dataset | ||
|
||
This dataset assigns genotypes to measles samples based on [criteria outlined by the WHO](https://www.who.int/publications/i/item/WER8709). | ||
|
||
The WHO has defined 24 measles genotypes based on N gene and H gene sequences from 28 reference strains. For new measles samples, genotypes can be assigned based on genetic similarity to the reference strains at the "N450" region (a 450 bp region of the N gene). | ||
|
||
The reference tree used in this dataset includes N450 sequences for the 28 reference strains, along with other representative strains for each genotype. | ||
|
||
This dataset can be used to assign genotypes to any sequence that includes at least 400 bp of the N450 region, including whole genome sequences. Sequence data beyond the N450 region will be reported as an insertion in the Nextclade output. | ||
|
||
## Features | ||
|
||
This dataset supports: | ||
|
||
- Assignment of genotypes | ||
- Phylogenetic placement | ||
- Sequence quality control (QC) | ||
|
||
## What are Nextclade datasets | ||
|
||
Read more about Nextclade datasets in the Nextclade documentation: https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html |
Oops, something went wrong.