For taxonomic classification of viruses from metagenomic sequences, we developed VirusTaxo using diverse (e.g., 402 DNA and 280 RNA) genera of viruses. VirusTaxo has an average accuracy of 93% at genus level prediction in DNA and RNA viruses. VirusTaxo outperformed existing taxonomic classifiers of viruses where it assigned taxonomy of a larger fraction of metagenomic contigs compared to other methods. Benchmarking of VirusTaxo on a collection of SARS-CoV-2 sequencing libraries and metavirome datasets suggests that VirusTaxo can characterize virus taxonomy from highly diverse contigs and provide a reliable decision on the taxonomy of viruses.
- Log into https://latch.bio.
- Find VirusTaxo within Workflows > All Workflows tab or, go here https://console.latch.bio/explore/63583/info.
- Add VirusTaxo to your workspace (There's a button for it)
- You will find VirusTaxo in My Workflows tab and click on it.
- Now you can run prediction analysis
- Provide input
fasta
file. You can uploadfasta
file containing viral (single or multiple) sequences. Or, you can use test input data containing 1,553 SARS-CoV-2 genome sequences provided by us. Download from here. - Select Genome Type as RNA since SARS-CoV-2 is an RNA virus.
Note: If you do not know what the genome type is, select Unknown option. In case of DNA viruses, select DNA. - Launch the workflow.
Input Fasta File
: Provide concatenated viral sequences in a single fasta file.Virus Genome Type
: For DNA viruses, set this toDNA
. For RNA viruses, set this toRNA
. If the fasta file contains both DNA and RNA viral sequences, then selectingUnknown
genome type will predict the taxonomy of both DNA and RNA viruses.
- VirusTaxo Tool: https://console.latch.bio/explore/63583/info
- Website: https://omics-lab.com/virustaxo
- Paper Code: https://github.com/omics-lab/VirusTaxo
- Workflow Source Code: https://github.com/nahid18/virustaxo-wf
Q: Can I use fastq
files?
A: No, you cannot. But you can assemble the fastq
reads using Megahit. Then use the Megahit contigs (in fasta
format) in VirusTaxo.