v3.3.0 Cenote-Taker 3: Sauceman's grotto
Changes from v3.2.1
:
-
redoing read coverage parsing from samtools coverage table to sequin comment files:
make_sequin_cmts.py
-
Adding read coverage to summary table. Adding virus gcode to virus summary table:
virus_summary.py
-
Fixing minor error/warning when trying to create a directory that exists:
assess_virus_genes1.py
-
adding sequencing tech argument
--seqtech
-
Added/integrated a new taxonomy database
hallmark
. This database is compiled from: 1) genbank nr clustered -> extracting virus proteins with taxonomical labels for genus, family, and class -> filtering to seqs with CT3 hallmark gene hits -> reclustering at 90% AAI with cd-hit and 2) refseq virus -> filtering to seqs with CT3 hallmark gene hits.
- a) I've found this new
hallmark
taxdb returns better or identical taxonomical calls to the originalrefseq
without size/memory increase - b) Zenodo files have been update to include this database. Zenodo DB v4.1
- c) users can choose original
refseq
taxdb or newhallmark
db with the--taxdb
flag - d) To add the
hallmark
taxdb, runget_ct3_dbs -o ct3_DBs --hallmark_tax T
What's Changed
Full Changelog: v3.2.1...v3.3.0