Releases: mtisza1/Cenote-Taker3
v3.3.2 Cenote-Taker 3: Large Cenotoid Earthlings
Updates:
- version change from
3.3.1
to3.3.2
- using
shutil.which
instead offind_executable
to check for external dependencies - python dependency check for
pyrodigal-gv
allows for packages called eitherpyrodigal-gv
orpyrodigal_gv
(this changed between0.3.1
and0.3.2
of that tool) #15 - Changing "Chunk" suffix from
@Chunk_#
to@C#
to shorten virus names. This helps with loading .gbf files into Geneious #13
What's Changed
Full Changelog: v3.3.1...v3.3.2
v3.3.1 Cenote-Taker 3: Majestic Gull Attack
Changes from v3.3.0
:
- Added argument
--max_dtr_assess
. Users can enter a maximum contig length to evaluate DTRs (default 1000000). Practically, long DTR-containing contigs are probably bacterial chromosomes and should be evaluated for prophages and pruned. Since DTR-containing contigs do not get pruned, this was preventing some bacterial chromosome assemblies from getting pruning treatment. #6 - Fixed
hhpred_to_table.py
gene_name variable parsing, which previously incorrectly used .strip() function. - Fixed location of MMSEQS_outdir in
get_ct3_dbs
command to avoid error. - For read-mapping,
--seqtech
now effectsminimap2
settings
What's Changed
Full Changelog: v3.3.0...v3.3.1
v3.3.0 Cenote-Taker 3: Sauceman's grotto
Changes from v3.2.1
:
-
redoing read coverage parsing from samtools coverage table to sequin comment files:
make_sequin_cmts.py
-
Adding read coverage to summary table. Adding virus gcode to virus summary table:
virus_summary.py
-
Fixing minor error/warning when trying to create a directory that exists:
assess_virus_genes1.py
-
adding sequencing tech argument
--seqtech
-
Added/integrated a new taxonomy database
hallmark
. This database is compiled from: 1) genbank nr clustered -> extracting virus proteins with taxonomical labels for genus, family, and class -> filtering to seqs with CT3 hallmark gene hits -> reclustering at 90% AAI with cd-hit and 2) refseq virus -> filtering to seqs with CT3 hallmark gene hits.
- a) I've found this new
hallmark
taxdb returns better or identical taxonomical calls to the originalrefseq
without size/memory increase - b) Zenodo files have been update to include this database. Zenodo DB v4.1
- c) users can choose original
refseq
taxdb or newhallmark
db with the--taxdb
flag - d) To add the
hallmark
taxdb, runget_ct3_dbs -o ct3_DBs --hallmark_tax T
What's Changed
Full Changelog: v3.2.1...v3.3.0
v3.2.1 Cenote-Taker 3: In the Skeloton
Changes from v3.2.0
:
- Added
-wd
/--working_directory
flags for specifying working directory for outputs. - Added pyarrow to enviroment files. Prevents pandas futurewarning.
- Fixed downcasting in
fillna()
commands in python/pandas. - Fixed paths so that hhsuite databases are download and referred to correctly.
- Added
--genbank
flag which allows users to disable genbank outputs. - Fixed taxonomy lineage parsing in
vote_taxonomy.py
in certain cases. - Fixed
hhpred_to_table.py
parsing of hhsuite output files to correctly hand CDD and PDB accessions/descriptions. - Updated scripts so that in annotation mode
-am True
(unless flag--caller adaptive
) to skip unnecessary initialpyhmmer
steps. This reduced runtime by about 25%.
v3.2.0 Cenote-Taker 3: Swamp Lightning
This is the first official release of Cenote-Taker 3
.
Everything you loved about Cenote-Taker 2
, but better and faster.