CAT v5.2
·
282 commits
to master
since this release
Notable changes
- A big update to the database files:
CAT prepare
now uses a new mapping file that allows us to assign taxids to up to 20% more proteins than with the previous prot.accession2taxid file. A big shout-out to NCBI for picking up our bug report and combining information from multiple resources into a single mapping file! - Another update to
CAT prepare
: file integrity of downloads is assessed based on md5 checksums. - Change to the output files: the ORF2LCA output file contains a new column for the number of hits the classification is based on.
- Change to the output files: the 'classification' column in the output files previously contained the verdicts
classified
andunclassified
. These two options are now calledtaxid assigned
andno taxid assigned
, respectively, in order to prevent confusion about cases where a contig or bin is classified but does not have a classification up to rank superkingdom. - Change to the output files: while converting to official ranks with
CAT add_names
, if a classification does not go to lower ranks this is now calledno support
as opposed to the previousnot classified
. This should prevent confusion about the meaning of 'classified' even further!