Releases: muellan/metacache
MetaCache 2.4.3
Improved Documentation
Fixed documentation for query parameter -hitdiff
.
See also this issue / question.
MetaCache 2.4.2
Improved sequence id extraction from filenames and sequence headers.
The default setting works a bit smarter now, it first tries to find NCBI-style accession or accession.version identifiers, then genbank identifiers and finally uses the filename (without path and extension).
The new command line option -sequence-id-format <type>
allows the user to select a preferred method for sequence id extraction.
Available values for <type>
are:
smart
: (default), works as described abovencbi
: only use NCBI-style accession or accession.version identifiersgenbank
: only use genbank identifiersfilename
: only use filename (without path and extension)leadingword
: only use first contiguous stretch of non-whitespace characters
MetaCache 2.4.1
fixed abundance table formatting
- prevent scientific notation from beeing used for read counts
- row showing unclassified reads had the taxon column missing, now shown with taxon "--"
MetaCache 2.4.0
Changed handling of non-unique sequence IDs during database build
If a reference sequence is inserted, whose ID (e.g. NCBI accession) is already present in the database, the newer sequence will now be inserted with a modified ID (an exclamation mark + duplication counter will be appended) and a warning will be printed to stderr.
Added min/max length filter
A minimum and maximum length for reads can now be set with -min-readlen <#>
and -max-readlen <#>
. Reads with lengths outside of this range will not be processed, i.e., treated as if they were not present in the input file. How many reads were discarded and how many were processed is printed to stderr. The default behavior, that all reads will be processed, remains unchanged.
Other changes
- cleaned up some includes
- updated dates
- changed some aspects of default code formatting
MetaCache 2.3.2
- improved parsing of
assembly_summary
files with inconsistent headers
MetaCache 2.3.1
- fixed type mismatch bug that could prevented compilation with
uint64_t
forMC_TARGET_ID_TYPE
/MC_WINDOW_ID_TYPE
/DMC_KMER_TYPE
- allow up to 10 alphanumeric characters in NCBI-style accession ids
- GPU version: removed outdated CUDA 10.2 and CUB from documentation
MetaCache 2.3.0
- Removed compaction step from GPU version and speed up GPU queries. This also removes the dependency on CUB.
- Set CUDA arch=native per default to automatically detect GPU architecture.
- Fixed make with multiple MACROS (#34 ).
MetaCache 2.2.3
Improved merge mode:
- Added -out option
- Recover from malformed input files (#33)
- Show more output on verbose info level
MetaCache 2.2.2
- Fixed kmers on GPU for k != 16 (default was working correctly)
- Fixed shown query parameters when running abundance estimation
MetaCache 2.2.1
- Fixed canonical kmer on GPU for k != 16 (default was working correctly)
- Fixed merge mode