[0.9.0] (16 Dec 2024)
This major release of Dorado introduces several new features and enhancements. The polish
command, currently experimental, is optimised for refining draft assemblies of human genomes. This release also adds faster DNA modification calling models and improved 6mA false positive rate (FPR) in native human samples. Barcode demultiplexing accuracy has been significantly enhanced for kits with barcodes at both ends, including SQK-NBD114
. Note that using custom barcode kits now requires the --kit-name
option. A feature has been added to enable running dorado correct
in blocks, allowing work to be divided into smaller pieces for easy submission to a compute cluster. Additional updates include the qs
tag for mean basecall Q-scores in FASTQ output, an upgrade to POD5 to support systems with large page sizes, improvements to Poly(A) tail length estimation, and various bug fixes to enhance stability and functionality.
- 2b96c0b - New Dorado
polish
feature for assembly polishing - 0bab166 - Faster modified base models for DNA
4mC_5mC
,5mC_5hmC
,5mCG_5hmCG
, and6mA
- e637166 - Enable running dorado correct in blocks, for easy submission to a compute cluster
- 40296da - Reduced false positive classification rates for kits with barcodes at both ends
- 35da003 - Improve barcode classification when barcodes can be on either end
- cbcdf38 - Only classify barcodes which are present on sample sheet if provided
- 2449d03 - Correct
AF02F_14
andAH10R_80
barcodes fromTWIST-96A-UDI
- 631e94c - Prevent Dorado
demux
from stripping alignment information when--no-trim
is specified - affea85 - Prevent missing filenames when using
--emit-summary
with Doradodemux
- 3dec15a - Improve poly(A) tail estimation accuracy, including with interrupted tails
- df57d34 - Limit poly(A) estimation to reads with plausible signal to prevent stalls in calculation
- 6cf701a - Add
min_primer_separation
option to custom poly(A) configuration - bf51bd4 - Add
qs
tag with mean basecall Q-score to FASTQ output - dac076d - Upgrade to POD5 v0.3.23 to support systems with large page sizes for POD5 and .fast5
- c7a7a58 - Prevent silent failure or segfault on Windows with bad custom barcode files
- 1e829d5 - Do not allow basecalling if target directory includes both POD5 and .fast5 files
- 05d0981 - Fix modified base trim for reverse-aligned BAM records
- afdb068 - Fix invalid
MM
tag after trimming when no mods are present - 0d788d7 - Prevent crash when insufficient permissions to read an input file/folder
- dbece01 - Update custom barcoding documentation to accurately reflect demultiplexing logic
- 6db40ec - Correct model context info shown in
dorado download --list-structured
- 03acc12 - Use the
-o
short option only for--output-dir
and not for--overlap
- 8d9c017 - Added support for reading gzipped compressed FASTQ files