Skip to content

v0.3.0

Compare
Choose a tag to compare
@iiSeymour iiSeymour released this 18 May 13:52
· 4994 commits to master since this release

[0.3.0] (18 May 2023)

This is a major release of Dorado which introuduces: Duplex pairing and splitting for directly going from POD5 to duplex reads, major performance improvements to simplex and duplex basecalling on A100 GPUs via int8 model quantization and the output of aligned BAM from Dorado and support for producing summary tsv files from BAM.

  • ddb7c1e - Improvements to modified basecalling performance
  • f879af5 - Add support for CPU basecalling of modified bases
  • 282a66c - Add duplex pair alignment accuracy check to reduce risk of incorrect matches
  • 3bb0ffc - Add dx:i tag to Dorado output to indicate whether a read is duplex dx:i:1 or simplex dx:i:0
  • 78d6bc4 - Improvements to Duplex calling performance
  • 20972d4 - Added ability to filter reads by read length, default of 5
  • b8ceee4 - Include simplex output in SAM when running duplex basecalling
  • 724bafd - Add dorado summary command which produces a summary.tsv file from a SAM/BAM file
  • da13d36 - Add splitting of live splitting of concatemer reads into duplex pairs
  • 8992e67 - Fix for segfault in older glibc version
  • d1377a0 - Add [email protected] simplex model
  • 88b547c - Add 5kHz duplex Stereo model and duplex 5kHz support
  • 2660956 - Various CPU performance improvements
  • f4ea664 - All context modbase tags
  • 338911d - Add v4.2 6mA 5mC modbase models
  • 1684168 - Improved support for short read duplex basecalling
  • 6fe6adb - Add verbose logging option for duplex basecalling.
  • a035d7f - Check model sample rate agrees with raw data, add option to skip check
  • 26c1112 - Add ability to perform automatic duplex pairing in dorado.
  • a824a7d - Output uncompressed BAM when dorado output is a pipe.
  • f27d672 - Add CPU to list of devices in help
  • 3329bb5 - Fix Fast5 basecalling
  • 4d91533 - Improvements to reduce possibility of out of memory issues on CUDA devices via a GPU device mutex
  • 14de2e8 - Improvements to progress bar reporting
  • 2095fea - Add alignment and BAM generation to duplex and simplex
  • fef15ae - Improvements to stereo duplex encoding.
  • c6dc187 - Add --reference option to basecalling to allow basecalls to be aligned to a reference.
  • 51ca9e7 - Add v4.2.0 5kHz simplex models
  • 1a215e7 - Reduce CPU load from mean_q_score_from_qstring
  • d3f7320 - Handle empty read-ids file
  • 25e2cd1 - Upgrade to Pod5 v0.1.20
  • 98eb30d - Add Cutlass LSTM kernels for significant performance improvement on A100 GPUs
  • 6aea63a - Reduce CPU load due to trimming
  • 7459371 - Increase per-device ScalerNode thread count
  • 758d0d9 - Minor improvements to RemoraEncoder::encode_kmer
  • b2af21b - Add read filter node to filter reads by Q score, length etc.
  • fb60425 - Reduce stereo duplex CPU load
  • 0bca7d8 - Reduce torch indexing overhead in modified basecalling
  • 4632f05 - Expose k, w comandmline options to dorado aligner
  • d560661 - Improved read trimming
  • 3cd1c80 - Improve performance of reverse_complement calculation
  • 92ef398 - Fix segfault in modified basecalling