You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I recently encountered significant differences in basecalling and demultiplexing results between Dorado v0.7.3 and MinKNOW.
When using Dorado for basecalling, most reads were categorized as "unclassified." For example, the size of the FASTQ for barcode 16 was 125 MB.
However, when I basecalled the same dataset using MinKNOW 24.11, the size of the FASTQ for barcode 16 was much larger (959 MB). After downstream assembly, I found that the MinKNOW output for this barcode was heavily contaminated with reads that appeared to belong to other barcodes.
I am trying to determine whether:
Dorado is overly strict during demultiplexing, or
MinKNOW is too lenient, leading to contamination and misclassified reads.
Additionally, I am unsure whether the contamination originates from mis-demultiplexing or inherent issues with the sample.
Steps to reproduce the issue:
Basecall using Dorado v0.7.3 with the following command:
I would recommend running the dorado basecaller command with --no-trim and then allowing dorado demux to perform barcode trimming to clear up the barcodes/adapters/primers. It is possible that adapter trimming from the basecaller command is interfering with the barcode detection in the demux stage.
Without more information I'm not sure it's possible to tell, but if dorado gives similar results after that change then I'd say it points in that direction.
Issue Report
Please describe the issue:
I recently encountered significant differences in basecalling and demultiplexing results between Dorado v0.7.3 and MinKNOW.
When using Dorado for basecalling, most reads were categorized as "unclassified." For example, the size of the FASTQ for barcode 16 was 125 MB.
However, when I basecalled the same dataset using MinKNOW 24.11, the size of the FASTQ for barcode 16 was much larger (959 MB). After downstream assembly, I found that the MinKNOW output for this barcode was heavily contaminated with reads that appeared to belong to other barcodes.
I am trying to determine whether:
Additionally, I am unsure whether the contamination originates from mis-demultiplexing or inherent issues with the sample.
Steps to reproduce the issue:
Run environment:
Thank you for your help and insights!
The text was updated successfully, but these errors were encountered: