-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Train for long reads e.g. nanopore, pacbio #2
Comments
Would you recommend using the current version of RiboDetector on long read metatranscriptomic datasets? |
I have tested it few years ago on simulated Nanopore data (simulated with high error rate of 10-15%), the recall is about 92-95%. The error rate of Nanopore data has dropped substantially in recent years, so performance on the latest real dataset should be better, though I'm not entirely sure. You could try it, but some rRNA reads may still remain. |
Thanks for your reply. I am giving it a go now! Do you think the best setting for the -l parameter is still the mean read length for ONT reads, as these will obviously be more variable for Nanopore reads, including some much longer (3-6x) than the mean read length? |
Good question, for long reads you don't need to set the actual mean read length for |
For optimal performance, I should train a model specifically for long-read data. |
Thanks! If this is something you are still interested in doing, then please let me know the result! |
No description provided.
The text was updated successfully, but these errors were encountered: