Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Train for long reads e.g. nanopore, pacbio #2

Open
dawnmy opened this issue May 17, 2021 · 6 comments
Open

Train for long reads e.g. nanopore, pacbio #2

dawnmy opened this issue May 17, 2021 · 6 comments
Labels
enhancement New feature or request

Comments

@dawnmy
Copy link
Member

dawnmy commented May 17, 2021

No description provided.

@dawnmy dawnmy added the enhancement New feature or request label Feb 24, 2022
@harrytchild
Copy link

Would you recommend using the current version of RiboDetector on long read metatranscriptomic datasets?

@dawnmy
Copy link
Member Author

dawnmy commented Sep 23, 2024

I have tested it few years ago on simulated Nanopore data (simulated with high error rate of 10-15%), the recall is about 92-95%. The error rate of Nanopore data has dropped substantially in recent years, so performance on the latest real dataset should be better, though I'm not entirely sure. You could try it, but some rRNA reads may still remain.

@harrytchild
Copy link

Thanks for your reply. I am giving it a go now! Do you think the best setting for the -l parameter is still the mean read length for ONT reads, as these will obviously be more variable for Nanopore reads, including some much longer (3-6x) than the mean read length?

@dawnmy
Copy link
Member Author

dawnmy commented Sep 23, 2024

Good question, for long reads you don't need to set the actual mean read length for -l. You can try 200.

@dawnmy
Copy link
Member Author

dawnmy commented Sep 23, 2024

For optimal performance, I should train a model specifically for long-read data.

@harrytchild
Copy link

Thanks! If this is something you are still interested in doing, then please let me know the result!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants