Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enquiry: Kraken2 result. Pear sequences tagged as apple #169

Open
CeciliaDeng opened this issue Oct 29, 2024 · 4 comments
Open

Enquiry: Kraken2 result. Pear sequences tagged as apple #169

CeciliaDeng opened this issue Oct 29, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@CeciliaDeng
Copy link
Collaborator

Description of the bug

I used the assemblyQC to check the quality of transcripts in several published Pear genomes. The BUSCO completeness scores look good (>95% complete completeness). In the Kraken result, all the pear transcripts were classified as apple. For example, BartDHv2 was classified 100% Malus, broken down to 41% M. domestica, 49% M. sylvestris. The remaining 10% was not labeled and the color looks like Malus. Why were the 10% transcripts not labelled to species level? And why the whole transcriptome was classified as apple, not pear (Pyrus)? Was Pyrus not included in the Kraken2 DB? Thank you.
image

Command used and terminal output

No response

Relevant files

No response

System information

No response

@CeciliaDeng CeciliaDeng added the bug Something isn't working label Oct 29, 2024
@GallVp
Copy link
Member

GallVp commented Oct 30, 2024

At PFR, we are using k2_pluspfp_20230314 database. It does include plant data.

A new update has been released here: https://benlangmead.github.io/aws-indexes/k2

Updating the data might fix this issue. Not sure, though. Kraken2 is not the most accurate tool.

@GallVp
Copy link
Member

GallVp commented Oct 30, 2024

See the database update issue #170

@GallVp
Copy link
Member

GallVp commented Nov 5, 2024

Hi @CeciliaDeng

I have updated the Kraken 2 database. Can you please run your data with the new version 2.2.0 and see if it has resolved your problem. Thank you!

@GallVp GallVp added the awaiting-feedback Waiting for input from user label Nov 5, 2024
@GallVp GallVp removed their assignment Nov 21, 2024
@GallVp GallVp removed the awaiting-feedback Waiting for input from user label Nov 21, 2024
@GallVp
Copy link
Member

GallVp commented Nov 21, 2024

From MS Teams,

Results for pear genomes (and transcriptomes) remain the same with the update kraken2 DB.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants