Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow for higher levels in taxid input #108

Open
tillenglert opened this issue Mar 8, 2024 · 1 comment
Open

Allow for higher levels in taxid input #108

tillenglert opened this issue Mar 8, 2024 · 1 comment
Labels
enhancement Improvement for existing functionality

Comments

@tillenglert
Copy link
Collaborator

Description of feature

Currently only strain level is allowed/feasible to be used within taxid input.

Solving higher tax levels like species will come with problems:

More than one strain within species -> some may be pathogenic, some may not and depending on the application this is a crucial discrimination and one would try to solve between those.

Therefore, multiple strategies how to use higher level taxids should be available like largest, subset, refseq.

@tillenglert tillenglert added the enhancement Improvement for existing functionality label Mar 8, 2024
@skrakau
Copy link
Member

skrakau commented Mar 8, 2024

The idea was to enable the input at species level. In any case, only one assembly should be used and not multiple (e.g. all strains for a species) since this would introduce a bias. One could think of using the associated genome for species level (not for strain level!), which is one representative strain (check definintion).
And one would need to think how to best handle this, and properly document this so the user knows exactly what happens in the background.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Improvement for existing functionality
Projects
None yet
Development

No branches or pull requests

2 participants