Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Surface more info on preprocessing tools #3301

Open
emmahodcroft opened this issue Nov 26, 2024 · 0 comments
Open

Surface more info on preprocessing tools #3301

emmahodcroft opened this issue Nov 26, 2024 · 0 comments
Labels
discussion Open questions preprocessing Issues related to the preprocessing component

Comments

@emmahodcroft
Copy link
Member

From feedback from Aine, she would like more info on the preprocessing of sequences.

For example, how the alignment constructed, what's reference used, whether masked, etc.

If we are tracking the version of Nextclade and the version of Nextclade dataset that's being used in the metadata (or the equivalent generic terms for any other tool in the preprocessing pipeline), this would be an easy thing to surface on the sequence detail page and in the metadata download file.

We may need to discuss exactly what we'd like to store and surface. If these could be (optionally) links, it could save effort in trying to explain by linking directly to the tool where users can figure this out for themselves/use that tool's docs (in our case that would be Nextclade for Pathoplexus but could of course be any tool that someone may want to link to).

@emmahodcroft emmahodcroft added discussion Open questions preprocessing Issues related to the preprocessing component labels Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion Open questions preprocessing Issues related to the preprocessing component
Projects
None yet
Development

No branches or pull requests

1 participant