Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make changes to allow software to run more easily on different computers #5

Open
petercombs opened this issue Apr 15, 2019 · 1 comment

Comments

@petercombs
Copy link
Owner

So probably the long-term value here is in making a complete package that will do the selection calculations on even more MPRA data, with the operation on the Kircher data being a nice proof-of-concept. To that end, I should:

  • Include the aligners in the conda installation (where applicable?) to minimize the number of pre-installation steps that need to be done.

  • Give the option for blast to be done on the NCBI server. To limit the amount of requests, one could imagine having a maximum number of submissions, and above that requiring a local download, but I would think that this is not actually a big deal, since the refseq_genomes is pretty big, so you'd have to do a lot of blast searches to make up the difference. I'm still glad I have it locally, but no sense requiring it.

@petercombs
Copy link
Owner Author

Okay, so one problem with using the NCBI server to do blast is that it fails on relatively large results, ironically exactly like the ones we are getting. There may be some way to filter this, but it's not obvious how to do that in an easy programmatic way. We don't know in advance how far up the tree we'll need to go to get an outgroup, for instance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant