-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Estimate cost of GWAS regression steps #32
Comments
On instance pricing:
If various dask memory issues could be solved and we could use preemptible standard instances, the total cost would be around $53k. |
@eric-czech I assume this stat:
and this comment https://github.com/pystatgen/sgkit/issues/390#issuecomment-748205731 are for the same run, correct? And if it is, how is the 11 hr 5 mins here, connected with about 2hrs it took to run the regressions for chr21? |
No, that caption is definitely misleading -- I was either wrong when I wrote it or trying to make it clear that the individual phenotypes can be seen as single spikes. Here is a full version of that readout that also includes the run of the 265 phenotypes: |
This is an estimate of the VM rental time necessary to do the GWAS regressions (similar to #8).
Here are current figures:
A ballpark cost to keep a cluster of this size running that long is 60 nodes * (231 days *24 hrs) * $0.946424/hr = $314,818.
Clearly we have got to find some room to improve this.
The text was updated successfully, but these errors were encountered: