Investigate use of preemptible GCP instances for GWAS #453

tomwhite · 2021-02-03T09:49:52Z

In #390 (and processing in general), using preemptible instances on GCP would bring a cost saving of ~5x.

tomwhite · 2021-02-03T10:17:46Z

I ran a few experiments to simulate preemption by stopping a worker VM midway through a job.

Here is a normal run on a cluster on 16 instances with no preemption. It took 102s.

Here is a run where I stopped one worker. The job took longer (124s), but completed fine:

Stopping two workers extends the runtime even more (162s), but the job still completes:

When I tried combining persisting the input dataset (#449) with preemption the results were more mixed. Stopping one worker had the effect of causing disk spilling:

tomwhite · 2021-02-03T10:21:11Z

Note that all of these experiments were done just by stopping the worker abruptly. There is an unmerged Dask issue to make workers handle shutdown gracefully. The idea is that the worker shutting down would copy its memory state to other workers in the cluster, so the work doesn't need to be recomputed.

This might be a challenge for the GCP preemption limits, however. An instance being preempted on GCP is given 30 seconds notice before being forcibly terminated. A n1-standard-8 instance has 30GB of memory, and a maximum egress bandwidth of 16Gbps. So that means it would take half the notice period (15s) to copy the data to another machine (at 2GB/s), not counting serialization cost etc.

The maximum worker-worker bandwidth I've seen on a cluster has been <0.5GB/s, so there's quite a gap there. (It still might be useful to give the worker notice of an impending shutdown so it doesn't accept new tasks though.)

tomwhite · 2023-01-06T15:11:56Z

Related: https://www.coiled.io/blog/save-money-with-spot

tomwhite added the performance label Feb 3, 2021

This was referenced Feb 9, 2021

Identify lack of scalability in gwas_linear_regression #390

Open

Add support for GCP preemptible instances dask/dask-cloudprovider#255

Closed

tomwhite mentioned this issue Mar 31, 2021

Run on preemptible instances related-sciences/ukb-gwas-pipeline-nealelab#36

Open

tomwhite closed this as completed Jan 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate use of preemptible GCP instances for GWAS #453

Investigate use of preemptible GCP instances for GWAS #453

tomwhite commented Feb 3, 2021

tomwhite commented Feb 3, 2021

tomwhite commented Feb 3, 2021

tomwhite commented Jan 6, 2023

Investigate use of preemptible GCP instances for GWAS #453

Investigate use of preemptible GCP instances for GWAS #453

Comments

tomwhite commented Feb 3, 2021

tomwhite commented Feb 3, 2021

tomwhite commented Feb 3, 2021

tomwhite commented Jan 6, 2023