You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I am really enjoying experimenting with Boltz, but finding that I run into out of memory errors on my 40GB A100 (NVIDIA A100-SXM4-40GB) on any single proteins above about 1200 AA in length. I am using the most recent version of pip, 0.3.2 and just wondering if perhaps there were any parameter options to boltz (reducing sample steps, etc) or pytorch settings (such as reducing float sizes) which were worth trying.... or really anything else.
At first hiking up the shared memory available to my docker container worked, but now allowing 128gb, it seems like it is very likely a GPU memory issue. On a slightly smaller protein, boltz is taking 36GB of GPU memory.
Super excited to keep using this awesome tool, and great work to the team!
Could you please make sure to run this in a new output directory and let me know if it happens again? I suspect that you may be using an MSA that was preprocessed prior to the latest version. We've run tests and are able to pass 1800AA on a 40G GPU.
Hello! I am really enjoying experimenting with Boltz, but finding that I run into out of memory errors on my 40GB A100 (NVIDIA A100-SXM4-40GB) on any single proteins above about 1200 AA in length. I am using the most recent version of pip, 0.3.2 and just wondering if perhaps there were any parameter options to boltz (reducing sample steps, etc) or pytorch settings (such as reducing float sizes) which were worth trying.... or really anything else.
At first hiking up the shared memory available to my docker container worked, but now allowing 128gb, it seems like it is very likely a GPU memory issue. On a slightly smaller protein, boltz is taking 36GB of GPU memory.
Super excited to keep using this awesome tool, and great work to the team!
Relevant logs for context:
And parameter settings:
2024-12-15 03:02:54,901 - boltz_watcher - INFO - Running command: boltz predict /data/jobs/DATA-1/af_out_complexes/temp_MOT_HUM_RPP_B07_FAS1_MON_STD_pos1_MOT_HUM_RPP_B07_FAS1_MON_STD_pos2/MOT_HUM_RPP_B07_FAS1_MON_STD_pos1_MOT_HUM_RPP_B07_FAS1_MON_STD_pos2.fa --use_msa_server --out_dir /data/jobs/DATA-1/af_out_complexes --accelerator gpu --recycling_steps 3 --sampling_steps 200 --diffusion_samples 1
The text was updated successfully, but these errors were encountered: