GPU prover #16

Brechtpd · 2022-09-12T20:49:07Z

Look into using the GPU to speed up certain prover work:

FFT
MSM
Custom gates?

Libraries:

mratsim · 2023-06-14T14:11:58Z

Others:

https://github.com/matter-labs/era-bellman-cuda
https://github.com/z-prize/2022-entries/tree/main/open-division/prize1-msm/prize1a-msm-gpu
- The 2 winning team, Matter Labs and Yrrid made a combined implementation:
  - https://github.com/matter-labs/z-prize-msm-gpu-combined
Add the 3x GPU prover o1-labs/snarky#356
https://github.com/MariusVanDerWijden/gpusnarks

See also my quick analysis at: mratsim/constantine#92

There are 2 additional backends that might be interesting:

AMD GPUs, in particular because AMD offers significantly more memory than Nvidia, (see AMD teasing: https://community.amd.com/t5/gaming/building-an-enthusiast-pc/ba-p/599407) but they aren't available in cloud machines
Apple Metal, due to unified memory, Mac Studios and Mac pro can access up to 192GB of memory, enough to fit the super-circuit. However Metal Assembly is closed source, I tried to look into reverse engineering effort to at least find add-with-carry, either from Apple LLVM or Asahi Linux but I'm not hopeful.

Intel integrated GPUs also have unified memory but they are not powerful enough. In case we want to use those we need to wait for an LLVM version with SPIR-V that is not experimental otherwise LLVM needs to be built from source with a couple of other LLVM+SPIR-V translators.

hugo-blue · 2023-06-25T15:52:31Z

Look into using the GPU to speed up certain prover work:

FFT

MSM

Custom gates?

Libraries:

https://github.com/ingonyama-zk/icicle/

https://github.com/arkworks-rs (accel)

The evaluation part of lookup and permutation also deserve optimization.

hugo-blue · 2023-06-25T15:56:13Z

Others:

https://github.com/matter-labs/era-bellman-cuda

https://github.com/z-prize/2022-entries/tree/main/open-division/prize1-msm/prize1a-msm-gpu

The 2 winning team, Matter Labs and Yrrid made a combined implementation:

https://github.com/matter-labs/z-prize-msm-gpu-combined

Add the 3x GPU prover o1-labs/snarky#356

https://github.com/MariusVanDerWijden/gpusnarks

See also my quick analysis at: mratsim/constantine#92

There are 2 additional backends that might be interesting:

AMD GPUs, in particular because AMD offers significantly more memory than Nvidia, (see AMD teasing: https://community.amd.com/t5/gaming/building-an-enthusiast-pc/ba-p/599407) but they aren't available in cloud machines

Apple Metal, due to unified memory, Mac Studios and Mac pro can access up to 192GB of memory, enough to fit the super-circuit. However Metal Assembly is closed source, I tried to look into reverse engineering effort to at least find add-with-carry, either from Apple LLVM or Asahi Linux but I'm not hopeful.

Intel integrated GPUs also have unified memory but they are not powerful enough. In case we want to use those we need to wait for an LLVM version with SPIR-V that is not experimental otherwise LLVM needs to be built from source with a couple of other LLVM+SPIR-V translators.

As there are many Nvidia GPUs available in the crypto mining market. Focusing on Nvidia GPU should be enough.

For each zkp project, to reduce the time of data copy and save memory, there should be also a common memory management module for MSM, FFT and so on.

mratsim · 2023-06-26T05:59:48Z

As there are many Nvidia GPUs available in the crypto mining market. Focusing on Nvidia GPU should be enough.

The miners focused on megahash per watt first, which was dominated by AMD GPUs, then they used Nvidia GPUs. However, GPUs with large amount of VRAM consume more (and cost more) without it being useful for parallel SHA256 computation.

Concretely they bought a lot of AMD RX480 and Nvidia GTX 1080ti but those had only 8 and 11GB of RAM.

And nvidia is still gimping the RAM of its GPUs (there are AMD consumer GPUs with 24GB)

For each zkp project, to reduce the time of data copy and save memory, there should be also a common memory management module for MSM, FFT and so on.

Do you have an example of this? Even on CPUs.

hugo-blue · 2023-06-26T07:34:49Z

As there are many Nvidia GPUs available in the crypto mining market. Focusing on Nvidia GPU should be enough.

The miners focused on megahash per watt first, which was dominated by AMD GPUs, then they used Nvidia GPUs. However, GPUs with large amount of VRAM consume more (and cost more) without it being useful for parallel SHA256 computation.

Concretely they bought a lot of AMD RX480 and Nvidia GTX 1080ti but those had only 8 and 11GB of RAM.

And nvidia is still gimping the RAM of its GPUs (there AMD consumer GPUs with 24GB)

I see. So, there is a challenge to let low-end machines with GPUs like 1080 to do zkp proving.

For each zkp project, to reduce the time of data copy and save memory, there should be also a common memory management module for MSM, FFT and so on.

Do you have an example of this? Even on CPUs.

On CPUs, the system DDR is shared for all the computation, and no need to care about this. For GPU, there is limited memory, which is smaller than DDR, so memory management is essential.

Moudytaiko added this to Taiko Project Board Nov 7, 2022

Moudytaiko added the area.zk-evm label Nov 7, 2022

dionysuzx added the status.needs-triage label May 1, 2023

dantaik assigned Brechtpd May 2, 2023

Brechtpd added meta.alpha-4 and removed status.needs-triage labels May 2, 2023

Brechtpd moved this to 📝 Todo in Taiko Project Board May 3, 2023

Brechtpd removed the meta.alpha-4 label Jul 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU prover #16

GPU prover #16

Brechtpd commented Sep 12, 2022 •

edited

Loading

mratsim commented Jun 14, 2023 •

edited

Loading

hugo-blue commented Jun 25, 2023

hugo-blue commented Jun 25, 2023

mratsim commented Jun 26, 2023 •

edited

Loading

hugo-blue commented Jun 26, 2023 •

edited

Loading

GPU prover #16

GPU prover #16

Comments

Brechtpd commented Sep 12, 2022 • edited Loading

mratsim commented Jun 14, 2023 • edited Loading

hugo-blue commented Jun 25, 2023

hugo-blue commented Jun 25, 2023

mratsim commented Jun 26, 2023 • edited Loading

hugo-blue commented Jun 26, 2023 • edited Loading

Brechtpd commented Sep 12, 2022 •

edited

Loading

mratsim commented Jun 14, 2023 •

edited

Loading

mratsim commented Jun 26, 2023 •

edited

Loading

hugo-blue commented Jun 26, 2023 •

edited

Loading