About A2B in Rep4 #1509

sycamoreeeee · 2024-09-29T12:27:02Z

Hi Keller, i'm studying the protocol and implementation of Fantastic Four.
In the paper, share splitting operation is used to convert arithmetic shares to boolean ones with Binary Adder, however, i can't connect the described protocol with the code of Rep4::split, especially the parameter "regs" and the unit processing.
Can you help me to understand the implementation? Thanks!

mkskeller · 2024-09-30T03:45:29Z

Rep4<T>::split implements a functionality that didn't make it into the paper. The functionality in the paper is found in Rep4Share2<K>::split. The difference between the two is akin to the two variants for bit decomposition in Section 5.3 of https://eprint.iacr.org/2018/403. In terms of parameters and techniques the two implementations are quite similar, however. regs comes from the virtual machine design and it's simply the arguments to the split instruction (apart from the length argument): https://mp-spdz.readthedocs.io/en/latest/instructions.html#Compiler.GC.instructions.split. The unit processing is a tool for optimization in the vectorized setting (i.e. processing many elements at once). Extracting single bits naively is expensive, but modern CPUs have instructions that allow optimizing transposing matrices of bits. The split code translates the problem of extracting vectors of bits from vectors of elements to this transposition. The maximum unit of this transposition is 64x64 bits, which is a natural choice for 64-bit processors.

sycamoreeeee · 2024-12-12T08:12:12Z

Hi Keller, in the paper, after locally splitting the [x]_2k (x = x1 + x2 + x3 + x4) into { [x1_j ]_2, [x2_j ]_2, [x3_j ]_2, [x4_j ]_2 } for all j in [k], the final step of computing [x_j]_2 is to use a binary adder. I want to know how to sum these splits up concretely? Is it to first compute y1 = (x1 + x2) and y2 = (x2 + x3) then compute x = (y1 + y2) ?
Also, in the paper, the formal detailed descriptions of mixed circuit computation such as B2A conversions, MSB extraction, bit injection and so on are omitted, while the Rep4 2k offers the basic mul, dotprod and split. Can you help me to find out the full-fledged implementation details? I would appreciate that, thank you!

mkskeller · 2024-12-13T01:48:27Z

Hi Keller, in the paper, after locally splitting the [x]_2k (x = x1 + x2 + x3 + x4) into { [x1_j ]_2, [x2_j ]_2, [x3_j ]_2, [x4_j ]_2 } for all j in [k], the final step of computing [x_j]_2 is to use a binary adder. I want to know how to sum these splits up concretely? Is it to first compute y1 = (x1 + x2) and y2 = (x2 + x3) then compute x = (y1 + y2) ?

We use Wallace trees, a technique from binary multiplication: https://en.wikipedia.org/wiki/Wallace_tree

Also, in the paper, the formal detailed descriptions of mixed circuit computation such as B2A conversions, MSB extraction, bit injection and so on are omitted, while the Rep4 2k offers the basic mul, dotprod and split. Can you help me to find out the full-fledged implementation details? I would appreciate that, thank you!

Bit injection is very simple and explained in the text of Section 3. MSB extraction is done by simply computing the MSB in binary as above followed by converting to arithmetic using bit injection. Lastly, B2A with more than one bit can be either be done using edaBits (https://eprint.iacr.org/2020/338) or with a few tricks that aren't in the paper because it's not really relevant in applications.

sycamoreeeee · 2024-12-17T10:04:51Z

Thank you. I'm still studying since I want to re-implement the Rep4.
In the paper you mentioned that “The most common design would use k AND gates in a first step to compute k generate-propagate tuples, followed by tree-wise reduction where every step involves 2 AND gates, resulting in 3k AND gates overall.”

Does the tree-wise reduction correspond to the Wallace tree?
If so, the way I understand it is to use the Full Adder to first reduce x1+x2+x3 to c and s (like ABY3) and repeat for (c, s, x4) to obtain c', s', on which we can use binary adder like PPA to compute the final result?
Is that correct?
If so, since 2 reductions require about 2k FullAdders and each FullAdder requires 2 AND Gates, the overall number of AND Gates and communication rounds could be (5k, k + 2) or (4k + klogk, 2 + log k) with PPA?

mkskeller · 2024-12-18T01:33:41Z

You can implement a full adder with only one AND gate when you use a MUX gate (which requires one AND): https://www.researchgate.net/figure/Full-adder-using-XOR-gates-and-a-MUX_fig6_234773872
Furthermore, MP-SPDZ achieves a trade-off between AND gates and rounds using a carry-select adder rather than PPA: https://en.wikipedia.org/wiki/Carry-select_adder

mkskeller closed this as completed Oct 8, 2024

mkskeller reopened this Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About A2B in Rep4 #1509

About A2B in Rep4 #1509

sycamoreeeee commented Sep 29, 2024

mkskeller commented Sep 30, 2024

sycamoreeeee commented Dec 12, 2024

mkskeller commented Dec 13, 2024

sycamoreeeee commented Dec 17, 2024

mkskeller commented Dec 18, 2024

About A2B in Rep4 #1509

About A2B in Rep4 #1509

Comments

sycamoreeeee commented Sep 29, 2024

mkskeller commented Sep 30, 2024

sycamoreeeee commented Dec 12, 2024

mkskeller commented Dec 13, 2024

sycamoreeeee commented Dec 17, 2024

mkskeller commented Dec 18, 2024