Single-shot DKG #3776

lukasz-zimnoch · 2024-02-07T19:50:46Z

Refs: #3770
Depends on: #3775

The currently used DKG retry mechanism based on random exclusion turned out to be ineffective for a higher number of participating operators. Such retries have a very small chance of success and produce a lot of unnecessary network traffic that consumes bandwidth and CPU excessively.

Here we aim to improve the situation. First, we are making DKG a single-shot process that fails fast if the result cannot be produced during the first attempt. Second, we are doubling down the announcement period to maximize participation chances for all selected operators, even those at the edge of the network. Last but not least, we are reducing the submission delay that is preserved between operators attempting to submit the final result on-chain.

All those changes combined allow us to achieve shorter DKG iterations that can be timed out quicker. This way, we will be able to repeat DKG more often, with different operator sets.

Last but not least, we are also changing the re-transmission strategy for the resultSigningState which was still using StandardRetransmissionStrategy with retransmissions occurring on each tick. All DKG states use the BackoffRetransmissionStrategy strategy which leads to a sparse distribution of retransmissions and is considered more lightweight. There is no point in making an exception for the resultSigningState. This should reduce network load in case one of the participants fails at the end of the protocol.

The currently used DKG retry mechanism based on random exclusion turned out to be ineffective for a higher number of participating operators. Such retries have a very small chance for success and produce a lot of unnecessary network traffic that consumes bandwidth and CPU excessively. Here we aim to improve the situation. First, we are making DKG a single-shot process which fails fast if the result cannot be produced during the first attempt. Second, we are doubling down the announcement period to maximize participation chance for all selected operators, even those being at the edge of the network. Last but not least, we are reducing the submission delay that is preserved between operators attempting to submit the final result on-chain. All those changes combined allow to achieve shorter DKG iterations that can be timed out quicker. This way, we will be able to repeat DKG more often, with different operator sets.

github-actions · 2024-02-07T19:52:35Z

Solidity API documentation preview available in the artifacts of the https://github.com/keep-network/keep-core/actions/runs/7820364827 check.

All DKG states use the `BackoffRetransmissionStrategy` strategy. There is no point to make an exception for the `resultSigningState`. This should reduce network load in case one of the participants fails at the end of the protocol.

github-actions · 2024-02-07T20:10:05Z

Solidity API documentation preview available in the artifacts of the https://github.com/keep-network/keep-core/actions/runs/7820549852 check.

pdyraga

Looks good to me, left just one comment.

pdyraga · 2024-02-08T11:17:31Z

pkg/tbtc/dkg.go

@@ -28,7 +28,7 @@ const (
 	// is used to calculate the submission delay period that should be respected
 	// by the given member to avoid all members submitting the same DKG result
 	// at the same time.
-	dkgResultSubmissionDelayStepBlocks = 15
+	dkgResultSubmissionDelayStepBlocks = 3


I am concerned this is too low. The gas price bump happens after one minute by default so if the initial estimation was not successful, there will be not enough time to do even one price bump.

I tested that yesterday on Sepolia where gas prices were quite high and volatile. Haven't observed any problem so I think we should be good. It is important to reduce DKG timeout to the minimum and this factor strongly commits to that value. We will monitor the situation on mainnet and increase that if necessary.

Worth noting that the price bump will be actually done. The only downside is that the next member may front-run with their own transaction. This may lead to a collision sometimes but is not harmful to the protocol.

This pull request backports #3776 to the `releases/mainnet/v2.0.0-m7` branch.

lukasz-zimnoch self-assigned this Feb 7, 2024

lukasz-zimnoch added this to the v2.0.0-m7 milestone Feb 7, 2024

lukasz-zimnoch added 📟 client ⛓️ solidity labels Feb 7, 2024

lukasz-zimnoch mentioned this pull request Feb 7, 2024

Reduce overhead around DKG #3770

Closed

pdyraga reviewed Feb 8, 2024

View reviewed changes

Base automatically changed from fix-electrum-mem-leak to main February 8, 2024 15:37

lukasz-zimnoch marked this pull request as ready for review February 8, 2024 15:37

lukasz-zimnoch requested review from nkuba, dimpar and tomaszslabon as code owners February 8, 2024 15:37

tomaszslabon approved these changes Feb 8, 2024

View reviewed changes

tomaszslabon merged commit 17735c2 into main Feb 8, 2024
29 checks passed

tomaszslabon deleted the single-shot-dkg branch February 8, 2024 15:54

lukasz-zimnoch mentioned this pull request Feb 12, 2024

[Backport] Single-shot DKG #3782

Merged

lukasz-zimnoch added a commit that referenced this pull request Feb 12, 2024

[Backport] Single-shot DKG (#3782)

cbf53d5

This pull request backports #3776 to the `releases/mainnet/v2.0.0-m7` branch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Single-shot DKG #3776

Single-shot DKG #3776

lukasz-zimnoch commented Feb 7, 2024 •

edited

Loading

github-actions bot commented Feb 7, 2024

github-actions bot commented Feb 7, 2024

pdyraga left a comment

pdyraga Feb 8, 2024

lukasz-zimnoch Feb 8, 2024

lukasz-zimnoch Feb 8, 2024

Single-shot DKG #3776

Single-shot DKG #3776

Conversation

lukasz-zimnoch commented Feb 7, 2024 • edited Loading

github-actions bot commented Feb 7, 2024

github-actions bot commented Feb 7, 2024

pdyraga left a comment

Choose a reason for hiding this comment

pdyraga Feb 8, 2024

Choose a reason for hiding this comment

lukasz-zimnoch Feb 8, 2024

Choose a reason for hiding this comment

lukasz-zimnoch Feb 8, 2024

Choose a reason for hiding this comment

lukasz-zimnoch commented Feb 7, 2024 •

edited

Loading