DHT usage and future work #11

Wondertan · 2021-06-06T11:36:27Z

Background

The Data Availability model we use requires data discovery. We rely on IPFS's Kademlia DHT, which basically allows any network participant to find a host for a certain piece of data by its hash.

Usage Description

To describe the way we use it, let's introduce a simple pseudo-code interface for it:

interface DHT {
	// Find the nearest peer to the hash and ask him to keep a record of us hosting the data.
	// By default, records are stored for 24h.
	Provide(hash) 
	// Find peers hosting the data by its hash. 
	FindProviders(hash) []peer 
	// Periodically execute Provide for a given hash to keep record around.
	Reprovide(hash) 
}

When a block producer creates a block, it saves it and calls Provide for every Data Availability root of the block, making it discoverable and afterward available. After, any other node that wants to get the block's data or validate its availability can call FindProviders, detect the block producer, and finally access the block data through Bitswap. The block producer and block requester also call Reprovide. Overall, with the described flow, we aim for maximum confidence that data of any particular block is always discoverable from peers storing it.

What's Left

The current state of the implementation does not conform to the flow above, and these things are left to be done:

Periodic reproviding for roots(https://github.com/lazyledger/lazyledger-core/issues/393)
Providing for non-producer nodes that retrieved a full block(RetreiveBlockData should do DHT providing celestia-core#394)

Pain Points

Node churn

Records of someone hosting data are stored on peers selected not by their qualities but by the simple XOR metric. Unfortunately, this eventually makes different light clients store those records unreliably, as they are not meant to be full-featured daemons. Therefore, some data may become undiscoverable for some period of time.

Solutions

Basically, reproviding helps here. However, we never know when a light client leaves and data may be undiscoverable for many hours until the next reprovide happens, which would store records on another node.
Full routing table DHT client can keep records besides ones chosen by XOR metric, and the nodes running it are expected to be reliable. Thus they can fill a gap of undiscoverable hours.

Providing Time

We need to ensure providing takes less time than the time between two subsequent block proposals by a node. Otherwise, DHT providing wouldn't keep up with block production, creating an evergrowing providing queue. Unfortunately, for the standard DHT client, providing can take up to 3 mins on a large-scale network.

From this also comes a rule - the bigger the committee is, the more time the node has to proceed with providing. So naturally, the larger the network, the larger the committee is, and the larger the providing time, so altogether, these can overlap organically, not causing any issues. But if we still observe slow providing time being an issue, full routing table DHT client for block producer would be a solution as it significantly drops providing time.

Other Possible Improvements

Store fewer DHT records by storing only one root for a block. In the end, we would end up storing block roots and peer addresses only. (adr: DAHeader and ipfs celestia-core#378)
Play around with Kademlia Bucker size. Default is 20.
Play around with resiliency parameter. Default is 3.
Play around with provide validity time. Default is 24h.

The text was updated successfully, but these errors were encountered:

liamsi · 2022-04-14T21:12:38Z

This is issue is great historic context. But is it still relevant?

Wondertan · 2022-04-14T22:34:12Z

I don't think so. DHT is still being used but not for content discovery, so closing

liamsi transferred this issue from celestiaorg/celestia-core Aug 16, 2021

Wondertan closed this as completed Apr 14, 2022

Wondertan added a commit that referenced this issue Aug 23, 2022

docs(adr): initial unfinished draft for ADR #11

bb99ca5

Wondertan added a commit that referenced this issue Aug 23, 2022

docs(adr): initial unfinished draft for ADR #11

63ad7eb

Wondertan added a commit that referenced this issue Sep 2, 2022

docs(adr): initial unfinished draft for ADR #11

978a79f

Wondertan added a commit that referenced this issue Sep 20, 2022

docs(adr): initial unfinished draft for ADR #11

346e957

Wondertan added a commit that referenced this issue Sep 20, 2022

docs(adr): initial unfinished draft for ADR #11

3a1e655

distractedm1nd pushed a commit to distractedm1nd/celestia-node that referenced this issue Sep 21, 2022

docs(adr): initial unfinished draft for ADR celestiaorg#11

6e24d47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DHT usage and future work #11

DHT usage and future work #11

Wondertan commented Jun 6, 2021

liamsi commented Apr 14, 2022

Wondertan commented Apr 14, 2022

DHT usage and future work #11

DHT usage and future work #11

Comments

Wondertan commented Jun 6, 2021

Background

Usage Description

What's Left

Pain Points

Node churn

Solutions

Providing Time

Other Possible Improvements

liamsi commented Apr 14, 2022

Wondertan commented Apr 14, 2022