wip

keep-starknet-strange · Oct 16, 2024 · ff890b2 · ff890b2
1 parent b873b01
commit ff890b2
Showing 1 changed file with 4 additions and 4 deletions.
diff --git a/docs/data.md b/docs/data.md
@@ -2,17 +2,17 @@
 In order to generate input to the [validate_and_apply](../packages/consensus/src/types/chain_state.cairo#L62) function, a lot of data needs to be gathered. 
 
 ## ChainState and Block
-Generating ChainState and Block data involves joining information between multiple blockes and transactions. Since this kind of operations is slow with Bitcoin RPC we use Google Bitcoin data set which allows us to export data with plain sql. Unfortunately due to the [missing transaction_index](https://github.com/blockchain-etl/bitcoin-etl/issues/47) bug in the data set it can't be the only source of data.
+Generating ChainState and Block data involves joining information between multiple blocks and transactions. Since this kind of operations is slow with Bitcoin RPC we use Google Bitcoin data set which allows us to export data with plain sql. Unfortunately due to the [missing transaction_index](https://github.com/blockchain-etl/bitcoin-etl/issues/47) bug in the data set it can't be the only source of data.
 
 <p align="center" width="100%">
   <img src="./img/data.svg" alt="client"/>
 </p>
 
-Steps:
+Input data is processed in multiple steps:
 1. [previous_timestamps.sql](../scripts/data/previous_timestamps.sql) and [previous_utxos.sql](../scripts/data/previous_utxos.sql) queries dump data into GCS
-2. Timestamp data dump is processed by [generate_timestamp_data.py](../scripts/data/generate_timestamp_data.py) script: data is downloaded from GCS and index files are created. Index maps block number to per block timestamp related data. Index is broken down into smaller files, in order to be quickly loaded into the memory.
+2. Timestamp data dump is processed by [generate_timestamp_data.py](../scripts/data/generate_timestamp_data.py) script. Data is downloaded from GCS and index files are created. Index maps block number to per block timestamp related data. Index is broken down into smaller files, in order to be quickly loaded into the memory.
 3. Utxo data dump is  by [generate_utxo_data.py](../scripts/data/generate_utxo_data.py) script: is downloaded from GCS, data files are broken down into smaller chunks, each chunk contains data about several blocks. Index files are created. Index maps block number to a chunk file. Index is broken down into smaller files.
-4. After data dump processing is complete functions `get_timestamp_data` and `get_utxo_set` in [generate_timestamp_data.py](../scripts/data/generate_timestamp_data.py) and [generate_utxo_data.py](../scripts/data/generate_utxo_data.py) give access to the per block data.
+4. After data dump processing is complete functions [`get_timestamp_data`](../scripts/data/generate_timestamp_data.py#L88) and [`get_utxo_set`](../scripts/data/generate_utxo_data.py#L125)  give access to the per block data.
 5. Script [generate_data](../scripts/data/generate_timestamp_data.py) generates data that can be consumed by the `validate_and_apply` function.
 
 ## UtxoSet