feat: bundle and tag all related blobs #553

islamaliev · 2025-02-28T10:03:47Z

After uploading a blob and entangling it a new hash sequence is being created that bundles all relevant blob hashes: for original blob, metadata blob and parity blobs.
Instead of returning an original blob's hash we return the hash of the hash sequence.

During the upload process all the blobs are assigned auto tags that are deleted in the end of the upload. The hash sequence is being assigned a "temp-{hash_seq_hash}" tag that will eventually be replaced with "stored-{hash_seq_hash}" by the validator.

islamaliev · 2025-02-28T10:06:25Z

here are some console outputs after uploading a blob:

$ iroh blobs list blobs
 pbqupxj2nul4pzli7sy6wuudhl2q44l77p5esbvt6pmezqi4ayaq (256.00 KiB) <- parity blob 1
 ps774yacwmkbhwmwaqdtmuru5sf5apco5uiplyfa2tiymwxkizka (256.00 KiB) <- parity blob 2
 qgxlv6zva5vgobqkfit6i6unbznxz7g7hjsiv2wjqlcpemr6tqka (256.00 KiB) <- parity blob 3
 uga6ypqnqyzpoozzuiznadi5f764dteh54tcekqyo3e6jxfgx2ra (160 B) <- hash_seq
 xmcorssflnxtuiocgbecra7xjpo2hvhy4irdb3nd6cu6jxk72g6q (328 B) <- metadata
 ytwgmhbmhfbrl7vpxhclxmwjbv3nkyesj4oxcfwnrr2f5o5ok3aa (255.01 KiB) <- original blob

$ iroh tags list
"stored-uga6ypqnqyzpoozzuiznadi5f764dteh54tcekqyo3e6jxfgx2ra": uga6ypqnqyzpoozzuiznadi5f764dteh54tcekqyo3e6jxfgx2ra (Raw)

$ recall bu query --address 0xff0000000000000000000000000000000000007f 
{
  "objects": [
    {
      "key": "cargo",
      "value": {
        "hash": "uga6ypqnqyzpoozzuiznadi5f764dteh54tcekqyo3e6jxfgx2ra",
        "size": 261128,
        "metadata": {
          "content-type": "application/octet-stream"
        }
      }
    }
  ],
  "common_prefixes": [],
  "next_key": null
}

$ recall bu  get --address 0xff0000000000000000000000000000000000007f cargo > bla.toml
✨  Downloaded object in 0 seconds (hash=vk7api6jjzykojjy6fgyj3phbvo6lfz37z7izfortszssmyo23la; size=261128)

$ ll Cargo.lock 
-rw-r--r--  1 islam  staff   255K Feb 24 17:22 Cargo.lock

sanderpick

This looks on the right track... I left some clarifying questions.

sanderpick · 2025-03-03T16:03:21Z

fendermint/app/src/cmd/objects.rs

@@ -442,16 +442,76 @@ async fn handle_object_upload(
        })
    })?;

+    let hash_seq_hash = tag_entangled_data(&iroh, &ent, &metadata_hash)


shouldn't the entangler tag it's own data? ie, it could return some list of temp tags (UUID / similar) instead of relying on the auto tag and having to scan all tags below? I think we need to avoid any design that does a full scan of tags.

sanderpick · 2025-03-03T16:05:16Z

fendermint/app/src/cmd/objects.rs

+        .try_filter_map(|tag| {
+            let cloned_hashes = hashes.clone();
+            async move {
+                if cloned_hashes.contains(&tag.hash) {


what is the format of these "temp" tags?

these are auto tags generated by iroh

sanderpick · 2025-03-03T16:06:29Z

fendermint/app/src/cmd/objects.rs

@@ -681,6 +731,72 @@ async fn handle_object_download<F: QueryClient + Send + Sync>(
    }
 }

+async fn extract_blob_hash_and_size(


are there any similar changes we need to make to the SDK to accommodate these changes? maybe not, just double checking.

It depends... I showed you some terminal outputs. There is the blob (or object) that is uploaded, but the hash that is displayed is the hash of the wrapping seq hash. The size though is still for a single blob, not for the whole seq hash bundle with parity blobs.

I'm not sure if users should know all internals of the structure of blobs. Let me know if it's fine like this, otherwise I can add whatever we need.

islamaliev added 5 commits February 28, 2025 10:41

Set tags to entangled parity blobs

c7c79fb

Tag uploaded blobs

8f9031a

Upload hash sequence

7d25cd2

Adjust tests

edea318

Polish

d6a0059

islamaliev requested a review from sanderpick February 28, 2025 10:03

islamaliev self-assigned this Feb 28, 2025

sanderpick reviewed Mar 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: bundle and tag all related blobs #553

feat: bundle and tag all related blobs #553

islamaliev commented Feb 28, 2025

islamaliev commented Feb 28, 2025 •

edited

Loading

sanderpick left a comment

sanderpick Mar 3, 2025

sanderpick Mar 3, 2025

islamaliev Mar 3, 2025

sanderpick Mar 3, 2025

islamaliev Mar 3, 2025 •

edited

Loading

feat: bundle and tag all related blobs #553

Are you sure you want to change the base?

feat: bundle and tag all related blobs #553

Conversation

islamaliev commented Feb 28, 2025

islamaliev commented Feb 28, 2025 • edited Loading

sanderpick left a comment

Choose a reason for hiding this comment

sanderpick Mar 3, 2025

Choose a reason for hiding this comment

sanderpick Mar 3, 2025

Choose a reason for hiding this comment

islamaliev Mar 3, 2025

Choose a reason for hiding this comment

sanderpick Mar 3, 2025

Choose a reason for hiding this comment

islamaliev Mar 3, 2025 • edited Loading

Choose a reason for hiding this comment

islamaliev commented Feb 28, 2025 •

edited

Loading

islamaliev Mar 3, 2025 •

edited

Loading