Fix 16k file bucket #143

bdon · 2024-02-21T07:14:14Z

currently archives less than 16K don't work with file buckets (maybe same for HTTP depending on server)

The initial IO is for 16384 bytes to save roundtrips over the network. This is a useful optimization, and it doesn't seem worth special-casing non-network archives to do something different (do one IO for a few bytes to then get the exact length...)

However doing a ReadAt for 16384 bytes will read the entire archive and then raise an EOF error. This PR special cases range requests at offset 0 to allow shorter responses. I don't love this because it adds hidden hardcoded behavior based on offset=0.

Alternative: add another boolean parameter to NewRangeReaderEtag that's like allowShortResponse bool to make this behavior explicit. Thoughts?

msbarry · 2024-02-21T10:44:47Z

It looks like HTTP/gocloud buckets don't care if your byte range request goes past the end of the file? To match that behavior, maybe we could just remove the && offset == 0 check?

Also make sure you update mockBucket, and you can remove the extra logic I added to fakeArchive to pad if <16kb.

bdon · 2024-02-22T06:44:07Z

To match that behavior, maybe we could just remove the && offset == 0 check?

It should be OK for files, however on HTTP we need to special-case it - right now it's caught by isRefreshRequiredCode

If we are treating ETags correctly and the server correctly implements conditional requests we should never see a 416, only 412s when etags are actually expired. The only case where we see a 416 is the <16kb case, and we don't even get the N successfully read bytes. So our HTTP bucket should probably detect 416 and retry. A bit ugly...

msbarry · 2024-02-22T11:09:38Z

I didn't think it was an etag thing, I meant that HTTP server range requests return the overlap of the requested range and the actual data range. I've tried on a few different HTTP servers and when you request from within the file past the end of it, it always returns a 206. 416 is only when the start of the range is > the file length. Are you saying that caddy returns that error when backed by an HTTP bucket or file bucket?

The 416 error code spec indicates that it should only be thrown when none of the ranges "overlap the current extent"

bdon · 2024-02-25T14:49:46Z

@msbarry ok, updated

msbarry · 2024-02-25T14:52:18Z

pmtiles/bucket.go

@@ -60,11 +60,15 @@ func (m mockBucket) NewRangeReaderEtag(_ context.Context, key string, offset int
 	if len(etag) > 0 && resultEtag != etag {
 		return nil, "", 412, &RefreshRequiredError{}
 	}
-	if offset+length > int64(len(bs)) {
+	if offset > int64(len(bs)) {


I think this should be >=

bdon mentioned this pull request Feb 21, 2024

Corrupted file #144

Closed

bdon added 3 commits February 25, 2024 22:34

fix typo on 'required'

b91a134

remove extraneous print

891f1bb

File bucket handles archives less than 16kb

e0f80f9

bdon force-pushed the fix-16k-file-bucket branch from 0567099 to e0f80f9 Compare February 25, 2024 14:34

file bucket can return bytes than requested

08536b9

msbarry reviewed Feb 25, 2024

View reviewed changes

bdon merged commit 5e5daa7 into main Feb 25, 2024
4 checks passed

bdon deleted the fix-16k-file-bucket branch February 25, 2024 14:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix 16k file bucket #143

Fix 16k file bucket #143

bdon commented Feb 21, 2024 •

edited

Loading

msbarry commented Feb 21, 2024 •

edited

Loading

bdon commented Feb 22, 2024

msbarry commented Feb 22, 2024 •

edited

Loading

bdon commented Feb 25, 2024

msbarry Feb 25, 2024

Fix 16k file bucket #143

Fix 16k file bucket #143

Conversation

bdon commented Feb 21, 2024 • edited Loading

msbarry commented Feb 21, 2024 • edited Loading

bdon commented Feb 22, 2024

msbarry commented Feb 22, 2024 • edited Loading

bdon commented Feb 25, 2024

msbarry Feb 25, 2024

Choose a reason for hiding this comment

bdon commented Feb 21, 2024 •

edited

Loading

msbarry commented Feb 21, 2024 •

edited

Loading

msbarry commented Feb 22, 2024 •

edited

Loading