Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow non compressed WARC files to be used while recording #915

Open
Lisias opened this issue Aug 19, 2024 · 0 comments
Open

Allow non compressed WARC files to be used while recording #915

Lisias opened this issue Aug 19, 2024 · 0 comments

Comments

@Lisias
Copy link

Lisias commented Aug 19, 2024

Modern file systems now have compression schemes that surpasses userland compression schemes in convenience and sometimes even on efficiency, rendering the current use of gzip inconvenient when such filesystems are used.

As a use case, BTRFS with zstd:15 gave me excellent results on compression, surpassing what gzip could do, with faster read access. At very least, I don't need to recompress the WARCball after a recording session.

Ideally it should be possible to allow the user to choose if they are want to use the warcio gzip support, or prefer to rely on the file system for such - preventing the user to uncompress the thing manually to take advantage of the file system compression.

Lisias added a commit to Lisias/pywb that referenced this issue Aug 19, 2024
…owadays (and to think this started with Stacker on MS-DOS!), so it makes sense to use **uncompressed** WARC files on a BTRFS or NTFS with it activated. This commit deactivates the WARCIO gzip support when the filename **does not** ends with `.gz`, allowing the user to use these filesystems to reach the compression he wants without having to deal with uncompressing the WARC on use.

for webrecorder#915
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant