`decrypt_file_iter`, a generator yielding chunks that would be passed to `on_data`? Concise alternative to using `on_data` for streaming #246

vergenzt · 2025-01-14T15:29:35Z

Is your feature request related to a problem? Please describe.
It'd be nice to be able to iterate over streamed chunks in gpg.decrypt_file instead of having to set an on_data callback.

Describe the solution you'd like
Use one of the functions from https://stackoverflow.com/questions/9968592/turn-functions-with-a-callback-into-python-generators to wrap decrypt_file, yield-ing each chunk of the file as it streams.

Then once the iterator terminates, perhaps a separate method could get the result from the GPG object? Not sure the best way to handle this. Or maybe just raise an exception if there's any failure and ignore the result object otherwise?

Describe alternatives you've considered
Just using on_data and doing this myself. 🙂

The text was updated successfully, but these errors were encountered:

vsajip · 2025-01-14T21:29:41Z

I'm sure it might be aesthetically pleasing from a "design purity" point of view, but does it give you anything that you can't do with on_data? I imagine that the users of on_data are a small subset of the users of this package - the functionality was only added relatively recently, because before that no-one asked for it! So it would add complexity to the code beyond what is there now, for an as yet unquantified (and perhaps unquantifiable) benefit.

vergenzt · 2025-01-27T20:26:20Z

Okay I've just figured out what it would give me that I can't do with on_data -- handle chunks of the file from the same thread I initiated the decryption from.

E.g. right now I'm trying to decrypt_file on a large file, process lines of it into a SQLAlchemy record, and then commit the result. I've now got on_data set up to handle the results... but now unfortunately because on_data gets called from a background thread, my database session which is thread-local by default is throwing sqlalchemy.exc.InvalidRequestError: Object '<MyObject at 0x123456789>' is already attached to session '3' (this is '5'). 🙁

Would you be open to merging it if I add this, to simplify this use case?

vsajip · 2025-01-28T08:51:43Z

Would you be open to merging it if I add this, to simplify this use case?

It depends on how the proposed changes look. After all, I would have to provide on-going support indefinitely, for an uncommon use case. In terms of your use case, this could be addressed with the current setup by the on_data callable just sending the chunks to a queue, which the session-owning thread can read from as a consumer.

vsajip · 2025-01-29T11:15:33Z

I've updated the documentation to talk about threading constraints when processing data.

vergenzt changed the title ~~Iterator yielding chunks on decrypt_file?1~~ decrypt_file_iter, a generator yielding chunks that *would* be passed to on_data? Concise alternative to using on_data for streaming Jan 14, 2025

vergenzt added a commit to vergenzt/python-gnupg that referenced this issue Jan 27, 2025

wip on vsajip#246

64ae800

vsajip added the enhancement New feature or request label Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`decrypt_file_iter`, a generator yielding chunks that would be passed to `on_data`? Concise alternative to using `on_data` for streaming #246

`decrypt_file_iter`, a generator yielding chunks that would be passed to `on_data`? Concise alternative to using `on_data` for streaming #246

vergenzt commented Jan 14, 2025

vsajip commented Jan 14, 2025

vergenzt commented Jan 27, 2025 •

edited

Loading

vsajip commented Jan 28, 2025

vsajip commented Jan 29, 2025

decrypt_file_iter, a generator yielding chunks that *would* be passed to on_data? Concise alternative to using on_data for streaming #246

decrypt_file_iter, a generator yielding chunks that *would* be passed to on_data? Concise alternative to using on_data for streaming #246

Comments

vergenzt commented Jan 14, 2025

vsajip commented Jan 14, 2025

vergenzt commented Jan 27, 2025 • edited Loading

vsajip commented Jan 28, 2025

vsajip commented Jan 29, 2025

`decrypt_file_iter`, a generator yielding chunks that would be passed to `on_data`? Concise alternative to using `on_data` for streaming #246

`decrypt_file_iter`, a generator yielding chunks that would be passed to `on_data`? Concise alternative to using `on_data` for streaming #246

vergenzt commented Jan 27, 2025 •

edited

Loading