Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatically inject metadata into scrubbed papers #30

Open
kanzure opened this issue Jul 8, 2013 · 0 comments
Open

Automatically inject metadata into scrubbed papers #30

kanzure opened this issue Jul 8, 2013 · 0 comments

Comments

@kanzure
Copy link
Owner

kanzure commented Jul 8, 2013

I think pdfparanoia should inject metadata into each pdf it scrubs. This data will be a json blob that contains information like which version of pdfparanoia was used to scrub the file, what time the scrubbing occurred, previous scrubbing history, etc.

Scrubbing history should record which objects were removed from the document. Eventually this might be useful for debugging what happened to a pdf in a collection.

Also, this could be an interesting hook for storing metadata about a paper inside the paper itself. There could be a json blob-- like the data returned from zotero translation-server and zotero translators-- that gets added by pdfparanoia next to the scrubbing history.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants