-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JSTOR watermark #24
Comments
JSTOR has been working since 0.0.10, can you show me a sample that it fails on? |
http://diyhpl.us/~bryan/papers2/paperbot/The%20New%20England%20Origins%20of%20Mormonism.pdf On Thu, Mar 28, 2013 at 9:45 PM, Bryan Bishop [email protected]:
|
I experience the same issue at this date. Having tested several JSTOR pdfs I can not scrub the watermark from them with pdfparanoia. |
The existing JSTOR scrubber stopped working because JSTOR are now adding The above patches remove watermark strings as before, but in the process, we're
Here's what I think's happening: A PDF object can be thought of as a hierarchy of objects; the most important of When we remove watermarks, we're changing the length of objects within the We could solve this by, after manipulating objects within |
Further errors, now. A sample:
|
This content downloaded from X at T on bottom of all pages
The text was updated successfully, but these errors were encountered: