[open-science] Removing watermarks from pdfs (pdfparanoia)

Bryan Bishop kanzure at gmail.com
Tue Feb 5 20:20:22 UTC 2013


How about removing those pesky watermarks from pdfs? Sometimes they
completely obfuscate the contents of a paper we're trying to read, or
sometimes they have more sinister purposes.

Working proof of concept:

https://github.com/kanzure/pdfparanoia
https://pypi.python.org/pypi/pdfparanoia

Discussion history:
https://groups.google.com/group/science-liberation-front/t/c68964cf55d8f6fa

People who could theoretically benefit from this:
http://scholar.google.com/scholar?q=%22Authorized+licensed+use+limited+to%22
http://scholar.google.com/scholar?q="Redistribution+subject+to+SEG+license+or+copyright"<http://scholar.google.com/scholar?q=%22Redistribution+subject+to+SEG+license+or+copyright%22>
http://scholar.google.com/scholar?q="Redistribution+subject+to+AIP"<http://scholar.google.com/scholar?q=%22Redistribution+subject+to+AIP%22>
http://scholar.google.com/scholar?q="Downloaded+from+http%3A%2F%2Fpubs.acs.org+on"<http://scholar.google.com/scholar?q=%22Downloaded+from+http%3A%2F%2Fpubs.acs.org+on%22>
http://scholar.google.com/scholar?q="Downloaded+*+*+2001..2013+to+*"<http://scholar.google.com/scholar?q=%22Downloaded+*+*+2001..2013+to+*%22>

To get source code:

git clone git://github.com/kanzure/pdfparanoia.git

To install:

sudo pip install pdfparanoia

or:

sudo easy_install pdfparanoia

Right now there's IEEE and AIP support. I need more samples to work with.

- Bryan
http://heybryan.org/
1 512 203 0507
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-science/attachments/20130205/f531dfd3/attachment.html>


More information about the open-science mailing list