On Sun, Apr 3, 2011 at 11:38, stef <stefan.marsiske at gmail.com> wrote: > how are you parsing the pdfs? will it be manual labor? Often pdf contains simply the text, pdftotxt converts it just fine. Depends on the individual case. g