[School-of-data] PDF Extraction Tools (Michael Bauer)

Adam Stiles adam.d.stiles at gmail.com
Thu Dec 20 19:56:00 UTC 2012


I asked the same question recently to folks at the Guardian Data Blog. This
was the reply:

@adstiles <http://twitter.com/adstiles> Weirdly Adobe pro is very good at
exporting tables and this too: http://www.pdftoexcelonline.com/


I added the converter here: http://schoolofdata.org/online-resources/

Adam

>
>
> ------------------------------
>
> Message: 3
> Date: Thu, 20 Dec 2012 11:54:38 +0100
> From: Michael Bauer <michael.bauer at okfn.org>
> Subject: [School-of-data] PDF Extraction Tools
> To: school-of-data at lists.okfn.org
> Message-ID: <20121220105438.GY23406 at lenore.local>
> Content-Type: text/plain; charset=us-ascii
>
> Hi,
>
> For the next School of Data tutorial I would like to cover Data extraction
> from PDFs (and text) as well as OCR.
>
> Does anyone here have experience with Tools that do not require coding
> skills to extract data and text from PDFs?
>
> Which tools do you use?
>
> Michael
>
>
> --
> Data Wrangler with the Open Knowledge Foundation (OKFN.org)
> GPG/PGP key: http://tentacleriot.eu/mihi.asc
> Twitter: @mihi_tr Skype: mihi_tr
>
>
>
> ------------------------------
>
> Message: 4
> Date: Thu, 20 Dec 2012 11:27:20 +0000
> From: Tom Longley <tom at tacticaltech.org>
> Subject: Re: [School-of-data] PDF Extraction Tools
> To: school-of-data at lists.okfn.org
> Message-ID: <50D2F618.1020400 at tacticaltech.org>
> Content-Type: text/plain; charset=ISO-8859-1
>
>
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hi Michael --
>
> I wrote this earlier in the year:
>
> http://drawingbynumbers.org/data-design-basics/note-3-opening-open-data#anchor-5
> There's some stuff about different tools and no much complimentary to
> say about  OCR.
>
> Tom
>
> On 20/12/12 10:54, Michael Bauer wrote:
> > Hi,
> >
> > For the next School of Data tutorial I would like to cover Data
> > extraction from PDFs (and text) as well as OCR.
> >
> > Does anyone here have experience with Tools that do not require
> > coding skills to extract data and text from PDFs?
> >
> > Which tools do you use?
> >
> > Michael
> >
> >
>
> - --
> Tom Longley
> Program Advisor
> Tactical Technology Collective
> e: tom at tacticaltech.org
> w: www.tacticaltech.org
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.10 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
>
> iQEcBAEBAgAGBQJQ0vYXAAoJEMZAm28GQNw+3BYH/AyeIVieIPJ0SG9Cm0WDSn7q
> BIScA+YLkIi6cyFwZdtBqn1Aft25g+ywJ1FpltpRES7M92VKbpcKxWNBLqLJ6AAp
> GiX94jKWardraGELMgYB+Jpy0PZGd0G99Za7pcb1+mF2fZE5svbvegqMhBtCTinP
> V/R+E7RRuzX4aA/mrLKPqADM6nONW8495uYe96+Yn4MmiEFrGIEUjCwGBNCG1VfA
> A/DOUZnOCcAwm9HkzFAywmbZ7zCcUtbgzbpSrYIXN0HcGhKBfWNU5sBhn/aZylY3
> xojwfVzLKkNPMSbvJAEUYKZQyDzhnbmCpudom2K1AOPyIXeNm2pvH/oyvABWMVg=
> =etSP
> -----END PGP SIGNATURE-----
>
>
>
>
> ------------------------------
>
> _______________________________________________
> School-of-data mailing list
> School-of-data at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/school-of-data
> Unsubscribe: http://lists.okfn.org/mailman/optionss/school-of-data
>
>
> End of School-of-data Digest, Vol 9, Issue 12
> *********************************************
>



-- 
Adam Stiles
510.280.4862
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.okfn.org/pipermail/school-of-data/attachments/20121220/ad5ee4ca/attachment.html>


More information about the school-of-data mailing list