[ddj] [School-of-data] 'Tabula helps you liberate data tables trapped inside evil PDFs'
M. Edward (Ed) Borasky
znmeb at znmeb.net
Sat Apr 13 17:17:55 UTC 2013
There are some people working on hosting their own versions - it's
open source. Attempting to put it in my toolset for Linux Mint, Ubuntu
and Fedora is on my to-do list, but I don't have a release date nailed
down yet.
There are a number of PDF manipulation tools already available in
https://github.com/znmeb/Computational-Journalism-Publishers-Workbench/blob/master/ScrapingTools/README.md
If I can get Tabula to work it will be added to that collection.
On Fri, Apr 5, 2013 at 4:49 PM, Stian Håklev <shaklev at gmail.com> wrote:
> Somebody needs to host an unrestricted version of this - it's too high a
> barrier for someone to fire up an EC2, just to try to extract some data from
> a PDF...
>
>
> On Fri, Apr 5, 2013 at 7:19 PM, Jonathan Gray <jonathan.gray at okfn.org>
> wrote:
>>
>> This might be useful/interesting to some of you!
>>
>> http://tabula.nerdpower.org/
>> http://source.mozillaopennews.org/en-US/articles/introducing-tabula/
>>
>> --
>> Jonathan Gray | @jwyg
>> Director of Policy and Ideas
>> The Open Knowledge Foundation | @okfn
>> Support our work: okfn.org/support
>>
>> _______________________________________________
>> School-of-data mailing list
>> School-of-data at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/school-of-data
>> Unsubscribe: http://lists.okfn.org/mailman/options/school-of-data
>>
>
>
>
> --
> http://reganmian.net/blog -- Random Stuff that Matters
>
> _______________________________________________
> data-driven-journalism mailing list
> data-driven-journalism at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/data-driven-journalism
> Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism
>
--
Twitter: http://twitter.com/znmeb; Computational Journalism Publishers Workbench
http://j.mp/CompJournBench/
I am not an IP address! I am a free man!
More information about the data-driven-journalism
mailing list