[School-of-data] 'Tabula helps you liberate data tables trapped inside evil PDFs'

Peter Murray-Rust pm286 at cam.ac.uk
Sat Apr 6 07:16:42 UTC 2013


I've made initial contact with them - don't yet have email addresses. This
fits very well into our AMI2 PDF-2-XML tool which is automatable. Table
still (I think) needs tables fed one at a time


On Sat, Apr 6, 2013 at 1:02 AM, Eldan Goldenberg <eldang at gmail.com> wrote:

> I haven't tried it yet, but it doesn't look like the process for
> self-hosting on a local OSX or Linux box is too onerous:
> https://github.com/jazzido/tabula/blob/master/README.md#manual-installation-os-x-or-linux
>
> Eldan Goldenberg
> eldang at gmail.com | @eldang <https://twitter.com/#!/eldang> | eldan.co.uk |
> skype: eldang
> PGP public key: http://eldan.co.uk/eldang.asc
>
> On Apr 5, 2013, at 4:49 PM, Stian Håklev wrote:
>
> Somebody needs to host an unrestricted version of this - it's too high a
> barrier for someone to fire up an EC2, just to try to extract some data
> from a PDF...
>
>
> On Fri, Apr 5, 2013 at 7:19 PM, Jonathan Gray <jonathan.gray at okfn.org>wrote:
>
>> This might be useful/interesting to some of you!
>>
>> http://tabula.nerdpower.org/
>> http://source.mozillaopennews.org/en-US/articles/introducing-tabula/
>>
>>
> _______________________________________________
> School-of-data mailing list
> School-of-data at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/school-of-data
> Unsubscribe: http://lists.okfn.org/mailman/options/school-of-data
>
>


-- 
Peter Murray-Rust
Reader in Molecular Informatics
Unilever Centre, Dep. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/school-of-data/attachments/20130406/1ac86af6/attachment-0001.html>


More information about the school-of-data mailing list