[ckan-discuss] PDF to replace RDF as primary format for data.southampton

Hanssens Bart Bart.Hanssens at fedict.be
Fri Apr 1 10:57:55 BST 2011


Do I hear... tagged PDF ?
Accessibility needs parsing / semantics, so there you go.

Just use PDF/A, that's very close to RDFa, isn't it ;-)

Bart

-----Original Message-----
From: ckan-discuss-bounces at lists.okfn.org [mailto:ckan-discuss-bounces at lists.okfn.org] On Behalf Of Peter Krantz
Sent: vrijdag 1 april 2011 11:40
To: CKAN discuss
Subject: Re: [ckan-discuss] PDF to replace RDF as primary format for data.southampton

On Fri, Apr 1, 2011 at 11:26, Tim McNamara <paperless at timmcnamara.co.nz> wrote:
> If you publish
> data as PDF, e.g. remove all structure, you make it impossible to build
> tools with those data.

That is not true. PDF is definitely not unstructured. By parsing the
PDF format you get a machine readable representation. E.g. the
specification [1] privides a lot of parsing details on how to convert
a font specification or a color in the PDF source to e.g. a custom XML
element.

[1]: http://tiny.cc/april-1st

Regards,

Peter

_______________________________________________
ckan-discuss mailing list
ckan-discuss at lists.okfn.org
http://lists.okfn.org/mailman/listinfo/ckan-discuss



More information about the ckan-discuss mailing list