[okfn-labs] CSV and Datapackage Validation projects at the ODI

Leigh Dodds leigh at ldodds.com
Mon Jan 27 09:52:30 UTC 2014


Hi,

The ODI tech team has been working on some validation tools to support
people in publishing and sharing CSV files and Data Packages. I'm
sending this email to share some pointers to the work and to request
some feedback from the labs community.

We've looked at various tools/approaching for validating CSV files,
exploring what features tools provide, their syntax, etc. There is
some documentation and examples here:

https://github.com/theodi/csv-validation-research

There's also a proposal that summarises some specific improvements to
the CSV DDF and JSON Table Schema specifications to support additional
validation use cases:

https://github.com/theodi/csv-validation-research/wiki/Extending-Data-Packages-to-Support-CSV-File-Validation

Based on this we've also been creating some validation tools. Here are
the early versions of our DataPackage and CSV validation libraries:

https://github.com/theodi/datapackage.rb
https://github.com/theodi/csvlint.rb

The CSV validator is being used to create a validation service:

https://github.com/theodi/csvlint
http://csvlint.io/

These are still very much works in progress, but we'd love to get
feedback from the community, particularly around features for the CSV
validation tools.

To help move forward discussion around some of the suggested
improvements to CSV DDF and JSON Table I plan on opening some tickets
on the relevant projects in github.

Cheers,

L.

-- 
Leigh Dodds
Freelance Technologist
Open Data, Linked Data Geek
t: @ldodds
w: ldodds.com
e: leigh at ldodds.com



More information about the okfn-labs mailing list