[okfn-labs] Using Git (and Github) for Data - a Data Pattern

Pieter Colpaert pieter.colpaert at okfn.org
Tue Jul 2 16:07:14 UTC 2013


Hi Rufus,

We are working very hard to get that working by the end of October. We'll
keep you posted :)

Kind regards,

Pieter


On Tue, Jul 2, 2013 at 6:04 PM, Rufus Pollock <rufus.pollock at okfn.org>wrote:

> Hi Pieter,
>
> Thanks for sharing this - I hadn't seen this before and it looks very
> interesting. Do you have a demo site using this by any chance - e.g. a web
> app where edits are versioned into the RDF database running rawbase on top
> of it?
>
> Rufus
>
>
> On 2 July 2013 15:20, Pieter Colpaert <pieter.colpaert at okfn.org> wrote:
>
>> Hi Rufus,
>>
>> You might also like the git for triples we're building in our lab at the
>> university of Ghent: http://rawbase.github.io
>>
>> Paper from WWW13 can be found here:
>> https://t.co/FxedWgR13Y
>>
>> Kind regards,
>>
>> Pieter
>>
>>
>> On Tue, Jul 2, 2013 at 4:12 PM, Rufus Pollock <rufus.pollock at okfn.org>wrote:
>>
>>> Hi folks,
>>>
>>> I wanted to give a heads up on a new post I've put out today under the
>>> title of "Git (and Github) for data":
>>>
>>> <http://blog.okfn.org/2013/07/02/git-and-github-for-data/>
>>>
>>> <excerpt>
>>> The ability to do “version control” for data is a big deal. There are
>>> various options but one of the most attractive is to reuse existing
>>> tools for doing this with code, like git and mercurial. This post
>>> describes a simple “data pattern” for storing and versioning data
>>> using those tools which we’ve been using for some time and found to be
>>> very effective.
>>> </excerpt>
>>>
>>> The basic pattern is very simple and probably familiar to lots of folks
>>> here:
>>>
>>> 1. Storing data as line-oriented text and specifically as CSV files.
>>> “Line oriented text” just indicates that individual units of the data
>>> such as a row of a table (or an individual cell) corresponds to one
>>> line.
>>>
>>> 2. Use best of breed (code) versioning like git mercurial to store and
>>> manage the data.
>>>
>>> As people know, this is exactly the model in use for a while with
>>> <https://github.com/datasets> and http://data.okfn.org/
>>>
>>> Regards,
>>>
>>> Rufus
>>>
>>> PS: I should also add that the appearance of this post at the same
>>> time as Max Ogden's recent Dat efforts is entirely fortuitous
>>> coincidence - the original draft of this post was done a while ago
>>> (polishing for actual publication always gets put off!), though I have
>>> sought to add in some additional links in light of recent
>>> developments!
>>>
>>> _______________________________________________
>>> okfn-labs mailing list
>>> okfn-labs at lists.okfn.org
>>> http://lists.okfn.org/mailman/listinfo/okfn-labs
>>> Unsubscribe: http://lists.okfn.org/mailman/options/okfn-labs
>>>
>>
>>
>
>
> --
> *
>
> Rufus Pollock
>
> Founder and Co-Director | skype: rufuspollock | @rufuspollock<https://twitter.com/rufuspollock>
>
> The Open Knowledge Foundation <http://okfn.org/>
>
> Empowering through Open Knowledge
> http://okfn.org/ | @okfn <http://twitter.com/OKFN> | OKF on Facebook<https://www.facebook.com/OKFNetwork>|
> Blog <http://blog.okfn.org/>  |  Newsletter<http://okfn.org/about/newsletter>
>
> *
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/okfn-labs/attachments/20130702/071003a3/attachment-0002.html>


More information about the okfn-labs mailing list