[ckan-dev] The future of resources.
Seb Bacon
seb.bacon at gmail.com
Tue Jan 25 09:26:11 UTC 2011
Pending being posted to ckan-discuss, just to add a big plus one to
the general gist of this proposal.
Seb
On 24 January 2011 09:37, David Raznick <david.raznick at okfn.org> wrote:
> Hello
> I have almost completed the work for ticket http://ckan.org/ticket/826. See
> https://bitbucket.org/kindly/ckan/changeset/5769ff6ac34f. (do not pull yet)
>
> I do not think anyone will be happy with it, but regardless I think it
> should stand. Hopefully it will displease all with equal measure.
> It adds a config option that means you can add more resource fields. i.e
> you could add a alternative_url field. The new fields act as if they were
> normal database fields to the developer/form designer. So they are
> attributes on an object the same way url, description and hash are. You can
> search on them too using the sql backend. Here are a list of the pros and
> cons I can think of.
> Pros.
> * Simple
> * The smallest change I can think of to complete the ticket, and give
> clients the custom extra fields they need in the short term.
> * The status quo has not changed concerning resources.
> Cons.
> * It will make relational purists unhappy as it uses a json (in fact just a
> dict jsononifed) field.
> * Will make nosql advocates unhappy as this is the thing they are designed
> to do.
> * Will make semantic web advocates unhappy as it adds nothing (and possibly
> even muddies) the classification of these resources.
> * Will make wiki style collaboration enthusiasts unhappy as it does not
> give the flexibility they need.
> * It makes me unhappy for the all the above reasons.
> I still think the pros outweigh the cons.
> So onto the topic what we need to do with resources in the long run. Here
> are my opinions.
> * Resources should be made first class citizens in ckan. For the simple
> reason that essentially THEY ARE THE DATA. I have added a
> ticket http://ckan.org/ticket/922 that outlines this.
> * They should at least have their own form. We can not squeeze all the
> information we need to describe them properly into a small table in
> packages.
> * There should be means of versioning and dating them
> properly amongst each other. e.g a way of saying this is the latest
> version of the csv file and it was from this date on this topic. I think
> manual versioning is better here than against a package (the packages
> version should just pick up the latest resource version).
> * We should give people the option of duplicating them.
> * We should be providing tools, access, guidance and lookups to
> ontologies, to help people classify the data/resources properly.
> * We should give tools beyond just previewing the data, to actually help
> people semantically analyse and convert the data itself. Stuffing in a link
> to a random excel file is not that great
> * Potentially provide basic visualisations of the resource.
> * Potentially develop, host and encourage data
> cleaning/augmenting/clustering tools (like google refine), to help people
> get their data in good state.
> So the work on them is not nearly over...
>
> Regards
> David
>
>
>
> _______________________________________________
> ckan-dev mailing list
> ckan-dev at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/ckan-dev
>
>
--
skype: seb.bacon
mobile: 07790 939224
land: 0207 183 9618
web: http://baconconsulting.co.uk
More information about the ckan-dev
mailing list