[ckan-dev] Datastore and linked-to resources that no longer exist

Sean Hammond sean.hammond at okfn.org
Thu Apr 18 13:37:43 UTC 2013


> Take a look at this dataset on publicdata.eu:
> 
> http://publicdata.eu/dataset/ministerial-data-cabinet-office
> 
> If you click on any of the resources you'll get an error:
> 
> Could not load preview: DataProxy returned an error (Data transformation
> failed. HTTPError: HTTP Error 404: Not Found)
> 
> and if you try to download any of the resource files from the source
> site you'll find they no longer exist, eg:
> 
> http://www.cabinetoffice.gov.uk/sites/default/files/resources/pm-meetings.csv
> 
> Related to the new work that's being done around the new datastorer
> service (data pusher is its current name I think) and the new datastorer
> paster command/cron job:
> 
> I'm not sure how we intend to deal with this problem in CKAN -- when a
> resource file is linked to, and then the source file on the remote site
> moves or disappears. Once we have the datastorer service and script
> stuff sorted out, then it can be deployed and a resource file like this
> would have been pulled into the datastore so could be previewed from the
> datastore. But what should the datastorer do, when it finds that the
> original source file is gone? Should it leave the data in the datastore,
> so that preview and data API keep working? Or should it delete the data
> in the datastore, and have the resource page display some clear error
> message that says the source file is no longer there?

Ping. This seems related to the datapusher discussion we had this
morning




More information about the ckan-dev mailing list