[ckan-dev] Trouble harvesting CSW

Guillaume SUEUR guillaume.sueur at neogeo-online.net
Wed Apr 10 19:29:52 UTC 2013


Hi Adrian,

Thanks for your answer. I can definitely switch to 2.0. I've tried it first but had difficulties in setting it up. With my new experience with Ckan I should be able to go further with the 2.0 version. 
I'll let you know how it goes. 

Best regards

Guillaume Sueur


Le 10 avr. 2013 à 20:14, Adrià Mercader <adria.mercader at okfn.org> a écrit :

> Hi Guillaume,
> 
> Is there any chance that you can use CKAN 2.0 or are you using 1.8 on
> an existing project?
> There have been several improvements in the harvesters on this version
> (including some of the errors you mention). Moreover CKAN 2.0 uses an
> organizations based authorization by default, so you wouldn't need to
> use the publisher profile option.
> To use 2.0, just checkout the release-v2.0 branch on ckan core,
> ckanext-spatial and ckanext-harvest, and change the harvest plugin
> from "gemini_csw_harvester" to "csw_harvester" [1].
> 
> If it is not possible for you to use 2.0 we'll try to fix your issues
> as well, but it would make things much easier.
> 
> Thanks,
> 
> Adrià
> 
> [1] https://github.com/okfn/ckanext-spatial/tree/release-v2.0#spatial-harvesters
> 
> On 10 April 2013 15:48, Guillaume SUEUR
> <guillaume.sueur at neogeo-online.net> wrote:
>> Hi again,
>> 
>> I have two different problems harvesting CSW catalogs :
>> 1. I can't use the 'publisher' mode in my config file, because if I do so, the harvester requires a publisher_id I'm not able to give.
>> 2. Using default mode, I can harvest my remote catalog. But there are two validation errors :
>>        • GUID ea022511-142a-4ee5-953c-521b481bdb6c
>> Validating against "ISO19139 XSD Schema" profile failed:
>> Dataset schema (gmx.xsd) Validation Error: (u"Element '{http://www.isotc211.org/2005/gmd}MD_DataIdentification', attribute 'namespace': The attribute 'namespace' is not allowed., line 94",)
>>        • GUID ea022511-142a-4ee5-953c-521b481bdb6c
>> Error importing Gemini document: Validation Error: {' junk': 'The input field __junk was not expected.', 'Organizations': ['Please choose an organization to add the dataset to']}
>> 
>> For the first one, how can I configure the validator to let it accept the presence of the attribute namespace ?
>> For the second one, I guess it comes from the harvest source configuration options. But I am not able to figure out how to indicate to put all the records in a specific group. Here is the configuration options used :
>>        { "api_version":"1", "default_tags":["Grand Lyon"], "default_groups":["grand-lyon"], "default_extras":{"new_extra":"Test","harvest_url":"{harvest_source_url}/dataset/{dataset_id}"}, "override_extras": true, "user":"admin", "read_only": true }
>> 
>> Thank you very much,
>> 
>> --------------------------------------------------
>> Guillaume SUEUR
>> 
>> 
>> 
>> 
>> 
>> 
>> _______________________________________________
>> ckan-dev mailing list
>> ckan-dev at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/ckan-dev
>> Unsubscribe: http://lists.okfn.org/mailman/options/ckan-dev
> 
> _______________________________________________
> ckan-dev mailing list
> ckan-dev at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/ckan-dev
> Unsubscribe: http://lists.okfn.org/mailman/options/ckan-dev




More information about the ckan-dev mailing list