[ckan-dev] Trouble harvesting CSW

Guillaume SUEUR guillaume.sueur at neogeo-online.net
Wed Apr 10 14:48:23 UTC 2013


Hi again,

I have two different problems harvesting CSW catalogs :
1. I can't use the 'publisher' mode in my config file, because if I do so, the harvester requires a publisher_id I'm not able to give. 
2. Using default mode, I can harvest my remote catalog. But there are two validation errors :
	• GUID ea022511-142a-4ee5-953c-521b481bdb6c
Validating against "ISO19139 XSD Schema" profile failed:
Dataset schema (gmx.xsd) Validation Error: (u"Element '{http://www.isotc211.org/2005/gmd}MD_DataIdentification', attribute 'namespace': The attribute 'namespace' is not allowed., line 94",)
	• GUID ea022511-142a-4ee5-953c-521b481bdb6c
Error importing Gemini document: Validation Error: {' junk': 'The input field __junk was not expected.', 'Organizations': ['Please choose an organization to add the dataset to']}

For the first one, how can I configure the validator to let it accept the presence of the attribute namespace ? 
For the second one, I guess it comes from the harvest source configuration options. But I am not able to figure out how to indicate to put all the records in a specific group. Here is the configuration options used :
	{ "api_version":"1", "default_tags":["Grand Lyon"], "default_groups":["grand-lyon"], "default_extras":{"new_extra":"Test","harvest_url":"{harvest_source_url}/dataset/{dataset_id}"}, "override_extras": true, "user":"admin", "read_only": true }

Thank you very much,

--------------------------------------------------
Guillaume SUEUR









More information about the ckan-dev mailing list