[Okfn-ca] Exemple de code Python

Peder Jakobsen pjakobsen at gmail.com
Thu Jul 11 22:44:07 UTC 2013


Sorry, I don't write French.   :(   

I do ETL with Python daily,  if you can show me a source data example (XML, csv, etc), perhaps I can offer some suggestions.

Peder Jakobsen
Data.gc.ca






On 2013-07-11, at 6:06, LIM <logementsinsalubresmontreal at gmail.com> wrote:

> Est-ce qu'il serait possible d'envoyer un exemple de code Python pour du ETL.
> 
> Merci
> 
> Pascal Robichaud
> 
> 
> Le 2013-07-10 à 07:00, okfn-ca-request at lists.okfn.org a écrit :
> 
>> Envoyez vos messages pour la liste Okfn-ca à
>>   okfn-ca at lists.okfn.org
>> 
>> Pour vous (dés)abonner par le web, consultez
>>   http://lists.okfn.org/mailman/listinfo/okfn-ca
>> 
>> ou, par email, envoyez un message avec 'help' dans le corps ou dans le
>> sujet à
>>   okfn-ca-request at lists.okfn.org
>> 
>> Vous pouvez contacter l'administrateur de la liste à l'adresse
>>   okfn-ca-owner at lists.okfn.org
>> 
>> Si vous répondez, n'oubliez pas de changer l'objet du message afin
>> qu'il soit plus spécifique que "Re: Contenu du digest de Okfn-ca..."
>> 
>> 
>> Thèmes du jour :
>> 
>>  1. Re: Alternatives to OpenRefine | Fwd: School-of-data    Digest,
>>     Vol 16, Issue 2 (Peder Jakobsen)
>> 
>> 
>> ----------------------------------------------------------------------
>> 
>> Message: 1
>> Date: Tue, 9 Jul 2013 11:05:26 -0400
>> From: Peder Jakobsen <pjakobsen at gmail.com>
>> Subject: Re: [Okfn-ca] Alternatives to OpenRefine | Fwd:
>>   School-of-data    Digest, Vol 16, Issue 2
>> To: diane.mercier at gmail.com
>> Cc: OKFN-ca <okfn-ca at lists.okfn.org>,
>>   open-data-montreal at googlegroups.com
>> Message-ID: <B200441B-6644-4E39-B3B1-84178A399365 at gmail.com>
>> Content-Type: text/plain; charset="iso-8859-1"
>> 
>> 
>> On 2013-07-09, at 8:39 AM, Diane Mercier <diane.mercier at gmail.com> wrote:
>> 
>>> It may depend on exactly what you want. Regularly on the OpenRefine list people post requests for functionality that tend to be answered 'look at dedicated ETL solutions'. Open Source ETL solutions mentioned include  Talend OpenStudio or Pentaho Data Integration
>>> 
>>> I don't think these are quite 'alternatives' to Refine but it may depend on what exactly you want to to do and the skills/resources you have available.
>> 
>> Owen is right, the answer to this question depends very much on what skills and resources are available.  
>> 
>> My personal preference is to use a scripting language for all ETL work.  There is no bizarre corner case or integration problem that cannot be easily dealt with a simple script.  Python is an obvious choice: tasks that would be a hassle with a tool like OpenRefine or in, say, Java  are a breeze, fast, and somewhat enjoyable to work on. 
>> 
>> ETL is my full time job, so I do grant that that not everyone has the luxury to figure out all the tricks of data manipulation with something like Python or Ruby.  But if possible, it's an investment that is worth making, and will pay big dividends  over the long term for any organization that needs to aggregate  data. 
>> 
>> Cheers,
>> 
>> Peder Jakobsen
>> Consultant, OKFN CKAN & data.gc.ca
>> 
>> 
>> -------------- section suivante --------------
>> Une pièce jointe HTML a été nettoyée...
>> URL: <http://lists.okfn.org/pipermail/okfn-ca/attachments/20130709/295742d8/attachment-0001.htm>
>> 
>> ------------------------------
>> 
>> _______________________________________________
>> Okfn-ca mailing list | Le groupe local au Canada de l'Open Knowledge Foundation Network (OKFN)
>> Okfn-ca at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/okfn-ca
>> Site Web : http://ca.okfn.org
>> 
>> 
>> 
>> Fin de Lot Okfn-ca, Vol 3, Parution 12
>> **************************************
> 
> _______________________________________________
> Okfn-ca mailing list | Le groupe local au Canada de l'Open Knowledge Foundation Network (OKFN)
> Okfn-ca at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/okfn-ca
> Site Web : http://ca.okfn.org
> 




More information about the okfn-ca mailing list