[OpenSpending] Exploring Senegal Public Procurements : how we turned PDF files into browsable data ?
Marc Joffe
marc at publicsectorcredit.org
Fri Jul 5 16:07:47 UTC 2013
Pierre
I just wanted to add my thanks as well. My project with California city fiscal data also entails struggles with PDFs. While we address this with a combination of brute force and commercial tools like Abbyy FineReader and Able2Extract, it is great to see that you have been successful with open source alternatives. I was not aware of OpenRefine until seeing your post and am glad that you introduced me to it; in certain circumstances, it appears to offer substantial advantages over spreadsheets for data cleansing.
Thanks again for contributing such an informative blog post.
Cheers,
Marc Joffe
Public Sector Credit Solutions
http://www.publicsectorcredit.org/ca
From: openspending-bounces at lists.okfn.org [mailto:openspending-bounces at lists.okfn.org] On Behalf Of Julia Keseru
Sent: Thursday, July 04, 2013 10:42 AM
To: OpenSpending Discussion List
Cc: Tangui Morlier; Patt Nsukami
Subject: Re: [OpenSpending] Exploring Senegal Public Procurements : how we turned PDF files into browsable data ?
Hey Pierre, this is great!
Would you be interested in helping us map the global landscape of procurement disclosure practices?
Here`s a blog post that explains our efforts: http://sunlightfoundation.com/blog/2013/05/01/open-procuring-how-do-other-countries-perform/
Could you please fill this questionnaire so that we might have credible information on Senegal too?
https://docs.google.com/forms/d/1naM_f6_fDSMpgueRutOV886G0zqY0K3riV7XxfVUjSw/viewform
Thanks,
Julia
On Thu, Jul 4, 2013 at 6:18 PM, Pierre Chrzanowski <pierre.chrzanowski at gmail.com <mailto:pierre.chrzanowski at gmail.com> > wrote:
Hi All, I just shared a post on the Open Spending Blog on our exploration of Senegal Public Procurements Data.
We explain in this tutorial how we turned PDF files released by the Senegalese Authority for Public Procurement into a browsable dataset on Open Spending. You will also find all the tools, scripts we used.
We hope this will be helpful in your own work.
http://blog.openspending.org/2013/07/04/exploring-senegal-public-procurements-how-we-turned-pdf-files-into-browsable-data/
--
Pierre Chrzanowski
// Skype: pierre.chrzanowski // Twitter : piezanowski
_______________________________________________
openspending mailing list
openspending at lists.okfn.org <mailto:openspending at lists.okfn.org>
http://lists.okfn.org/mailman/listinfo/openspending
Unsubscribe: http://lists.okfn.org/mailman/options/openspending
--
Júlia Keserű
International Program Coordinator
1818 N Street NW, Suite 300
Washington, DC 20036
(1) 202-742-1520 *280
<http://sunlightfoundation.com/> <http://www.facebook.com/sunlightfoundation> <http://twitter.com/sunfoundation> <http://www.reddit.com/r/sunlight> <http://www.youtube.com/sunlightfoundation> <http://sunlightfoundation.com/feeds/latest/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/openspending/attachments/20130705/6ab61bab/attachment.html>
More information about the openspending
mailing list