[open-sustainability] Carbon Disclosure Project (CDP)

Fırat Gelbal firatgelbal at gmail.com
Tue Feb 5 17:56:07 UTC 2013


Hi Maya,

There are several ways to convert PDFs:
http://helpmeinvestigate.posterous.com/7-ways-to-get-data-out-of-pdfs

http://www.investintech.com/prod_a2e_pro.htm

Although I don't have experience myself, I heard none of the methods is
perfect. If you are not very lucky, there would be manual labor involved.

Cheers..

- firat



On Tue, Feb 5, 2013 at 6:22 PM, Maya Forstater <hiyamaya at gmail.com> wrote:

> I would nominate the CDP Global 500 Reports - Its the biggest data set as
> it goes back to 2003 (it was the original report) and of widest interest I
> would think.
>
> The particular data I am most interested in is the scope 1 and scope 2
> emissions by company and sector, but there might be some value in the other
> columns for others...
>
> Is there a more automated way to do this, or does it have to be done by
> hand?
>
>
>
> Maya Forstater
> ****************************************************
> Email:     maya at zadek.net
> Phone:   +44 (0)1727 833 200
> Mobile:   +44 (0) 7966676465
> Skype:   Maya Forstater
> Web:      www.hiyamaya.wordpress.com
> Twitter:   MForstater
>
>
>
>
> On 5 February 2013 17:12, Jonathan Gray <jonathan.gray at okfn.org> wrote:
>
>> Hmm.. we could set a project with Crowdcrafting to transcribe PDFs if we
>> could identify an interesting set that we wanted to focus on:
>>
>> http://crowdcrafting.org/
>>
>>
>> On Tue, Feb 5, 2013 at 6:10 PM, Maya Forstater <hiyamaya at gmail.com>wrote:
>>
>>> It is high-time that the CDP data was made more open and accessible.
>>>
>>> I think the CDP has done a great job in getting a first round of carbon
>>> disclosure over the past ten years, and making the case for mandatory
>>> disclosure.
>>>
>>> Some of the data is not released because companies report it on that
>>> condition, but even the data which is made public is made unusable because
>>> it is locked into tables in the back of the annual CDP reports (pdfs...)
>>> https://www.cdproject.net/en-US/Results/Pages/reports.aspx
>>>
>>> The CDP is perhaps constrained by a business model developed before the
>>> idea of open data really took off.
>>>
>>> But I think as a first step the existing public data should be made
>>> available in csv format for non-commercial and research purposes.
>>>
>>> I think that scraping the data from the pdfs and making it available on
>>> that basis would probably be in line (just about) with their T&Cs:
>>>
>>> "You may retrieve and view content from our site, print off one copy of
>>> individual pages, and may store these pages for research, your personal
>>> use, or other non-commercial use provided that wherever you cite the data,
>>> you credit “Carbon Disclosure Project” as the source. You may also use
>>> content from our site in editorial content, provided that you credit
>>> “Carbon Disclosure Project” as the source. You must not modify the paper or
>>> digital copies of any materials you have printed off or downloaded in any
>>> way, and you must not use any illustrations, photographs, video or audio
>>> sequences or any graphics separately from any accompanying text without
>>> prior written permission from CDP."
>>>
>>> ...?
>>>
>>> But I don't have the technical nouse to do it. I have done a couple of
>>> years by cutting and pasting, but I am sure there is a better way! - If
>>> anyone wants help with this it would be a public service.
>>>
>>> More here:
>>> http://hiyamaya.wordpress.com/2012/12/16/a-little-tweak-to-the-broken-carbon-dashboard/
>>>
>>> Maya Forstater
>>> ****************************************************
>>> Email:     maya at zadek.net
>>> Phone:   +44 (0)1727 833 200
>>> Mobile:   +44 (0) 7966676465
>>> Skype:   Maya Forstater
>>> Web:      www.hiyamaya.wordpress.com
>>> Twitter:   MForstater
>>>
>>>
>>>
>>>
>>> On 5 February 2013 16:59, Jonathan Gray <jonathan.gray at okfn.org> wrote:
>>>
>>>> Indeed aware of this:
>>>>
>>>> https://www.cdproject.net/en-US/Pages/Terms-And-Conditions.aspx
>>>>
>>>> But (i) it isn't openly licensed, (ii) I understand that a lot of data
>>>> goes through them which is not released...
>>>>
>>>> J.
>>>>
>>>>
>>>> On Tue, Feb 5, 2013 at 5:51 PM, Daniel Dietrich <
>>>> daniel.dietrich at okfn.org> wrote:
>>>>
>>>>> Just spotted this and thought sharing with those who didn't know this:
>>>>>
>>>>> https://www.cdproject.net/
>>>>>
>>>>> Daniel
>>>>>
>>>>> _______________________________________________
>>>>> open-sustainability mailing list
>>>>> open-sustainability at lists.okfn.org
>>>>> http://lists.okfn.org/mailman/listinfo/open-sustainability
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Jonathan Gray <http://jonathangray.org/> | @jwyg<http://twitter.com/jwyg>
>>>> Director of Policy and Ideas
>>>> The Open Knowledge Foundation <http://okfn.org/> | @okfn<http://twitter.com/okfn>
>>>> Support our work: okfn.org/support
>>>>
>>>> _______________________________________________
>>>> open-sustainability mailing list
>>>> open-sustainability at lists.okfn.org
>>>> http://lists.okfn.org/mailman/listinfo/open-sustainability
>>>>
>>>>
>>>
>>> _______________________________________________
>>> open-sustainability mailing list
>>> open-sustainability at lists.okfn.org
>>> http://lists.okfn.org/mailman/listinfo/open-sustainability
>>>
>>>
>>
>>
>> --
>> Jonathan Gray <http://jonathangray.org/> | @jwyg<http://twitter.com/jwyg>
>> Director of Policy and Ideas
>> The Open Knowledge Foundation <http://okfn.org/> | @okfn<http://twitter.com/okfn>
>> Support our work: okfn.org/support
>>
>> _______________________________________________
>> open-sustainability mailing list
>> open-sustainability at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/open-sustainability
>>
>>
>
> _______________________________________________
> open-sustainability mailing list
> open-sustainability at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/open-sustainability
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-sustainability/attachments/20130205/7c93bcc9/attachment.html>


More information about the open-sustainability mailing list