[ddj] 5811 datasets in beta EU data portal

Rufus Pollock rufus.pollock at okfn.org
Fri Jan 11 15:15:38 UTC 2013


You know this a CKAN based data catalog so there's an API from which
you can get all of this directly without needing to scrape ;-)

http://open-data.europa.eu/open-data/data/api

http://open-data.europa.eu/open-data/data/api/search/dataset?limit=2&all_fields=1

Rufus

On 11 January 2013 10:25, Michael Bauer <michael.bauer at okfn.org> wrote:
> The metadata:
>
> https://scraperwiki.com/scrapers/metadata_european_open_data_catalog/
>
> (hope this runs through...)
>
> You want to use:
>
> https://scraperwiki.com/docs/api?name=metadata_european_open_data_catalog#sqlite
>
> with the query:
>
> select license,count(dataset_url) from `swdata` group by license
>
> Michael
>
> On Fri, Jan 11, 2013 at 08:56:47AM +0100, Michael Bauer wrote:
>> Can we get the metadata out and do a statistic of this. This would mean
>> that the majority of the data in the open data portal are not open (as in
>> the opendefinition.org)
>>
>> Michael
>>
>> On Thu, Jan 10, 2013 at 09:29:28PM +0100, Benjamin Ooghe-Tabanou wrote:
>> > Like on most data catalogs, each dataset is released under a specific licence.
>> > In the EU Commission's case, there are three different ones used :
>> > cc-by for about 100 datasets, a few dozens under Europa Legal Notice,
>> > which seems complex but opendata-compatible, and the whole others,
>> > meaning more than 95%, are released under the Eurostat
>> > Copyright/Licence Policy
>> > <http://epp.eurostat.ec.europa.eu/portal/page/portal/about_eurostat/policies/copyright_licence_policy>
>> > This last one explicitly says that commercial reuse is possible except
>> > in a broad variety of situations which the user obviously has to
>> > identify for himself : this is the exact opposite of an opendata
>> > licence that would get rid of any legal uncertainty that could put the
>> > reuser at risk as defined in the opendefinition.
>> >
>> > Benjamin
>> >
>> > On Thu, Jan 10, 2013 at 6:04 PM, Ahmed ElAmin <elamin.ahmed at gmail.com> wrote:
>> > > I don't know if you are correct Benjamin. Where did you get the info
>> > > that it is for non-commercial use? Even if it says that, media can
>> > > analyse and publish the data (eg see UK gov transparency data
>> > > publishing, which is what this exercise by the Commission is based
>> > > on). Non-commercial use normally means companies cannot package and
>> > > resell the data. That's my understanding. I can always contact the
>> > > Commission to clarify if people are still unsure. Here is what the
>> > > site says (though it is badly put):
>> > >
>> > > 'This portal is about transparency, open government and innovation.
>> > > The European Commission Data Portal provides access to open public
>> > > data from the European Commission. It also provides access to data of
>> > > other Union institutions, bodies, offices and agencies at their
>> > > request. The published data can be downloaded by everyone interested
>> > > to facilitate reuse, linking and the creation of innovative services.
>> > > Moreover, this Data Portal promotes and builds literacy around
>> > > Europe’s data. The data publishers, application developers and the
>> > > general public can also use new functionalities enabled by the
>> > > semantic technologies.'
>> > >
>> > > On 10 January 2013 15:23, Benjamin Ooghe-Tabanou <b.ooghe at gmail.com> wrote:
>> > >> That is the issue... Does a media reusing data make a commercial
>> > >> reuse? No one knows, and everyone is in a high legal incertainty
>> > >> situation...
>> > >>
>> > >> Benjamin
>> > >>
>> > >>
>> > >> On Thu, Jan 10, 2013 at 3:10 PM, Robin Linderborg
>> > >> <robin.linderborg at gmail.com> wrote:
>> > >>> Just of curiosity, where does one draw the line with non-commercial data?
>> > >>> Surely, it must be allowed to reference the results, although not in great
>> > >>> detail.
>> > >>>
>> > >>> /Robin Linderborg
>> > >>>
>> > >>>
>> > >>> 2013/1/10 Benjamin Ooghe-Tabanou <b.ooghe at gmail.com>
>> > >>>>
>> > >>>> Warning : most of this data can not be used by medias as it is
>> > >>>> released under a non-commercial licence, which is quite surprising
>> > >>>> coming from the european commission repeating in loop the strong
>> > >>>> interest of OpenData for economic purposes...
>> > >>>>
>> > >>>> Benjamin
>> > >>>>
>> > >>>>
>> > >>>> On Thu, Jan 10, 2013 at 11:53 AM, Ahmed ElAmin <elamin.ahmed at gmail.com>
>> > >>>> wrote:
>> > >>>> > Very interesting and sometimes bizzare kind of microdata begin put
>> > >>>> > online as part of a Commission transparency exercise. Here is a sample
>> > >>>> > of what can be found begging for infographic kind of treatment.
>> > >>>> > http://open-data.europa.eu/open-data/data/
>> > >>>> >
>> > >>>> >
>> > >>>> >     Psychiatric care beds in hospitals was updated on 09/01/13.
>> > >>>> >     Total length of railway lines was updated on 09/01/13.
>> > >>>> >     Public electronic procurement systems was updated on 09/01/13.
>> > >>>> >     Enterprises using Internet for interaction with public authorities
>> > >>>> > (NACE Rev. 1.1) was updated on 09/01/13.
>> > >>>> >     Public services - Individuals was updated on 09/01/13.
>> > >>>> >     Enterprises using the Internet for submitting a proposal in a
>> > >>>> > public electronic tender system to public authorities was updated on
>> > >>>> > 09/01/13.
>> > >>>> >     Population connected to independent wastewater collecting systems:
>> > >>>> > with treatment was updated on 09/01/13.
>> > >>>> >     Gross value added - NACE Rev.1: L-P - current prices was updated
>> > >>>> > on 09/01/13.
>> > >>>> >     Water abstracted for public water supply was updated on 09/01/13.
>> > >>>> >     Compensation of employees by NACE Rev.1 was updated on 09/01/13.
>> > >>>> >
>> > >>>> > Cheers
>> > >>>> > Ahmed
>> > >>>> >
>> > >>>> > _______________________________________________
>> > >>>> > data-driven-journalism mailing list
>> > >>>> > data-driven-journalism at lists.okfn.org
>> > >>>> > http://lists.okfn.org/mailman/listinfo/data-driven-journalism
>> > >>>> > Unsubscribe:
>> > >>>> > http://lists.okfn.org/mailman/options/data-driven-journalism
>> > >>>>
>> > >>>> _______________________________________________
>> > >>>> data-driven-journalism mailing list
>> > >>>> data-driven-journalism at lists.okfn.org
>> > >>>> http://lists.okfn.org/mailman/listinfo/data-driven-journalism
>> > >>>> Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism
>> > >>>
>> > >>>
>> > >>
>> > >> _______________________________________________
>> > >> data-driven-journalism mailing list
>> > >> data-driven-journalism at lists.okfn.org
>> > >> http://lists.okfn.org/mailman/listinfo/data-driven-journalism
>> > >> Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism
>> >
>> > _______________________________________________
>> > data-driven-journalism mailing list
>> > data-driven-journalism at lists.okfn.org
>> > http://lists.okfn.org/mailman/listinfo/data-driven-journalism
>> > Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism
>>
>> --
>> Data Wrangler with the Open Knowledge Foundation (OKFN.org)
>> GPG/PGP key: http://tentacleriot.eu/mihi.asc
>> Twitter: @mihi_tr Skype: mihi_tr
>>
>> _______________________________________________
>> data-driven-journalism mailing list
>> data-driven-journalism at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/data-driven-journalism
>> Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism
>
> --
> Data Wrangler with the Open Knowledge Foundation (OKFN.org)
> GPG/PGP key: http://tentacleriot.eu/mihi.asc
> Twitter: @mihi_tr Skype: mihi_tr
>
> _______________________________________________
> data-driven-journalism mailing list
> data-driven-journalism at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/data-driven-journalism
> Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism



-- 
Co-Founder, Open Knowledge Foundation
Promoting Open Knowledge in a Digital Age
http://www.okfn.org/ - http://blog.okfn.org/




More information about the data-driven-journalism mailing list