[Open-Legislation] oeil scraper

stef stefan.marsiske at gmail.com
Mon Feb 14 00:41:06 UTC 2011


hey all,

you might like this crawler/scraper:
https://github.com/stef/le-n-x/blob/master/brain/oeil.py

it scrapes every issue on oeil. currently this is completely independent from
the rest of le-n-x. so you can use it for your own purposes, it's agplv3.

i also have a json dump of all data from tonite, here:
http://www.ctrlc.hu/~stef/oeil.json.bz2 (app. 30MB uncompressed)

two output formats/backends are currently supported, mongodb and json.

comming up, rendering the db on the web as (xhtml|atom)+microformats/json

cheers,s

-- 
gpg: https://www.ctrlc.hu/~stef/stef.gpg
gpg fp: F617 AC77 6E86 5830 08B8  BB96 E7A4 C6CF A84A 7140




More information about the open-legislation mailing list