[Open-Legislation] Fwd: Public domain US legal data and code

Daniel Dietrich daniel.dietrich at okfn.org
Sun Oct 7 17:48:47 UTC 2012


Begin forwarded message:

> From: Eric Mill <eric at sunlightfoundation.com>
> Subject: Public domain US legal data and code
> Date: 5 October 2012 18:32:51 CEST
> To: transparency-tech at googlegroups.com
> Reply-To: transparency-tech at googlegroups.com
> Hi all,
> I've been working for the last month or two with Josh Tauberer (of GovTrack.us) and Derek Willis on a project to produce a public domain scraper and dataset from THOMAS.gov, the official source for legislative information for the US Congress. 
> It's a reasonably well documented set of Python scripts, which you can find here:
> https://github.com/unitedstates/congress
> We just hit a great milestone - it gets everything important that THOMAS has on bills, back to the year THOMAS starts (1973). We've published and documented all of this data in bulk, and I've worked it into Sunlight's pipeline, so that searches for bills in Scout use data collected directly from this effort.
> The data and code are all hosted on Github on a "unitedstates" organization, which is right now co-owned by me, Josh, and Derek - the intent is to have this all exist in a common space. To the extent that the code needs a license at all, I'm using a public domain "unlicense" that should at least be sufficient for the US (other suggestions welcome).
> There's other great stuff in this organization, too - Josh made an amazing donation of his legislator dataset, and converted it to YAML for easy reuse. I've worked that dataset into Sunlight's products already as well. I've also moved my legal citation extractor into this organization -- and my colleague Thom Neale has an in-progress parser for the US Code, to convert it from binary typesetting codes into JSON.
> Github's organization structure actually makes possible a very neat commons. I'm hoping this model proves useful, both for us and for the public.
> -- Eric
> -- 
> Developer | sunlightfoundation.com

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-legislation/attachments/20121007/9db76c87/attachment.html>

More information about the open-legislation mailing list