[open-bibliography] Metadata aggregators, discovery tools and libraries

Jim Pitman pitman at stat.Berkeley.EDU
Mon Jan 24 18:16:15 UTC 2011


Peter Murray-Rust <pm286 at cam.ac.uk> wrote:

> Just got back from US  but have been talking witrh Sam Adams about our tools
> and we are fairly optimistic that we can technically scrape a lot. We may
> need per-publisher crowdsourcing of templates, etc.

BibSonomy has a suite of per-publisher scraping tools which are openly available, and also usable
by webservice 

http://www.bibsonomy.org/scraperinfo
http://scraper.bibsonomy.org/

I suggest integration of these tools with an OKFN supported effort.

> > We have more than that - we have ca 150,000 articles. 

Please can you expose this dataset with CC0 or whatever as a test dataset for
the community to exercise various tools?  

> I think we will have to rescrape the biblio but that's tractable.  
> I am becoming very excited about the ideas of community-scraping and I think it will scale. We won't
> get everything initially but we will get the stuff that is cared about. 

I strongly agree with that.  

--Jim P.
----------------------------------------------
Jim Pitman
Director, Bibliographic Knowledge Network Project
http://www.bibkn.org/

Professor of Statistics and Mathematics
University of California
367 Evans Hall # 3860
Berkeley, CA 94720-3860

ph: 510-642-9970  fax: 510-642-7892
e-mail: pitman at stat.berkeley.edu
URL: http://www.stat.berkeley.edu/users/pitman




More information about the open-bibliography mailing list