[open-science] Content Mining Workshop

Mr. Puneet Kishor punk.kish at gmail.com
Fri Nov 2 23:23:15 UTC 2012


(Wish I could be there).

When the workshop is done, please package up the proceedings and any tools/presentations you demonstrate or develop so they may be reused. I can well imagine conducting one here State-side, for paleo as well as neo folks.


On Nov 3, 2012, at 7:09 AM, Ross Mounce <ross.mounce at gmail.com> wrote:

> Likewise I think the workshop/meeting is a splendid and timely idea.
> 
> I've already had a few inquiries about textmining from
> *palaeontologists*(and why not?) after I talked about mining in a
> recent conference talk in
> North Carolina (one of whom is in Oxford actually, John Clarke might well
> be interested in this, I've cc'd him into this message).
> 
> As John and I were only just discussing earlier today, the scope and
> potential for mining techniques is massive and would be useful in so many
> different fields. e.g. anyone wanting to do a systematic
> (all-literature-encompassing) review on a broad topic
> 
> I'll definitely try and be there if I can. I can't do Feb 11th-15th though
> (at another conference).
> 
> Best,
> 
> Ross
> 
> On 2 November 2012 20:32, Peter Murray-Rust <pm286 at cam.ac.uk> wrote:
> 
>> I think this is a really good idea - I'd been thinking about suggesting it
>> to OKF. People getting into this area spend a lot of time hacking about
>> with substandard tools and ending up inefficient. I might be in UK end of
>> Feb onwards and would certainly want to help.
>> 
>> The vital need IMO is to list all the current tools and comment briefly on
>> them. There are several levels (off the top of my head):
>> * discovery - where is the content
>> * scraping - getting it onto your machine (or relevant cloud)
>> * extracting into "readable" form (e.g. PDF2Foo) - I am blogging about this
>> * finding tables
>> * searching for the bits you want (e.g. OSCAR for chemistry)
>> * linking to known ontologies (e.g. PubChem)
>> * fiinding somewhere to put the results
>> 
>> and, of course we need to touch on legal aspects and other procedural
>> matters.
>> 
>> I can contribute chemistry (which although it may not be immediately
>> relevant to everyone shows many of the aspects and most people know
>> slightly what it is about - they have heard of molecules and chemical
>> structures.
>> 
>> On Fri, Nov 2, 2012 at 4:05 PM, Jenny Molloy <jcmcoppice12 at gmail.com>wrote:
>> 
>>> Hi All
>>> 
>>> I've had a request from a grad student via the Oxford Open Science group
>>> to run a hands-on workshop on data/content mining from the scientific
>>> literature, which sounds like an excellent idea.
>>> 
>>> ...
>>> 
>> 
>> 
>> 
>> --
>> Peter Murray-Rust
>> Reader in Molecular Informatics
>> Unilever Centre, Dep. Of Chemistry
>> University of Cambridge
>> CB2 1EW, UK
>> +44-1223-763069
>> ..
> 
> 
> -- 
> -- 
> -/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-
> Ross Mounce
> PhD Student & Open Knowledge Foundation Panton Fellow
> Fossils, Phylogeny and Macroevolution Research Group
> University of Bath, 4 South Building, Lab 1.07
> http://about.me/rossmounce

--
Puneet Kishor
Science and Data Policy at Creative Commons






More information about the open-science mailing list