[open-science] Content Mining Workshop
Mr. Puneet Kishor
punk.kish at gmail.com
Fri Nov 2 23:23:15 UTC 2012
(Wish I could be there).
When the workshop is done, please package up the proceedings and any tools/presentations you demonstrate or develop so they may be reused. I can well imagine conducting one here State-side, for paleo as well as neo folks.
On Nov 3, 2012, at 7:09 AM, Ross Mounce <ross.mounce at gmail.com> wrote:
> Likewise I think the workshop/meeting is a splendid and timely idea.
>
> I've already had a few inquiries about textmining from
> *palaeontologists*(and why not?) after I talked about mining in a
> recent conference talk in
> North Carolina (one of whom is in Oxford actually, John Clarke might well
> be interested in this, I've cc'd him into this message).
>
> As John and I were only just discussing earlier today, the scope and
> potential for mining techniques is massive and would be useful in so many
> different fields. e.g. anyone wanting to do a systematic
> (all-literature-encompassing) review on a broad topic
>
> I'll definitely try and be there if I can. I can't do Feb 11th-15th though
> (at another conference).
>
> Best,
>
> Ross
>
> On 2 November 2012 20:32, Peter Murray-Rust <pm286 at cam.ac.uk> wrote:
>
>> I think this is a really good idea - I'd been thinking about suggesting it
>> to OKF. People getting into this area spend a lot of time hacking about
>> with substandard tools and ending up inefficient. I might be in UK end of
>> Feb onwards and would certainly want to help.
>>
>> The vital need IMO is to list all the current tools and comment briefly on
>> them. There are several levels (off the top of my head):
>> * discovery - where is the content
>> * scraping - getting it onto your machine (or relevant cloud)
>> * extracting into "readable" form (e.g. PDF2Foo) - I am blogging about this
>> * finding tables
>> * searching for the bits you want (e.g. OSCAR for chemistry)
>> * linking to known ontologies (e.g. PubChem)
>> * fiinding somewhere to put the results
>>
>> and, of course we need to touch on legal aspects and other procedural
>> matters.
>>
>> I can contribute chemistry (which although it may not be immediately
>> relevant to everyone shows many of the aspects and most people know
>> slightly what it is about - they have heard of molecules and chemical
>> structures.
>>
>> On Fri, Nov 2, 2012 at 4:05 PM, Jenny Molloy <jcmcoppice12 at gmail.com>wrote:
>>
>>> Hi All
>>>
>>> I've had a request from a grad student via the Oxford Open Science group
>>> to run a hands-on workshop on data/content mining from the scientific
>>> literature, which sounds like an excellent idea.
>>>
>>> ...
>>>
>>
>>
>>
>> --
>> Peter Murray-Rust
>> Reader in Molecular Informatics
>> Unilever Centre, Dep. Of Chemistry
>> University of Cambridge
>> CB2 1EW, UK
>> +44-1223-763069
>> ..
>
>
> --
> --
> -/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/-
> Ross Mounce
> PhD Student & Open Knowledge Foundation Panton Fellow
> Fossils, Phylogeny and Macroevolution Research Group
> University of Bath, 4 South Building, Lab 1.07
> http://about.me/rossmounce
--
Puneet Kishor
Science and Data Policy at Creative Commons
More information about the open-science
mailing list