[Open-access] Trying to index the malaria literature for BOAI-Openness - what has to be done paper-by-paper?

Daniel Mietchen daniel.mietchen at googlemail.com
Tue Mar 13 15:46:22 UTC 2012


Google does not give a simple list of results, and it currently yields
over 80k hits for malaria on PMC:
https://www.google.com/search?q=malaria+site%3Awww.ncbi.nlm.nih.gov%2Fpmc .

However, the Crawler currently being coded as part of the Open Access
Media Importer (cf.
http://wir.okfn.org/2012/03/10/open-access-media-importer-apology-frontend-usage/
) does almost what you are looking for, and so it should not be too
difficult to modify it accordingly.

I have copied Nils and Raphael - my partners on this - into this mail.

Cheers,

Daniel


On Tue, Mar 13, 2012 at 4:14 PM, Peter Murray-Rust <pm286 at cam.ac.uk> wrote:
>
>
> On Tue, Mar 13, 2012 at 3:09 PM, Daniel Mietchen
> <daniel.mietchen at googlemail.com> wrote:
>>
>> Just seen in
>>
>> http://blogs.ch.cam.ac.uk/pmr/2012/03/13/sparc2012-a-manifesto-in-absentia-for-open-data/
>> :
>> "Our recent @ccess group is trying to index the malaria literature for
>> BOAI-Openness and it has to be done paper-by-paper"
>>
>> I am not sure what PMR had in mind here that "has to be done
>> paper-by-paper", but albeit PMC / PMUK do indeed a bad job in
>> identifying papers by their licenses,
>
>
> That's exactly what I meant!
>>
>> Google does it fairly well:
>> 216 PMC articles under CC0 are being brought up in search for "malaria"
>>
>> https://www.google.com/search?q=malaria+site%3Awww.ncbi.nlm.nih.gov%2Fpmc+%22Creative+Commons+Public+Domain%22
>> as well as over 9000 under CC BY:
>>
>> https://www.google.com/search?q=malaria+site%3Awww.ncbi.nlm.nih.gov%2Fpmc+%22http%3A%2F%2Fcreativecommons.org%2Flicenses%2Fby%22+-%22by-sa%22++-%22by-nc%22++-%22by-nd%22
>> (for further stats, see
>>
>> http://wir.okfn.org/2011/07/14/a-wiki-approach-to-open-access-and-open-science/#comment-44
>> ).
>>
>> Of course, the relevance of these papers for Malaria World cannot be
>> established that way.
>>
> Malaria World gave us 70K references and the questions is how we establish
> the Openness of them. We can start with Google and maybe intersect the sets.
> Does google give a simple list of the 9000 papers?
>
>>
>> _______________________________________________
>> open-access mailing list
>> open-access at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/open-access
>
>
>
>
> --
> Peter Murray-Rust
> Reader in Molecular Informatics
> Unilever Centre, Dep. Of Chemistry
> University of Cambridge
> CB2 1EW, UK
> +44-1223-763069




More information about the open-access mailing list