No subject


Sun Dec 12 18:29:16 UTC 2010


surmise that the NCBI taxonomy, presumably with a bias toward human
and model organisms, as well as its adoption by Uniprot, should be the
preferred choice for expressing taxon info in the context of
biomedical knowledge?

Next question (thinking that I know the answer) is how to integrate
datasets that use one of each: NCBI and Darwin? Of course, we can
shelve this and come back to it some other year, if we want to dig
into something more specifically biomedical. The issue will eventually
come back to haunt us in any case. For example, with chemical
identifiers..

Also, is taxon out of scope for Identifier.org ?

Cheers,
Scott

===================================
[Jerven]
Hi All,

Don't wish to spam the mailing list about which taxonomy to use. However if
you actually look at the darwin and uniprot taxonomy "schema" then they are
very similar. And even in the taxonomy world it doesn't have that many
controversies.

Its the instances that get hairy.
i.e. is it

Dugu is a rodentia
<purl.uniprot.org/taxonomy/10160> rdfs:subClassOf <
purl.uniprot.org/taxonomy/9989>
or
Dugu is a Caviomorpha
<purl.uniprot.org/taxonomy/10160> rdfs:subClassOf <
http://dbpedia.org/resource/Caviomorpha>

Which gets taxonomist all exited :)

Mapping schemas is easy to do here. Its mapping instances that get the
feuds started :D

Regards,
Jerven

============================================
PMR comment - this isn't spam, it's science!
============================================
PMR - thanks for this. If we can make progress on identifiers it makes *me*
happy!
-- 
Peter Murray-Rust
Reader in Molecular Informatics
Unilever Centre, Dep. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069

--0015173ff2d4dc66c304b31f8b60
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Jenny has highlighted that we shall be using this list to discuss the hacka=
thon. I suggest we use a separate title for each thread, prefaced by HACKAT=
HON-<br><br>My problem is how to create indentiers for (say) viruses. If yo=
u look at Wikipedia it doesn&#39;t give IDs. But I then discovered (by chan=
ce) taxid: which gives numbers. But the pages don&#39;t give static URIs (t=
hey contain cgi). I am cutting and pasing the discussion (and I shal;l refe=
r any others to here.<br>
<br><table class=3D"cf gJ" cellpadding=3D"0"><tbody><tr><td class=3D"gF gK"=
><table class=3D"cf ix" cellpadding=3D"0"><tbody><tr><td><div class=3D"iw">=
<span class=3D"ik"><img title=3D"" class=3D"de" id=3D"upi" name=3D"upi" src=
=3D"https://mail.google.com/mail/images/cleardot.gif" height=3D"16px" width=
=3D"16px"></span><span class=3D"gD" style=3D"color:#790619">Jerven Bollema<=
/span> </div>
</td></tr></tbody></table></td><td class=3D"gH"><br></td><td class=3D"gH"><=
br></td></tr></tbody></table>Hi Peter, All,<br>
<br>
All taxons in the UniProt taxonomy can be found via (<a href=3D"http://purl=
.uniprot.org/taxonomy/10305" target=3D"_blank">http://purl.uniprot.org/taxo=
nomy/10305</a>).
 This is synchronized with the NCBI taxonomy and is the same in the=20
public version (release delta excepted). Some limited NCBI taxonomy=20
curation happens at the Swiss-Prot group which also does the UniProt rdf
 work (Guess where I work ;).<br>
<br>
In this case you actually have an link in rdf from the herpes virus to=20
its hosts. The proteins it encodes (might not be all for each virus=20
isolate e.g. in this case only a single virion membrane protein is=20
known) and links to relevant papers as well as related virion proteins.<br>
Will love to show you all how you can get this data in RDF and work with it=
 using SPARQL.<br>
<br>
Regards,<br>
Jerven<br clear=3D"all">=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D<br><br><table class=3D"cf gJ" cellpadding=3D"0"><tbody><tr><td cl=
ass=3D"gF gK"><table class=3D"cf ix" cellpadding=3D"0"><tbody><tr><td><div =
class=3D"iw"><span class=3D"gD" style=3D"color:#c88900">M. Scott Marshall</=
span> <span class=3D"hb"><span class=3D"g2"></span> </span></div>
</td></tr></tbody></table></td><td class=3D"gH"><div class=3D"gK"><span cla=
ss=3D"iD">show details</span> <span id=3D":1vn" class=3D"g3" title=3D"Fri, =
Dec 2, 2011 at 11:36 AM" alt=3D"Fri, Dec 2, 2011 at 11:36 AM">11:36 AM (6 h=
ours ago)</span> <span></span></div>
</td><td class=3D"gH"><br></td></tr></tbody></table>Dear Peter and Jerven,<=
br>
<br>
Nice blog with dawg and frog!<br>
<br>
Thanks for the answer Jerven. I&#39;m looking forward to this.<br>
<br>
Not wanting to start (too much) commotion but also bumped into this<br>
for taxons: <a href=3D"http://rs.tdwg.org/dwc/index.htm" target=3D"_blank">=
http://rs.tdwg.org/dwc/index.htm</a><br>


More information about the open-science mailing list