[open-bibliography] Norwegian Personal Name Authority File
Tom Morris
tfmorris at gmail.com
Thu Apr 12 13:44:01 UTC 2012
Has anyone here attempted to work with the linked data version of the
Norwegian personal name authorities file?
http://www.bibsys.no/files/out/linked_data/autreg/index.html
http://openbiblio.net/2011/04/06/radata-na-norwegian-personal-name-authorities-as-linked-open-data/
I noticed that it had some owl:sameAs links to VIAF in it, so I
figured it would be pretty easy to link up with other data, but then I
discovered that they've got multiple VIAF links for a single name
authority record as well as a single VIAF record being linked to
multiple name authority records. I can just ignore anything which is
multiply linked, but it makes me suspicious of the quality of the rest
of the links. I'm wondering if perhaps they just did a simple name
match without any criteria. This would make it inadvisable to use
even the 1:1 matches.
Here's an example which is linked 17 times one way and 8 the other:
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x06016930>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x07019560>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x09068253>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x10038002>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x98024353>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90637787>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883724>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883799>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90886687>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883648>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883793>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90914430>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90935347>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x97001128>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x00004331>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x00032556>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://viaf.org/viaf/103384140> .
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/20834817>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/73957208>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/33173455>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://viaf.org/viaf/100194281> .
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://d-nb.info/gnd/140075895> .
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/71457471>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://dbpedia.org/resource/John_Harris_%28USN%29> .
Has anyone else tried to use these owl:sameAs links? Any advice to offer?
For anyone who's interested, here are some basic stats on the data set:
1.2 GB (it's an uncompressed(!) download, but compresses to 85 MB)
1,441,148 entries
primary name in normal order duplicated in foaf:name, rdfs:label,
primary name in inverted order in
http://def.bibsys.no/xmlns/radatana/1.0#catalogueName
311,447 VIAF links (owl:sameAs)
209,888 DNB links
30,422 DBpedia links
95,488 aliases (skos:altLabel)
57,218 birth year (whois:since http://www.kanzaki.com/ns/whois#since)
9,758 death year (whois:until)
Tom
More information about the open-bibliography
mailing list