[open-bibliography] Norwegian Personal Name Authority File

Tom Morris tfmorris at gmail.com
Thu Apr 12 13:44:01 UTC 2012


Has anyone here attempted to work with the linked data version of the
Norwegian personal name authorities file?

http://www.bibsys.no/files/out/linked_data/autreg/index.html
http://openbiblio.net/2011/04/06/radata-na-norwegian-personal-name-authorities-as-linked-open-data/

I noticed that it had some owl:sameAs links to VIAF in it, so I
figured it would be pretty easy to link up with other data, but then I
discovered that they've got multiple VIAF links for a single name
authority record as well as a single VIAF record being linked to
multiple name authority records.  I can just ignore anything which is
multiply linked, but it makes me suspicious of the quality of the rest
of the links.  I'm wondering if perhaps they just did a simple name
match without any criteria.  This would make it inadvisable to use
even the 1:1 matches.

Here's an example which is linked 17 times one way and 8 the other:

<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x06016930>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x07019560>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x09068253>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x10038002>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x98024353>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90637787>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883724>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883799>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90886687>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883648>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90883793>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90914430>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x90935347>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x97001128>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x00004331>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x00032556>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.

<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://viaf.org/viaf/103384140> .
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/20834817>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/42034050>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/73957208>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/33173455>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://viaf.org/viaf/100194281> .
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://d-nb.info/gnd/140075895> .
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs> <http://viaf.org/viaf/71457471>
.
<http://data.bibsys.no/data/notrbib/authorityentry/x03106769>
<http://www.w3.org/2002/07/owl#sameAs>
<http://dbpedia.org/resource/John_Harris_%28USN%29> .

Has anyone else tried to use these owl:sameAs links?  Any advice to offer?

For anyone who's interested, here are some basic stats on the data set:

1.2 GB (it's an uncompressed(!) download, but compresses to 85 MB)

1,441,148 entries
  primary name in normal order duplicated in foaf:name, rdfs:label,
  primary name in inverted order in
http://def.bibsys.no/xmlns/radatana/1.0#catalogueName

  311,447 VIAF links  (owl:sameAs)
  209,888 DNB links
   30,422 DBpedia links

   95,488 aliases (skos:altLabel)
   57,218 birth year (whois:since http://www.kanzaki.com/ns/whois#since)
    9,758 death year (whois:until)

Tom




More information about the open-bibliography mailing list