[open-bibliography] Place of Publication data from the BL dataset

Ben O'Steen bosteen at gmail.com
Thu Nov 25 11:16:55 UTC 2010


I've pulled out the place of publication(?) data (isbd:P1016) from the
BL BNB dataset and compiled it into a sorted spreadsheet, focussed on
the locations:

http://bit.ly/g2l2tM

Due to google's limitations, I've broken the spreadsheet up into 12.

sortedlocations0-12 ->

"0" - "00:0/0" to "Birmingham] ([Red House, Hill Lane, Great Barr,
Birmingham B43 6LZ])"

"1" - "Birmingham] ([Reference Library, Birmingham B3 3HQ])" to "Cheadle
(266 Councillor La., Cheadle, Cheshire)"

"2" - "Cheadle (266 Councillor La., Cheadle, Cheshire SK8 5PN)" to
"Dunmow (37 The Close, Dunmow, Essex CM6 1EN)"

"3" - "Dunmow]" to "Grand Rapids, Miss"

"4" - "Grand Rapids, MI, USA" to "Kirkcudbright (77 High St.,
Kirkcudbright DG6 4JW)"

"5" - "Kirkcudbright (Tongland, Kirkcudbright)" to "London (19 Cornwall
Terrace, NW1 4QP)"

"6" - "London (19 Douglas St., London, SW1P 4PA)" to "London (6 Hugh
St., Pimlico, London SW1V 1RP)"

"7" - "London (6 Hugo St., London, SW1V 1RP)" to "London (Scorpio House,
106 Church Rd., London SE19 2UB)"

"8" - "London (Scorpio House, 106 Church Rd., London SE19 2UB)" to
"Newport (National Foaling Bank, Meretown Stud, Newport, Salop)"

"9" - "Newport (Newbridge Works Ltd, Chepstow Rd, Newport, Gwent NP5
4TW)" to "Research Section, Dublin"

"10" - "Research Studies Press" to "St. Sampson"

"11" - "St. Sampson] Guernsey" to "Wolverhampton (50 Queen St.,
Wolverhampton WV1 3BU)"

"12" - "Wolverhampton (59 Waterloo Rd, Wolverhampton W1V 4QT)" to
"Zwolle"

The grouping of locations is done purely by exact string matching, so as
not to interfere with the data too much at this point.

Eg 'London' is not equal to 'London]' or 'London (14 C...)'

Also note, that many records have multiple P1016 values.

I can perform a full export of this, so that each row has a full list of
GB Ids in which the location occurs, but it will overload GDocs, and a
number of normal spreadsheet programs so best used programatically.

Note that this set is just to aid you in exploring the data held in
bnb.bibliographica.org and to help avoid DoS from too many location
lookups! :)

Ben





More information about the open-bibliography mailing list