[OpenGLAM] 2.5 million public domain images now available.

Mon Sep 1 08:53:10 UTC 2014

Hi everyone, welcome to September.

This already got a lot of attention over the weekend, but wanted to share
with you anyway cause it is really great. A research fellow has been
extracting over 2.5 million images from public domain books from the
internet archive. By using the OCR text that surround the images, it is
possible to quite accurately search for keywords. The metadata is of course
not perfect, but I've already seen some Wikimedians talking about ways to
improve this.
One could also think about the methods that the British Library used for
their 1 million public domain images where they show you the 'least tagged'
ones. This has resulted in every images at least tagged once by now. See:
https://secure.flickr.com/photos/britishlibrary/sets/72157640284615695/

For more information about the release see:

http://openglam.org/2014/08/30/the-internet-archive-joins-flickr-commons/

Cheers,

Joris
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-glam/attachments/20140901/a3ec68d0/attachment-0002.html>