[open-humanities] 25,000 early modern texts to be released CC0
syeates at gmail.com
Wed Dec 31 00:40:35 UTC 2014
From my point of view, the format that I'm most interested in seeing is
OpenURL metadata will allow a huge swathe of modern systems to 'just
work' with this data. The list on the wikipedia article includes "Ovid,
Web of Science, SciFinder, Modern Languages Association Bibliography and
Google Scholar" https://en.wikipedia.org/wiki/OpenURL
On 31/12/14 12:09, James Cummings wrote:
> Hi all,
> I don't want to pre-empt our more official announcement in the next day
> or so, which I'll make sure to send to this list, but all these texts
> will be released in a variety of forms.
> The total number is actually more than 25000 (even in the phase 1 CC0
> release from 1 Jan 2015). The TEI P5 XML is the modern 'source' that you
> should use for any future derivative projects: I would strongly
> recommend not starting from the TCP original SGML. (A lot of work has
> gone into this conversion to preserve intellectual content and
> intentions of the original encoders.) In addition to the TEI P5 XML, a
> basic HTML Web version of each text, EPUB version, and a link to
> ProQuest's EEBO images (but you institution needs to subscribe to that)
> will be available from the University of Oxford Text Archive. The
> 'source' link will go to a GitHub repository for each text. There are
> tools we will provide on github for those wishing to download all of the
> individual texts.
> Happy New Year.
> On 30/12/14 12:45, John Levin wrote:
>> On 30/12/2014 11:45, Andrea Zanni wrote:
>>> Wow, cool!
>>> It will be possible to download all texts + metadata? There is
>>> an API
>>> for the website?
>> Don't know about APIs, but the notice linked does state that the
>> whole lot will be available from box.com.
>>> On Tue, Dec 30, 2014 at 11:28 AM, John Levin <john at anterotesis.com
>>> <mailto:john at anterotesis.com>> wrote:
>>> Notice from the Text Creation partnership:
>>> Texts to be released on January 1st.
>>> I think these are the same as the ones I uploaded ages ago
>>> to CKAN
>>> (can't find URL right now), but with full xml markup.
>>> Uses? Not sure, and I don't have the time to do anything
>>> with them.
>>> Perhaps they could be treated a la Project Gitenberg?
>>> Happy new year!
More information about the open-humanities