[open-humanities] 25,000 early modern texts to be released CC0
James Cummings
James.Cummings at it.ox.ac.uk
Tue Dec 30 23:09:48 UTC 2014
Hi all,
I don't want to pre-empt our more official announcement in the
next day or so, which I'll make sure to send to this list, but
all these texts will be released in a variety of forms.
The total number is actually more than 25000 (even in the phase 1
CC0 release from 1 Jan 2015). The TEI P5 XML is the modern
'source' that you should use for any future derivative projects:
I would strongly recommend not starting from the TCP original
SGML. (A lot of work has gone into this conversion to preserve
intellectual content and intentions of the original encoders.) In
addition to the TEI P5 XML, a basic HTML Web version of each
text, EPUB version, and a link to ProQuest's EEBO images (but you
institution needs to subscribe to that) will be available from
the University of Oxford Text Archive. The 'source' link will go
to a GitHub repository for each text. There are tools we will
provide on github for those wishing to download all of the
individual texts.
Happy New Year.
-James
On 30/12/14 12:45, John Levin wrote:
> On 30/12/2014 11:45, Andrea Zanni wrote:
>> Wow, cool!
>> It will be possible to download all texts + metadata? There is
>> an API
>> for the website?
>>
>
> Don't know about APIs, but the notice linked does state that the
> whole lot will be available from box.com.
>
> John
>
>> Andrea
>>
>> On Tue, Dec 30, 2014 at 11:28 AM, John Levin <john at anterotesis.com
>> <mailto:john at anterotesis.com>> wrote:
>>
>> Notice from the Text Creation partnership:
>>
>> http://www.__textcreationpartnership.org/__2014/12/24/eebo-tcp-phase-i-__public-release-what-to-expect-__on-january-1/
>>
>>
>> <http://www.textcreationpartnership.org/2014/12/24/eebo-tcp-phase-i-public-release-what-to-expect-on-january-1/>
>>
>> Texts to be released on January 1st.
>>
>> I think these are the same as the ones I uploaded ages ago
>> to CKAN
>> (can't find URL right now), but with full xml markup.
>>
>> Uses? Not sure, and I don't have the time to do anything
>> with them.
>> Perhaps they could be treated a la Project Gitenberg?
>>
>> Happy new year!
>>
>> John
>
>
--
Dr James Cummings, James.Cummings at it.ox.ac.uk
Academic IT Services, University of Oxford
More information about the open-humanities
mailing list