[humanities-dev] Shakespeare Annotations

Nick Stenning nick at whiteink.com
Wed Apr 18 15:42:17 UTC 2012

On 12 Apr 2012, at 19:43, Andrew Magliozzi <andrew at finalsclub.org> wrote:

> Ah Nick,
> I think you have come across one of our "gotchas". I believe our system was done on word count not character count.

Hi Andrew,

Words vs. characters isn't the issue here, I'm afraid. The index in
square brackets refers to the scene, not the character offset within
the scene, which is the "{start, end}Offset" field, and which David
has already converted from words to characters. In the below example,
the "end" property should undoubtedly be "/span[19]", not "/span[20]",
and while that's easy to correct in this case, I don't want to futz up
any annotations that actually do span scenes, hence the request for
David to check over the code.



>    {
>      "text": " '\"Hecate\" is also ... scene i). '",
>      "uri": "The Tragedy of Macbeth 7.html",
>      "ranges": [{
>        "start": "/span[19]",
>        "end": "/span[20]",
>        "startOffset": 49,
>        "endOffset": 55
>      }],
>      "quote": " 'HECATE ",
>      "finalsclub_id": 5029
>    }

