[open-linguistics] LLOD diagram draft
Christian Chiarcos
christian.chiarcos at web.de
Thu Apr 24 15:20:57 UTC 2014
Dear Philipp,
> In Metashare the following top-level distinctions are made:
>
> 1) corpus (including written/text, oral/spoken, multimodal/multimedia
> corpora),
> 2) lexical/conceptual resource (including terminological resources, word
> lists, semantic lexica,
> ontologies, etc.),
> 3) language description (including grammars, typological databases,
> courseware, etc.),
...
> I propose we adopt the categories 1)-3) directly as they cover exactly
> the categories we had in mind. Reusing these names would contribute to
> overall interoperability.
I agree, these categories correspond to our top-level categories by
intention, and the majority just decided to adopt the lexical/conceptual
resource label in the diagram.
As for the "language description" category, I am more sceptical. In fact,
we *had* adopted Metashare's "language description" in the September 2013
edition of the diagram, but it was not received too well, mostly because
the structured data we have in this group is not what a linguist would
expect under the label "description". In particular, there are no text
books on grammar modeled in RDF (and I don't see any coming, even if, they
would probably be XML+RDFa rather than fully modelled as linked data, with
RDFa providing metadata only).
As a consequence, Bettina dissolved this group into "ontology", "language
database" and "other". For presentational reasons, I grouped them together
in the new diagram draft again, but the only acceptable generalization I
could come up with was "metadata". At least, I don't think that the
earlier issues with "language description" have been resolved.
I guess there is no chance to convince the Metashare holders to adopt our
current terminology, as all their data is "metadata" in some way. But I am
(and I guess everyone else, as well) absolutely open to discuss with them
to develop common terminology. But as said before, "language description"
caused irritations and should be avoided.
Best,
Christian
--
Christian Chiarcos
Applied Computational Linguistics
Johann Wolfgang Goethe Universität Frankfurt a. M.
60054 Frankfurt am Main, Germany
office: Robert-Mayer-Str. 10, #401b
mail: chiarcos at informatik.uni-frankfurt.de
web: http://acoli.cs.uni-frankfurt.de
tel: +49-(0)69-798-22463
fax: +49-(0)69-798-28931
More information about the open-linguistics
mailing list