[open-linguistics] LLOD diagram draft

Christian Chiarcos christian.chiarcos at web.de
Thu Apr 24 15:20:57 UTC 2014


Dear Philipp,

> In Metashare the following top-level distinctions are made:
>
> 1) corpus (including written/text, oral/spoken, multimodal/multimedia
> corpora),
> 2) lexical/conceptual resource (including terminological resources, word
> lists, semantic lexica,
> ontologies, etc.),
> 3) language description (including grammars,  typological databases,
> courseware, etc.),

...

> I propose we adopt the categories 1)-3) directly as they cover exactly
> the categories we had in mind. Reusing these names would contribute to
> overall interoperability.

I agree, these categories correspond to our top-level categories by  
intention, and the majority just decided to adopt the lexical/conceptual  
resource label in the diagram.

As for the "language description" category, I am more sceptical. In fact,  
we *had* adopted Metashare's "language description" in the September 2013  
edition of the diagram, but it was not received too well, mostly because  
the structured data we have in this group is not what a linguist would  
expect under the label "description". In particular, there are no text  
books on grammar modeled in RDF (and I don't see any coming, even if, they  
would probably be XML+RDFa rather than fully modelled as linked data, with  
RDFa providing metadata only).

As a consequence, Bettina dissolved this group into "ontology", "language  
database" and "other". For presentational reasons, I grouped them together  
in the new diagram draft again, but the only acceptable generalization I  
could come up with was "metadata". At least, I don't think that the  
earlier issues with "language description" have been resolved.

I guess there is no chance to convince the Metashare holders to adopt our  
current terminology, as all their data is "metadata" in some way. But I am  
(and I guess everyone else, as well) absolutely open to discuss with them  
to develop common terminology. But as said before, "language description"  
caused irritations and should be avoided.

Best,
Christian
-- 
Christian Chiarcos
Applied Computational Linguistics
Johann Wolfgang Goethe Universität Frankfurt a. M.
60054 Frankfurt am Main, Germany

office: Robert-Mayer-Str. 10, #401b
mail: chiarcos at informatik.uni-frankfurt.de
web: http://acoli.cs.uni-frankfurt.de
tel: +49-(0)69-798-22463
fax: +49-(0)69-798-28931



More information about the open-linguistics mailing list