[open-linguistics] open-linguistics Digest, Vol 33, Issue 8

Tue Jul 16 10:11:10 UTC 2013

Dear all,

I summarized the discussion in a wiki page  
(http://wiki.okfn.org/Llod-categories) together with a provisional  
alignment of OLAC, LRE and proposed LLOD categories.

@Paul: aligning with LRE Map categories is a great idea, but their terms  
are nowhere defined, and I personally have some difficulties to decide how  
to align them (hence the question marks at several points). In particular,  
I can imagine different meanings for lre:terminology, lre:lexicon and  
lre:ontology and their respective interpretation.

@John: As the subclassification of the older corpus category was something  
you brought up in the discussion (it was already applied at MLODE), I  
realized that features like multilingual (bitext) and multimodal  
(multimodal-corpus) are actually applicable also to the lexicon category  
and to typological data bases. Maybe, we should better model these as  
features independent from the resource categorization (similar to the LRE  
map). Then, we could provide alternative visualizations, one for the  
resource type, one for "modality" or whatever we'd like to call this  
feature.

Best,
Christian

On Mon, 15 Jul 2013 13:28:46 +0200, Paul Buitelaar  
<paul.buitelaar at deri.org> wrote:

> John, all, not sure if this has come up yet in respect to Categories for  
> data in the LLOD Cloud Diagram, but I would suggest to align with the  
> LRE map where possible:
>
> http://en.wikipedia.org/wiki/LRE_Map
>
>
> Paul
>
>
> On 13/07/2013 12:00, open-linguistics-request at lists.okfn.org wrote:
>> Send open-linguistics mailing list submissions to
>> 	open-linguistics at lists.okfn.org
>>
>> To subscribe or unsubscribe via the World Wide Web, visit
>> 	http://lists.okfn.org/mailman/listinfo/open-linguistics
>> or, via email, send a message with subject or body 'help' to
>> 	open-linguistics-request at lists.okfn.org
>>
>> You can reach the person managing the list at
>> 	open-linguistics-owner at lists.okfn.org
>>
>> When replying, please edit your Subject line so it is more specific
>> than "Re: Contents of open-linguistics digest..."
>>
>>
>> Today's Topics:
>>
>>     1. Re: Categories for data in the LLOD Cloud Diagram
>>        (Christian Chiarcos)
>>     2. Re: Categories for data in the LLOD Cloud Diagram (Jonathan Pool)
>>
>>
>> ----------------------------------------------------------------------
>>
>> Message: 1
>> Date: Fri, 12 Jul 2013 17:46:30 +0200
>> From: "Christian Chiarcos" <christian.chiarcos at web.de>
>> Subject: Re: [open-linguistics] Categories for data in the LLOD Cloud
>> 	Diagram
>> To: open-linguistics at lists.okfn.org, "Dave Lewis"
>> 	<dave.lewis at cs.tcd.ie>
>> Message-ID: <op.wz38rsefyoucrc at christian-pc>
>> Content-Type: text/plain; charset=utf-8; format=flowed; delsp=yes
>>
>> Hi Dave,
>>
>>> It raises an interesting question of whether bi-text should be a
>>> classification by itself, or is a characterization of the _link_  
>>> between
>>> two monolingual resources.
>>
>> We had this discussion yesterday, as well, I think that both views are
>> feasible and should probably be left to the data providers. There are
>> other multilingual ressources, e.g., interlinear glosses as harvested in
>> ODIN (http://linguistics-ontology.org/project/4?), where data for some
>> single language may be so marginal (say, 10 sentences or so, for a
>> selected language) that a language-by-language split wouldn't make much
>> sense. This is what prevents me from a general recommendation to  
>> represent
>> bitext (in whatever definition) as links between two monolingual  
>> resources.
>>
>> But maybe, this wouldn't bitext after all, would it?
>>
>> Best,
>> Christian
>>
>>
>>
>> ------------------------------
>>
>> Message: 2
>> Date: Fri, 12 Jul 2013 13:39:16 -0700
>> From: Jonathan Pool <pool at panlex.org>
>> Subject: Re: [open-linguistics] Categories for data in the LLOD Cloud
>> 	Diagram
>> To: open-linguistics at lists.okfn.org
>> Message-ID: <51E06974.3060204 at panlex.org>
>> Content-Type: text/plain; charset=UTF-8; format=flowed
>>
>>> The proposal is that we should divide language resources into 6 broad
>>> categories
>>
>> Sorry I missed the meeting today.
>>
>> If this is a partition, I think it is likely to create disagreement
>> about (1) how many categories should exist, (2) which categories should
>> exist, and (3) which category should be assigned to each resource.
>>
>> Less prescriptive solutions might be considered:
>>
>> A. Allowing multiple categories to be assigned to the same resource.
>>
>> B. Allowing additional categories to be created.
>>
>> C. Allowing multiple coexisting authorities to create sets of categories
>> and assign them to resources.
>>
>>
>>
>> ------------------------------
>>
>> _______________________________________________
>> open-linguistics mailing list
>> open-linguistics at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/open-linguistics
>> Unsubscribe: http://lists.okfn.org/mailman/optionss/open-linguistics
>>
>>
>> End of open-linguistics Digest, Vol 33, Issue 8
>> ***********************************************
>>
>
> _______________________________________________
> open-linguistics mailing list
> open-linguistics at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/open-linguistics
> Unsubscribe: http://lists.okfn.org/mailman/options/open-linguistics
>

-- 
Christian Chiarcos
Applied Computational Linguistics
Johann Wolfgang Goethe Universität Frankfurt a. M.
60054 Frankfurt am Main, Germany

office: Robert-Mayer-Str. 10, #401b
mail: chiarcos at informatik.uni-frankfurt.de
web: http://acoli.cs.uni-frankfurt.de
tel: +49-(0)69-798-22463
fax: +49-(0)69-798-28931