[open-linguistics] LLOD diagram draft
Philipp Cimiano
cimiano at cit-ec.uni-bielefeld.de
Wed Apr 9 13:15:35 UTC 2014
Hi Marta,
thanks, very interesting. Given your input I am even more convinced
that reusing the upper category "LexicalConceptualResource" subsuming
all those things that you mention would be appropriate.
Philipp.
Am 09.04.14 15:10, schrieb Marta Villegas:
> Dear all,
>
> I'm afraid this is my first time at LLOD (sorry for not participating
> much). I'm sending you some few comments regarding Philpp''s mail
> about top categories.Currently, at UPF we follow MetaShare proposal
> and distinguish between:
>
> LexicalConceptualResource
> ComputationalLexicon
> Framenet
> Lexicon
> MachineReadableDictionary
> Ontology
> TerminologicalResource
> Thesaurus
> WordList
> Wordnet
> Corpus
> CorpusAudio
> CorpusCollection
> CorpusImage
> CorpusText
> CorpustextNgram
> CorpusVideo
>
> As you can see, Corpus sub-classes are defined according to Media Type.
> In MetaShare, things like 'parallel corpus' vs 'monolingual corpus'
> are encoded by means of multilinguality property
> which serve to distinguish between parallel, comparable and
> MultilingualSingleText.
> Similarly 'bilingual' vs 'monolingual' is encoded by means of
> linguality (for monolingual, bilingual and multilingual).
>
> You can have a look at the browser
> (http://lod.iula.upf.edu/types/Service). The ontology files are at
> http://purl.org/ms-lod/MetaShare.ttl
> http://purl.org/ms-lod/BioServices.ttl
> http://purl.org/ms-lod/UPF-MetadataRecords.ttl
>
> Please note that this is an ongoing project!!!!
>
> All the best!
>
>
> 2014-04-09 14:24 GMT+02:00 Philipp Cimiano
> <cimiano at cit-ec.uni-bielefeld.de
> <mailto:cimiano at cit-ec.uni-bielefeld.de>>:
>
> Dear all,
>
> apologies, but my connection here is very bad, so I can not
> follow the skype telco, so I provide my input here answering to
> the email of Christian.
>
> I like the top categories: corpus, lexicon metadata in principle.
> But I would recommend to reuse categories proposed by others. For
> example, the Metashare node of UPF uses the following categories
> (thanks to Jorge for providing them):
>
> *
>
> Lexical Conceptual Resource (94)
>
> o
>
> Lexicon (77)
>
> o
>
> Wordnet (6)
>
> o
>
> Terminological Resource (4)
>
> o
>
> Word List (4)
>
> o
>
> Ontology (3)
>
> *
>
> Corpus (30)
>
> * Tool Service (10)
>
>
> I think reusing these categories (except for Tool Service) would
> be fine. The numbers in brackets indicate the number of resoruces
> of the corresponding type available. Adding Metadata would be good.
>
> ParallelCorpus as subcategory of Corpus seems appropriate and
> useful aas just suggested in the telco (I picked that ;))
>
> Other than that, the subcategories of Corpus would be defined by
> the annotation layers the corpus contains, getting too fine-graned
> at the level of the cloud is difficult.
>
> In any case in the future I hope that we can dynamically generate
> different diagrams filtering by conditions, e.g. license,
> annotation layers available, language etc.
>
>
>
>
> Am 04.04.14 21:44, schrieb Christian Chiarcos:
>> Dear all,
>>
>> please find the first draft for the new LLOD cloud diagram attached.
>>
>> An important difference as compared to the last draft is that
>> *only datasets with links to other LLOD datasets are included*.
>> Data sets for which we could not read information from any of the
>> URLs given in Datahub responded were excluded.
>>
>> If you don't find your dataset displayed properly (or missing),
>> please check your Datahub entry!
>>
>> Differences as compared to last edition:
>> - Categories revised, now at two levels of granularity (feedback
>> please!)
>> - Novel data sets, including the datasets of LDL-2014
>> contributions and the associated data challenge
>> - Included linguistically relevant Datahub entries *not* marked
>> as ressources of the linguistics group (e.g., the Greek WordNet).
>> We extracted all Datahub entries with tags "llod",
>> "linguistics%20lod", "lexicon", "corpus", "thesaurus",
>> "linguistic", "linguistics", or "typology".
>> - Diagram pruning: Eliminate data sets not linked with other LLOD
>> data sets
>>
>> Known issues:
>> - Edge breadth and bubble size reflect the link/triple counts as
>> given in Datahub. Where this information is not found, edges are
>> missing or bubbles are equally sized.
>> - Datasets from the LREC Share Your Resources Initiative have not
>> been included yet. We can discuss at the telco next week whether
>> we want to prepare a May-2014 edition that covers this (and
>> other) data.
>>
>> All the best,
>> Christian
>>
>>
>> _______________________________________________
>> open-linguistics mailing list
>> open-linguistics at lists.okfn.org <mailto:open-linguistics at lists.okfn.org>
>> https://lists.okfn.org/mailman/listinfo/open-linguistics
>> Unsubscribe:https://lists.okfn.org/mailman/options/open-linguistics
>
>
> --
>
> Prof. Dr. Philipp Cimiano
>
> Phone:+49 521 106 12249 <tel:%2B49%20521%20106%2012249>
> Fax:+49 521 106 12412 <tel:%2B49%20521%20106%2012412>
> Mail:cimiano at cit-ec.uni-bielefeld.de <mailto:cimiano at cit-ec.uni-bielefeld.de>
>
> Forschungsbau Intelligente Systeme (FBIIS)
> Raum 2.307
> Universität Bielefeld
> Inspiration 1
> 33619 Bielefeld
>
>
> _______________________________________________
> open-linguistics mailing list
> open-linguistics at lists.okfn.org
> <mailto:open-linguistics at lists.okfn.org>
> https://lists.okfn.org/mailman/listinfo/open-linguistics
> Unsubscribe: https://lists.okfn.org/mailman/options/open-linguistics
>
>
>
>
> --
> Marta Villegas
> marta.villegas at gmail.com <mailto:marta.villegas at gmail.com>
>
>
> _______________________________________________
> open-linguistics mailing list
> open-linguistics at lists.okfn.org
> https://lists.okfn.org/mailman/listinfo/open-linguistics
> Unsubscribe: https://lists.okfn.org/mailman/options/open-linguistics
--
Prof. Dr. Philipp Cimiano
Phone: +49 521 106 12249
Fax: +49 521 106 12412
Mail: cimiano at cit-ec.uni-bielefeld.de
Forschungsbau Intelligente Systeme (FBIIS)
Raum 2.307
Universität Bielefeld
Inspiration 1
33619 Bielefeld
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-linguistics/attachments/20140409/d6e93e92/attachment-0003.html>
More information about the open-linguistics
mailing list