[open-linguistics] new colored LLOD diagram

Christian Chiarcos christian.chiarcos at web.de
Sat Sep 21 22:30:26 UTC 2013

Hi Tom,

> What's the criteria for inclusion?  It seems weird that DBpedia  
> qualifies,
> but Freebase doesn't when it's got everything DBpedia does plus Wordnet,
> data from the Long Now Foundation's Rosetta Project and other linguistic
> resources.

We had a brief discussion about criteria, without a clear result.* And  
right now, the criterion is just a pragmatic check of datahub entries and  
whether these are tagged for the group (except for obvious spam, and  
non-RDF data sets, look in the script). You're right, freebase could be  
included, as well. It only needs to be tagged as such on datahub.

* Personally, I think that DBpedia is somewhat borderline, too, as it  
contains very little specifically linguistic information, but it qualifies  
through its application in NLP and through its multi-lingual character:  
Even if originally not intended to be, the linking between different  
language versions yields something like a multilingual wordlist (which  
would be a linguistic resource in any case).

> Other things that caught my eye:
> - there seem to a lot of free-floating circles for a *linked* data  
> diagram.
>  Is this an issue with missing data or are these data sets really islands
> that are only linked internally?

Actually, the diagram is drawn from the datahub entries, and we did not  
verify the information given there. This is also true for the links. The  
metadata there is incomplete, and several times, I know that links should  
exist, but are not documented in the metadata, e.g., for the "simple"  
resources (cf. the MLOD submission:  
or for OLiA and lemonuby. Some are real islands, but if so, they should be  
linkable. John and me discussed whether these should be excluded, but as  
some of them are actually a concrete result of collaborations within the  
group whose linking was just not documented yet (think of simple), I  
thought we should keep them in there. The diagram built at MLODE-2012  
followed the same policy.
Of course, a better solution would be to be stricter and to work our way  
through the metadata. And this is something that really needs to be done,  
actually ...

> - DBpedia Live is redundant with DBpedia (English)

Possibly, but apparently, there are two datahub entries. Anything from the  
DBpedia folks on this ?

> - if the diagram is going to exist in an online version, it'd be cool to
> link the circles back to their CKAN entries (or their home pages)

Sure. In SVG, this is possible and we had this in the last hand-written  
diagram (Feb 2012). Right now, we generate GraphML and I haven't checked  
whether it supports hyperlinks.


More information about the open-linguistics mailing list