[open-linguistics] Creation of a joint linguistic LOD cloud

Pablo Mendes pablomendes at gmail.com
Mon Nov 21 15:56:56 UTC 2011


Sebastian,
Thanks for the pointer. I think it is a good idea to keep a moderated group.

However, I would still encourage people to tag *everything* related to NLP
as nlp, and if there is a need for another tag called "linguistics", or
whatever, people should tag as appropriate. Then, based on the datasets
with those tags, we could conduct a reviewing process that approves
datasets that pass our minimal requirements into the "linguistics" group.

This is the process followed by the general LOD Cloud, where everything
tagged "lod" gets reviewed for (manual) addition to the "lodcloud" group.
For the general case, we developed some guidelines and minimal requirements
here:
www4.wiwiss.fu-berlin.de/lodcloud/ckan/validator/levels.html

Cheers,
Pablo

On Mon, Nov 21, 2011 at 4:30 PM, Sebastian Hellmann <
hellmann at informatik.uni-leipzig.de> wrote:

> **
> Hi Pablo,
> Yes, that is our goal in the end. We already made a moderated group with
> open linguistics data sets:
> http://thedatahub.org/group/linguistics
> From time to time we are looking for data sets that are open to add it to
> this group. OpenCalais does not qualify for example as it is not open.
> The group is a little bit more strict than just tagging. We started with
> the criterion "open" for now.
> Currently there are 4 group moderators, but it should be more.
> The main reason why there are only 4 is probably because we don't know how
> to add people to be a moderator.
> @Jonathan: How can we add new moderators?
>
> Cheers,
> Sebastian
>
>
>
> On 11/21/2011 03:17 PM, Pablo Mendes wrote:
>
> Hi all,
> May I suggest that we use TheDataHub.org for cataloguing our datasets?
>
> I have already tagged a few datasets with NLP and LOD.http://thedatahub.org/dataset?q=nlp&tags=lod
>
> But there is a larger number of NLP data sets that are not yet LOD.
> http://thedatahub.org/dataset?q=nlp<http://thedatahub.org/dataset?q=nlp&tags=lod> <http://thedatahub.org/dataset?q=nlp&tags=lod>
>
>
> So, if you're interested in triplifying some data, that could be a good
> start. Say you are interested in part of speech data sets:http://thedatahub.org/dataset?q=part+of+speech
>
> Anybody can add entries to TheDataHub.org and it is the same platform used
> for the general-purpose LOD Cloud Diagram.
>
> Cheers,
> Pablo
>
>
> On Fri, Nov 18, 2011 at 11:12 AM, Sebastian Hellmann <hellmann at informatik.uni-leipzig.de> wrote:
>
>
>  Dear all,
> unfortunately I will not be able to attend the telco, as I am on holiday
> in December (1st-22nd) .
> The criteria for submitting data are not 100% strict. Here were is a
> summary of ideas:
> - Data which is not in RDF can be submitted to discuss about how to
> convert it to RDF.  The great advantage of Open resources is also that one
> person can provide the  data, while another person can triplify it.
> - The provided data should be "interlinkable" with other data sets. The
> exact definition of "interlinkable" was not yet tackled however. I think we
> have to refine based on the data that will be submitted.
> - I think there might also be the possibility to create a refinement chain
> with the different people that are in this group: 1. someone submits (open)
> data 2. somebody else triplifies it 3. somebody else interlinks it 4.
> somebody else hosts it 5. somebody else uses it to build an application ;)
>
> We also agreed that we will make a Linguistic LOD image in the style of:http://lod-cloud.net/
>
> All the best,
> Sebastian
>
>
>
> On 11/08/2011 10:53 AM, Sebastian Nordhoff wrote:
>
>
>  On Tue, 08 Nov 2011 08:55:25 +0100, Pablo Mendes <pablomendes at gmail.com> <pablomendes at gmail.com>
> wrote:
>
>  Hi all,
>
>   That's great news, and I'd really like to get a glimpse on the RDF data
>
>  ... but as no one so far has jumped on the discussion so far, and as you
> are not yet distributing the tool, I suggest that we continue to talk
> about this off-list.
>
>
>  I am also interested. :)
>
>
>  Hi Pablo,
> that's great
>
>  I am able to contribute a few datasets annotated with DBpedia resources,
>
>  but some of them were created by other researchers, and I'd have to
> double
> check licensing with them.
>
> I also know of the the NERD ontology, which maps entity types used by
> entity recognition and disambiguation systems such as Alchemy API,
> Zemanta
> and DBpedia Spotlight.
>
> We plant to generate a larger dataset from these, but Dec is too short
> of a
> time frame for us.
>
>
>  If you only have a smaller, preliminary set by December, that would also
> be welcome
> Best
> Sebastian
>
>
>
>
>  Best
> Pablo
>
>
>
> --
> Dipl. Inf. Sebastian Hellmann
> Department of Computer Science, University of Leipzig
> Projects: http://nlp2rdf.org , http://dbpedia.org
>
> Homepage: http://bis.informatik.uni-**leipzig.de/SebastianHellmann<http://bis.informatik.uni-leipzig.de/SebastianHellmann> <http://bis.informatik.uni-leipzig.de/SebastianHellmann>
> Research Group: http://aksw.org
>
>
>
> ______________________________**_________________
> open-linguistics mailing listopen-linguistics at lists.okfn.**org <open-linguistics at lists.okfn.org> <open-linguistics at lists.okfn.org>http://lists.okfn.org/mailman/**listinfo/open-linguistics<http://lists.okfn.org/mailman/listinfo/open-linguistics> <http://lists.okfn.org/mailman/listinfo/open-linguistics>
>
>
> _______________________________________________
> open-linguistics mailing listopen-linguistics at lists.okfn.orghttp://lists.okfn.org/mailman/listinfo/open-linguistics
>
>
>
> --
> Dipl. Inf. Sebastian Hellmann
> Department of Computer Science, University of Leipzig
> Projects: http://nlp2rdf.org , http://dbpedia.org
> Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
> Research Group: http://aksw.org
>
>
> _______________________________________________
> open-linguistics mailing list
> open-linguistics at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/open-linguistics
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-linguistics/attachments/20111121/52c64946/attachment-0001.html>


More information about the open-linguistics mailing list