[open-linguistics] Linking different Wiktionary dumps

Judith Eckle-Kohler eckle-kohler at tk.informatik.tu-darmstadt.de
Fri Dec 16 09:30:55 UTC 2011


Dear all,

In the telco on resource creation on Dec 14, the question came up if it is possible to link different Wiktionary dumps.

After consulting with Christian Meyer, the answer to that question turned out to be twofold:

1) On article page level, different Wiktionary dumps can be linked, because Wiktionary page IDs (that is, the ID of the article page corresponding to the lemma) are unique and stable across multiple different dumps.

2) However, on sense level, different Wiktionary dumps CAN NOT be linked, because Wiktionary sense IDs are unique for a certain dump, but NOT stable across multiple different dumps.
The generation of IDs for the different concepts (senses) has been a crucial point of discussion between Christian Meyer and Sebastian Hellmann some time ago - there is no solution for this yet (although Christian Meyer has collected some ideas).

I put Cristian Meyer in copy - if you have any further questions related to that topic you can contact him directly, his email address is chmeyer at tk.informatik.tu-darmstadt.de.

Regards
Judith Eckle-Kohler

--
-------------------------------------------------------------------

Dr. Judith Eckle-Kohler
Postdoctoral Researcher
Ubiquitous Knowledge Processing (UKP Lab)
FB 20 Computer Science Department    
Technische Universität Darmstadt
Hochschulstr. 10, D-64289 Darmstadt, Germany
phone [+49] (0)6151 16-6166, fax -5455, room S2/02/B115
eckle-kohler at tk.informatik.tu-darmstadt.de
www.ukp.tu-darmstadt.de 
Web Research at TU Darmstadt (WeRC) www.werc.tu-darmstadt.de

------------------------------------------------------------------- 



More information about the open-linguistics mailing list