[open-linguistics] Question: replacing language codes in a SPARQL BIND statement?
Christian Chiarcos
chiarcos at informatik.uni-frankfurt.de
Sun Mar 13 11:09:33 UTC 2016
Dear all,
this is a general technical question, albeit one specific to working with
multilinguality issues in multiple lemon/ontolex dictionaries, hence I'm
asking here in the first place.
Imagine the following situation: I use the Russian DBnary (provided in a
slightly extended variant of the old lemon) and an ontolex dictionary for
Chalkan (with Russian glosses). Both provided by third parties, and I do
not want to manipulate the data prior to querying. Now, I want to use
DBnary to retrieve an English gloss for the Chalkan words in a single
SPARQL query.
If both dictionaries use the same xml:lang representation, this works
rather well (I skip the query for reasons of brevity): I bind the Russian
gloss from the Chalkan dictionary to variable ?ru and start searching
DBnary for a data property that assigns ?ru as literal.
It is more complicated, though, if both files use different language
codes, e.g., ISO-639-3 (rus) and ISO-639-2 (ru) for Russian, or if a
language code with region sub-tag is used (e.g., ru-RU). Is there any way
to use, say, BIND to bind the string value of ?ru to a new variable which
uses ISO-639-2 codes instead of the original ISO-639-3 (resp.
ISO-639-2+ISO-3166) code?
At the moment, I see only one way to solve this problem, i.e., using
FILTER, str() and a string comparison of both variables. This should be
fairly inefficient, though, as I presume the FILTER is applied only after
all potential bindings for both variables for Russian terms have been
determined.
Am I overlooking anything?
Best,
Christian
--
Prof. Dr. Christian Chiarcos
Applied Computational Linguistics
Johann Wolfgang Goethe Universität Frankfurt a. M.
60054 Frankfurt am Main, Germany
office: Robert-Mayer-Str. 10, #401b
mail: chiarcos at informatik.uni-frankfurt.de
web: http://acoli.cs.uni-frankfurt.de
tel: +49-(0)69-798-22463
fax: +49-(0)69-798-28931
More information about the open-linguistics
mailing list