[open-linguistics] 2nd CfP: MomenT Workshop @ LREC 2018: Multilingualism at the intersection of Knowledge Bases and Machine Translation
Mihael
mihael.arcan at insight-centre.org
Thu Dec 14 08:23:11 UTC 2017
/[apologies for cross-postings]/
2nd Call for Papers: *Multilingualism at the intersection of Knowledge
Bases and Machine Translation* (1st MomenT Workshop)
**Date*: *May 12th, 2018. To be held as part of the 11th edition of the
Language Resources and Evaluation Conference (LREC), at the Phoenix
Seagaia Resort, Miyazaki, Japan.
*Website*: http://moment2018.insight-centre.org/
*Submission Deadline* : January 12th, 2018
*Submission page*: https://www.softconf.com/lrec2018/MomenT/
Workshop Description
The increasing availability of knowledge bases, generated by researchers
in the academia and industry, enables extensions of NLP applications
with the vast amount of background knowledge available in open formats.
However, most of the information that we find in knowledge bases, like
knowledge graphs, ontologies, taxonomies, thesauri, dictionaries or
terminological datasets, contain descriptions in natural language,
mostly only in English, which severely limits their exploitation.
Machine translation (MT) can be applied to automatically translate the
terminological expressionsinto multiple languages, but not without its
challenges. Usually, they contain domain-specific terminology, which is
notoriously difficult to deal with since it is underrepresented in the
MT training data.
On the other hand, growing attention has also been paid to the
integration of domain-specific knowledge into MT systems or CAT tools.
To reduce the post-editing effort involved in the translation process,
approaches that enhance the MT systems with existing
multilingualdomain-specific knowledge, (e.g. DBpedia, IATE or ETB) are
becoming increasingly popular. In this sense, the provision of
multilingual knowledge bases that could be integrated better into the MT
pipeline is a crucial step towards increasing the translation quality of
highly-specific texts.
We believe that in the same way that domain ontologies and terminologies
can benefit from NLP tools and MT systems that help to overcome the
language barrier, NLP tools and MT systems would also profit from the
specialized semantic knowledge and the terms that are captured in
multilingual ontologies or terminologies. The workshop, therefore, aims
at building a bridge between the knowledge base and machine translation
communities.
Objective
This workshop aims to offer a forum for discussions to researchers who
find themselves at the intersection of the two strands of research,
namely, knowledge representations of high-specialized domains
(represented by domain ontologies, terminologies, dictionaries) and MT,
where the intersection point is precisely multilingualism. On the one
hand, there is an impending need to cater for multilingual knowledge
resources that support NLP, content analytics or information extraction
across languages. On the other hand, there is a growing interest in
exploiting the knowledge captured in such resources for MT systems that
often lack the knowledge of domain-specific expressions.
We would like to bring together researchers from different communities
as well as other interested stakeholders from the industry and the
public sector in order to identify common interests, exploit synergies,
provide use cases, and share methods, tools and resources. We are
particularly interested to bring together researchers and practitioners
working on MT, specifically statistical (SMT) or neural (NMT), on the
one hand and experts on knowledge basesfrom various perspectives on the
other, in order to enable cross fertilisation and foster the creation of
innovative solutions that can only arise from interdisciplinary
collaborations.
Topics of Interest
The workshop invites the submission of papers reporting on research on
topics related to terminological/ontological resources processing in MT,
including, but not limited to:
*
Multilingual knowledge base generation with MT
*
Injection of multilingual knowledge into MT
*
Provenance of multiple multilingual resources for MT
*
Terminological variations and MT
*
MT used for cross-lingual information access, search and retrieval
*
Neural MT applied to terminology translation
*
Exploiting relational knowledge graphs for Neural MT
*
Morphological variations of terminological expressions in MT
*
Multilingual knowledge generation for under-resourced languages
*
Enriching MT with Linked Data (BabelNet, DBpedia, UMLS, ...)
*
Multilingual terminology and linguistic resource development,
integration and mapping in a Linked Data context
*
Multilingual Semantic Web and Linked Data use cases
Submission & Publication
We accept submission of both long (up to 8 pages) and short papers (up
to 4 pages) to be presented as long or short oral presentations at the
workshop. The papers of the workshop will be published as online
proceedings on the LREC 2018 website. Please check the LREC author’s kit
website <http://lrec2018.lrec-conf.org/en/submission/authors-kit/>for
the proceedings template.
In addition, we aim for a journal special issue as post-conference
proceedings in which a selected amount of papers presented at the
workshop will be published.
When submitting a paper from the START page
<https://www.softconf.com/lrec2018/MomenT/>, authors will be asked to
provide essential information about resources (in a broad sense, i.e.
also technologies, standards, evaluation kits, etc.) that have been used
for the work described in the paper or are a new result of your research.
For contact data, stylesheets, up-to-date details on submission and the
workshop itself, please consult the workshop website
(http://moment2018.insight-centre.org/).
Identify, Describe and Share your LRs!
Describing your LRs in the LRE Map is now a normal practice in the
submission procedure of LREC (introduced in 2010 and adopted by other
conferences). To continue the efforts initiated at LREC 2014 about
“Sharing LRs” (data, tools, web-services, etc.), authors will have the
possibility, when submitting a paper, to upload LRs in a special LREC
repository. This effort of sharing LRs, linked to the LRE Map for their
description, may become a new “regular” feature for conferences in our
field, thus contributing to creating a common repository where everyone
can deposit and share data.
As scientific work requires accurate citations of referenced work so as
to allow the community to understand the whole context and also
replicate the experiments conducted by other researchers, LREC 2018
endorses the need to uniquely Identify LRs through the use of the
International Standard Language Resource Number (ISLRN,www.islrn.org
<http://www.islrn.org>), a Persistent Unique Identifier to be assigned
to each Language Resource. The assignment of ISLRNs to LRs cited in LREC
papers will be offered at submission time.
Important Dates:
Paper submission deadline: January 12th2018
Notification of Acceptance: February 9th2018
Camera-ready paper: February 23rd2018
Workshop date: May 12th2018
Organizing Committee:
Mihael Arcan, National University of Ireland, Galway, Ireland
Elena Montiel-Ponsoda, Universidad Politécnica de Madrid, Spain
Darja Fiser, University of Ljubljana, Jožef Stefan Institute, CLARIN ERIC
Tatjana Gornostaja, Tilde, Latvia, ELRA, BDVA
For further information please contact: mihael.arcan at insight-centre.org
The workshop is supported by the CLARIN research infrastructure.
Programme Committee:
Guadalupe Aguado-de-Cea - Universidad Politécnica de Madrid, Spain
Paul Buitelaar - National University of Ireland, Galway, Ireland
Philipp Cimiano - Bielefeld University, Germany
Christian Chiarcos - Goethe-Universität, Germany
Christopher Crowhurst - United Language Group, USA
Béatrice Daille - University of Nantes, France
Brian Davis - Maynooth University, Ireland
Thierry Declerck - German Research Center for Artificial Intelligence
(DFKI), Germany
Arantza Diaz de Ilarraza Sanchez - University of the Basque Country, Spain
Mauro Dragoni - Fondazione Bruno Kessler (FBK), Italy
Tomaž Erjavec - Jožef Stefan Institute, Slovenia
Miquel Esplà-Gomis - Universitat d'Alacant, Spain
Gema Ramírez - Prompsit Language Engineering, Spain
Natalia Grabar - Université de Lille, France
Jorge Gracia - University of Zaragoza, Spain
Miloš Jakubiček - University in Brno / Lexical Computing Limited, Czech
Republic
John Judge - Dublin City University, Ireland
Ilan Kernerman - K Dictionaries, Israel
Iztok Kosem - Trojina, Institute for Applied Slovene Studies, Slovenia
Simon Krek - Jožef Stefan Institute, Slovenia
Els Lefever - Ghent University, Belgium
Gudrun Magnusdottir - ESTeam AB, Sweden / Coreon GmbH, Germany
Qun Liu - Dublin City University, Ireland & Chinese Academy of Sciences,
China
John P. McCrae - National University of Ireland, Galway, Ireland
Roberto Navigli - Sapienza University of Rome, Italy
Mārcis Pinnis - Tilde, Latvia
Laurette Pretorius - University of South Africa (UNISA), South Africa
Georg Rehm - German Research Center for Artificial Intelligence (DFKI),
Germany
Antonio Toral - University of Groningen, Netherlands
Marco Turchi - Fondazione Bruno Kessler (FBK), Italy
Špela Vintar - University of Ljubljana, Slovenia
--
Dr. Mihael Arcan
Postdoctoral Researcher at Unit for Natural Language Processing (UNLP)
Insight Centre for Data Analytics @ NUI Galway
http://nuig.insight-centre.org/unlp/people/members/mihael-arcan/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-linguistics/attachments/20171214/b60f4733/attachment-0002.html>
More information about the open-linguistics
mailing list