[open-linguistics] Fwd: CfP: 2nd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2019 at MT Summit XVII)
Atul Kr. Ojha
shashwatup9k at gmail.com
Tue Mar 19 15:25:36 UTC 2019
Thanks and regards,
Atul
---------- Forwarded message ---------
From: Chao-Hong Liu <chaohong.liu at adaptcentre.ie>
Date: Tue, 19 Mar 2019 at 20:46
Subject: CfP: 2nd Workshop on Technologies for MT of Low Resource Languages
(LoResMT 2019 at MT Summit XVII)
To:
[Apologies for multiple postings]
--------------------------------------------------------------------------------
2nd Workshop on Technologies for MT of Low Resource Languages (LoResMT
2019)
The Helix, DCU, Dublin, August 20, 2019
https://sites.google.com/view/loresmt/
@ MT Summit 2019 (https://www.mtsummit2019.com/)
--------------------------------------------------------------------------------
BRIEF
1. Call for papers: Submission due on "May 24, 2019"
2. Please find below the proceeding and slides of LoResMT 2018. Works on
several low resource languages, e.g. Filipino, Finnish, Irish, Latvian,
Mongolian, Quechua, Tibetan, and Uyghur, are presented.
https://amtaweb.org/wp-content/uploads/2018/03/AMTA_2018_Workshop_Proceedings_LoResMT.pdf
https://sites.google.com/view/loresmt-2018/
SCOPES
Machine translation (MT) technologies have been improved significantly in
the last two decades, with the developments on phrased-based statistical MT
(SMT) and recently the neural MT (NMT). However, most of these methods rely
on the availability of large parallel data (millions to tens of millions
sentence pairs) in the training, which are resources that do not exist in
many language pairs.
In addition, MT methods still rely on a few natural language processing
(NLP) tools to help pre-process human generated texts in the forms that are
required as input for these methods, and/or post-process the output in
proper textual forms in target languages. In many MT systems, the
performance of these tools has great impacts on the quality of the
resulting translation. These NLP tools include, but not limited to, several
kinds of word tokenizers/de-tokenizers, word segmenters, morphology
analyzers, etc.
The workshop solicits papers on MT systems/methods for low resource
languages in general. We also solicit papers dedicated to these
supplementary NLP tools that are used in any language and especially in low
resource languages. We would like to have an overview of research on MT for
low resource languages and these NLP tools from our community.
TOPICS
We solicit original research papers, review papers, and position papers on
MT research for low resource languages in the workshop. Multilingual and/or
cross-lingual NLP tools for low resource languages are especially welcome.
Topics of the workshop include but are not limited to:
- Research and review papers of pre-processing and/or post-processing NLP
tools for MT
- Position papers on the development of pre-processing and/or
post-processing tools for MT
- Word tokenizers/de-tokenizers for specific languages
- Word/morpheme segmenters for specific languages
- Alignment/Re-ordering tools for specific language pairs
- Use of morphology analysers and/or morpheme segmenters in MT
- Multilingual/cross-lingual NLP tools for MT
- Re-usability of existing NLP tools for low resource languages
- Corpora creation and curation technologies for low resource languages
- Review of available parallel corpora for low resource languages
- Research and review papers of MT methods for low resource languages
- MT systems/methods (e.g. rule-based, PBMT, NMT) for low resource languages
- Pivot MT for low resource languages
- Zero-shot MT for low resource languages
- Fast building of MT systems for low resource languages
- Reusability of existing MT systems for low resource languages
- Machine translation for language preservation
SUBMISSION INFORMATION
Workshop papers should adhere to MT Summit 2019 style guide (LaTeX,
OpenOffice, Word).
https://www.mtsummit2019.com/submissions
There are two types of submissions in the workshop. For research, review
and position papers, the length of each paper should be at least four (4)
and not exceed eight (8) pages, plus unlimited pages for references. More
pages would be allowed as long as it could be justified. The review will be
double-blinded. For non-archival system demonstration abstracts, the limit
is four (4) pages. The review will be single-blind.
We would like to encourage authors to cite papers written in ANY language
that are related to the topics, as long as both original bibliographic
items and their corresponding English translations are provided.
IMPORTANT DATES
March 19, 2019: Call for papers
April 19, 2019: 2nd Call for papers
May 24, 2019: Submission deadline of workshop papers
June 21, 2019: Notification of acceptance
July 12, 2019: Camera-ready papers due
July 19, 2019: Workshop proceeding on-line
August 21, 2018: LoResMT workshop
ORGANIZERS (listed alphabetically)
Alina Karakanta FBK-Fondazione Bruno Kessler
Atul Kr. Ojha Panlingua Language Processing LLP/Jawaharlal Nehru
University
Chao-Hong Liu ADAPT Centre, Dublin City University
Jonathan Washington Swarthmore College
Nathaniel Oco National University (Philippines)
Surafel Melaku Lakew FBK-Fondazione Bruno Kessler
Valentin Malykh Moscow Institute of Physics and Technology
Xiaobing Zhao Minzu University of China
CONTACT
chaohong.liu at adaptcentre.ie
LoResMT @ MT Summit 2019
https://sites.google.com/view/loresmt/
LoResMT @ AMTA 2018
https://sites.google.com/view/loresmt-2018/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-linguistics/attachments/20190319/8418b436/attachment-0001.html>
More information about the open-linguistics
mailing list