Machine Translation Summit XIV, Nice, France
2 - 6 September 2013

Multi-word Units in Machine Translation and Translation Technology

Call for Papers

Workshop at MT Summit XIV in Nice, France
September 3, 2013

Submission deadline: 15 June 2013 at 24:00 CET (GMT+1)

Machine Translation (MT) has evolved along with different types of computer-assisted translation tools and a notable progress has been achieved in improving the quality of translations. However, in spite of the recent positive developments in translation technologies, not all problems have been solved and in particular the identification, interpretation and translation of multi-word units (MWUs) still represent open challenges, both from a theoretical and a practical point of view. The low standard of analysis and translation of MWUs in translation technologies suggest that there is the need to invest in further research with the goal of improving the performance of the various translation applications.

Multi-word units (MWUs) are a complex linguistic phenomenon, ranging from lexical units with a relatively high degree of internal variability to expressions that are frozen or semi-frozen. Such units are very frequent both in everyday language and in languages for special purposes. Their interpretation and translation sometimes present unexpected obstacles even to human translators, mainly because of intrinsic ambiguities, structural and lexical asymmetries between languages, and, finally, cultural differences.

The current theoretical work on this topic deals with different formalisms and techniques relevant for MWU processing in MT as well as other translation applications, such as: automatic recognition of MWUs in a monolingual or bilingual setting; alignment and paraphrasing methodologies; development, features and usefulness of handcrafted monolingual and bilingual linguistic resources and grammars; use of MWUs in Statistical Machine Translation (SMT) domain adaptation, as well as empirical work concerning their modelling accuracy and descriptive adequacy across various language pairs. At the practical level, the issue of MWU has been addressed in various MT approaches, whether knowledge-based, statistical (word-based, phrase-based or factored-based) or hybrid. In general, MWU identification and translation problems are far from being solved and there is still considerable room for improvement. There is a recent growing attention to MWU processing in MT and Translation Technologies, as it has been acknowledged that it is not possible to create large-scale applications without properly handling MWUs of all kinds.

The focus of this workshop is to address the MWU issue in a synergetic way, taking advantage of the recent developments in disciplines such as Linguistics, Translation Studies, Computational Linguistics, and Computational Phraseology.

The main aim of the Workshop is, therefore, to bring together researchers working on various aspects of MWU processing in different disciplines, in order to discuss and propose innovative ideas and methods in relation to MT and Translation Technologies. In particular, this workshop welcomes the exchange of interactions between researchers in NLP working on the computational treatment of multi-word units, experts in phraseology (including computational phraseology) working on challenging topics of their discipline, as well as translation practitioners, to the benefit of applying their latest results to advance the state of the art in MWU translation.

TOPICS

We invite the submissions of papers reporting original and unpublished research on multi-word unit (MWU) processing in Machine Translation and Translation Technologies. We encourage the representation of a broad range of areas including, but not limited to, the following:

Lexical, syntactic, semantic and translational aspects in MWU representation
Identification and acquisition of multi-word terms and their variants
Automatic extraction of (multilingual) MWU resources
MWUs in Computer-Assisted Translation
Development and use of handcrafted MWU linguistic resources in MT
Paraphrasing of MWUs applied to improving MT
MWUs and word alignment techniques
Identification and acquisition of non-compositional items
Learning semantic information about MWUs from monolingual, parallel or comparable corpora
Multilingualism and MWU processing
Integration of MWUs into rule-based and statistical MT
MT evaluation focused on MWUs and novel automatic metrics handling MWUs
Creation of MWU-annotated corpora with a focus on translation aspects

PROGRAMME COMMITTEE

Iñaki Alegria (University of the Basque Country, Spain)
Giuseppe Attardi (University of Pisa, Italy)
Doug Arnold (University of Essex, United Kingdom)
Francis Bond (Nanyang Technological University, Singapore)
Bruno Cartoni (University of Geneva, Switzerland)
Jean-Pierre Colson (Université catholique de Louvain, Belgium)
Béatrice Daille (Nantes University, France)
Mona Diab (Columbia University, USA)
Gaël Dias (University of Caen Basse-Normandie, France)
Dmitrij O. Dobrovol'skij (Russian Academy of Sciences, Russia)
Annibale Elia (University of Salerno, Italy)
Thierry Fontenelle (Translation Centre for the Bodies of the European Union, Luxembourg)
Roxana Girju (University of Illinois at Urbana-Champaign, USA)
Barry Haddow (University of Edinburgh, United Kingdom)
Ulrich Heid (Universität Hildesheim, Germany)
Kyo Kageura (University of Tokyo, Japan)
Valia Kordoni (Humboldt University Berlin, Germany)
Koenraad Kuiper (University of Canterbury, New Zealand)
Guy Lapalme (University of Montreal, Canada)
Preslav Nakov (Qatar Computing Research Institute, Qatar Foundation, Qatar)
Pavel Pecina (Charles University in Prague, Czech Republic)
Carlos Ramisch (University of Grenoble, France)
Johann Roturier (Symantec Ltd., Ireland)
Gilles Serasset (University of Grenoble, France)
Max Silberztein (University of Franche-Comté, France)
Dan Tufis (Romanian Academy, Romania)
Agnes Tutin (University of Grenoble, France)
Michael Zock (Aix-Marseille University, France)

WORKSHOP CHAIRS

Ruslan Mitkov (University of Wolverhampton, United Kingdom)
Johanna Monti (University of Sassari, Italy)
Gloria Corpas Pastor (University of Málaga, Spain)
Violeta Seretan (University of Geneva, Switzerland)

CONTACT

For any inquiries regarding the workshop please send an email to Johanna Monti (jmonti@uniss.it).

SUBMISSION OF PAPERS

The workshop will use electronic submission through the EasyChair conference tool. Please use the following link:
https://www.easychair.org/conferences/?conf=mumttt2013
Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see above).

GUIDELINES FOR FORMATTING OF PAPERS

Submissions should be anonymised (no authors, affiliations or addresses, and no explicit self-reference)
Submissions/camera-ready should be no longer than eight (8) pages (A4 size)
Format required: PDF
Initial versions of papers must conform to the format defined by the EAMT templates available below.

FORMATTING TEMPLATES

Latex

Style file: mtsxiv.sty
Bibliography style: mtsxiv.bst
Sample document: mtsxiv.pdf
Source of the sample document: mtsxiv.tex

Microsoft Word

Document template: mtsxiv.dot for Papers
Sample document: mtsxiv.doc for Papers

IMPORTANT DATES

Submission deadline: 15 June 2013
Acceptance notification: 10 July 2013
Camera-ready version: 1 August 2013
Workshop: 3 September 2013