19th Workshop on Multiword Expressions (MWE 2023)

Colocated with: EACL 2023 (Dubrovnik, Croatia)

Date of the Workshop: 5 or 6 May 2023

Organised and sponsored by:
Special Interest Group on the Lexicon (SIGLEX) of the Association for Computational Linguistics (ACL)


News


Contents on this page

Proceedings and video recording

TBD


Program

Tentative schedule:

08:30–09:00 Registration
09:00–10:30 Session 1
10:30–11:15 Morning coffee break
11:15–12:45 Session 2
12:45–14:15 Lunch break
14:15–15:45 Session 3
15:45–16:30 Afternoon coffee break
16:30–18:00 Session 4

Keynote speakers

Leo Wanner, Universitat Pompeu Fabra

Bio: Leo Wanner earned his Diploma in Computer Science from the University of Karlsruhe and his PhD in Computational Linguistics from the University of The Saarland, Germany. Prior to joining ICREA he held positions at the German National Centre for Computer Science (GMD), University of Waterloo, the University of Stuttgart and the Pompeu Fabra University, Barcelona. As visiting researcher, he was affiliated with U of Montreal, U of Sydney, U of Southern California’s Institute for Information Sciences, U Paris 7, Columbia University, and U of Augsburg. Throughout his career, Leo has been involved as Principal Investigator in numerous of national and European research projects. He has published 10 volumes and more than 230 peer reviewed papers. He is Associate Editor of the Computational Intelligence and Frontiers in AI, Language and Computation journals and serves as regular reviewer for a number of high profile conferences and journals in the field.

TBD


Description

Multiword expressions (MWEs) are word combinations that exhibit lexical, syntactic, semantic, pragmatic, and/or statistical idiosyncrasies (Baldwin & Kim 2010), such as by and large, hot dog, pay a visit and pull one’s leg. The notion encompasses closely related phenomena: idioms, compounds, light-verb constructions, phrasal verbs, rhetorical figures, collocations, institutionalised phrases, etc. Their behaviour is often unpredictable; for example, their meaning often does not result from the direct combination of the meanings of their parts. Given their irregular nature, MWEs often pose complex problems in linguistic modelling (e.g. annotation), NLP tasks (e.g. parsing), and end-user applications (e.g. natural language understanding and MT), hence still representing an open issue for computational linguistics (Constant et al. 2017).

For almost two decades, modelling and processing MWEs for NLP has been the topic of the MWE workshop organised by the MWE section of SIGLEX in conjunction with major NLP conferences since 2003. Impressive progress has been made in the field, but our understanding of MWEs still requires much research considering their need and usefulness in NLP applications. This is also relevant to domain-specific NLP pipelines that need to tackle terminologies most often realised as MWEs. Following previous years, for this 19th edition of the workshop, we identified the following topics on which contributions are particularly encouraged:

Through this workshop, we would like to bring together and encourage researchers in various NLP subfields to submit MWE-related research, so that approaches that deal with processing of MWEs including processing for low-resource languages and for various applications can benefit from each other. We also intend to consolidate the converging effects of previous joint workshops LAW-MWE-CxG 2018, MWE-WN 2019 and MWE-LEX 2020, the joint MWE-WOAH panel in 2021, and the MWE-SIGUL 2022 joint session, extending our scope to MWEs in e-lexicons and WordNets, MWE annotation, as well as grammatical constructions. Correspondingly, we call for papers on research related (but not limited) to MWEs and constructions in:


Shared task

We do not have a shared task this year, but a new release of the PARSEME corpus of verbal MWEs is currently underway. We encourage submission of research papers that include analyses of the new edition of the PARSEME data and improvements over the results for PARSEME 2020 shared task as well as SemEval 2022 task 2 on idiomaticity prediction.


Special track on MWEs in clinical NLP

Pursuing the MWE Section’s tradition of synergies with other communities, this year, we are organizing a joint session with the Clinical NLP workshop for shared papers/poster presentations. Since clinical texts contain an important amount of multiword expressions (e.g. medical terms or domain-specific collocations), a joint session is deemed beneficial for both communities. The goal is to foster future synergies that could address scientific challenges in the creation of resources, models and applications to deal with multiword expressions and related phenomena in the specialised domain of ClinicalNLP. Submissions describing research on MWEs in the specialized domain of ClinicalNLP, especially introducing new datasets or new tools and resources, are welcome. Papers accepted in this track will have the option to present their work in the Clinical NLP workshop at ACL 2023 as well, after being presented at MWE 2023.


Best paper award

All full papers in the workshop will be considered by the program committee for a best paper award.


Submission formats

The workshop invites two types of submissions:


Paper submission and templates

Papers should be submitted via the workshop’s START submission page. Please choose the appropriate submission format (archival/non-archival). Archival papers with existing reviews will also be accepted through the ACL Rolling Review. Submissions must follow the ACL 2023 stylesheet.

Archival papers with existing reviews from ACL Rolling Review will also be considered. A paper may not be simultaneously under review through ARR and MWE. A paper that has or will receive reviews through ARR may not be submitted for review to MWE.


Important dates

What When
Paper submission deadline 13 February 2023
ARR commitment deadline 6 March 2023
Notification of acceptance 13 March 2023
Camera-ready papers due 27 March 2023
Workshop 5 or 6 May 2023

All deadlines are at 23:59 UTC-12 (Anywhere on Earth).


Organizing Committee

What Who
Program chairs Marcos Garcia, Voula Giouli, Shiva Taslimipoor, Lifeng Han
Publication chair Archna Bhatia
Coordination and communication chair Voula Giouli
Publicity chair Kilian Evang

Program Committee (Preliminary)


Anti-harassment policy

The workshop follows the ACL anti-harassment policy.


Contact

For any inquiries regarding the workshop please send an email to the Organizing Committee at mweworkshop2023@googlegroups.com

Please register to SIGLEX and check the “MWE Section” box to be registered to our mailing list.