Mostrar el registro sencillo del ítem
gApp: a text preprocessing system to improve the neural machine translation of discontinuous multiword expressions
dc.contributor.author | Hidalgo Ternero, Carlos Manuel | |
dc.contributor.author | Zhou Lian, Xiaoqing | |
dc.date.accessioned | 2022-12-20T12:28:41Z | |
dc.date.available | 2022-12-20T12:28:41Z | |
dc.date.issued | 2022 | |
dc.identifier.uri | https://hdl.handle.net/10630/25650 | |
dc.description.abstract | In this paper we present research results with gApp, a text-preprocessing system designed for automati-cally detecting and converting discontinuous multiword expressions (MWEs) into their continuous forms so as to improve the performance of current neural machine translation systems (NMT) (see Hidalgo-Ternero, 2021 & 2022, Hidalgo-Ternero & Corpas Pastor, 2020, 2022a & 2022b, Hidalgo-Ternero, Lista, and Corpas Pastor, 2022, and Hidalgo-Ternero and Zhou-Lian, 2022a & 2022b). To test its effectiveness, eight experiments with several NMT systems such as DeepL, Google Translate, ModernMT and VIP have been carried out in different language directionalities (ES/FR/IT > ES/EN/DE/FR/IT/PT/ZH) for the trans-lation of somatisms, i.e., MWEs containing lexemes referring to human or animal body parts (Mellado Blanco, 2004). More specifically, we have analysed both flexible verb-noun idiomatic constructions (VNICs) and flexible verb + prepositional phrase (VPP) constructions. In this regard, the promising results obtained for these typologies of MWEs throughout experiments 1-8 will shed some light on new avenues for enhancing MWE-aware NMT systems. | es_ES |
dc.description.sponsorship | Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech. | es_ES |
dc.language.iso | eng | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Traducción automática | es_ES |
dc.subject.other | Neural machine translation | es_ES |
dc.subject.other | Text-preprocessing system | es_ES |
dc.subject.other | Multiword expressions | es_ES |
dc.title | gApp: a text preprocessing system to improve the neural machine translation of discontinuous multiword expressions | es_ES |
dc.type | info:eu-repo/semantics/conferenceObject | es_ES |
dc.centro | Facultad de Filosofía y Letras | es_ES |
dc.relation.eventtitle | Translating and the Computer conference — TC44 | es_ES |
dc.relation.eventplace | Luxemburgo, Luxemburgo | es_ES |
dc.relation.eventdate | 24/11/2022 | es_ES |
dc.rights.cc | Atribución 4.0 Internacional | * |