How can Paidiom improve the neural machine translation of idioms?

dc.centroFacultad de Filosofía y Letrases_ES
dc.contributor.authorHidalgo Ternero, Carlos Manuel
dc.date.accessioned2023-12-01T11:43:33Z
dc.date.available2023-12-01T11:43:33Z
dc.date.issued2023
dc.departamentoTraducción e Interpretación
dc.description.abstractIn this paper we present research results with Paidiom, a text-preprocessing algorithm designed for 1) converting discontinuous multiword expressions (MWEs) into their continuous forms and 2) translemmatising them, i.e., converting source-text MWEs into their target-text equivalents, in order to improve the performance of current neural machine translation (NMT) systems. To test its effectiveness, an experiment with the NMT systems of VIP, Google Translate and DeepL has been carried out in the ES>EN translation direction with Verb-Noun Idiomatic Constructions (VNICs) in Spanish. The performance of Paidiom was compared to both the one of our previous algorithm (gApp) and to the manual conversion (our gold standard). In this regard, the promising results yielded by this study, the first one analysing Paidiom’s performance, will shed some light on new avenues for enhancing MWE-aware NMT systems.es_ES
dc.description.sponsorshipUniversidad de Málaga. Campus de Excelencia Internacional Andalucía Tech.es_ES
dc.identifier.urihttps://hdl.handle.net/10630/28196
dc.language.isoenges_ES
dc.relation.eventdate20/11/2023-22/11/2023es_ES
dc.relation.eventplaceLuxemburgo (Luxemburgo)es_ES
dc.relation.eventtitleTranslating and the Computer conference — TC45es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectTraducción automáticaes_ES
dc.subjectModismos - Traducción automáticaes_ES
dc.subjectTraducción - Innovaciones tecnológicases_ES
dc.subject.otherNeural machine translationes_ES
dc.subject.otherText-preprocessing systemes_ES
dc.subject.otherMultiword expressionses_ES
dc.titleHow can Paidiom improve the neural machine translation of idioms?es_ES
dc.typeconference outputes_ES
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
HidalgoTernero_TC45.pdf
Size:
182.44 KB
Format:
Adobe Portable Document Format
Description: