Maraim Masoud, Daniel Torregrosa, Paul Buitelaar, Mihael Arčan
Refereed Conference Meeting Proceeding
Recently, machine translation has demonstrated significant progress in terms of translation quality. However, most of the research has focused on translating with pure monolingual texts in the source and the target side of the parallel corpora, when in fact code-switching is very common in communication nowadays. Despite the importance of handling code-switching in the translation task, existing machine translation systems fail to accommodate the code-switching content. In this paper, we examine the phenomenon of code-switching in machine translation for low-resource languages. Through different approaches, we evaluate the performance of our systems and make some observations about the role of code-mixing in the available corpora.
27th AIAI Irish Conference on Artificial Intelligence and Cognitive Science
Digital Object Identifer (DOI):
National University of Ireland, Galway (NUIG)
Open access repository: