English-Catalan Neural Machine Translation in the Biomedical Domain through the cascade approach
Marta R. Costa-juss\`a, Noe Casas, Maite Melero

TL;DR
This paper presents a neural machine translation system for English-Catalan biomedical texts using a cascade approach via Spanish, addressing low-resource challenges and introducing a new test dataset.
Contribution
It introduces a cascade pivot strategy for English-Catalan biomedical translation and provides a new test dataset for evaluation.
Findings
Effective translation performance demonstrated
New English-Catalan biomedical test dataset created
Cascade approach improves low-resource translation quality
Abstract
This paper describes the methodology followed to build a neural machine translation system in the biomedical domain for the English-Catalan language pair. This task can be considered a low-resourced task from the point of view of the domain and the language pair. To face this task, this paper reports experiments on a cascade pivot strategy through Spanish for the neural machine translation using the English-Spanish SCIELO and Spanish-Catalan El Peri\'odico database. To test the final performance of the system, we have created a new test data set for English-Catalan in the biomedical domain which is freely available on request.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
