Identification of Fertile Translations in Medical Comparable Corpora: a Morpho-Compositional Approach
Estelle Delpech (LINA), B\'eatrice Daille (LINA), Emmanuel Morin, (LINA), Claire Lemaire

TL;DR
This paper presents a morpho-compositional method for extracting fertile translations in biomedical comparable corpora, enhancing lexicon quality for English-French translation by leveraging morpheme-level translation equivalences.
Contribution
It introduces a novel approach that generates fertile translations using compositional translation and morpheme-level equivalences, improving lexicon extraction in the biomedical domain.
Findings
Fertile translations increase lexicon quality.
The method effectively generates morphologically complex translations.
Improves translation quality in biomedical English-French lexicons.
Abstract
This paper defines a method for lexicon in the biomedical domain from comparable corpora. The method is based on compositional translation and exploits morpheme-level translation equivalences. It can generate translations for a large variety of morphologically constructed words and can also generate 'fertile' translations. We show that fertile translations increase the overall quality of the extracted lexicon for English to French translation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Translation Studies and Practices
