A Probabilistic Model of Machine Translation
G.E. Miram, V.K. Petrov

TL;DR
This paper introduces a probabilistic model for machine translation that leverages parallel corpora to generate bilingual dictionaries with translation probabilities, enhancing translation quality.
Contribution
It presents a novel probabilistic approach trained on aligned parallel texts to automatically build bilingual dictionaries for machine translation.
Findings
Successfully trained on English-Russian corpora
Generated bilingual dictionaries with translation probabilities
Improved translation accuracy expected
Abstract
A probabilistic model for computer-based generation of a machine translation system on the basis of English-Russian parallel text corpora is suggested. The model is trained using parallel text corpora with pre-aligned source and target sentences. The training of the model results in a bilingual dictionary of words and "word blocks" with relevant translation probability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Biomedical Text Mining and Ontologies
