Plan Optimization to Bilingual Dictionary Induction for Low-Resource Language Families
Arbi Haza Nasution, Yohei Murakami, Toru Ishida

TL;DR
This paper introduces a Markov Decision Process-based method to optimize the order of creating bilingual dictionaries for low-resource languages, significantly reducing costs and improving efficiency.
Contribution
It formalizes a plan optimization framework using MDP and Bayesian modeling to minimize costs in bilingual dictionary induction for low-resource languages.
Findings
Achieved 61.5% cost reduction over baseline plans.
Reduced total costs by 39.4% with the MDP approach.
Outperformed baseline methods in cost efficiency.
Abstract
Creating bilingual dictionary is the first crucial step in enriching low-resource languages. Especially for the closely-related ones, it has been shown that the constraint-based approach is useful for inducing bilingual lexicons from two bilingual dictionaries via the pivot language. However, if there are no available machine-readable dictionaries as input, we need to consider manual creation by bilingual native speakers. To reach a goal of comprehensively create multiple bilingual dictionaries, even if we already have several existing machine-readable bilingual dictionaries, it is still difficult to determine the execution order of the constraint-based approach to reducing the total cost. Plan optimization is crucial in composing the order of bilingual dictionaries creation with the consideration of the methods and their costs. We formalize the plan optimization for creating bilingual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Software Engineering Research · Speech and dialogue systems
