Anusaaraka: Overcoming the Language Barrier in India
Akshar Bharati, Vineet Chaitanya, Amba P. Kulkarni, Rajeev Sangal, G, Umamaheshwara Rao

TL;DR
Anusaaraka is a hybrid human-computer system that facilitates understanding and translation between Indian languages by presenting source text in a close target language, reducing learning time and enabling post-editing for accuracy.
Contribution
It introduces a novel hybrid approach combining machine processing with human interpretation to bridge language barriers among Indian languages.
Findings
System enables quick learning of target language in about 2 weeks
Supports post-editing for grammatical correctness and style customization
Currently supports multiple language pairs including Telugu, Kannada, Marathi, Bengali, Punjabi to Hindi
Abstract
The anusaaraka system makes text in one Indian language accessible in another Indian language. In the anusaaraka approach, the load is so divided between man and computer that the language load is taken by the machine, and the interpretation of the text is left to the man. The machine presents an image of the source text in a language close to the target language.In the image, some constructions of the source language (which do not have equivalents) spill over to the output. Some special notation is also devised. The user after some training learns to read and understand the output. Because the Indian languages are close, the learning time of the output language is short, and is expected to be around 2 weeks. The output can also be post-edited by a trained user to make it grammatically correct in the target language. Style can also be changed, if necessary. Thus, in this scenario, it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultilingual Education and Policy · South Asian Studies and Conflicts · Linguistics, Language Diversity, and Identity
