Empowering OLAC Extension using Anusaaraka and Effective text processing using Double Byte coding
B Prabhulla Chandran Pillai

TL;DR
This paper discusses challenges in extending OLAC for Indian languages and explores solutions inspired by Chinese text processing and Anusaaraka systems to address these issues.
Contribution
It introduces novel approaches by analyzing Chinese text processing and Anusaaraka systems to overcome OLAC extension hurdles for Dravidian and Indian languages.
Findings
Identified key hurdles in OLAC extension for Indian languages
Analyzed Chinese text processing techniques for potential adaptation
Explored Anusaaraka system's applicability to Indian language processing
Abstract
The paper reviews the hurdles while trying to implement the OLAC extension for Dravidian / Indian languages. The paper further explores the possibilities which could minimise or solve these problems. In this context, the Chinese system of text processing and the anusaaraka system are scrutinised.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification · Speech and dialogue systems
