Empowering OLAC Extension using Anusaaraka and Effective text processing   using Double Byte coding

B Prabhulla Chandran Pillai

arXiv:0909.1147·cs.CL·September 8, 2009

Empowering OLAC Extension using Anusaaraka and Effective text processing using Double Byte coding

B Prabhulla Chandran Pillai

PDF

Open Access

TL;DR

This paper discusses challenges in extending OLAC for Indian languages and explores solutions inspired by Chinese text processing and Anusaaraka systems to address these issues.

Contribution

It introduces novel approaches by analyzing Chinese text processing and Anusaaraka systems to overcome OLAC extension hurdles for Dravidian and Indian languages.

Findings

01

Identified key hurdles in OLAC extension for Indian languages

02

Analyzed Chinese text processing techniques for potential adaptation

03

Explored Anusaaraka system's applicability to Indian language processing

Abstract

The paper reviews the hurdles while trying to implement the OLAC extension for Dravidian / Indian languages. The paper further explores the possibilities which could minimise or solve these problems. In this context, the Chinese system of text processing and the anusaaraka system are scrutinised.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Speech and dialogue systems