FastPOS: Language-Agnostic Scalable POS Tagging Framework Low-Resource Use Case
Md Abdullah Al Kafi, Sumit Kumar Banshal

TL;DR
FastPOS is a modular, language-agnostic transformer-based POS tagging framework that enables rapid adaptation to low-resource languages like Bangla and Hindi with minimal code changes, achieving high accuracy.
Contribution
The paper introduces a scalable, low-resource POS tagging framework that can be easily adapted across languages with minimal modifications, emphasizing its modular and open-source design.
Findings
Achieved over 96% token-level accuracy in Bangla and Hindi.
Demonstrated effective cross-lingual transfer with minimal code adjustments.
Identified challenges in dataset curation affecting POS category performance.
Abstract
This study proposes a language-agnostic transformer-based POS tagging framework designed for low-resource languages, using Bangla and Hindi as case studies. With only three lines of framework-specific code, the model was adapted from Bangla to Hindi, demonstrating effective portability with minimal modification. The framework achieves 96.85 percent and 97 percent token-level accuracy across POS categories in Bangla and Hindi while sustaining strong F1 scores despite dataset imbalance and linguistic overlap. A performance discrepancy in a specific POS category underscores ongoing challenges in dataset curation. The strong results stem from the underlying transformer architecture, which can be replaced with limited code adjustments. Its modular and open-source design enables rapid cross-lingual adaptation while reducing model design and tuning overhead, allowing researchers to focus on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsICT in Developing Communities · Natural Language Processing Techniques · Topic Modeling
