TeLCoS: OnDevice Text Localization with Clustering of Script
Rachit S Munjal, Manoj Goyal, Rutika Moharir, Sukumar Moharana

TL;DR
TeLCoS is a lightweight, real-time on-device text localization and script clustering method that reduces overhead by integrating script identification into the localization process, suitable for resource-constrained environments.
Contribution
It introduces a dual-branch CNN that performs text localization and script clustering simultaneously, with a novel channel pruning mechanism for efficiency.
Findings
Achieves state-of-the-art performance on benchmark datasets.
Runs in 60 ms on Exynos 990 chipset device.
Uses only 1.15 million parameters for the network.
Abstract
Recent research in the field of text localization in a resource constrained environment has made extensive use of deep neural networks. Scene text localization and recognition on low-memory mobile devices have a wide range of applications including content extraction, image categorization and keyword based image search. For text recognition of multi-lingual localized text, the OCR systems require prior knowledge of the script of each text instance. This leads to word script identification being an essential step for text recognition. Most existing methods treat text localization, script identification and text recognition as three separate tasks. This makes script identification an overhead in the recognition pipeline. To reduce this overhead, we propose TeLCoS: OnDevice Text Localization with Clustering of Script, a multi-task dual branch lightweight CNN network that performs real-time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
