MT3S: Mobile Turkish Scene Text-to-Speech System for the Visually   Impaired

Muhammet Bastan; Hilal Kandemir; Busra Canturk

arXiv:1608.05054·cs.MM·August 18, 2016·2 cites

MT3S: Mobile Turkish Scene Text-to-Speech System for the Visually Impaired

Muhammet Bastan, Hilal Kandemir, Busra Canturk

PDF

Open Access 2 Repos

TL;DR

This paper presents MT3S, a mobile Turkish scene text-to-speech system for the visually impaired that combines fast multi-scale text detection with OCR to enable real-time reading on mobile devices.

Contribution

The paper introduces a novel mobile system for Turkish scene text reading that is faster and maintains high OCR accuracy compared to existing systems.

Findings

01

System operates in real-time on mobile devices

02

Achieves OCR accuracy comparable to state-of-the-art

03

Demonstrates significant speed improvements

Abstract

Reading text is one of the essential needs of the visually impaired people. We developed a mobile system that can read Turkish scene and book text, using a fast gradient-based multi-scale text detection algorithm for real-time operation and Tesseract OCR engine for character recognition. We evaluated the OCR accuracy and running time of our system on a new, publicly available mobile Turkish scene text dataset we constructed and also compared with state-of-the-art systems. Our system proved to be much faster, able to run on a mobile device, with OCR accuracy comparable to the state-of-the-art.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Tactile and Sensory Interactions · Vehicle License Plate Recognition