Visual Words for Automatic Lip-Reading

Ahmad Basheer Hassanat

arXiv:1409.6689·cs.CV·September 24, 2014·23 cites

Visual Words for Automatic Lip-Reading

Ahmad Basheer Hassanat

PDF

Open Access

TL;DR

This paper introduces a new visual words-based approach for automatic lip reading, including novel face and lip localization techniques, aiming to improve visual speech recognition systems.

Contribution

It proposes a novel visual words framework with automatic face and lip localization methods for enhanced lip reading automation.

Findings

01

Developed a new face localization scheme

02

Created a lip localization method

03

Improved accuracy of visual speech recognition

Abstract

Lip reading is used to understand or interpret speech without hearing it, a technique especially mastered by people with hearing difficulties. The ability to lip read enables a person with a hearing impairment to communicate with others and to engage in social activities, which otherwise would be difficult. Recent advances in the fields of computer vision, pattern recognition, and signal processing has led to a growing interest in automating this challenging task of lip reading. Indeed, automating the human ability to lip read, a process referred to as visual speech recognition, could open the door for other novel applications. This thesis investigates various issues faced by an automated lip-reading system and proposes a novel "visual words" based approach to automatic lip reading. The proposed approach includes a novel automatic face localisation scheme and a lip localisation method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Face recognition and analysis · Face and Expression Recognition