ExpresSense: Exploring a Standalone Smartphone to Sense Engagement of   Users from Facial Expressions Using Acoustic Sensing

Pragma Kar; Shyamvanshikumar Singh; Avijit Mandal; Samiran; Chattopadhyay; Sandip Chakraborty

arXiv:2301.06762·cs.HC·January 18, 2023

ExpresSense: Exploring a Standalone Smartphone to Sense Engagement of Users from Facial Expressions Using Acoustic Sensing

Pragma Kar, Shyamvanshikumar Singh, Avijit Mandal, Samiran, Chattopadhyay, Sandip Chakraborty

PDF

Open Access

TL;DR

ExpresSense is a lightweight smartphone application that uses acoustic signals to detect facial expressions with about 75% accuracy, addressing limitations of image-based methods like occlusion and privacy concerns.

Contribution

This work introduces a novel acoustic sensing approach for facial expression detection on standalone smartphones, bypassing traditional image processing limitations.

Findings

01

Achieved ~75% classification accuracy on various expressions.

02

Validated effectiveness through lab and large-scale studies.

03

Demonstrated potential for privacy-preserving expression sensing.

Abstract

Facial expressions have been considered a metric reflecting a person's engagement with a task. While the evolution of expression detection methods is consequential, the foundation remains mostly on image processing techniques that suffer from occlusion, ambient light, and privacy concerns. In this paper, we propose ExpresSense, a lightweight application for standalone smartphones that relies on near-ultrasound acoustic signals for detecting users' facial expressions. ExpresSense has been tested on different users in lab-scaled and large-scale studies for both posed as well as natural expressions. By achieving a classification accuracy of ~75% over various basic expressions, we discuss the potential of a standalone smartphone to sense expressions through acoustic sensing.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Emotion and Mood Recognition · Face recognition and analysis