Continuous Affect Prediction Using Eye Gaze and Speech
Jonny O'Dwyer, Ronan Flynn, Niall Murray

TL;DR
This paper presents a novel real-time affect prediction system combining speech and eye gaze, improving accuracy without intrusive devices, useful for remote psychological assessments and audio-visual communication.
Contribution
It introduces a new eye gaze feature set and demonstrates the effectiveness of combining speech and eye gaze for continuous affect prediction.
Findings
Arousal prediction improved by 3.5% using combined modalities.
Valence prediction improved by 19.5% with speech and eye gaze.
System operates in real-time without specialized eye-tracking hardware.
Abstract
Affective computing research traditionally focused on labeling a person's emotion as one of a discrete number of classes e.g. happy or sad. In recent times, more attention has been given to continuous affect prediction across dimensions in the emotional space, e.g. arousal and valence. Continuous affect prediction is the task of predicting a numerical value for different emotion dimensions. The application of continuous affect prediction is powerful in domains involving real-time audio-visual communications which could include remote or assistive technologies for psychological assessment of subjects. Modalities used for continuous affect prediction may include speech, facial expressions and physiological responses. As opposed to single modality analysis, the research community have combined multiple modalities to improve the accuracy of continuous affect prediction. In this context,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
