EyeCue: Driver Cognitive Distraction Detection via Gaze-Empowered Egocentric Video Understanding

Lang Zhang; JinYi Yoon; Matthew Corbett; Abhijit Sarkar; and Bo Ji

arXiv:2605.07859·cs.CV·May 11, 2026

EyeCue: Driver Cognitive Distraction Detection via Gaze-Empowered Egocentric Video Understanding

Lang Zhang, JinYi Yoon, Matthew Corbett, Abhijit Sarkar, and Bo Ji

PDF

1 Repo

TL;DR

EyeCue is a novel framework that detects driver cognitive distraction by analyzing the interaction between eye gaze and egocentric video, supported by a new multi-scenario dataset, CogDrive.

Contribution

The paper introduces EyeCue, a gaze-empowered video understanding model, and CogDrive, a comprehensive dataset, advancing cognitive distraction detection in driving scenarios.

Findings

01

EyeCue achieves 74.38% accuracy, outperforming baselines by over 7%.

02

It maintains over 70% accuracy across diverse scenarios.

03

Modeling gaze-context interactions enhances detection performance.

Abstract

Driver cognitive distraction is a major cause of road collisions and remains difficult to detect. Unlike manual or visual distraction, cognitive distraction is diverted by thoughts unrelated to driving, even when the driver appears visually attentive and exhibits no explicit physical movements. In this work, we propose EyeCue, a gaze-empowered egocentric video understanding framework, to detect driver cognitive distraction. A key insight is that cognitive distraction manifests in the interaction between eye gaze and visual context. To capture this interaction, EyeCue integrates eye gaze with egocentric video to enable context-aware modeling of the driver's attention over time. Furthermore, to tackle the limited scale and diversity of existing datasets, we introduce CogDrive, a comprehensive multi-scenario dataset that augments four existing driving datasets with cognitive distraction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

langzhang2000/EyeCue
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.