TL;DR
EyeCue is a novel framework that detects driver cognitive distraction by analyzing the interaction between eye gaze and egocentric video, supported by a new multi-scenario dataset, CogDrive.
Contribution
The paper introduces EyeCue, a gaze-empowered video understanding model, and CogDrive, a comprehensive dataset, advancing cognitive distraction detection in driving scenarios.
Findings
EyeCue achieves 74.38% accuracy, outperforming baselines by over 7%.
It maintains over 70% accuracy across diverse scenarios.
Modeling gaze-context interactions enhances detection performance.
Abstract
Driver cognitive distraction is a major cause of road collisions and remains difficult to detect. Unlike manual or visual distraction, cognitive distraction is diverted by thoughts unrelated to driving, even when the driver appears visually attentive and exhibits no explicit physical movements. In this work, we propose EyeCue, a gaze-empowered egocentric video understanding framework, to detect driver cognitive distraction. A key insight is that cognitive distraction manifests in the interaction between eye gaze and visual context. To capture this interaction, EyeCue integrates eye gaze with egocentric video to enable context-aware modeling of the driver's attention over time. Furthermore, to tackle the limited scale and diversity of existing datasets, we introduce CogDrive, a comprehensive multi-scenario dataset that augments four existing driving datasets with cognitive distraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
