Detecting Attended Visual Targets in Video

Eunji Chong; Yongxin Wang; Nataniel Ruiz; and James M. Rehg

arXiv:2003.02501·cs.CV·April 1, 2020·1 cites

Detecting Attended Visual Targets in Video

Eunji Chong, Yongxin Wang, Nataniel Ruiz, and James M. Rehg

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a novel deep learning architecture for detecting where people are looking in videos, including out-of-frame targets, and introduces a new dataset for training and evaluation.

Contribution

The paper introduces a new model for dynamic attention detection in videos and a new annotated dataset, advancing the understanding of gaze behavior analysis.

Findings

01

Effective inference of dynamic attention in videos.

02

State-of-the-art performance on multiple gaze datasets.

03

First automatic classification of clinically-relevant gaze behavior.

Abstract

We address the problem of detecting attention targets in video. Our goal is to identify where each person in each frame of a video is looking, and correctly handle the case where the gaze target is out-of-frame. Our novel architecture models the dynamic interaction between the scene and head features and infers time-varying attention targets. We introduce a new annotated dataset, VideoAttentionTarget, containing complex and dynamic patterns of real-world gaze behavior. Our experiments show that our model can effectively infer dynamic attention in videos. In addition, we apply our predicted attention maps to two social gaze behavior recognition tasks, and show that the resulting classifiers significantly outperform existing methods. We achieve state-of-the-art performance on three datasets: GazeFollow (static images), VideoAttentionTarget (videos), and VideoCoAtt (videos), and obtain the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ejcgt/attention-target-detection
pytorch

Videos

Detecting Attended Visual Targets in Video· youtube

Taxonomy

TopicsGaze Tracking and Assistive Technology · Visual Attention and Saliency Detection · Neonatal and fetal brain pathology