CrowdFix: An Eyetracking Dataset of Real Life Crowd Videos

Memoona Tahira; Sobas Mehboob; Anis U. Rahman; Omar Arif

arXiv:1910.02618·cs.CV·October 10, 2019

CrowdFix: An Eyetracking Dataset of Real Life Crowd Videos

Memoona Tahira, Sobas Mehboob, Anis U. Rahman, Omar Arif

PDF

Open Access 1 Repo

TL;DR

This paper introduces CrowdFix, a new eye-tracking dataset of crowd videos with varying densities, to improve understanding of visual attention in crowded scenes and evaluate saliency models.

Contribution

It provides a novel eye-tracking dataset of crowd videos with annotations and assesses current saliency models on this data for future improvements.

Findings

01

State-of-the-art models perform variably across crowd densities.

02

The dataset reveals challenges in current saliency modeling for crowded scenes.

03

CrowdFix offers a new benchmark for saliency research in complex environments.

Abstract

Understanding human visual attention and saliency is an integral part of vision research. In this context, there is an ever-present need for fresh and diverse benchmark datasets, particularly for insight into special use cases like crowded scenes. We contribute to this end by: (1) reviewing the dynamics behind saliency and crowds. (2) using eye tracking to create a dynamic human eye fixation dataset over a new set of crowd videos gathered from the Internet. The videos are annotated into three distinct density levels. (3) Finally, we evaluate state-of-the-art saliency models on our dataset to identify possible improvements for the design and creation of a more robust saliency model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MemoonaTahira/CrowdFix
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Gaze Tracking and Assistive Technology