A Study of Human Gaze Behavior During Visual Crowd Counting
Raji Annadi, Yupei Chen, Viresh Ranjan, Dimitris Samaras, Gregory, Zelinsky, Minh Hoai

TL;DR
This study investigates how humans allocate visual attention during crowd counting using eye tracking, revealing common strategies and highlighting the differences in accuracy compared to computer algorithms.
Contribution
It provides detailed analysis of human gaze behavior during crowd counting and introduces a dataset of gaze data and images for further research.
Findings
Humans tend to enumerate or focus on sections depending on crowd size.
Participants generally undercount compared to computer algorithms.
Gaze patterns are similar for small crowds but differ for large crowds.
Abstract
In this paper, we describe our study on how humans allocate their attention during visual crowd counting. Using an eye tracker, we collect gaze behavior of human participants who are tasked with counting the number of people in crowd images. Analyzing the collected gaze behavior of ten human participants on thirty crowd images, we observe some common approaches for visual counting. For an image of a small crowd, the approach is to enumerate over all people or groups of people in the crowd, and this explains the high level of similarity between the fixation density maps of different human participants. For an image of a large crowd, our participants tend to focus on one section of the image, count the number of people in that section, and then extrapolate to the other sections. In terms of count accuracy, our human participants are not as good at the counting task, compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Retinal Imaging and Analysis · Visual Attention and Saliency Detection
