FGENet: Fine-Grained Extraction Network for Congested Crowd Counting

Hao-Yuan Ma; Li Zhang; Xiang-Yi Wei

arXiv:2401.01208·cs.CV·January 3, 2024·1 cites

FGENet: Fine-Grained Extraction Network for Congested Crowd Counting

Hao-Yuan Ma, Li Zhang, Xiang-Yi Wei

PDF

Open Access

TL;DR

FGENet is an end-to-end crowd counting model that directly localizes individuals with high accuracy, employing a fusion module and a robust loss function to outperform existing methods, especially in high-density scenarios.

Contribution

The paper introduces FGENet, a novel model that directly predicts individual coordinates, utilizing a new fusion module and loss function to improve accuracy over density map-based methods.

Findings

01

Achieves 3.14 MAE improvement on ShanghaiTech Part A dataset.

02

Surpasses previous benchmarks with 30.16 MAE improvement on UCF_CC_50.

03

Demonstrates robustness in high-density crowd images.

Abstract

Crowd counting has gained significant popularity due to its practical applications. However, mainstream counting methods ignore precise individual localization and suffer from annotation noise because of counting from estimating density maps. Additionally, they also struggle with high-density images.To address these issues, we propose an end-to-end model called Fine-Grained Extraction Network (FGENet). Different from methods estimating density maps, FGENet directly learns the original coordinate points that represent the precise localization of individuals.This study designs a fusion module, named Fine-Grained Feature Pyramid(FGFP), that is used to fuse feature maps extracted by the backbone of FGENet. The fused features are then passed to both regression and classification heads, where the former provides predicted point coordinates for a given image, and the latter determines the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Mobility and Location-Based Analysis · Anomaly Detection Techniques and Applications

MethodsMasked autoencoder