Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Shuyu Yang, Yaxiong Wang, Li Zhu, Zhedong Zheng

TL;DR
This paper introduces a large-scale benchmark and a novel framework for text-based person anomaly search, enabling the identification of abnormal behaviors in real-world scenarios using natural language descriptions.
Contribution
It presents a new task, the Pedestrian Anomaly Behavior benchmark, and a cross-modal pose-aware framework for improved abnormal behavior retrieval.
Findings
Synthetic data enhances behavior retrieval accuracy.
The pose-aware method achieves 84.93% recall@1.
The benchmark covers diverse normal and abnormal actions.
Abstract
Text-based person search aims to retrieve specific individuals across camera networks using natural language descriptions. However, current benchmarks often exhibit biases towards common actions like walking or standing, neglecting the critical need for identifying abnormal behaviors in real-world scenarios. To meet such demands, we propose a new task, text-based person anomaly search, locating pedestrians engaged in both routine or anomalous activities via text. To enable the training and evaluation of this new task, we construct a large-scale image-text Pedestrian Anomaly Behavior (PAB) benchmark, featuring a broad spectrum of actions, e.g., running, performing, playing soccer, and the corresponding anomalies, e.g., lying, being hit, and falling of the same identity. The training set of PAB comprises 1,013,605 synthesized image-text pairs of both normalities and anomalies, while the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNames, Identity, and Discrimination Research · Data-Driven Disease Surveillance
MethodsSparse Evolutionary Training
