# A Fusion Algorithm for Pedestrian Anomaly Detection and Tracking on Urban Roads Based on Multi-Module Collaboration and Cross-Frame Matching Optimization

**Authors:** Wei Zhao, Xin Gong, Lanlan Li, Luoyang Zuo

PMC · DOI: 10.3390/s26020400 · Sensors (Basel, Switzerland) · 2026-01-08

## TL;DR

This paper introduces a new algorithm that improves pedestrian anomaly detection and tracking in urban settings by combining detection and tracking modules with optimized features.

## Contribution

A novel fusion algorithm integrating YOLO-SGCF and BoT-SORT-ReID with cross-frame matching optimization for enhanced pedestrian anomaly detection and tracking.

## Key findings

- The proposed algorithm achieves 92.2% mAP@50% on a self-built dataset with four abnormal behaviors, outperforming the original model.
- Generalization testing on UCSD Ped1 dataset shows a 1.5% improvement in mAP score over the original model.
- The tracking algorithm achieves an MOTA of 90.8% and a 47.6% reduction in IDS compared to existing models.

## Abstract

Amid rapid advancements in artificial intelligence, the detection of abnormal human behaviors in complex traffic environments has garnered significant attention. However, detection errors frequently occur due to interference from complex backgrounds, small targets, and other factors. Therefore, this paper proposes a research methodology that integrates the anomaly detection YOLO-SGCF algorithm with the tracking BoT-SORT-ReID algorithm. The detection module uses YOLOv8 as the baseline model, incorporating Swin Transformer to enhance global feature modeling capabilities in complex scenes. CBAM and CA attention are embedded into the Neck and backbone, respectively: CBAM enables dual-dimensional channel-spatial weighting, while CA precisely captures object location features by encoding coordinate information. The Neck layer incorporates GSConv convolutional modules to reduce computational load while expanding feature receptive fields. The loss function is replaced with Focal-EIoU to address sample imbalance issues and precisely optimize bounding box regression. For tracking, to enhance long-term tracking stability, ReID feature distances are incorporated during the BoT-SORT data association phase. This integrates behavioral category information from YOLO-SGCF, enabling the identification and tracking of abnormal pedestrian behaviors in complex environments. Evaluations on our self-built dataset (covering four abnormal behaviors: Climb, Fall, Fight, Phone) show mAP@50%, precision, and recall reaching 92.2%, 90.75%, and 86.57% respectively—improvements of 3.4%, 4.4%, and 6% over the original model—while maintaining an inference speed of 328.49 FPS. Additionally, generalization testing on the UCSD Ped1 dataset (covering six abnormal behaviors: Biker, Skater, Car, Wheelchair, Lawn, Runner) yielded an mAP score of 92.7%, representing a 1.5% improvement over the original model and outperforming existing mainstream models. Furthermore, the tracking algorithm achieved an MOTA of 90.8% and an MOTP of 92.6%, with a 47.6% reduction in IDS, demonstrating superior tracking performance compared to existing mainstream algorithms.

## Full-text entities

- **Diseases:** Anomaly (MESH:D000013)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12845580/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12845580/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12845580/full.md

---
Source: https://tomesphere.com/paper/PMC12845580