# Vision-Based Vehicle State and Behavior Analysis for Aircraft Stand Safety

**Authors:** Ke Tang, Liang Zeng, Tianxiong Zhang, Di Zhu, Wenjie Liu, Xinping Zhu

PMC · DOI: 10.3390/s26061821 · Sensors (Basel, Switzerland) · 2026-03-13

## TL;DR

This paper introduces a vision-based system to monitor ground vehicles at aircraft stands, improving safety by detecting abnormal behaviors using existing cameras.

## Contribution

A lightweight framework for vehicle state perception and behavior analysis using monocular cameras and improved detection algorithms.

## Key findings

- The framework achieves 0.32 m RMSE in physical localization and 90.4% mAP@50 for vehicle detection.
- It detects 'area intrusion' and 'abnormal stop' violations with 96.0% recall and 95.8% precision.
- The system uses existing cameras, making it cost-effective and easy to deploy.

## Abstract

With the continuous elevation of aviation safety standards, accurate monitoring of ground support vehicles in aircraft stand areas has become a critical task for enhancing overall aircraft stand operational safety. Given the limitations of existing surface movement radar and multi-camera surveillance systems in terms of cost, deployment complexity, and coverage, this paper proposes a lightweight vision-based framework for vehicle state perception and spatiotemporal behavior analysis oriented toward aircraft stand safety. Leveraging existing fixed monocular monitoring resources in the stand area, the framework first establishes a precise mapping from image pixel coordinates to the physical plane through self-calibration and homography transformation utilizing scene line features, thereby achieving unified spatial measurement of vehicle targets. Subsequently, it integrates an improved lightweight YOLO detector (incorporating Ghost modules and CBAM for noise suppression) with the ByteTrack tracking algorithm to enable stable extraction of vehicle trajectories under complex occlusion conditions. Finally, by combining functional zone division within the stand, a semantic map is constructed, and a behavior analysis method based on a spatiotemporal finite state machine is proposed. This method performs joint reasoning by fusing multi-dimensional constraints including position, zone, and time, enabling automatic detection of abnormal behaviors such as “intrusion into restricted areas” and “abnormal stop.” Quantitative evaluations demonstrate the framework’s efficacy: it achieves an average physical localization error (RMSE) of 0.32 m, and the improved detection model reaches an accuracy (mAP@50) of 90.4% for ground support vehicles. In tests simulating typical violation scenarios, the system achieved high recall (96.0%) and precision (95.8%) rates in detecting ‘area intrusion’ and ‘abnormal stop’ violations, respectively. These results, achieved using only existing surveillance cameras, validate its potential as a cost-effective and easily deployable tool to augment existing safety monitoring systems for airport ground operations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13030404/full.md

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13030404/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC13030404/full.md

---
Source: https://tomesphere.com/paper/PMC13030404