# An auditory-visual cooperative perception method for honking vehicle localization

**Authors:** Fei Yuan, Junxi Kang, Jiao Yin, Jinli Cao

PMC · DOI: 10.1371/journal.pone.0337352 · PLOS One · 2025-11-21

## TL;DR

This paper introduces a method that combines audio and visual data to more accurately locate honking vehicles, improving over traditional sound-based approaches.

## Contribution

The novel AVCP method integrates ECAPA-TDNN and YOLO v9 for cooperative auditory-visual honking vehicle localization.

## Key findings

- The AVCP method is less affected by environmental noise compared to traditional methods.
- It achieves more accurate and distant vehicle identification and localization.
- Experiments on simulated and public datasets validated the method's effectiveness.

## Abstract

Locating honking vehicles is crucial for controlling arbitrary honking and reducing environmental noise. However, traditional methods for honking vehicle localization, which utilize sound source localization technology, suffer from inaccuracies and limited detection range due to the multipath effects of sound propagation and environmental noise interference. To address these challenges, an auditory-visual cooperative perception (AVCP) method for honking vehicle localization is proposed, and a detailed workflow of this method is presented. In the AVCP method workflow, the Emphasized Channel Attention, Propagation, and Aggregation in Time-Delay Neural Network (ECAPA-TDNN) is used to recognize honking vehicle models from captured audio signals, as different vehicle models exhibit distinct horn sound characteristics. Subsequently, YOLO v9 is employed to detect vehicles and recognize their corresponding models in the images captured by the camera. Thus, among the vehicles detected and identified using YOLO v9, the honking vehicle is determined as the one whose model matches the vehicle model recognized by ECAPA-TDNN. Additionally, experiments with simulated and public datasets were conducted to evaluate the performance of the AVCP method for honking vehicle localization. The experimental results show that the AVCP method is less susceptible to environmental noise and can more accurately identify and locate vehicles from greater distances compared to traditional methods based on sound source localization technology.

## Full-text entities

- **Diseases:** anxiety (MESH:D001007), sleep fragmentation (MESH:D012892), MFCC (MESH:D006316), hypertension (MESH:D006973), vascular dysfunction (MESH:D002561)
- **Chemicals:** triglyceride (MESH:D014280), SUVs (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12637988/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12637988/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC12637988/full.md

---
Source: https://tomesphere.com/paper/PMC12637988