# A Terrain Perception Method for Quadruped Robots Based on Acoustic Signal Fusion

**Authors:** Meng Hong, Nian Wang, Xingyu Liu, Chao Huang, Ganchang Li, Zijian Li, Shuai Shu, Ruixuan Chen, Jincheng Sheng, Zhongren Wang, Sijia Guan, Min Guo

PMC · DOI: 10.3390/s26020594 · Sensors (Basel, Switzerland) · 2026-01-15

## TL;DR

This paper introduces a new method for quadruped robots to better sense different terrains by combining sound and movement data, improving their navigation in complex environments.

## Contribution

The novel contribution is a multimodal terrain perception approach that fuses acoustic and proprioceptive signals for improved classification accuracy.

## Key findings

- Integrating acoustic signals with proprioceptive data increased terrain classification accuracy from 78.28% to 82.52%.
- Optimizing SVM hyperparameters and time-window length further boosted accuracy to 99.53%.
- The method was tested on four terrains: concrete, gravel, sand, and carpet.

## Abstract

In unstructured environments, terrain perception is essential for stability and environmental awareness of Quadruped robot locomotion. Existing approaches primarily rely on visual or proprioceptive signals, but their effectiveness is limited under conditions of visual occlusion or ambiguous terrain features. To address this, this study proposes a multimodal terrain perception method that integrates acoustic features with proprioceptive signals. This terrain perception method collects environmental acoustic information through an externally mounted sound sensor, and combines the sound signal with proprioceptive sensor data from IMU and joint encoder of the quadruped robot. The method was deployed on the quadruped robot Lite2 platform developed by Deep Robotics, and experiments were conducted on four representative terrain types: concrete, gravel, sand, and carpet. Mel-spectrogram features are extracted from the acoustic signals and concatenated with the IMU and joint encoder to form feature vectors, which are subsequently fed into a support vector machine for terrain classification. For each terrain type, 400 s of data were collected. Experimental results show that the terrain classification accuracy reaches 78.28% without using acoustic signals, while increasing to 82.52% when acoustic features are incorporated. To further enhance the classification performance, this study performs a combined exploration of the SVM hyperparameters C and γ as well as the time-window length win. The final results demonstrate that the classification accuracy can be improved to as high as 99.53% across all four terrains.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12845680/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12845680/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/PMC12845680/full.md

---
Source: https://tomesphere.com/paper/PMC12845680