# Short-Time Homomorphic Deconvolution (STHD): A Novel 2D Feature for Robust Indoor Direction of Arrival Estimation

**Authors:** Yeonseok Park, Jun-Hwa Kim

PMC · DOI: 10.3390/s26020722 · Sensors (Basel, Switzerland) · 2026-01-21

## TL;DR

This paper introduces a new audio-based method for accurately determining sound source direction indoors, using a novel feature extraction technique that improves deep learning model performance.

## Contribution

The novel Short-Time Homomorphic Deconvolution feature captures temporal time-of-flight differences for robust indoor direction estimation.

## Key findings

- The proposed feature achieves a Mean Absolute Error of 1.99 degrees in real-world direction of arrival estimation.
- The system shows strong consistency between simulated and real-world results.
- A lightweight CNN with dual-stage attention improves spatial cue prioritization.

## Abstract

Accurate indoor positioning and navigation remain significant challenges, with audio sensor-based sound source localization emerging as a promising sensing modality. Conventional methods, often reliant on multi-channel processing or time-delay estimation techniques such as Generalized Cross-Correlation, encounter difficulties regarding computational complexity, hardware synchronization, and reverberant environments where time difference in arrival cues are masked. While machine learning approaches have shown potential, their performance depends heavily on the discriminative power of input features. This paper proposes a novel feature extraction method named Short-Time Homomorphic Deconvolution, which transforms multi-channel audio signals into a 2D Time × Time-of-Flight representation. Unlike prior 1D methods, this feature effectively captures the temporal evolution and stability of time-of-flight differences between microphone pairs, offering a rich and robust input for deep learning models. We validate this feature using a lightweight Convolutional Neural Network integrated with a dual-stage channel attention mechanism, designed to prioritize reliable spatial cues. The system was trained on a large-scale dataset generated via simulations and rigorously tested using real-world data acquired in an ISO-certified anechoic chamber. Experimental results demonstrate that the proposed model achieves precise Direction of Arrival estimation with a Mean Absolute Error of 1.99 degrees in real-world scenarios. Notably, the system exhibits remarkable consistency between simulation and physical experiments, proving its effectiveness for robust indoor navigation and positioning systems.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12846163/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12846163/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12846163/full.md

---
Source: https://tomesphere.com/paper/PMC12846163