# Enhancing Bone Conduction Sensor Signals via Self-Supervised Acoustic Priors and Key-Value Memory

**Authors:** Changyan Zheng, Hao He, Xiaohu Fan, Lin Li, Yang Zhao, Ye Yan, Erwei Yin

PMC · DOI: 10.3390/s26041137 · 2026-02-10

## TL;DR

This paper introduces a method to enhance bone conduction sensor signals using self-supervised learning and memory modules to recover lost high-frequency speech components.

## Contribution

A novel time-domain framework combining SSL priors and a Key-Value Memory module to improve BC signal quality without reference signals.

## Key findings

- The proposed method achieves significant PESQ gains of over 51% and 73% on the ABCS and ESMB datasets.
- The Key-Value Memory module effectively bridges the sensor domain gap by retrieving high-fidelity acoustic priors.
- The compact architecture is optimized for real-world deployment while maintaining high performance.

## Abstract

Bone conduction (BC) sensors naturally resist ambient noise, but the captured speech suffers from severe high-frequency attenuation due to the low-pass filtering characteristics of body tissue. To compensate for this hardware-induced information deficiency, we propose a time-domain framework leveraging highly generalized representations from Self-Supervised Learning (SSL). Specifically, we employ a large-scale pre-trained SSL model to generate embeddings that function as robust acoustic priors. Subsequently, a Key-Value Memory module is integrated to bridge the sensor domain gap, enabling the retrieval of high-fidelity priors from BC queries in the absence of reference air conduction signals. These retrieved cues are then processed by a Gated Attention Projection and dynamically fused into the primary network’s bottleneck, effectively recovering the high-frequency harmonics attenuated by the physical transmission path and rectifying the spectral distortion inherent in BC signals. Experiments on the ABCS and ESMB datasets demonstrate that our method surpasses state-of-the-art baselines in both quality and efficiency. It achieves PESQ gains of over 51% and 73% relative to raw BC inputs, respectively, with a compact architecture optimized for real-world deployment.

## Full-text entities

- **Diseases:** blind (MESH:D001766), ESMB (MESH:D001847), hallucination (MESH:D006212), ABCS (MESH:D009378), injury to (MESH:D014947)
- **Chemicals:** N (MESH:D009584), EBEN (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944054/full.md

---
Source: https://tomesphere.com/paper/PMC12944054