# Hide and Speak: Towards Deep Neural Networks for Speech Steganography

**Authors:** Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet

arXiv: 1902.03083 · 2020-07-28

## TL;DR

This paper introduces a neural network-based speech steganography method that effectively conceals multiple messages with minimal perceptible changes, outperforming traditional techniques and robust under channel distortions.

## Contribution

The paper proposes a novel neural network model incorporating Fourier transforms for speech steganography, addressing limitations of vision-based models and enabling multi-message concealment.

## Key findings

- Effective concealment of multiple messages in speech
- Minimal perceptible changes to human listeners
- Robustness under various channel distortions

## Abstract

Steganography is the science of hiding a secret message within an ordinary public message, which is referred to as Carrier. Traditionally, digital signal processing techniques, such as least significant bit encoding, were used for hiding messages. In this paper, we explore the use of deep neural networks as steganographic functions for speech data. We showed that steganography models proposed for vision are less suitable for speech, and propose a new model that includes the short-time Fourier transform and inverse-short-time Fourier transform as differentiable layers within the network, thus imposing a vital constraint on the network outputs. We empirically demonstrated the effectiveness of the proposed method comparing to deep learning based on several speech datasets and analyzed the results quantitatively and qualitatively. Moreover, we showed that the proposed approach could be applied to conceal multiple messages in a single carrier using multiple decoders or a single conditional decoder. Lastly, we evaluated our model under different channel distortions. Qualitative experiments suggest that modifications to the carrier are unnoticeable by human listeners and that the decoded messages are highly intelligible.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.03083/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1902.03083/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/1902.03083/full.md

---
Source: https://tomesphere.com/paper/1902.03083