# A speech compression method without utilizing signal prediction

**Authors:** Ikuo Matsuo, Kazuo Ueda, Yoshitaka Nakajima

PMC · DOI: 10.1177/20416695251340236 · i-Perception · 2025-05-21

## TL;DR

A new speech compression method uses amplitude envelopes to preserve speech quality and intelligibility at a much lower bit rate.

## Contribution

The method uses amplitude envelopes in specific frequency bands without signal prediction, achieving high intelligibility at low bit rates.

## Key findings

- Intelligibility reached ~80% at 2,400 bps, compared to 256,000 bps for the original signal.
- The method preserves natural speech quality and is intuitive to understand.

## Abstract

Previous speech compression methods for practical purposes had been based on signal prediction, taking the auditory functions into account but overlooking features specific to speech signals. A new method was developed in which amplitude envelopes in four frequency bands corresponding to spectral factors common to different languages were used to modulate infinitely peak-clipped signals, which also had been revealed to contain useful linguistic information. In a pilot experiment, intelligibility reached ~80% with limited information of only 2,400 bits per second (bps), whereas the bit rate of the original signal was 256,000 bps. This algorithm preserves the naturalness of speech and is easy to grasp intuitively.

## Full-text entities

- **Diseases:** ORCID iDs (MESH:C535742)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12120533/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12120533/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/PMC12120533/full.md

---
Source: https://tomesphere.com/paper/PMC12120533