Optimizing Audio Compression Through Entropy-Controlled Dithering

Ellison Murray; Morriel Kasher; Predrag Spasojevic

arXiv:2501.02293·eess.AS·January 13, 2025

Optimizing Audio Compression Through Entropy-Controlled Dithering

Ellison Murray, Morriel Kasher, Predrag Spasojevic

PDF

Open Access

TL;DR

This paper investigates entropy-controlled dithering techniques in audio compression, demonstrating improved perceptual quality with TPDF-based methods and introducing a practical plugin for enhanced audio processing.

Contribution

It introduces entropy-controlled dithering methods using TPDFs and noise shaping, providing a practical plugin implementation and insights into optimizing audio compression.

Findings

01

TPDF-based dithering outperforms RPDF in perceptual quality

02

Optimal alpha conditions enhance dithering performance

03

Situational choice of TPDF distributions affects results

Abstract

This paper explores entropy-controlled dithering techniques in audio compression, examining the application of standard and modified TPDFs, combined with noise shaping and entropy-controlled parameters, across various audio contexts, including pitch, loudness, rhythm, and instrumentation variations. Perceptual quality metrics such as VISQOL and STOI were used to evaluate performance. The results demonstrate that TPDF-based dithering consistently outperforms RPDF, particularly under optimal alpha conditions, while highlighting performance variability based on signal characteristics. These findings suggest the situational appropriateness of using various TPDF distributions. This work emphasizes the trade-off between entropy and perceptual fidelity, offering insights into the potential of entropy-controlled dithering as a foundation for enhanced audio compression algorithms. A practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Data Compression Techniques · Music and Audio Processing