Complex-Valued Restricted Boltzmann Machine for Direct Speech Parameterization from Complex Spectra
Toru Nakashika, Shinji Takaki, Junichi Yamagishi

TL;DR
This paper introduces the complex-valued restricted Boltzmann machine (CRBM), a novel model that directly encodes complex spectra for speech feature extraction, outperforming traditional methods in speech coding tasks.
Contribution
The paper presents the CRBM, extending RBMs to handle complex-valued data with real-imaginary connections, enabling direct complex spectrum encoding and decoding for speech processing.
Findings
CRBM outperforms traditional speech coding methods.
CRBM effectively encodes complex spectra into binary latent features.
The model improves speech feature extraction by utilizing phase information.
Abstract
This paper describes a novel energy-based probabilistic distribution that represents complex-valued data and explains how to apply it to direct feature extraction from complex-valued spectra. The proposed model, the complex-valued restricted Boltzmann machine (CRBM), is designed to deal with complex-valued visible units as an extension of the well-known restricted Boltzmann machine (RBM). Like the RBM, the CRBM learns the relationships between visible and hidden units without having connections between units in the same layer, which dramatically improves training efficiency by using Gibbs sampling or contrastive divergence (CD). Another important characteristic is that the CRBM also has connections between real and imaginary parts of each of the complex-valued visible units that help represent the data distribution in the complex domain. In speech signal processing, classification and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques
