A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Biao Ma, Chengben Xu, Ye Zhang

TL;DR
This paper introduces a novel speech feature fusion algorithm combining IVA and PCNN for text-independent speaker recognition, enhancing feature extraction and recognition accuracy.
Contribution
It proposes a new fusion method using IVA and PCNN to improve speaker recognition performance by effectively combining diverse speech features.
Findings
Improved recognition accuracy over baseline methods
Effective fusion of time and frequency domain features
Robustness to different speech conditions
Abstract
A novel speech feature fusion algorithm with independent vector analysis (IVA) and parallel convolutional neural network (PCNN) is proposed for text-independent speaker recognition. Firstly, some different feature types, such as the time domain (TD) features and the frequency domain (FD) features, can be extracted from a speaker's speech, and the TD and the FD features can be considered as the linear mixtures of independent feature components (IFCs) with an unknown mixing system. To estimate the IFCs, the TD and the FD features of the speaker's speech are concatenated to build the TD and the FD feature matrix, respectively. Then, a feature tensor of the speaker's speech is obtained by paralleling the TD and the FD feature matrix. To enhance the dependence on different feature types and remove the redundancies of the same feature type, the independent vector analysis (IVA) can be used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Algorithms and Applications · Advanced Computational Techniques and Applications · Advanced Sensor and Control Systems
