Topological Deep Learning for Speech Data

Zhiwang Yu

arXiv:2505.21173·cs.LG·May 28, 2025

Topological Deep Learning for Speech Data

Zhiwang Yu

PDF

Open Access

TL;DR

This paper introduces topology-aware convolutional kernels inspired by topological data analysis to enhance speech recognition networks, demonstrating improved performance and cross-domain adaptability through novel mathematical and practical approaches.

Contribution

It presents a new class of topology-aware kernels and a fiber-bundle decomposition method, advancing the integration of topological data analysis with deep learning for speech data.

Findings

01

Orthogonal Feature layer outperforms existing methods in phoneme recognition

02

Proposed kernels improve robustness in low-noise environments

03

Demonstrates cross-domain adaptability of the approach

Abstract

Topological data analysis (TDA) offers novel mathematical tools for deep learning. Inspired by Carlsson et al., this study designs topology-aware convolutional kernels that significantly improve speech recognition networks. Theoretically, by investigating orthogonal group actions on kernels, we establish a fiber-bundle decomposition of matrix spaces, enabling new filter generation methods. Practically, our proposed Orthogonal Feature (OF) layer achieves superior performance in phoneme recognition, particularly in low-noise scenarios, while demonstrating cross-domain adaptability. This work reveals TDA's potential in neural network optimization, opening new avenues for mathematics-deep learning interdisciplinary studies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Music and Audio Processing