Score: 1

Topological Deep Learning for Speech Data

Published: May 27, 2025 | arXiv ID: 2505.21173v1

By: Zhiwang Yu

Potential Business Impact:

Makes computers understand voices better, even with noise.

Business Areas:
Speech Recognition Data and Analytics, Software

Topological data analysis (TDA) offers novel mathematical tools for deep learning. Inspired by Carlsson et al., this study designs topology-aware convolutional kernels that significantly improve speech recognition networks. Theoretically, by investigating orthogonal group actions on kernels, we establish a fiber-bundle decomposition of matrix spaces, enabling new filter generation methods. Practically, our proposed Orthogonal Feature (OF) layer achieves superior performance in phoneme recognition, particularly in low-noise scenarios, while demonstrating cross-domain adaptability. This work reveals TDA's potential in neural network optimization, opening new avenues for mathematics-deep learning interdisciplinary studies.

Repos / Data Links

Page Count
21 pages

Category
Computer Science:
Machine Learning (CS)