Topological Deep Learning for Speech Data
By: Zhiwang Yu
Potential Business Impact:
Makes computers understand voices better, even with noise.
Topological data analysis (TDA) offers novel mathematical tools for deep learning. Inspired by Carlsson et al., this study designs topology-aware convolutional kernels that significantly improve speech recognition networks. Theoretically, by investigating orthogonal group actions on kernels, we establish a fiber-bundle decomposition of matrix spaces, enabling new filter generation methods. Practically, our proposed Orthogonal Feature (OF) layer achieves superior performance in phoneme recognition, particularly in low-noise scenarios, while demonstrating cross-domain adaptability. This work reveals TDA's potential in neural network optimization, opening new avenues for mathematics-deep learning interdisciplinary studies.
Similar Papers
Improving Remote Sensing Classification using Topological Data Analysis and Convolutional Neural Networks
CV and Pattern Recognition
Helps computers better understand satellite pictures.
Commutative algebra-enhanced topological data analysis
Computational Geometry
Finds hidden patterns in data more deeply.
Topological Dictionary Learning
Signal Processing
Finds hidden patterns in connected data.