Voice Pathology Detection Using Phonation
By: Sri Raksha Siva , Nived Suthahar , Prakash Boominathan and more
Potential Business Impact:
Detects voice problems without touching you.
Voice disorders significantly affect communication and quality of life, requiring an early and accurate diagnosis. Traditional methods like laryngoscopy are invasive, subjective, and often inaccessible. This research proposes a noninvasive, machine learning-based framework for detecting voice pathologies using phonation data. Phonation data from the Saarbr\"ucken Voice Database are analyzed using acoustic features such as Mel Frequency Cepstral Coefficients (MFCCs), chroma features, and Mel spectrograms. Recurrent Neural Networks (RNNs), including LSTM and attention mechanisms, classify samples into normal and pathological categories. Data augmentation techniques, including pitch shifting and Gaussian noise addition, enhance model generalizability, while preprocessing ensures signal quality. Scale-based features, such as H\"older and Hurst exponents, further capture signal irregularities and long-term dependencies. The proposed framework offers a noninvasive, automated diagnostic tool for early detection of voice pathologies, supporting AI-driven healthcare, and improving patient outcomes.
Similar Papers
AI-Driven Acoustic Voice Biomarker-Based Hierarchical Classification of Benign Laryngeal Voice Disorders from Sustained Vowels
Sound
Helps doctors find voice problems using sound.
MVP: Multi-source Voice Pathology detection
Audio and Speech Processing
Helps doctors find voice sickness from talking.
The use of vocal biomarkers in the detection of Parkinson's disease: a robust statistical performance comparison of classic machine learning models
Machine Learning (CS)
Detects Parkinson's disease early using just your voice.