Detecting COPD Through Speech Analysis: A Dataset of Danish Speech and Machine Learning Approach
By: Cuno Sankey-Olsen , Rasmus Hvass Olesen , Tobias Oliver Eberhard and more
Potential Business Impact:
Spots lung disease early from voice
Chronic Obstructive Pulmonary Disease (COPD) is a serious and debilitating disease affecting millions around the world. Its early detection using non-invasive means could enable preventive interventions that improve quality of life and patient outcomes, with speech recently shown to be a valuable biomarker. Yet, its validity across different linguistic groups remains to be seen. To that end, audio data were collected from 96 Danish participants conducting three speech tasks (reading, coughing, sustained vowels). Half of the participants were diagnosed with different levels of COPD and the other half formed a healthy control group. Subsequently, we investigated different baseline models using openSMILE features and learnt x-vector embeddings. We obtained a best accuracy of 67% using openSMILE features and logistic regression. Our findings support the potential of speech-based analysis as a non-invasive, remote, and scalable screening tool as part of future COPD healthcare solutions.
Similar Papers
Enhancing Lung Disease Diagnosis via Semi-Supervised Machine Learning
Audio and Speech Processing
Listens to coughs to find lung sickness.
Severity Classification of Chronic Obstructive Pulmonary Disease in Intensive Care Units: A Semi-Supervised Approach Using MIMIC-III Dataset
Machine Learning (CS)
Helps doctors quickly tell how sick lung patients are.
Interpretable Early Detection of Parkinson's Disease through Speech Analysis
Machine Learning (CS)
Detects Parkinson's early using voice.