Score: 0

Comparative Analysis of Stroke Prediction Models Using Machine Learning

Published: May 14, 2025 | arXiv ID: 2505.09812v1

By: Anastasija Tashkova , Stefan Eftimov , Bojan Ristov and more

Potential Business Impact:

Helps doctors guess who might get a stroke.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Stroke remains one of the most critical global health challenges, ranking as the second leading cause of death and the third leading cause of disability worldwide. This study explores the effectiveness of machine learning algorithms in predicting stroke risk using demographic, clinical, and lifestyle data from the Stroke Prediction Dataset. By addressing key methodological challenges such as class imbalance and missing data, we evaluated the performance of multiple models, including Logistic Regression, Random Forest, and XGBoost. Our results demonstrate that while these models achieve high accuracy, sensitivity remains a limiting factor for real-world clinical applications. In addition, we identify the most influential predictive features and propose strategies to improve machine learning-based stroke prediction. These findings contribute to the development of more reliable and interpretable models for the early assessment of stroke risk.

Country of Origin
🇲🇰 North Macedonia

Page Count
4 pages

Category
Computer Science:
Machine Learning (CS)