Oversampling techniques for predicting COVID-19 patient length of stay
By: Zachariah Farahany , Jiawei Wu , K M Sajjadul Islam and more
Potential Business Impact:
Predicts how sick COVID-19 patients will get.
COVID-19 is a respiratory disease that caused a global pandemic in 2019. It is highly infectious and has the following symptoms: fever or chills, cough, shortness of breath, fatigue, muscle or body aches, headache, the new loss of taste or smell, sore throat, congestion or runny nose, nausea or vomiting, and diarrhea. These symptoms vary in severity; some people with many risk factors have been known to have lengthy hospital stays or die from the disease. In this paper, we analyze patients' electronic health records (EHR) to predict the severity of their COVID-19 infection using the length of stay (LOS) as our measurement of severity. This is an imbalanced classification problem, as many people have a shorter LOS rather than a longer one. To combat this problem, we synthetically create alternate oversampled training data sets. Once we have this oversampled data, we run it through an Artificial Neural Network (ANN), which during training has its hyperparameters tuned using Bayesian optimization. We select the model with the best F1 score and then evaluate it and discuss it.
Similar Papers
Machine Learning and Statistical Insights into Hospital Stay Durations: The Italian EHR Case
Machine Learning (CS)
Predicts how long patients stay in hospitals.
A Hybrid Data-Driven Approach For Analyzing And Predicting Inpatient Length Of Stay In Health Centre
Machine Learning (CS)
Helps hospitals get patients out faster.
Statistical and Predictive Analysis to Identify Risk Factors and Effects of Post COVID-19 Syndrome
Machine Learning (CS)
Predicts who gets long COVID and how bad.