Score: 0

Hybrid(Penalized Regression and MLP) Models for Outcome Prediction in HDLSS Health Data

Published: December 2, 2025 | arXiv ID: 2512.02489v1

By: Mithra D K

Potential Business Impact:

Finds diabetes risk using health survey data.

Business Areas:
A/B Testing Data and Analytics

I present an application of established machine learning techniques to NHANES health survey data for predicting diabetes status. I compare baseline models (logistic regression, random forest, XGBoost) with a hybrid approach that uses an XGBoost feature encoder and a lightweight multilayer perceptron (MLP) head. Experiments show the hybrid model attains improved AUC and balanced accuracy compared to baselines on the processed NHANES subset. I release code and reproducible scripts to encourage replication.

Country of Origin
🇮🇳 India

Page Count
5 pages

Category
Computer Science:
Machine Learning (CS)