Score: 1

Feasibility-Guided Fair Adaptive Offline Reinforcement Learning for Medicaid Care Management

Published: September 11, 2025 | arXiv ID: 2509.09655v1

By: Sanjay Basu , Sadiq Y. Patel , Parth Sheth and more

Potential Business Impact:

Makes AI fairer and safer for everyone.

Business Areas:
Assisted Living Health Care

We introduce Feasibility-Guided Fair Adaptive Reinforcement Learning (FG-FARL), an offline RL procedure that calibrates per-group safety thresholds to reduce harm while equalizing a chosen fairness target (coverage or harm) across protected subgroups. Using de-identified longitudinal trajectories from a Medicaid population health management program, we evaluate FG-FARL against behavior cloning (BC) and HACO (Hybrid Adaptive Conformal Offline RL; a global conformal safety baseline). We report off-policy value estimates with bootstrap 95% confidence intervals and subgroup disparity analyses with p-values. FG-FARL achieves comparable value to baselines while improving fairness metrics, demonstrating a practical path to safer and more equitable decision support.

Repos / Data Links

Page Count
13 pages

Category
Computer Science:
Machine Learning (CS)