Score: 0

A Multi-Component Reward Function with Policy Gradient for Automated Feature Selection with Dynamic Regularization and Bias Mitigation

Published: October 9, 2025 | arXiv ID: 2510.09705v1

By: Sudip Khadka, L. S. Paudel

Potential Business Impact:

Makes AI fair by choosing the right information.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Static feature exclusion strategies often fail to prevent bias when hidden dependencies influence the model predictions. To address this issue, we explore a reinforcement learning (RL) framework that integrates bias mitigation and automated feature selection within a single learning process. Unlike traditional heuristic-driven filter or wrapper approaches, our RL agent adaptively selects features using a reward signal that explicitly integrates predictive performance with fairness considerations. This dynamic formulation allows the model to balance generalization, accuracy, and equity throughout the training process, rather than rely exclusively on pre-processing adjustments or post hoc correction mechanisms. In this paper, we describe the construction of a multi-component reward function, the specification of the agents action space over feature subsets, and the integration of this system with ensemble learning. We aim to provide a flexible and generalizable way to select features in environments where predictors are correlated and biases can inadvertently re-emerge.

Automation and Feature Selection Enhancement with Reinforcement Learning (RL)

Machine Learning (CS)

Teaches computers to pick the best information faster.

15 Mar 2025 0

88%

Heterogeneous Multi-Agent Reinforcement Learning with Attention for Cooperative and Scalable Feature Transformation

Machine Learning (CS)

Makes computers better at finding patterns in data.

26 Nov 2025 1

87%

A Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning

Machine Learning (CS)

Helps hiring software find better job candidates.

20 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

14 pages

A Multi-Component Reward Function with Policy Gradient for Automated Feature Selection with Dynamic Regularization and Bias Mitigation

Makes AI fair by choosing the right information.

Technical Abstract

Automation and Feature Selection Enhancement with Reinforcement Learning (RL)

Heterogeneous Multi-Agent Reinforcement Learning with Attention for Cooperative and Scalable Feature Transformation

A Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning