Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making
By: Prakhar Godara
Potential Business Impact:
Shows how brains learn from choices, not just bias.
Recent studies claim that human behavior in a two-armed Bernoulli bandit (TABB) task is described by positivity and confirmation biases, implying that humans do not integrate new information objectively. However, we find that even if the agent updates its belief via objective Bayesian inference, fitting the standard Q-learning model with asymmetric learning rates still recovers both biases. Bayesian inference cast as an effective Q-learning algorithm has symmetric, though decreasing, learning rates. We explain this by analyzing the stochastic dynamics of these learning systems using master equations. We find that both confirmation bias and unbiased but decreasing learning rates yield the same behavioral signatures. Finally, we propose experimental protocols to disentangle true cognitive biases from artifacts of decreasing learning rates.
Similar Papers
Optimal Information Combining for Multi-Agent Systems Using Adaptive Bias Learning
Machine Learning (CS)
Fixes computer mistakes caused by changing conditions.
Bayesian Decision Making around Experts
Machine Learning (CS)
Helps robots learn faster from experts.
On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback
Machine Learning (Stat)
Reduces unfairness when hiring by spotting hidden favoritism.