Modelling bounded rational decision-making through Wasserstein constraints
By: Benjamin Patrick Evans, Leo Ardon, Sumitra Ganesh
Potential Business Impact:
Helps robots learn to make better choices.
Modelling bounded rational decision-making through information constrained processing provides a principled approach for representing departures from rationality within a reinforcement learning framework, while still treating decision-making as an optimization process. However, existing approaches are generally based on Entropy, Kullback-Leibler divergence, or Mutual Information. In this work, we highlight issues with these approaches when dealing with ordinal action spaces. Specifically, entropy assumes uniform prior beliefs, missing the impact of a priori biases on decision-makings. KL-Divergence addresses this, however, has no notion of "nearness" of actions, and additionally, has several well known potentially undesirable properties such as the lack of symmetry, and furthermore, requires the distributions to have the same support (e.g. positive probability for all actions). Mutual information is often difficult to estimate. Here, we propose an alternative approach for modeling bounded rational RL agents utilising Wasserstein distances. This approach overcomes the aforementioned issues. Crucially, this approach accounts for the nearness of ordinal actions, modeling "stickiness" in agent decisions and unlikeliness of rapidly switching to far away actions, while also supporting low probability actions, zero-support prior distributions, and is simple to calculate directly.
Similar Papers
Quantization-based Bounds on the Wasserstein Metric
Machine Learning (CS)
Makes computer math faster for pictures.
Unregularized limit of stochastic gradient method for Wasserstein distributionally robust optimization
Optimization and Control
Makes computer learning better with uncertain information.
Bayesian Distributionally Robust Merton Problem with Nonlinear Wasserstein Projections
Optimization and Control
Helps investors make smarter money choices with less risk.