Risk-Averse Learning with Varying Risk Levels
By: Siyi Wang, Zifan Wang, Karl H. Johansson
In safety-critical decision-making, the environment may evolve over time, and the learner adjusts its risk level accordingly. This work investigates risk-averse online optimization in dynamic environments with varying risk levels, employing Conditional Value-at-Risk (CVaR) as the risk measure. To capture the dynamics of the environment and risk levels, we employ the function variation metric and introduce a novel risk-level variation metric. Two information settings are considered: a first-order scenario, where the learner observes both function values and their gradients; and a zeroth-order scenario, where only function evaluations are available. For both cases, we develop risk-averse learning algorithms with a limited sampling budget and analyze their dynamic regret bounds in terms of function variation, risk-level variation, and the total number of samples. The regret analysis demonstrates the adaptability of the algorithms in non-stationary and risk-sensitive settings. Finally, numerical experiments are presented to demonstrate the efficacy of the methods.
Similar Papers
On Design of Representative Distributionally Robust Formulations for Evaluation of Tail Risk Measures
Risk Management
Finds the worst possible money loss safely.
Safe Navigation in Uncertain Crowded Environments Using Risk Adaptive CVaR Barrier Functions
Robotics
Robots safely avoid moving crowds by guessing risks.
Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions
Mathematical Finance
Teaches computers to trade money safely and smartly.