Risk-aware Markov Decision Processes Using Cumulative Prospect Theory
By: Thomas Brihaye , Krishnendu Chatterjee , Stefanie Mohr and more
Potential Business Impact:
Helps computers make better choices over time.
Cumulative prospect theory (CPT) is the first theory for decision-making under uncertainty that combines full theoretical soundness and empirically realistic features [P.P. Wakker - Prospect theory: For risk and ambiguity, Page 2]. While CPT was originally considered in one-shot settings for risk-aware decision-making, we consider CPT in sequential decision-making. The most fundamental and well-studied models for sequential decision-making are Markov chains (MCs), and their generalization Markov decision processes (MDPs). The complexity theoretic study of MCs and MDPs with CPT is a fundamental problem that has not been addressed in the literature. Our contributions are as follows: First, we present an alternative viewpoint for the CPT-value of MCs and MDPs. This allows us to establish a connection with multi-objective reachability analysis and conclude the strategy complexity result that memoryless randomized strategies are necessary and sufficient for optimality. Second, based on this connection, we provide an algorithm for computing the CPT-value in MDPs with infinite-horizon objectives. We show that the problem is in EXPTIME and fixed-parameter tractable. Moreover, we provide a polynomial-time algorithm for the special case of MCs.
Similar Papers
Goal-Oriented Semantic Resource Allocation with Cumulative Prospect Theoretic Agents
Information Theory
Helps computers make smarter choices when things are uncertain.
Optimization for Semantic-Aware Resource Allocation under CPT-based Utilities
Information Theory
Helps computers share information smarter, considering risks.
Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Artificial Intelligence
Makes AI think like people when unsure.