Score: 0

MultiRisk: Multiple Risk Control via Iterative Score Thresholding

Published: December 31, 2025 | arXiv ID: 2512.24587v1

By: Sunay Joshi , Yan Sun , Hamed Hassani and more

As generative AI systems are increasingly deployed in real-world applications, regulating multiple dimensions of model behavior has become essential. We focus on test-time filtering: a lightweight mechanism for behavior control that compares performance scores to estimated thresholds, and modifies outputs when these bounds are violated. We formalize the problem of enforcing multiple risk constraints with user-defined priorities, and introduce two efficient dynamic programming algorithms that leverage this sequential structure. The first, MULTIRISK-BASE, provides a direct finite-sample procedure for selecting thresholds, while the second, MULTIRISK, leverages data exchangeability to guarantee simultaneous control of the risks. Under mild assumptions, we show that MULTIRISK achieves nearly tight control of all constraint risks. The analysis requires an intricate iterative argument, upper bounding the risks by introducing several forms of intermediate symmetrized risk functions, and carefully lower bounding the risks by recursively counting jumps in symmetrized risk functions between appropriate risk levels. We evaluate our framework on a three-constraint Large Language Model alignment task using the PKU-SafeRLHF dataset, where the goal is to maximize helpfulness subject to multiple safety constraints, and where scores are generated by a Large Language Model judge and a perplexity filter. Our experimental results show that our algorithm can control each individual risk at close to the target level.

Joint Score-Threshold Optimization for Interpretable Risk Assessment Under Partial Supervision

Machine Learning (CS)

Improves doctor's risk scores for patients.

24 Oct 2025 1

88%

Risk-Aware Financial Forecasting Enhanced by Machine Learning and Intuitionistic Fuzzy Multi-Criteria Decision-Making

Statistical Finance

Helps predict money changes better, even with risks.

11 Dec 2025 0

87%

Risk-averse Fair Multi-class Classification

Machine Learning (Stat)

Helps computers learn from messy, incomplete data.

6 Sep 2025 0

View PDF Login to Bookmark

MultiRisk: Multiple Risk Control via Iterative Score Thresholding

Technical Abstract

Joint Score-Threshold Optimization for Interpretable Risk Assessment Under Partial Supervision

Risk-Aware Financial Forecasting Enhanced by Machine Learning and Intuitionistic Fuzzy Multi-Criteria Decision-Making

Risk-averse Fair Multi-class Classification