Uncertainty-Aware Decoding with Minimum Bayes Risk
By: Nico Daheim , Clara Meister , Thomas Möllenhoff and more
Potential Business Impact:
Makes AI write better and know when to stop.
Despite their outstanding performance in the majority of scenarios, contemporary language models still occasionally generate undesirable outputs, for example, hallucinated text. While such behaviors have previously been linked to uncertainty, there is a notable lack of methods that actively consider uncertainty during text generation. In this work, we show how Minimum Bayes Risk (MBR) decoding, which selects model generations according to an expected risk, can be generalized into a principled uncertainty-aware decoding method. In short, we account for model uncertainty during decoding by incorporating a posterior over model parameters into MBR's computation of expected risk. We show that this modified expected risk is useful for both choosing outputs and deciding when to abstain from generation and can provide improvements without incurring overhead. We benchmark different methods for learning posteriors and show that performance improves with prediction diversity. We release our code publicly.
Similar Papers
Agreement-Constrained Probabilistic Minimum Bayes Risk Decoding
Computation and Language
Improves computer translation quality and speed.
Structure-Conditional Minimum Bayes Risk Decoding
Computation and Language
Makes AI talk better by understanding hidden meanings.
Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition
Computation and Language
Makes speech-to-text programs understand better.