DiVA: Fine-grained Factuality Verification with Agentic-Discriminative Verifier
By: Hui Huang, Muyun Yang, Yuki Arase
Potential Business Impact:
Helps AI tell true from false better.
Despite the significant advancements of Large Language Models (LLMs), their factuality remains a critical challenge, fueling growing interest in factuality verification. Existing research on factuality verification primarily conducts binary judgments (e.g., correct or incorrect), which fails to distinguish varying degrees of error severity. This limits its utility for applications such as fine-grained evaluation and preference optimization. To bridge this gap, we propose the Agentic Discriminative Verifier (DiVA), a hybrid framework that synergizes the agentic search capabilities of generative models with the precise scoring aptitude of discriminative models. We also construct a new benchmark, FGVeriBench, as a robust testbed for fine-grained factuality verification. Experimental results on FGVeriBench demonstrate that our DiVA significantly outperforms existing methods on factuality verification for both general and multi-hop questions.
Similar Papers
FineDialFact: A benchmark for Fine-grained Dialogue Fact Verification
Computation and Language
Helps computers tell if their answers are true.
FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations
Computation and Language
Stops AI from believing fake news.
Agentic Verification for Ambiguous Query Disambiguation
Computation and Language
Helps computers give better answers from many sources.