Score: 0

VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures

Published: March 16, 2025 | arXiv ID: 2503.12651v1

By: Yoo Yeon Sung, Hannah Kim, Dan Zhang

Potential Business Impact:

Helps people understand why AI makes mistakes.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

AI practitioners increasingly use large language model (LLM) agents in compound AI systems to solve complex reasoning tasks, these agent executions often fail to meet human standards, leading to errors that compromise the system's overall performance. Addressing these failures through human intervention is challenging due to the agents' opaque reasoning processes, misalignment with human expectations, the complexity of agent dependencies, and the high cost of manual inspection. This paper thus introduces a human-centered evaluation framework for Verifying LLM Agent failures (VeriLA), which systematically assesses agent failures to reduce human effort and make these agent failures interpretable to humans. The framework first defines clear expectations of each agent by curating human-designed agent criteria. Then, it develops a human-aligned agent verifier module, trained with human gold standards, to assess each agent's execution output. This approach enables granular evaluation of each agent's performance by revealing failures from a human standard, offering clear guidelines for revision, and reducing human cognitive load. Our case study results show that VeriLA is both interpretable and efficient in helping practitioners interact more effectively with the system. By upholding accountability in human-agent collaboration, VeriLA paves the way for more trustworthy and human-aligned compound AI systems.

Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents

Human-Computer Interaction

Protects your private info from smart computer helpers.

24 Apr 2025 2

88%

Advancing AI-Scientist Understanding: Multi-Agent LLMs with Interpretable Physics Reasoning

Artificial Intelligence

AI helps scientists solve physics problems better.

2 Apr 2025 0

87%

Plan Verification for LLM-Based Embodied Task Completion Agents

Artificial Intelligence

Makes robots learn better by fixing their mistakes.

2 Sep 2025 2

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

13 pages

VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures

Helps people understand why AI makes mistakes.

Technical Abstract

Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents

Advancing AI-Scientist Understanding: Multi-Agent LLMs with Interpretable Physics Reasoning

Plan Verification for LLM-Based Embodied Task Completion Agents