Score: 1

Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors

Published: December 15, 2025 | arXiv ID: 2512.13860v1

By: Henger Li , Shuangjie You , Flavio Di Palo and more

BigTech Affiliations: Amazon

Potential Business Impact:

Helps AI understand and use tools better.

Business Areas:

Semantic Search Internet Services

Tool calling enables large language models (LLMs) to interact with external environments through tool invocation, providing a practical way to overcome the limitations of pretraining. However, the effectiveness of tool use depends heavily on the quality of the associated documentation and knowledge base context. These materials are usually written for human users and are often misaligned with how LLMs interpret information. This problem is even more pronounced in industrial settings, where hundreds of tools with overlapping functionality create challenges in scalability, variability, and ambiguity. We propose Verification-Guided Context Optimization (VGCO), a framework that uses LLMs as editors to automatically refine tool-related documentation and knowledge base context. VGCO works in two stages. First, Evaluation collects real-world failure cases and identifies mismatches between tools and their context. Second, Optimization performs hierarchical editing through offline learning with structure-aware, in-context optimization. The novelty of our LLM editors has three main aspects. First, they use a hierarchical structure that naturally integrates into the tool-calling workflow. Second, they are state-aware, action-specific, and verification-guided, which constrains the search space and enables efficient, targeted improvements. Third, they enable cost-efficient sub-task specialization, either by prompt engineering large editor models or by post-training smaller editor models. Unlike prior work that emphasizes multi-turn reasoning, VGCO focuses on the single-turn, large-scale tool-calling problem and achieves significant improvements in accuracy, robustness, and generalization across LLMs.

Hierarchical Contextual Grounding LVLM: Enhancing Fine-Grained Visual-Language Understanding with Robust Grounding

CV and Pattern Recognition

Helps computers understand pictures better and more accurately.

23 Aug 2025 1

88%

CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text Generation

Computation and Language

Makes AI write legal papers that are true.

7 Aug 2025 0

87%

ContextGuard-LVLM: Enhancing News Veracity through Fine-grained Cross-modal Contextual Consistency Verification

CV and Pattern Recognition

Finds fake news by checking pictures and words match.

8 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

17 pages

Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors

Helps AI understand and use tools better.

Technical Abstract

Hierarchical Contextual Grounding LVLM: Enhancing Fine-Grained Visual-Language Understanding with Robust Grounding

CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text Generation

ContextGuard-LVLM: Enhancing News Veracity through Fine-grained Cross-modal Contextual Consistency Verification