Score: 2

Distilling Feedback into Memory-as-a-Tool

Published: January 9, 2026 | arXiv ID: 2601.05960v1

By: Víctor Gallego

Potential Business Impact:

Makes AI learn faster and cheaper.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We propose a framework that amortizes the cost of inference-time reasoning by converting transient critiques into retrievable guidelines, through a file-based memory system and agent-controlled tool calls. We evaluate this method on the Rubric Feedback Bench, a novel dataset for rubric-based learning. Experiments demonstrate that our augmented LLMs rapidly match the performance of test-time refinement pipelines while drastically reducing inference cost.


Page Count
15 pages

Category
Computer Science:
Computation and Language