Score: 0

Aegis: Automated Error Generation and Identification for Multi-Agent Systems

Published: September 17, 2025 | arXiv ID: 2509.14295v3

By: Fanqi Kong , Ruijie Zhang , Huaxiao Yin and more

Potential Business Impact:

Finds mistakes in smart robot teams.

Business Areas:

Intelligent Systems Artificial Intelligence, Data and Analytics, Science and Engineering

As Multi-Agent Systems (MAS) become increasingly autonomous and complex, understanding their error modes is critical for ensuring their reliability and safety. However, research in this area has been severely hampered by the lack of large-scale, diverse datasets with precise, ground-truth error labels. To address this bottleneck, we introduce \textbf{AEGIS}, a novel framework for \textbf{A}utomated \textbf{E}rror \textbf{G}eneration and \textbf{I}dentification for Multi-Agent \textbf{S}ystems. By systematically injecting controllable and traceable errors into initially successful trajectories, we create a rich dataset of realistic failures. This is achieved using a context-aware, LLM-based adaptive manipulator that performs sophisticated attacks like prompt injection and response corruption to induce specific, predefined error modes. We demonstrate the value of our dataset by exploring three distinct learning paradigms for the error identification task: Supervised Fine-Tuning, Reinforcement Learning, and Contrastive Learning. Our comprehensive experiments show that models trained on AEGIS data achieve substantial improvements across all three learning paradigms. Notably, several of our fine-tuned models demonstrate performance competitive with or superior to proprietary systems an order of magnitude larger, validating our automated data generation framework as a crucial resource for developing more robust and interpretable multi-agent systems. Our project website is available at https://kfq20.github.io/AEGIS-Website.

AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems

Artificial Intelligence

Helps self-driving cars see what's important.

8 Apr 2025 1

88%

AEGIS : Automated Co-Evolutionary Framework for Guarding Prompt Injections Schema

Cryptography and Security

Teaches AI to block bad instructions.

27 Aug 2025 0

88%

AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences

CV and Pattern Recognition

Finds fake videos made by computers.

14 Aug 2025 2

View PDF Login to Bookmark

Page Count

31 pages

Aegis: Automated Error Generation and Identification for Multi-Agent Systems

Finds mistakes in smart robot teams.

Technical Abstract

AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems

AEGIS : Automated Co-Evolutionary Framework for Guarding Prompt Injections Schema

AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences