Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems
By: Zherui Li , Yan Mi , Zhenhong Zhou and more
Potential Business Impact:
Stops fake news from fooling AI teams.
Large Language Model-based Multi-Agent Systems (MASs) have demonstrated strong advantages in addressing complex real-world tasks. However, due to the introduction of additional attack surfaces, MASs are particularly vulnerable to misinformation injection. To facilitate a deeper understanding of misinformation propagation dynamics within these systems, we introduce MisinfoTask, a novel dataset featuring complex, realistic tasks designed to evaluate MAS robustness against such threats. Building upon this, we propose ARGUS, a two-stage, training-free defense framework leveraging goal-aware reasoning for precise misinformation rectification within information flows. Our experiments demonstrate that in challenging misinformation scenarios, ARGUS exhibits significant efficacy across various injection attacks, achieving an average reduction in misinformation toxicity of approximately 28.17% and improving task success rates under attack by approximately 10.33%. Our code and dataset is available at: https://github.com/zhrli324/ARGUS.
Similar Papers
MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning
Artificial Intelligence
Finds fake news in pictures and words.
Argus: A Multi-Agent Sensitive Information Leakage Detection Framework Based on Hierarchical Reference Relationships
Cryptography and Security
Finds secret codes hidden in computer programs.
Aegis: Automated Error Generation and Identification for Multi-Agent Systems
Robotics
Finds mistakes in smart robot teams.