AgentGuardian: Learning Access Control Policies to Govern AI Agent Behavior
By: Nadya Abaev , Denis Klimov , Gerard Levinov and more
Potential Business Impact:
Guards AI to stop bad actions and mistakes.
Artificial intelligence (AI) agents are increasingly used in a variety of domains to automate tasks, interact with users, and make decisions based on data inputs. Ensuring that AI agents perform only authorized actions and handle inputs appropriately is essential for maintaining system integrity and preventing misuse. In this study, we introduce the AgentGuardian, a novel security framework that governs and protects AI agent operations by enforcing context-aware access-control policies. During a controlled staging phase, the framework monitors execution traces to learn legitimate agent behaviors and input patterns. From this phase, it derives adaptive policies that regulate tool calls made by the agent, guided by both real-time input context and the control flow dependencies of multi-step agent actions. Evaluation across two real-world AI agent applications demonstrates that AgentGuardian effectively detects malicious or misleading inputs while preserving normal agent functionality. Moreover, its control-flow-based governance mechanism mitigates hallucination-driven errors and other orchestration-level malfunctions.
Similar Papers
Towards Automating Data Access Permissions in AI Agents
Cryptography and Security
Lets AI ask permission before acting.
Securing AI Agents: Implementing Role-Based Access Control for Industrial Applications
Artificial Intelligence
Keeps AI agents safe from hackers.
Secure and Efficient Access Control for Computer-Use Agents via Context Space
Cryptography and Security
Keeps AI from messing up your computer.