Score: 1

AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents

Published: September 9, 2025 | arXiv ID: 2509.07764v1

By: Haitao Hu , Peng Chen , Yanpeng Zhao and more

Potential Business Impact:

Stops smart computer helpers from doing bad things.

Business Areas:

Network Security Information Technology, Privacy and Security

Large Language Models (LLMs) have been increasingly integrated into computer-use agents, which can autonomously operate tools on a user's computer to accomplish complex tasks. However, due to the inherently unstable and unpredictable nature of LLM outputs, they may issue unintended tool commands or incorrect inputs, leading to potentially harmful operations. Unlike traditional security risks stemming from insecure user prompts, tool execution results from LLM-driven decisions introduce new and unique security challenges. These vulnerabilities span across all components of a computer-use agent. To mitigate these risks, we propose AgentSentinel, an end-to-end, real-time defense framework designed to mitigate potential security threats on a user's computer. AgentSentinel intercepts all sensitive operations within agent-related services and halts execution until a comprehensive security audit is completed. Our security auditing mechanism introduces a novel inspection process that correlates the current task context with system traces generated during task execution. To thoroughly evaluate AgentSentinel, we present BadComputerUse, a benchmark consisting of 60 diverse attack scenarios across six attack categories. The benchmark demonstrates a 87% average attack success rate on four state-of-the-art LLMs. Our evaluation shows that AgentSentinel achieves an average defense success rate of 79.6%, significantly outperforming all baseline defenses.

Sentinel Agents for Secure and Trustworthy Agentic AI in Multi-Agent Systems

Artificial Intelligence

Protects smart systems from bad actors.

18 Sep 2025 0

91%

SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of LLM-based Embodied Agents

Artificial Intelligence

Keeps robots from doing dangerous things.

14 Oct 2025 1

89%

SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection

Cryptography and Security

Protects AI teams from bad actors.

17 Oct 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

15 pages

AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents

Stops smart computer helpers from doing bad things.

Technical Abstract

Sentinel Agents for Secure and Trustworthy Agentic AI in Multi-Agent Systems

SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of LLM-based Embodied Agents

SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection