Score: 0

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

Published: October 14, 2025 | arXiv ID: 2510.15994v1

By: Dongsen Zhang , Zekun Li , Xu Luo and more

Potential Business Impact:

Tests if AI can use tools safely.

Business Areas:

Penetration Testing Information Technology, Privacy and Security

The Model Context Protocol (MCP) standardizes how large language model (LLM) agents discover, describe, and call external tools. While MCP unlocks broad interoperability, it also enlarges the attack surface by making tools first-class, composable objects with natural-language metadata, and standardized I/O. We present MSB (MCP Security Benchmark), the first end-to-end evaluation suite that systematically measures how well LLM agents resist MCP-specific attacks throughout the full tool-use pipeline: task planning, tool invocation, and response handling. MSB contributes: (1) a taxonomy of 12 attacks including name-collision, preference manipulation, prompt injections embedded in tool descriptions, out-of-scope parameter requests, user-impersonating responses, false-error escalation, tool-transfer, retrieval injection, and mixed attacks; (2) an evaluation harness that executes attacks by running real tools (both benign and malicious) via MCP rather than simulation; and (3) a robustness metric that quantifies the trade-off between security and performance: Net Resilient Performance (NRP). We evaluate nine popular LLM agents across 10 domains and 400+ tools, producing 2,000 attack instances. Results reveal the effectiveness of attacks against each stage of MCP. Models with stronger performance are more vulnerable to attacks due to their outstanding tool calling and instruction following capabilities. MSB provides a practical baseline for researchers and practitioners to study, compare, and harden MCP agents.

MCPSecBench: A Systematic Security Benchmark and Playground for Testing Model Context Protocols

Cryptography and Security

Finds security flaws in AI tools.

17 Aug 2025 2

92%

MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Computation and Language

Tests AI safety with real-world tools.

17 Dec 2025 2

92%

Systematic Analysis of MCP Security

Cryptography and Security

Finds ways AI can be tricked by tools.

18 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

28 pages

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

Tests if AI can use tools safely.

Technical Abstract

MCPSecBench: A Systematic Security Benchmark and Playground for Testing Model Context Protocols

MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Systematic Analysis of MCP Security