Score: 1

MCP Safety Audit: LLMs with the Model Context Protocol Allow Major Security Exploits

Published: April 2, 2025 | arXiv ID: 2504.03767v2

By: Brandon Radosevich, John Halloran

Potential Business Impact:

Finds and fixes AI security problems before they happen.

Business Areas:

Application Performance Management Data and Analytics, Software

To reduce development overhead and enable seamless integration between potential components comprising any given generative AI application, the Model Context Protocol (MCP) (Anthropic, 2024) has recently been released and subsequently widely adopted. The MCP is an open protocol that standardizes API calls to large language models (LLMs), data sources, and agentic tools. By connecting multiple MCP servers, each defined with a set of tools, resources, and prompts, users are able to define automated workflows fully driven by LLMs. However, we show that the current MCP design carries a wide range of security risks for end users. In particular, we demonstrate that industry-leading LLMs may be coerced into using MCP tools to compromise an AI developer's system through various attacks, such as malicious code execution, remote access control, and credential theft. To proactively mitigate these and related attacks, we introduce a safety auditing tool, MCPSafetyScanner, the first agentic tool to assess the security of an arbitrary MCP server. MCPScanner uses several agents to (a) automatically determine adversarial samples given an MCP server's tools and resources; (b) search for related vulnerabilities and remediations based on those samples; and (c) generate a security report detailing all findings. Our work highlights serious security issues with general-purpose agentic workflows while also providing a proactive tool to audit MCP server safety and address detected vulnerabilities before deployment. The described MCP server auditing tool, MCPSafetyScanner, is freely available at: https://github.com/johnhalloran321/mcpSafetyScanner

We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems

Machine Learning (CS)

Makes AI safer when it uses outside tools.

16 Jun 2025 2

92%

MCPGuard : Automatically Detecting Vulnerabilities in MCP Servers

Cryptography and Security

Fixes security holes in smart AI tools.

27 Oct 2025 1

92%

MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Computation and Language

Tests AI safety with real-world tools.

17 Dec 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com github.com github.com github.com github.com github.com github.com

Page Count

27 pages

MCP Safety Audit: LLMs with the Model Context Protocol Allow Major Security Exploits

Finds and fixes AI security problems before they happen.

Technical Abstract

We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems

MCPGuard : Automatically Detecting Vulnerabilities in MCP Servers

MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers