AgentSight: System-Level Observability for AI Agents Using eBPF
By: Yusheng Zheng , Yanpeng Hu , Tong Yu and more
Potential Business Impact:
Finds problems in AI programs by watching them.
Modern software infrastructure increasingly relies on LLM agents for development and maintenance, such as Claude Code and Gemini-cli. However, these AI agents differ fundamentally from traditional deterministic software, posing a significant challenge to conventional monitoring and debugging. This creates a critical semantic gap: existing tools observe either an agent's high-level intent (via LLM prompts) or its low-level actions (e.g., system calls), but cannot correlate these two views. This blindness makes it difficult to distinguish between benign operations, malicious attacks, and costly failures. We introduce AgentSight, an AgentOps observability framework that bridges this semantic gap using a hybrid approach. Our approach, boundary tracing, monitors agents from outside their application code at stable system interfaces using eBPF. AgentSight intercepts TLS-encrypted LLM traffic to extract semantic intent, monitors kernel events to observe system-wide effects, and causally correlates these two streams across process boundaries using a real-time engine and secondary LLM analysis. This instrumentation-free technique is framework-agnostic, resilient to rapid API changes, and incurs less than 3% performance overhead. Our evaluation shows AgentSight detects prompt injection attacks, identifies resource-wasting reasoning loops, and reveals hidden coordination bottlenecks in multi-agent systems. AgentSight is released as an open-source project at https://github.com/agent-sight/agentsight.
Similar Papers
AgentSight: System-Level Observability for AI Agents Using eBPF
Operating Systems
Finds AI mistakes and attacks in computer programs.
Reliable Weak-to-Strong Monitoring of LLM Agents
Artificial Intelligence
Finds ways AI spies on other AI.
An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation
Computation and Language
AI helps understand internet device problems faster.