Score: 1

AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs

Published: December 24, 2025 | arXiv ID: 2512.20986v1

By: Yihan Wang , Huanqi Yang , Shantanu Pal and more

Potential Business Impact:

Protects smart watches from sneaky instructions.

Business Areas:

Intelligent Systems Artificial Intelligence, Data and Analytics, Science and Engineering

The integration of Large Language Models (LLMs) into wearable sensing is creating a new class of mobile applications capable of nuanced human activity understanding. However, the reliability of these systems is critically undermined by their vulnerability to prompt injection attacks, where attackers deliberately input deceptive instructions into LLMs. Traditional defenses, based on static filters and rigid rules, are insufficient to address the semantic complexity of these new attacks. We argue that a paradigm shift is needed -- from passive filtering to active protection and autonomous reasoning. We introduce AegisAgent, an autonomous agent system designed to ensure the security of LLM-driven HAR systems. Instead of merely blocking threats, AegisAgent functions as a cognitive guardian. It autonomously perceives potential semantic inconsistencies, reasons about the user's true intent by consulting a dynamic memory of past interactions, and acts by generating and executing a multi-step verification and repair plan. We implement AegisAgent as a lightweight, full-stack prototype and conduct a systematic evaluation on 15 common attacks with five state-of-the-art LLM-based HAR systems on three public datasets. Results show it reduces attack success rate by 30\% on average while incurring only 78.6 ms of latency overhead on a GPU workstation. Our work makes the first step towards building secure and trustworthy LLM-driven HAR systems.

AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security

Machine Learning (CS)

Protects AI from bad instructions and secrets.

29 Apr 2025 1

89%

TraceAegis: Securing LLM-Based Agents via Hierarchical and Behavioral Anomaly Detection

Cryptography and Security

Protects smart computer helpers from being tricked.

13 Oct 2025 0

89%

Immunity memory-based jailbreak detection: multi-agent adaptive guard for large language models

Cryptography and Security

AI learns to remember and block bad instructions.

3 Dec 2025 1

View PDF Login to Bookmark

Page Count

16 pages

AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs

Protects smart watches from sneaky instructions.

Technical Abstract

AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security

TraceAegis: Securing LLM-Based Agents via Hierarchical and Behavioral Anomaly Detection

Immunity memory-based jailbreak detection: multi-agent adaptive guard for large language models