Trajectory Guard -- A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI
By: Laksh Advani
Potential Business Impact:
Finds bad steps in AI plans before they cause problems.
Autonomous LLM agents generate multi-step action plans that can fail due to contextual misalignment or structural incoherence. Existing anomaly detection methods are ill-suited for this challenge: mean-pooling embeddings dilutes anomalous steps, while contrastive-only approaches ignore sequential structure. Standard unsupervised methods on pre-trained embeddings achieve F1-scores no higher than 0.69. We introduce Trajectory Guard, a Siamese Recurrent Autoencoder with a hybrid loss function that jointly learns task-trajectory alignment via contrastive learning and sequential validity via reconstruction. This dual objective enables unified detection of both "wrong plan for this task" and "malformed plan structure." On benchmarks spanning synthetic perturbations and real-world failures from security audits (RAS-Eval) and multi-agent systems (Who\&When), we achieve F1-scores of 0.88-0.94 on balanced sets and recall of 0.86-0.92 on imbalanced external benchmarks. At 32 ms inference latency, our approach runs 17-27$\times$ faster than LLM Judge baselines, enabling real-time safety verification in production deployments.
Similar Papers
Unsupervised Anomaly Detection in Multi-Agent Trajectory Prediction via Transformer-Based Models
Machine Learning (CS)
Finds hidden dangers for self-driving cars.
Detecting Silent Failures in Multi-Agentic AI Trajectories
Artificial Intelligence
Finds hidden mistakes in smart computer teams.
CroTad: A Contrastive Reinforcement Learning Framework for Online Trajectory Anomaly Detection
Machine Learning (CS)
Finds weird parts in travel paths.