Adversarial Augmentation and Active Sampling for Robust Cyber Anomaly Detection
By: Sidahmed Benabderrahmane, Talal Rahwan
Potential Business Impact:
Finds hidden computer attacks with less data.
Advanced Persistent Threats (APTs) present a considerable challenge to cybersecurity due to their stealthy, long-duration nature. Traditional supervised learning methods typically require large amounts of labeled data, which is often scarce in real-world scenarios. This paper introduces a novel approach that combines AutoEncoders for anomaly detection with active learning to iteratively enhance APT detection. By selectively querying an oracle for labels on uncertain or ambiguous samples, our method reduces labeling costs while improving detection accuracy, enabling the model to effectively learn with minimal data and reduce reliance on extensive manual labeling. We present a comprehensive formulation of the Attention Adversarial Dual AutoEncoder-based anomaly detection framework and demonstrate how the active learning loop progressively enhances the model's performance. The framework is evaluated on real-world, imbalanced provenance trace data from the DARPA Transparent Computing program, where APT-like attacks account for just 0.004\% of the data. The datasets, which cover multiple operating systems including Android, Linux, BSD, and Windows, are tested in two attack scenarios. The results show substantial improvements in detection rates during active learning, outperforming existing methods.
Similar Papers
Ranking-Enhanced Anomaly Detection Using Active Learning-Assisted Attention Adversarial Dual AutoEncoders
Machine Learning (CS)
Finds hidden computer attacks with less work.
Attackers Strike Back? Not Anymore -- An Ensemble of RL Defenders Awakens for APT Detection
Cryptography and Security
Finds sneaky computer hackers before they steal data.
Preliminary Investigation into Uncertainty-Aware Attack Stage Classification
Cryptography and Security
Helps computers guess hacker's next move.