Score: 0

Metric Matters: A Formal Evaluation of Similarity Measures in Active Learning for Cyber Threat Intelligence

Published: August 26, 2025 | arXiv ID: 2508.19019v1

By: Sidahmed Benabderrahmane, Talal Rahwan

Potential Business Impact:

Finds hidden computer spies faster by learning what's normal.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Advanced Persistent Threats (APTs) pose a severe challenge to cyber defense due to their stealthy behavior and the extreme class imbalance inherent in detection datasets. To address these issues, we propose a novel active learning-based anomaly detection framework that leverages similarity search to iteratively refine the decision space. Built upon an Attention-Based Autoencoder, our approach uses feature-space similarity to identify normal-like and anomaly-like instances, thereby enhancing model robustness with minimal oracle supervision. Crucially, we perform a formal evaluation of various similarity measures to understand their influence on sample selection and anomaly ranking effectiveness. Through experiments on diverse datasets, including DARPA Transparent Computing APT traces, we demonstrate that the choice of similarity metric significantly impacts model convergence, anomaly detection accuracy, and label efficiency. Our results offer actionable insights for selecting similarity functions in active learning pipelines tailored for threat intelligence and cyber defense.

A Problem-Oriented Taxonomy of Evaluation Metrics for Time Series Anomaly Detection

Artificial Intelligence

Helps find fake data in computer signals.

24 Nov 2025 0

87%

Ranking-Enhanced Anomaly Detection Using Active Learning-Assisted Attention Adversarial Dual AutoEncoders

Machine Learning (CS)

Finds hidden computer attacks with less work.

25 Nov 2025 2

87%

Adversarial Augmentation and Active Sampling for Robust Cyber Anomaly Detection

Cryptography and Security

Finds hidden computer attacks with less data.

5 Sep 2025 0

View PDF Login to Bookmark

Page Count

14 pages

Metric Matters: A Formal Evaluation of Similarity Measures in Active Learning for Cyber Threat Intelligence

Finds hidden computer spies faster by learning what's normal.

Technical Abstract

A Problem-Oriented Taxonomy of Evaluation Metrics for Time Series Anomaly Detection

Ranking-Enhanced Anomaly Detection Using Active Learning-Assisted Attention Adversarial Dual AutoEncoders

Adversarial Augmentation and Active Sampling for Robust Cyber Anomaly Detection