Isolation-based Spherical Ensemble Representations for Anomaly Detection
By: Yang Cao , Sikun Yang , Hao Tian and more
Potential Business Impact:
Finds weird patterns in data faster and better.
Anomaly detection is a critical task in data mining and management with applications spanning fraud detection, network security, and log monitoring. Despite extensive research, existing unsupervised anomaly detection methods still face fundamental challenges including conflicting distributional assumptions, computational inefficiency, and difficulty handling different anomaly types. To address these problems, we propose ISER (Isolation-based Spherical Ensemble Representations) that extends existing isolation-based methods by using hypersphere radii as proxies for local density characteristics while maintaining linear time and constant space complexity. ISER constructs ensemble representations where hypersphere radii encode density information: smaller radii indicate dense regions while larger radii correspond to sparse areas. We introduce a novel similarity-based scoring method that measures pattern consistency by comparing ensemble representations against a theoretical anomaly reference pattern. Additionally, we enhance the performance of Isolation Forest by using ISER and adapting the scoring function to address axis-parallel bias and local anomaly detection limitations. Comprehensive experiments on 22 real-world datasets demonstrate ISER's superior performance over 11 baseline methods.
Similar Papers
Sparse, self-organizing ensembles of local kernels detect rare statistical anomalies
Machine Learning (CS)
Finds hidden problems in complex data.
ERIS: An Energy-Guided Feature Disentanglement Framework for Out-of-Distribution Time Series Classification
Machine Learning (CS)
Helps computers learn patterns that don't change.
Text Anomaly Detection with Simplified Isolation Kernel
Computation and Language
Finds weird text faster with less computer power.