Score: 1

Randomized PCA Forest for Outlier Detection

Published: August 18, 2025 | arXiv ID: 2508.12776v2

By: Muhammad Rajabinasab , Farhad Pakdaman , Moncef Gabbouj and more

Potential Business Impact:

Finds weird data points in large groups.

We propose a novel unsupervised outlier detection method based on Randomized Principal Component Analysis (PCA). Inspired by the performance of Randomized PCA (RPCA) Forest in approximate K-Nearest Neighbor (KNN) search, we develop a novel unsupervised outlier detection method that utilizes RPCA Forest for outlier detection. Experimental results showcase the superiority of the proposed approach compared to the classical and state-of-the-art methods in performing the outlier detection task on several datasets while performing competitively on the rest. The extensive analysis of the proposed method reflects it high generalization power and its computational efficiency, highlighting it as a good choice for unsupervised outlier detection.

Country of Origin
🇩🇰 Denmark

Page Count
11 pages

Category
Computer Science:
Machine Learning (CS)