Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
By: Fatemeh Moradi, Mehran Tarif, Mohammadhossein Homaei
Potential Business Impact:
Finds fake items in shipping faster.
Detecting fraud in modern supply chains is a growing challenge, driven by the complexity of global networks and the scarcity of labeled data. Traditional detection methods often struggle with class imbalance and limited supervision, reducing their effectiveness in real-world applications. This paper proposes a novel two-phase learning framework to address these challenges. In the first phase, the Isolation Forest algorithm performs unsupervised anomaly detection to identify potential fraud cases and reduce the volume of data requiring further analysis. In the second phase, a self-training Support Vector Machine (SVM) refines the predictions using both labeled and high-confidence pseudo-labeled samples, enabling robust semi-supervised learning. The proposed method is evaluated on the DataCo Smart Supply Chain Dataset, a comprehensive real-world supply chain dataset with fraud indicators. It achieves an F1-score of 0.817 while maintaining a false positive rate below 3.0%. These results demonstrate the effectiveness and efficiency of combining unsupervised pre-filtering with semi-supervised refinement for supply chain fraud detection under real-world constraints, though we acknowledge limitations regarding concept drift and the need for comparison with deep learning approaches.
Similar Papers
Advanced fraud detection using machine learning models: enhancing financial transaction security
Machine Learning (CS)
Finds fake credit card charges faster.
Unsupervised Detection of Fraudulent Transactions in E-commerce Using Contrastive Learning
Machine Learning (CS)
Finds fake online shopping deals without needing examples.
Unsupervised Anomaly Detection for Smart IoT Devices: Performance and Resource Comparison
Machine Learning (CS)
Finds hidden computer dangers on smart devices.