Big Data-Driven Fraud Detection Using Machine Learning and Real-Time Stream Processing
By: Chen Liu , Hengyu Tang , Zhixiao Yang and more
Potential Business Impact:
Finds fake money crimes faster than ever.
In the age of digital finance, detecting fraudulent transactions and money laundering is critical for financial institutions. This paper presents a scalable and efficient solution using Big Data tools and machine learning models. We utilize realtime data streaming platforms like Apache Kafka and Flink, distributed processing frameworks such as Apache Spark, and cloud storage services AWS S3 and RDS. A synthetic dataset representing real-world Anti-Money Laundering (AML) challenges is employed to build a binary classification model. Logistic Regression, Decision Tree, and Random Forest are trained and evaluated using engineered features. Our system demonstrates over 99% classification accuracy, illustrating the power of combining Big Data architectures with machine learning to tackle fraud.
Similar Papers
Advanced fraud detection using machine learning models: enhancing financial transaction security
Machine Learning (CS)
Finds fake credit card charges faster.
Deep Learning Approaches for Anti-Money Laundering on Mobile Transactions: Review, Framework, and Directions
Machine Learning (CS)
Finds hidden dirty money in fast, complex payments.
Anti-Money Laundering Machine Learning Pipelines; A Technical Analysis on Identifying High-risk Bank Clients with Supervised Learning
Artificial Intelligence
Finds bad guys using bank data.