Open Access Open Access  Restricted Access Subscription Access

FRAUD SHIELD AI INTELLIGENT FINANCIAL FRAUD DETECTION AND PREVENTION SYSTEM USING MACHINE LEARNING

Ashika V U, Gayathri N

Abstract


The rapid growth of digital financial transactions has led to a surge in fraudulent activities, necessitating intelligent and adaptive detection systems. This paper presents Fraud Shield AI — a production-grade, real-time financial fraud detection and prevention system built using Python, FastAPI, scikit-learn, and an event-driven streaming architecture. The system ingests payment transaction events via a Kafka-based pipeline, applies a trained machine learning classifier to score each transaction for fraud probability, and exposes predictions through a low-latency REST API with a live web dashboard. A baseline Random Forest classifier is trained on the Kaggle Credit Card Fraud dataset and deployed as a FastAPI service containerised with Docker. MLflow is integrated for experiment tracking and model metadata management, while Prometheus and Grafana provide real-time monitoring of prediction throughput and service health. The system is evaluated on two dataset schemas and demonstrates high accuracy in distinguishing fraudulent from legitimate transactions. The proposed platform provides a reproducible, extensible baseline for real-time fraud detection research and illustrates end-to-end MLOps practices applicable to the financial services domain.


Full Text:

PDF

References


A. Dal Pozzolo, O. Caelen, R. A. Johnson, and G. Bontempi, "Calibrating Probability with Undersampling for Unbalanced Classification," in Proc. IEEE Symposium on Computational Intelligence and Data Mining (CIDM), 2015, pp. 159–166.

A. C. Bahnsen, D. Aouada, A. Stojanovic, and B. Ottersten, "Feature Engineering Strategies for Credit Card Fraud Detection," Expert Systems with Applications, vol. 51, pp. 134–142, 2016.

S. Bhattacharyya, S. Jha, K. Tharakunnel, and J. C. Westland, "Data Mining for Credit Card Fraud: A Comparative Study," Decision Support Systems, vol. 50, no. 3, pp. 602–613, 2011.

T. Chen and C. Guestrin, "XGBoost: A Scalable Tree Boosting System," in Proc. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pp. 785–794.

M. Kleppmann, Designing Data-Intensive Applications. Sebastopol, CA: O'Reilly Media, 2017.

M. Zaharia et al., "Apache Spark: A Unified Engine for Big Data Processing," Communications of the ACM, vol. 59, no. 11, pp. 56–65, 2016.

D. Sculley et al., "Hidden Technical Debt in Machine Learning Systems," in Advances in Neural Information Processing Systems (NeurIPS), vol. 28, 2015.

ULB Machine Learning Group, "Credit Card Fraud Detection," Kaggle, 2016. [Online]. Available: https://www.kaggle.com/mlg-ulb/creditcardfraud

F. Pedregosa et al., "Scikit-learn: Machine Learning in Python," Journal of Machine Learning Research, vol. 12, pp. 2825–

, 2011.

S. Raschka and V. Mirjalili, Python Machine Learning, 3rd ed. Birmingham: Packt Publishing, 2019.


Refbacks

  • There are currently no refbacks.