Development and Implementation of a Machine Learning-Based Framework for Credit Card Fraud Detection: A Comparative Study of Random Forest and Logistic Regression Models

Abstract

Credit card fraud remains a pervasive and evolving threat in the digital age, necessitating the development of sophisticated methods for early detection and prevention. This study provides a thorough examination of a machine learning-based credit card fraud detection system, employing two prominent algorithms: Random Forest and Logistic Regression. The research methodology involves preprocessing a diverse and extensive credit card transaction dataset, incorporating various transactional features. Through rigorous feature engineering, the dataset is meticulously prepared for model training and validation. The Random Forest model, an ensemble learning technique, aggregates multiple decision trees to improve predictive accuracy and mitigate the risk of over-fitting. In parallel, Logistic Regression—a classical statistical approach—models the probabilistic relationship between transaction features and the likelihood of fraud. A comparative analysis of these models offers valuable insights into their respective strengths and limitations, guiding the selection of the most suitable model for fraud detection. Model performance is evaluated using critical metrics, including accuracy, precision, recall, and F1-score, with a detailed examination of these indicators across different scenarios to assess each model's ability to distinguish between legitimate and fraudulent transactions. Furthermore, the study explores the practical implications of implementing these models in financial institutions, highlighting their potential to enhance security and reduce financial losses. Ethical considerations, including privacy concerns, model interpretability, and the adaptive nature of fraud patterns, are also discussed, providing a comprehensive perspective on the deployment of machine learning in fraud detection systems. Ultimately, this research contributes to the advancement of financial security, offering a robust analysis of Random Forest and Logistic Regression models and their real-world applications in combating credit card fraud.

Country : Nigeria

1 Anusiuba, Overcomer Ifeanyi Alex

  1. Department of Computer Science, Faculty of Physical Sciences, Nnamdi Azikiwe University, Awka, Nigeria

IRJIET, Volume 9, Issue 3, March 2025 pp. 52-66

doi.org/10.47001/IRJIET/2025.903008

References

  1. Aleskerov, E., Freisleben, B., AndRao. B( 1997) Card watch: A Neural Network Based Database Mining System For Credit Card Fraud Detection, Proceedings of IEEE/Iafe: Computational Intelligence For Financial Eng.. Pp. 220-226.
  2. Anusiuba O. I. A, (2025), Design and Implementation of a Credit Card Fraud Detection System Using Random Forest and Logistics Regression Models, International Research Journal of Innovations in Engineering and Technology-IRJIET, 9(2),152-166
  3. Anusiuba O. I. A, Okechukwu O. P., Ekwealor O. U, Anusiuba A. A.,(2022), The Application of Hidden Markov Model in Credit Card Fraud Detection System, iJournals: International Journal of Software & Hardware Research in Engineering (IJSHRE), 10(2), 1-20
  4. Brause, R., Langsdorf, T., Flepp, M. (1999), Neural Data Mining for Credit Card Fraud Detection, International Conference on Tools with Artificial Intelligence, IEEE, pp. 103-106.
  5. Chiu C., and Tsai, C., (2004). A Web Services-Based Collaborative Scheme for Credit Card Fraud Detection, Proceedings of IEEE International Conference e-Technology, e-Commerce and e-Service. pp. 177-181.
  6. Dornadul, V.N., & Geetha, S. (2019). Credit card fraud detection using machine learning algorithm 165 631-641 http://creativecommons.org/licenses/by-nc-nd/4.0/
  7. Ekwealor O.U, Anusiuba O. I. A, Ezuruka E. O, Uchefuna C.I., (2021). An Intelligent Credit Card Fraud Detection System, iJournals: International Journal of Software & Hardware Research in Engineering (IJSHRE) 9(2), 25-54
  8. Ghosh, S., Reilly, D.L. (1994). Credit Card Fraud Detection with a Neural- Network, Proc. 27th Hawaii Int’l Conf. System Sciences: Information Systems: Decision Support And Knowledge-Based Systems, Vol. 3, Pp. 621-630.
  9. Kim, M.T. and Kim. T.S. 2002. A Neural Classifier with Fraud Density Map for Effective Credit Card Fraud Detection, Proc. International Conference on Intelligent Data Engineering and Automated Learning, Lecture Notes in Computer Science, Springer Verlag, no. 2412, pp. 378-383.
  10. Kurshan, E., Shen, H., & Yu, H. (2020). Financial crime & fraud detection using graph computing: Application considerations & outlook, in: 2020 Second International Conference on Transdisciplinary AI (TransAI), IEEE, pp. 125–130.
  11. Lebichot, B., Siblini, G.M.P.W., & Bontempi, L.H.F.O.G. (2021). Incremental learning strategies for credit cards fraud detection: International Journal of Data Science and Analytics, 12(2), 165–174.
  12. Phua, C., Lee. V.. Smith. K. and Gayler R (2007) . A comprehensive Survey of Data Mining based Fraud Detection Research, Pp. 190-200.
  13. Sulaiman, B.R., Schetinin, V. & Sant, P. (2022). Review of Machine Learning Approach on Credit Card Fraud Detection: Human-Centered Intelligent Systems, 2, 55–68. https://doi.org/10.1007/s44230-022-00004-0
  14. Syeda. M. Zhang. Y.Q., Pan, Y. (2002). Parallel Granular Networks for Fast Credit Card Fraud Detection, Proc. IEEE Int’l Conf. Fuzzy Systems, Pp. 572-577.
  15. Stolfo, S. J., Fan. D. W, Lee, W„ Prodrorakis. A. And Chan, P. K.. (2000). Cost-Based Modeling for Fraud and Intrusion Detection Results from the Jam Project, Proceedings of Darpa Information Survivability Conference And Exposition Vol. 2 Pp. 130-144.     
  16. Vatsa.Y..Sural, S. and Majumdar, A.K.(2005). A Game-theoretic Approach to Credit Card Fraud Detection, Proc. 1st International Conference on Information Systems Security, Lecture Notes in Computer Science, Springer Verlag, pp. 263-276.