Credit Card Fraud Detection: Mitigating Extreme Class Imbalance Using Synthetic Oversampling and Ensemble Machine Learning

Abstract

The rapid proliferation of digital payment infrastructure has established credit card transactions as the backbone of the modern global economy, concurrently exposing financial networks to sophisticated fraudulent activities. The automated detection of such anomalies presents a significant algorithmic challenge due to extreme class imbalance, as fraudulent instances typically represent less than 0.5% of the overall transaction volume. This research proposes a robust, machine learning-based classification architecture utilizing a highly imbalanced dataset of 284,807 transactions, where the minority fraud class constitutes merely 0.17% of the data. To neutralize the statistical bias introduced by this skew, rigorous data preprocessing techniques including Z-score standardization and stratified splitting were implemented. The Synthetic Minority Over-sampling Technique (SMOTE) was deployed strictly within the training environment to synthetically balance the class distributions and prevent algorithmic convergence toward majority-class predictions. A comparative analysis was conducted evaluating a linear Logistic Regression classifier against a non-linear Random Forest ensemble. Empirical analysis demonstrates that while the linear model achieved high theoretical class separation, the Random Forest ensemble delivered superior operational performance. By optimizing the precision-recall trade-off, achieving a precision of 0.84 and a recall of 0.83, the ensemble model successfully minimized false negative rates without inflating false positive rates, proving its viability for real-world deployment in institutional financial security systems.

Country : India

1 Prathmesh Sunil Dhobe2 Anjaneya Kokre3 Suyash Kale4 Kartik Sabe5 Shahrukh Shaikh

  1. Student, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, India
  2. Student, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, India
  3. Student, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, India
  4. Student, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, India
  5. Guide / Supervisor, Professor, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, India

IRJIET, Volume 10, Issue 4, April 2026 pp. 61-65

doi.org/10.47001/IRJIET/2026.104007

References

  1. R. Bin Sulaiman, V. Schetinin & P. Sant, Review of Machine Learning Approach on Credit Card Fraud Detection, Human‑Centric Intelligent Systems, 2022. — comprehensive ML methods & challenges.
  2. K. Ghosh Dastidar, O. Caelen & M. Granitzer, Machine Learning Methods for Credit Card Fraud Detection: A Survey, IEEE Access, 2024.
  3. Y. A. Hassan & O. S. Kareem, Credit Card Fraud Detection: Comparative Study of ML & DL Methods, Engineering and Technology Journal, 2025. — recent ML vs deep learning comparison.
  4. E. Btoush et al., Achieving Excellence in Cyber Fraud Detection: A Hybrid ML+DL Ensemble Approach for Credit Cards, Applied Sciences, 2025. — ensemble and hybrid ML+DL strategies.
  5. E. Ileberi, Y. Sun & Z. Wang, ML‑based Credit Card Fraud Detection with GA Feature Selection, Journal of Big Data, 2022 — uses feature selection + multiple classifiers.
  6. E. Btoush et al., Resampling Methods for Imbalanced Credit Card Fraud Data, Applied Sciences, 2026 — analysis of sampling + ML classifiers (XGBoost, RF, etc.).
  7. Autonomous credit card fraud detection using LSTM‑RNN, Computers & Electrical Engineering, 2022 — RNN + deep learning methods.
  8. Frontiers in AI, Enhancing Credit Card Fraud Detection using Traditional and Deep Learning Models with Imbalance Mitigation, 2025 — modern deep learning evaluation.
  9. R. Jayalakshmi, R. G. S. Kumar & T. Thanushree, A Survey on Credit Card Fraud Detection Using Deep Learning Models, IRJAEM, 2025 — DL‑focused survey.
  10. R. Ali, Explainable AI Framework for Credit Card Fraud Detection using XGBoost & Deep Neural Networks, SSRN, 2026 — blends ML, DL & SHAP for interpretability.