Credit Card Fraud Detection: Mitigating Extreme Class Imbalance Using Synthetic Oversampling and Ensemble Machine Learning

Prathmesh Sunil Dhobe; Anjaneya Kokre; Suyash Kale; Kartik Sabe; Shahrukh Shaikh

doi:https://doi.org/10.47001/IRJIET/2026.104007

Credit Card Fraud Detection: Mitigating Extreme Class Imbalance Using Synthetic Oversampling and Ensemble Machine Learning

Prathmesh Sunil DhobeStudent, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, IndiaAnjaneya KokreStudent, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, IndiaSuyash KaleStudent, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, IndiaKartik SabeStudent, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, IndiaShahrukh ShaikhGuide / Supervisor, Professor, Department of Artificial Intelligence & Machine Learning, Ajeenkya D.Y. Patil School of Engineering, Maharashtra, India

Vol 10 No 4 (2026): Volume 10, Issue 4, April 2026 | Pages: 61-65

International Research Journal of Innovations in Engineering and Technology

OPEN ACCESS | Research Article | Published Date: 10-04-2026

doi.org/10.47001/IRJIET/2026.104007

Full Text PDF

Abstract

The rapid proliferation of digital payment infrastructure has established credit card transactions as the backbone of the modern global economy, concurrently exposing financial networks to sophisticated fraudulent activities. The automated detection of such anomalies presents a significant algorithmic challenge due to extreme class imbalance, as fraudulent instances typically represent less than 0.5% of the overall transaction volume. This research proposes a robust, machine learning-based classification architecture utilizing a highly imbalanced dataset of 284,807 transactions, where the minority fraud class constitutes merely 0.17% of the data. To neutralize the statistical bias introduced by this skew, rigorous data preprocessing techniques including Z-score standardization and stratified splitting were implemented. The Synthetic Minority Over-sampling Technique (SMOTE) was deployed strictly within the training environment to synthetically balance the class distributions and prevent algorithmic convergence toward majority-class predictions. A comparative analysis was conducted evaluating a linear Logistic Regression classifier against a non-linear Random Forest ensemble. Empirical analysis demonstrates that while the linear model achieved high theoretical class separation, the Random Forest ensemble delivered superior operational performance. By optimizing the precision-recall trade-off, achieving a precision of 0.84 and a recall of 0.83, the ensemble model successfully minimized false negative rates without inflating false positive rates, proving its viability for real-world deployment in institutional financial security systems.

Keywords

Credit Card Fraud, Class Imbalance, SMOTE, Random Forest, Logistic Regression, Anomaly Detection

Citation of this Article

Prathmesh Sunil Dhobe, Anjaneya Kokre, Suyash Kale, Kartik Sabe, & Shahrukh Shaikh. (2026). Credit Card Fraud Detection: Mitigating Extreme Class Imbalance Using Synthetic Oversampling and Ensemble Machine Learning. International Research Journal of Innovations in Engineering and Technology - IRJIET, 10(4), 61-65. Article DOI https://doi.org/10.47001/IRJIET/2026.104007

This work is licensed under Creative common Attribution Non Commercial 4.0 Internation Licence

References

R. Bin Sulaiman, V. Schetinin & P. Sant, Review of Machine Learning Approach on Credit Card Fraud Detection, Human‑Centric Intelligent Systems, 2022. — comprehensive ML methods & challenges.
K. Ghosh Dastidar, O. Caelen & M. Granitzer, Machine Learning Methods for Credit Card Fraud Detection: A Survey, IEEE Access, 2024.
Y. A. Hassan & O. S. Kareem, Credit Card Fraud Detection: Comparative Study of ML & DL Methods, Engineering and Technology Journal, 2025. — recent ML vs deep learning comparison.
E. Btoush et al., Achieving Excellence in Cyber Fraud Detection: A Hybrid ML+DL Ensemble Approach for Credit Cards, Applied Sciences, 2025. — ensemble and hybrid ML+DL strategies.
E. Ileberi, Y. Sun & Z. Wang, ML‑based Credit Card Fraud Detection with GA Feature Selection, Journal of Big Data, 2022 — uses feature selection + multiple classifiers.
E. Btoush et al., Resampling Methods for Imbalanced Credit Card Fraud Data, Applied Sciences, 2026 — analysis of sampling + ML classifiers (XGBoost, RF, etc.).
Autonomous credit card fraud detection using LSTM‑RNN, Computers & Electrical Engineering, 2022 — RNN + deep learning methods.
Frontiers in AI, Enhancing Credit Card Fraud Detection using Traditional and Deep Learning Models with Imbalance Mitigation, 2025 — modern deep learning evaluation.
R. Jayalakshmi, R. G. S. Kumar & T. Thanushree, A Survey on Credit Card Fraud Detection Using Deep Learning Models, IRJAEM, 2025 — DL‑focused survey.
R. Ali, Explainable AI Framework for Credit Card Fraud Detection using XGBoost & Deep Neural Networks, SSRN, 2026 — blends ML, DL & SHAP for interpretability.

For Authors

Publication Archives

Volume 1 - 2017

Volume 2 - 2018

Volume 3 - 2019

Volume 4 - 2020

Volume 5 - 2021

Volume 6 - 2022

Volume 7 - 2023

Volume 8 - 2024

Volume 9 - 2025

Volume 10 - 2026

For Board Members

Downloads

Research Areas

Credit Card Fraud Detection: Mitigating Extreme Class Imbalance Using Synthetic Oversampling and Ensemble Machine Learning

Abstract

Keywords

Citation of this Article

References

International Research Journal of Innovations in Engineering
and Technology - IRJIET

Editorial Policies

Quick Links