Ensemble Synthesized Minority Oversampling-Based Generative Adversarial Networks and Random Forest Algorithm for Credit Card Fraud Detection

Ghaleb, Fuad A. and Saeed, Faisal and Al-Sarem, Mohammed and Qasem, Sultan Noman and Al-Hadhrami, Tawfik (2023) Ensemble Synthesized Minority Oversampling-Based Generative Adversarial Networks and Random Forest Algorithm for Credit Card Fraud Detection. IEEE Access, 11. pp. 89694-89710. ISSN 2169-3536

[img]
Preview
Text
Ensemble_Synthesized_Minority_Oversampling-Based_Generative_Adversarial_Networks_and_Random_Forest_Algorithm_for_Credit_Card_Fraud_Detection.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB)

Abstract

The recent increase in credit card fraud is rapidly has caused huge monetary losses for individuals and financial institutions. Most credit card frauds are conducted online by illegally obtaining payment credentials through data breaches, phishing, or scamming. Many solutions have been suggested to address the credit card fraud problem for online transactions. However, the high-class imbalance is the major challenge that faces the existing solutions to construct an effective detection model. Most of the existing techniques used for class imbalance overestimate the distribution of the minority class, resulting in highly overlapped or noisy and unrepresentative features, which cause either overfitting or imprecise learning. In this study, a credit card fraud detection model (CCFDM) is proposed based on ensemble learning and a generative adversarial network (GAN) assisted by Ensemble Synthesized Minority Oversampling techniques (ESMOTE-GAN). Multiple subsets were extracted using under-sampling and SMOTE was applied to generate less skewed sets to prevent the GAN from modeling the noise. These subsets were used to train diverse sets of GAN models to generate the synthesized subsets. A set of Random Forest classifiers was then trained based on the proposed ESMOTE-GAN technique. The probabilistic outputs of the trained classifiers were combined using a weighted voting scheme for decision-making. The results show that the proposed model achieved 1.9%, and 3.2% improvements in overall performance and the detection rate, respectively, with a 0% false alarm rate. Due to the massive number of transactions, even a tiny false positive rate can overwhelm the analysis team. Thus, the proposed model has improved the detection performance and reduced the cost needed for manual analysis.

Item Type: Article
Identification Number: https://doi.org/10.1109/ACCESS.2023.3306621
Dates:
DateEvent
10 August 2023Accepted
18 August 2023Published Online
Uncontrolled Keywords: Class imbalance, credit card fraud detection, GAN, Random Forest, SMOTE
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions: Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Gemma Tonks
Date Deposited: 07 Sep 2023 13:48
Last Modified: 07 Sep 2023 13:50
URI: https://www.open-access.bcu.ac.uk/id/eprint/14751

Actions (login required)

View Item View Item

Research

In this section...