Resample-based Ensemble Framework for Drifting Imbalanced Data Streams

Zhang, Hang and Liu, Weike and Wang, Shuo and Shan, Jicheng and Liu, Qingbao (2019) Resample-based Ensemble Framework for Drifting Imbalanced Data Streams. IEEE Access, 7 (2019). ISSN 2169-3536

[img]
Preview
Text
IEEE Access.pdf - Accepted Version

Download (1MB)

Abstract

Machine learning in real-world scenarios is often challenged by concept drift and class imbalance. This paper proposes a Resample-based Ensemble Framework for Drifting Imbalanced Stream (RE-DI). The ensemble framework consists of a long-term static classifier to handle gradual and multiple dynamic classifiers to handle sudden concept drift. The weights of the ensemble classifier are adjusted from two aspects. First, a time-decayed strategy decreases the weights of the dynamic classifiers to make the ensemble classifier focus more on the new concept of the data stream. Second, a novel reinforcement mechanism is proposed to increase the weights of the base classifiers that perform better on the minority class and decrease the weights of the classifiers that perform worse. A resampling buffer is used for storing instances of the minority class to balance the imbalanced distribution over time. In our experiment, we compare the proposed method with other state-of-the-art algorithms on both real-world and synthetic data streams. The results show that the proposed method achieves the best performance in terms of both the Prequential AUC and accuracy.

Item Type: Article
Additional Information: (c) 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works
Identification Number: https://doi.org/10.1109/ACCESS.2019.2914725
Dates:
DateEvent
22 April 2019Accepted
6 May 2019Published
Uncontrolled Keywords: Online ensemble learning, resample learning ,reinforcement, concept drift, class imbalance
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-05 - artificial intelligence
Divisions: Faculty of Computing, Engineering and the Built Environment
Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Shuo Wang
Date Deposited: 09 May 2019 06:15
Last Modified: 03 Mar 2022 15:46
URI: https://www.open-access.bcu.ac.uk/id/eprint/7431

Actions (login required)

View Item View Item

Research

In this section...