EACD: evolutionary adaptation to concept drifts in data streams

Ghomeshi, Hossein and Gaber, Mohamed Medhat and Kovalchuk, Yevgeniya (2019) EACD: evolutionary adaptation to concept drifts in data streams. Data Mining and Knowledge Discovery, 33 (3). pp. 663-694. ISSN 1384-5810

[img]
Preview
Text
EACD__ECML_PKDD.PDF - Accepted Version

Download (591kB)

Abstract

This paper presents a novel ensemble learning method based on evolutionary algorithms to cope with different types of concept drifts in non-stationary data stream classification tasks. In ensemble learning, multiple learners forming an ensemble are trained to obtain a better predictive performance compared to that of a single learner, especially in non-stationary environments, where data evolve over time. The evolution of data streams can be viewed as a problem of changing environment, and evolutionary algorithms offer a natural solution to this problem. The method proposed in this paper uses random subspaces of features from a pool of features to create different classification types in the ensemble. Each such type consists of a limited number of classifiers (decision trees) that have been built at different times over the data stream. An evolutionary algorithm (replicator dynamics) is used to adapt to different concept drifts; it allows the types with a higher performance to increase and those with a lower performance to decrease in size. Genetic algorithm is then applied to build a two-layer architecture based on the proposed technique to dynamically optimise the combination of features in each type to achieve a better adaptation to new concepts. The proposed method, called EACD, offers both implicit and explicit mechanisms to deal with concept drifts. A set of experiments employing four artificial and five real-world data streams is conducted to compare its performance with that of the state-of-the-art algorithms using the immediate and delayed prequential evaluation methods. The results demonstrate favourable performance of the proposed EACD method in different environments.

Item Type: Article
Identification Number: https://doi.org/10.1007/s10618-019-00614-6
Dates:
DateEvent
14 January 2019Accepted
13 February 2019Published Online
Uncontrolled Keywords: data streams, ensemble learning, concept drifts, evolutionary algorithms, genetic algorithm, non-stationary environments
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-05 - artificial intelligence
Divisions: Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Mohamed Gaber
Date Deposited: 14 Feb 2019 13:53
Last Modified: 22 Mar 2023 12:01
URI: https://www.open-access.bcu.ac.uk/id/eprint/7085

Actions (login required)

View Item View Item

Research

In this section...