A Non-Canonical Hybrid Metaheuristic Approach to Adaptive Data Stream Classification

Ghomeshi, Hossein and Gaber, Mohamed Medhat and Kovalchuk, Yevgeniya (2019) A Non-Canonical Hybrid Metaheuristic Approach to Adaptive Data Stream Classification. Future Generation Computer Systems, 102 (Jan). pp. 127-139. ISSN 0167-739X

[img]
Preview
Text
RED_PSO__Future_Generation_Computer_Systems.pdf - Accepted Version

Download (1MB)

Abstract

Data stream classification techniques have been playing an important role in big data analytics recently due to their diverse applications (e.g. fraud and intrusion detection, forecasting and healthcare monitoring systems) and the growing number of real-world data stream generators (e.g. IoT devices and sensors, websites and social network feeds). Streaming data is often prone to evolution over time. In this context, the main challenge for computational models is to adapt to changes, known as concept drifts, using data mining and optimisation techniques. We present a novel ensemble technique called RED-PSO that seamlessly adapts to different concept drifts in non-stationary data stream classification tasks. RED-PSO is based on a three-layer architecture to produce classification types of different size, each created by randomly selecting a certain percentage of features from a pool of features of the target data stream. An evolutionary algorithm, namely, Replicator Dynamics (RD), is used to seamlessly adapt to different concept drifts; it allows good performing types to grow and poor performing ones to shrink in size. In addition, the selected feature combinations in all classification types are optimised using a non-canonical version of the Particle Swarm Optimisation (PSO) technique for each layer individually. PSO allows the types in each layer to go towards local (within the same type) and global (in all types) optimums with a specified velocity. A set of experiments are conducted to compare the performance of the proposed method to state-of-the-art algorithms using real-world and synthetic data streams in immediate and delayed prequential evaluation settings. The results show a favourable performance of our method in different environments.

Item Type: Article
Identification Number: https://doi.org/10.1016/j.future.2019.07.067
Dates:
DateEvent
26 July 2019Accepted
1 August 2019Published Online
Uncontrolled Keywords: Ensemble learning, Data stream mining, Concept drifts, Bio-inspired algorithms, Non-stationary environments, Particle swarm optimisation, Replicator dynamics
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-05 - artificial intelligence
Divisions: Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Mohamed Gaber
Date Deposited: 28 Jul 2019 15:25
Last Modified: 22 Mar 2023 12:01
URI: https://www.open-access.bcu.ac.uk/id/eprint/7782

Actions (login required)

View Item View Item

Research

In this section...