Leveraging Asynchronous Parallel Computing to Produce Simple Genetic Programming Computational Models

Sambo, Aliyu Sani and Azad, R. Muhammad Atif and Kovalchuk, Yevgeniya and Indramohan, Vivek and Shah, H. (2020) Leveraging Asynchronous Parallel Computing to Produce Simple Genetic Programming Computational Models. In: ACM Symposium on Applied Computing, March 30, 2020 - April 3, 2020, Brno, Czech Republic. (In Press)

[img] Text
V3_Leveraging_Asynchronous_Parallel_GP_to_Produce_Simple_Models-2.pdf - Accepted Version
Restricted to Repository staff only

Download (1MB) | Request a copy

Abstract

Traditionally, reducing complexity in Machine Learning promises benefits such as less overfitting. However, complexity control in Genetic Programming (GP) often means reducing the sizes of the evolving expressions, and past literature shows that size reduction does not necessarily reduce overfitting. In fact, whether size consistently represents complexity is itself debatable. Therefore, this paper proposes evaluation time of an evolving model -- the computational time required to evaluate a model on data -- as the estimate of its complexity. Evaluation time depends upon the size, but crucially also on the composition of an evolving model, and can thus distil its underlying complexity.

To discourage complexity, this paper takes an innovative approach that
asynchronously evaluates multiple models concurrently. These models race to their completion; thus, those models that finish earlier, join the population earlier to breed further in a steady-state fashion. Thus, the computationally simpler models, even if less accurate, get further chances to evolve before the more accurate yet expensive models join the population. Crucially, since evaluation times vary from one execution to another, this paper also shows how to significantly minimise this variation.

The paper compares the proposed method on six challenging symbolic regression problems with both standard GP and GP with an effective bloat control method. The results demonstrated that the proposed asynchronous parallel GP (APGP) indeed produces individuals that are smaller, faster and more accurate than those in standard GP. While GP with bloat control (GP+BC) produced smaller individuals, it did so at the cost of lower accuracy than APGP both on training and test data, thus questioning the overall benefits of bloat control. Also, while APGP took the fewest evaluations to match the training accuracy of GP, GP+BC took the most.

These results, and the portability of evaluation time as an estimate of complexity encourage further experimentation and fine-tuning of this hitherto unexplored style of GP.

Item Type: Conference or Workshop Item (Paper)
Date: 4 April 2020
Uncontrolled Keywords: genetic programming, model complexity, parallel computing
Subjects: G400 Computer Science
G700 Artificial Intelligence
Divisions: Faculty of Computing, Engineering and the Built Environment
Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Faculty of Health, Education and Life Sciences > Centre for Social Care, Health and Related Research (C-SHARR) > Health Sciences
Depositing User: Raja Azad
Date Deposited: 06 Dec 2019 09:35
Last Modified: 15 May 2020 05:30
URI: http://www.open-access.bcu.ac.uk/id/eprint/8522

Actions (login required)

View Item View Item

Research

In this section...