Efficient Interleaved Sampling of Training Data in Genetic Programming

Azad, R. Muhammad Atif and Medernach, David and Ryan, Conor (2014) Efficient Interleaved Sampling of Training Data in Genetic Programming. In: Proceedings of Genetic and Evolutionary Computation Conference, GECCO '14. ACM, pp. 127-128. ISBN 978-1-4503-2662-9

Full text not available from this repository. (Request a copy)

Abstract

The ability to generalize beyond the training set is important for Genetic Programming (GP). Interleaved Sampling is a recently proposed approach to improve generalization in GP. In this technique, GP alternates between using the entire data set and only a single data point. Initial results showed that the technique not only produces solutions that generalize well, but that it so happens at a reduced computational expense as half the number of generations only evaluate a single data point.

This paper further investigates the merit of interleaving the use of training set with two alternatives approaches. These are: the use of random search instead of a single data point, and simply minimising the tree size. Both of these alternatives are computationally even cheaper than the original setup as they simply do not invoke the fitness function half the time. We test the utility of these new methods on four, well cited, and high dimensional problems from the symbolic regression domain.

The results show that the new approaches continue to produce general solutions despite taking only half the fitness evaluations. Size minimisation also prevents bloat while producing competitive results on both training and test data sets. The tree sizes with size minisation are substantially smaller than the rest of the setups, which further brings down the training costs.

Item Type: Book Section
Identification Number: https://doi.org/10.1145/2598394.2598480
Dates:
DateEvent
2014Published
Uncontrolled Keywords: Genetic Programming, Over-fitting, Interleaved Sampling, Computational Efficiency, Speedup technique, Robustness of solutions
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions: Faculty of Computing, Engineering and the Built Environment
Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Oana-Andreea Dumitrascu
Date Deposited: 12 Jun 2017 12:53
Last Modified: 22 Mar 2023 12:02
URI: https://www.open-access.bcu.ac.uk/id/eprint/4603

Actions (login required)

View Item View Item

Research

In this section...