Open Access Te Herenga Waka-Victoria University of Wellington
Browse

Rademacher Complexity for Enhancing the Generalization of Genetic Programming for Symbolic Regression

Download (2.05 MB)
journal contribution
posted on 2022-08-23, 23:26 authored by Qi ChenQi Chen, Bing XueBing Xue, Mengjie ZhangMengjie Zhang

Model complexity has a close relationship with the generalization ability and the interpretability of the learned models. Simple models are more likely to generalize well and easy to interpret. However, too much emphasis on minimizing complexity can prevent the discovery of more complex yet more accurate solutions. Genetic programming (GP) has a trend of generating overcomplex models that are difficult to interpret while not being able to generalize well. This work proposes a novel complexity measure based on the Rademacher complexity for GP for symbolic regression. The complexity of an evolved model is measured by the maximum correlation between the model and the Rademacher variables on the selected training instances. Taking minimizing the training error and the Rademacher complexity of the models as the two objectives, the proposed GP method has shown to be much superior to the standard GP on generalization performance. Compared with GP equipped with two state-of-the-art complexity measures, the proposed method still has a notable advance on generating a better front consisting of individuals with lower generalization errors and being simpler in the behavioral complexity. Further analyses reveal that compared with the state-of-the-art methods, the proposed GP method evolves models that are much closer to the target models in the model structure, and have better interpretability.


© 2020 IEEE.  Personal use of this material is permitted.  Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

History

Preferred citation

Chen, Q., Xue, B. & Zhang, M. (2022). Rademacher Complexity for Enhancing the Generalization of Genetic Programming for Symbolic Regression. IEEE Transactions on Cybernetics, 52(4), 2382-2395. https://doi.org/10.1109/TCYB.2020.3004361

Journal title

IEEE Transactions on Cybernetics

Volume

52

Issue

4

Publication date

2022-04-01

Pagination

2382-2395

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publication status

Published

Online publication date

2022-04-05

ISSN

2168-2267

eISSN

2168-2275

Language

en