Rademacher Complexity for Enhancing the Generalization of Genetic Programming for Symbolic Regression
journal contribution
posted on 2022-08-23, 23:26 authored by Qi ChenQi Chen, Bing XueBing Xue, Mengjie ZhangMengjie Zhang<p>Model complexity has a close relationship with the generalization ability and the interpretability of the learned models. Simple models are more likely to generalize well and easy to interpret. However, too much emphasis on minimizing complexity can prevent the discovery of more complex yet more accurate solutions. Genetic programming (GP) has a trend of generating overcomplex models that are difficult to interpret while not being able to generalize well. This work proposes a novel complexity measure based on the Rademacher complexity for GP for symbolic regression. The complexity of an evolved model is measured by the maximum correlation between the model and the Rademacher variables on the selected training instances. Taking minimizing the training error and the Rademacher complexity of the models as the two objectives, the proposed GP method has shown to be much superior to the standard GP on generalization performance. Compared with GP equipped with two state-of-the-art complexity measures, the proposed method still has a notable advance on generating a better front consisting of individuals with lower generalization errors and being simpler in the behavioral complexity. Further analyses reveal that compared with the state-of-the-art methods, the proposed GP method evolves models that are much closer to the target models in the model structure, and have better interpretability.</p>
<p><br></p>
<p>© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works </p>
History
Related Materials
- 1.
Preferred citation
Chen, Q., Xue, B. & Zhang, M. (2022). Rademacher Complexity for Enhancing the Generalization of Genetic Programming for Symbolic Regression. IEEE Transactions on Cybernetics, 52(4), 2382-2395. https://doi.org/10.1109/TCYB.2020.3004361Publisher DOI
Journal title
IEEE Transactions on CyberneticsVolume
52Issue
4Publication date
2022-04-01Pagination
2382-2395Publisher
Institute of Electrical and Electronics Engineers (IEEE)Publication status
PublishedOnline publication date
2022-04-05ISSN
2168-2267eISSN
2168-2275Language
enUsage metrics
Categories
Keywords
Complexity theoryPredictive modelsMathematical modelTrainingData modelsGenetic programmingMeasurement uncertaintyGeneralizationgenetic programming (GP)Rademacher complexityHumansLearningPreventionArtificial Intelligence & Image ProcessingApplied Mathematics not elsewhere classifiedArtificial Intelligence and Image ProcessingElectrical and Electronic Engineering not elsewhere classified
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC


