A New Representation in PSO for Discretization-Based Feature Selection
journal contribution
posted on 2021-03-24, 01:11 authored by B Tran, Bing XueBing Xue, Mengjie ZhangMengjie ZhangIn machine learning, discretization and feature selection (FS) are important techniques for preprocessing data to improve the performance of an algorithm on high-dimensional data. Since many FS methods require discrete data, a common practice is to apply discretization before FS. In addition, for the sake of efficiency, features are usually discretized individually (or univariate). This scheme works based on the assumption that each feature independently influences the task, which may not hold in cases where feature interactions exist. Therefore, univariate discretization may degrade the performance of the FS stage since information showing feature interactions may be lost during the discretization process. Initial results of our previous proposed method [evolve particle swarm optimization (EPSO)] showed that combining discretization and FS in a single stage using bare-bones particle swarm optimization (BBPSO) can lead to a better performance than applying them in two separate stages. In this paper, we propose a new method called potential particle swarm optimization (PPSO) which employs a new representation that can reduce the search space of the problem and a new fitness function to better evaluate candidate solutions to guide the search. The results on ten high-dimensional datasets show that PPSO select less than 5% of the number of features for all datasets. Compared with the two-stage approach which uses BBPSO for FS on the discretized data, PPSO achieves significantly higher accuracy on seven datasets. In addition, PPSO obtains better (or similar) classification performance than EPSO on eight datasets with a smaller number of selected features on six datasets. Furthermore, PPSO also outperforms the three compared (traditional) methods and performs similar to one method on most datasets in terms of both generalization ability and learning capacity.
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Funding
Large-scale Evolutionary Feature Selection for Classification
Royal Society of New Zealand
Find out more...History
Preferred citation
Tran, B., Xue, B. & Zhang, M. (2017). A New Representation in PSO for Discretization-Based Feature Selection. IEEE Transactions on Cybernetics, 48(6). https://doi.org/10.1109/TCYB.2017.2714145Publisher DOI
Journal title
IEEE Transactions on CyberneticsVolume
48Issue
6Publication date
2017-06-23Pagination
(14)Publisher
Institute of Electrical and Electronics Engineers (IEEE)Publication status
AcceptedContribution type
ArticleISSN
2168-2267eISSN
2168-2275Language
enUsage metrics
Keywords
Science & TechnologyTechnologyAutomation & Control SystemsComputer Science, Artificial IntelligenceComputer Science, CyberneticsComputer ScienceClassificationdiscretizationfeature selection (FS)high-dimensional dataparticle swarm optimization (PSO)PARTICLE SWARM OPTIMIZATIONCLASSIFICATIONALGORITHMArtificial Intelligence & Image ProcessingApplied MathematicsElectrical and Electronic EngineeringArtificial Intelligence and Image Processing
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC