{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,6]],"date-time":"2026-06-06T16:48:15Z","timestamp":1780764495984,"version":"3.54.1"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2021,4,20]],"date-time":"2021-04-20T00:00:00Z","timestamp":1618876800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["81472860"],"award-info":[{"award-number":["81472860"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61761001"],"award-info":[{"award-number":["61761001"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Key Research and Development project in Hunan Province","award":["2020DK2002"],"award-info":[{"award-number":["2020DK2002"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Gene expression profiling has played a significant role in the identification and classification of tumor molecules. In gene expression data, only a few feature genes are closely related to tumors. It is a challenging task to select highly discriminative feature genes, and existing methods fail to deal with this problem efficiently. This article proposes a novel metaheuristic approach for gene feature extraction, called variable neighborhood learning Harris Hawks optimizer (VNLHHO). First, the F-score is used for a primary selection of the genes in gene expression data to narrow down the selection range of the feature genes. Subsequently, a variable neighborhood learning strategy is constructed to balance the global exploration and local exploitation of the Harris Hawks optimization. Finally, mutation operations are employed to increase the diversity of the population, so as to prevent the algorithm from falling into a local optimum. In addition, a novel activation function is used to convert the continuous solution of the VNLHHO into binary values, and a naive Bayesian classifier is utilized as a fitness function to select feature genes that can help classify biological tissues of binary and multi-class cancers. An experiment is conducted on gene expression profile data of eight types of tumors. The results show that the classification accuracy of the VNLHHO is greater than 96.128% for tumors in the colon, nervous system and lungs and 100% for the rest. We compare seven other algorithms and demonstrate the superiority of the VNLHHO in terms of the classification accuracy, fitness value and AUC value in feature selection for gene expression data.<\/jats:p>","DOI":"10.1093\/bib\/bbab097","type":"journal-article","created":{"date-parts":[[2021,3,5]],"date-time":"2021-03-05T12:11:20Z","timestamp":1614946280000},"source":"Crossref","is-referenced-by-count":34,"title":["Improving feature selection performance for classification of gene expression data using Harris Hawks optimizer with variable neighborhood learning"],"prefix":"10.1093","volume":"22","author":[{"given":"Chiwen","family":"Qu","sequence":"first","affiliation":[{"name":"College of Mathematics and Statistics, Hunan Normal University, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lupeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Pathology and Pathophysiology, Jishou University School of Medicine, Jishou University, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jinlong","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Pathology and Pathophysiology, Jishou University School of Medicine, Jishou University, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Fang","family":"Deng","sequence":"additional","affiliation":[{"name":"Department of Epidemiology and Health Statistics, Xiangya Public Health School, Central South University, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yifan","family":"Tang","sequence":"additional","affiliation":[{"name":"Department of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Hunan Normal University, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaomin","family":"Zeng","sequence":"additional","affiliation":[{"name":"Department of Epidemiology and Health Statistics, Xiangya Public Health School, Central South University, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoning","family":"Peng","sequence":"additional","affiliation":[{"name":"Department of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Hunan Normal University, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2021,4,20]]},"reference":[{"key":"2021090815320995600_ref1","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1016\/j.ins.2018.11.019","article-title":"Optimized time-lag differential method for constructing gene regulatory network","volume":"478","author":"Paul","year":"2019","journal-title":"Inf Sci"},{"issue":"53","key":"2021090815320995600_ref2","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1016\/j.swevo.2019.04.004","article-title":"Feature selection for classification of microarray gene expression cancers using Bacterial Colony Optimization with multi-dimensional population","volume":"48","author":"Wang","year":"2019","journal-title":"Swarm Evol Comput"},{"issue":"4","key":"2021090815320995600_ref3","doi-asserted-by":"crossref","first-page":"975","DOI":"10.1016\/j.bbe.2018.08.004","article-title":"A hybrid gene selection method for microarray recognition","volume":"38","author":"Shukla","year":"2018","journal-title":"Biocybern Biomed Eng"},{"key":"2021090815320995600_ref4","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1016\/j.ins.2018.12.008","article-title":"Predicting disease-genes based on network information loss and protein complexes in heterogeneous network","volume":"479","author":"Lei","year":"2019","journal-title":"Inf Sci"},{"issue":"6","key":"2021090815320995600_ref5","doi-asserted-by":"crossref","first-page":"1765","DOI":"10.1109\/TCBB.2016.2602263","article-title":"Feature selection for optimized high-dimensional biomedical data using an improved shuffled frog leaping algorithm","volume":"15","author":"Hu","year":"2018","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2021090815320995600_ref6","doi-asserted-by":"publisher","DOI":"10.1016\/j.swevo.2020.100661","article-title":"Gene selection for cancer types classification using novel hybrid metaheuristics approach","volume":"54","author":"Shukla","year":"2020","journal-title":"Swarm Evol Comput"},{"key":"2021090815320995600_ref7","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1007\/s13258-020-00916-w","article-title":"Detecting biomarkers from microarray data using distributed correlation based gene selection","volume":"42","author":"Shukla","year":"2020","journal-title":"Genes Genomics"},{"key":"2021090815320995600_ref8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.patrec.2014.02.013","article-title":"T-test feature selection approach based on term frequency for text categorization","volume":"45","author":"Wang","year":"2014","journal-title":"Pattern Recogn Lett"},{"issue":"3","key":"2021090815320995600_ref9","doi-asserted-by":"crossref","first-page":"3747","DOI":"10.1016\/j.eswa.2011.09.073","article-title":"A new hybrid ant colony optimization algorithm for feature selection","volume":"39","author":"Kabir","year":"2012","journal-title":"Expert Syst Appl"},{"key":"2021090815320995600_ref10","author":"Zhang","year":"2003"},{"issue":"13","key":"2021090815320995600_ref11","first-page":"1","article-title":"Knowledge discovery in medical and biological datasets by integration of relief-F and correlation feature selection techniques","volume":"38","author":"Shukl","year":"2020","journal-title":"J Intell Fuzzy Syst"},{"issue":"12","key":"2021090815320995600_ref12","doi-asserted-by":"crossref","first-page":"6909","DOI":"10.1007\/s13369-017-2905-4","article-title":"An efficient multi-layer ensemble framework with BPSOGSA-based feature selection for credit scoring data analysis","volume":"43","author":"Edla","year":"2018","journal-title":"Arab J Sci Eng"},{"key":"2021090815320995600_ref13","doi-asserted-by":"crossref","first-page":"922","DOI":"10.1016\/j.asoc.2015.10.037","article-title":"Two hybrid wrapper-filter feature selection algorithms applied to high-dimensional microarray experiments","volume":"38","author":"Apolloni","year":"2016","journal-title":"Appl Soft Comput"},{"issue":"3","key":"2021090815320995600_ref14","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/s12065-019-00306-6","article-title":"A study on metaheuristics approaches for gene selection in microarray data: algorithms, applications and open challenges","volume":"13","author":"Shukla","year":"2020","journal-title":"Evol Intel"},{"issue":"21","key":"2021090815320995600_ref15","first-page":"889","article-title":"A genetic algorithm-based feature selection","volume":"4","author":"Babatunde","year":"2014","journal-title":"Br J Math Comput Sci"},{"issue":"8","key":"2021090815320995600_ref16","doi-asserted-by":"crossref","first-page":"3494","DOI":"10.1016\/j.asoc.2013.03.021","article-title":"Modified binary PSO for feature selection using SVM applied to mortality prediction of septic patients","volume":"13","author":"Vieira","year":"2013","journal-title":"Appl Soft Comput"},{"issue":"9","key":"2021090815320995600_ref17","doi-asserted-by":"crossref","first-page":"11515","DOI":"10.1016\/j.eswa.2011.03.028","article-title":"Feature subset selection using differential evolution and a statistical repair mechanism","volume":"38","author":"Khushaba","year":"2011","journal-title":"Expert Syst Appl"},{"issue":"4","key":"2021090815320995600_ref18","first-page":"119","article-title":"Combination of PSO algorithm and naive Bayesian classification for Parkinson disease diagnosis","volume":"4","author":"Ghanad","year":"2015","journal-title":"Adv Comp Sci"},{"issue":"5","key":"2021090815320995600_ref19","first-page":"1257","article-title":"Modified bat algorithm for feature selection with the Wisconsin diagnosis breast cancer (WDBC) dataset","volume":"18","author":"Jeyasingh","year":"2017","journal-title":"Asian Pac J Cancer Prev"},{"key":"2021090815320995600_ref20","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1016\/j.knosys.2017.04.004","article-title":"A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data","volume":"126","author":"Wang","year":"2017","journal-title":"Knowl-Based Syst"},{"issue":"4","key":"2021090815320995600_ref21","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1016\/j.ygeno.2018.04.004","article-title":"Gene selection using hybrid binary black hole algorithm and modified binary particle swarm optimization","volume":"111","author":"Pashaei","year":"2019","journal-title":"Genomics"},{"key":"2021090815320995600_ref22","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1016\/j.ins.2019.06.063","article-title":"A new hybrid wrapper TLBO and SA with SVM approach for gene expression data","volume":"503","author":"Shukla","year":"2019","journal-title":"Inf Sci"},{"issue":"7\u20138","key":"2021090815320995600_ref23","doi-asserted-by":"crossref","first-page":"452","DOI":"10.1080\/01969722.2018.1541597","article-title":"BMNABC: binary multi-neighborhood artificial bee colony for high-dimensional discrete optimization problems","volume":"49","author":"Beheshti","year":"2018","journal-title":"Cybern Syst"},{"key":"2021090815320995600_ref24","doi-asserted-by":"publisher","first-page":"106092","DOI":"10.1016\/j.asoc.2020.106092","article-title":"Quantum based whale optimization algorithm for wrapper feature selection","volume":"89","author":"Agrawal","year":"2020","journal-title":"Appl Soft Comput"},{"key":"2021090815320995600_ref25","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1111\/coin.12341","article-title":"Feature selection inspired by human intelligence for improving classification accuracy of cancer types","author":"Shukla","year":"2020","journal-title":"Comput Intell"},{"issue":"1","key":"2021090815320995600_ref26","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1109\/4235.585893","article-title":"No free lunch theorems for optimization","volume":"1","author":"Wolpert","year":"1997","journal-title":"IEEE Trans Evol Comput"},{"key":"2021090815320995600_ref27","doi-asserted-by":"crossref","first-page":"849","DOI":"10.1016\/j.future.2019.02.028","article-title":"Harris Hawks optimization: algorithm and applications","volume":"97","author":"Heidari","year":"2019","journal-title":"Futur Gener Comput Syst"},{"key":"2021090815320995600_ref28","article-title":"A novel hybrid model based on multi-objective Harris Hawks optimization algorithm for daily PM2.5 and PM10 forecasting","volume":"96","author":"Du","year":"2019","journal-title":"arXiv: Learning"},{"issue":"12","key":"2021090815320995600_ref29","doi-asserted-by":"crossref","first-page":"1421","DOI":"10.3390\/rs11121421","article-title":"Dynamic Harris Hawks optimization with mutation mechanism for satellite image segmentation","volume":"11","author":"Jia","year":"2019","journal-title":"Remote Sens"},{"key":"2021090815320995600_ref30","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1007\/s00366-019-00828-8","article-title":"A novel Harris hawks' optimization and k-fold cross-validation predicting slope stability","volume":"37","author":"Moayedi","year":"2021","journal-title":"Eng Comput"},{"issue":"16","key":"2021090815320995600_ref31","doi-asserted-by":"crossref","first-page":"3590","DOI":"10.3390\/s19163590","article-title":"Harris Hawks optimization: a novel swarm intelligence technique for spatial assessment of landslide susceptibility","volume":"19","author":"Bui","year":"2019","journal-title":"Sensors"},{"key":"2021090815320995600_ref32","doi-asserted-by":"crossref","first-page":"100824","DOI":"10.1109\/ACCESS.2019.2930831","article-title":"Harmonic overloading minimization of frequency-dependent components in harmonics polluted distribution systems using Harris Hawks optimization algorithm","volume":"7","author":"Aleem","year":"2019","journal-title":"IEEE Access"},{"key":"2021090815320995600_ref33","doi-asserted-by":"crossref","first-page":"106656","DOI":"10.1016\/j.compchemeng.2019.106656","article-title":"A novel hybrid Harris Hawks optimization and support vector machines for drug design and discovery","volume":"133","author":"Houssein","year":"2020","journal-title":"Comput Chem Eng"},{"issue":"8","key":"2021090815320995600_ref34","doi-asserted-by":"crossref","first-page":"735","DOI":"10.3139\/120.111378","article-title":"A new hybrid Harris Hawks-Nelder-Mead optimization algorithm for solving design and manufacturing problems","volume":"61","author":"Yildiz","year":"2019","journal-title":"Mater Test"},{"key":"2021090815320995600_ref35","doi-asserted-by":"crossref","first-page":"118778","DOI":"10.1016\/j.jclepro.2019.118778","article-title":"Parameters identification of photovoltaic cells and modules using diversification-enriched Harris Hawks optimization with chaotic drifts","volume":"244","author":"Chen","year":"2020","journal-title":"J Clean Prod"},{"key":"2021090815320995600_ref36","doi-asserted-by":"crossref","DOI":"10.1016\/j.engappai.2019.103370","article-title":"Performance analysis of chaotic multi-verse Harris Hawks optimization: a case study on solving engineering problems","volume":"88","author":"Ewees","year":"2020","journal-title":"Eng Appl Artif Intell"},{"key":"2021090815320995600_ref37","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1016\/j.future.2020.04.008","article-title":"Multi-population differential evolution-assisted Harris Hawks optimization: framework and case studies","volume":"111","author":"Chen","year":"2020","journal-title":"Futur Gener Comput Syst"},{"key":"2021090815320995600_ref38","doi-asserted-by":"publisher","first-page":"117804","DOI":"10.1016\/j.energy.2020.117804","article-title":"Orthogonally adapted Harris Hawk Optimization for parameter estimation of photovoltaic models","volume":"203","author":"Jiao","year":"2020","journal-title":"Energy"},{"key":"2021090815320995600_ref39","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1016\/j.asoc.2015.01.035","article-title":"Distributed feature selection: an application to microarray data classification","volume":"30","author":"Bol\u00f3n-Canedo","year":"2015","journal-title":"Appl Soft Comput"},{"key":"2021090815320995600_ref40","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.compbiomed.2016.12.002","article-title":"Wrapper-based gene selection with Markov blanket","volume":"81","author":"Wang","year":"2017","journal-title":"Comput Biol Med"},{"key":"2021090815320995600_ref41","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1016\/j.chemolab.2018.11.010","article-title":"Hybrid binary coral reefs optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical datasets","volume":"184","author":"Yan","year":"2019","journal-title":"Chemom Intell Lab Syst"},{"issue":"3","key":"2021090815320995600_ref42","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1016\/j.bbe.2016.05.001","article-title":"A novel feature extraction approach based on ensemble feature selection and modified discriminant independent component analysis for microarray data classification","volume":"36","author":"Mollaee","year":"2016","journal-title":"Biocybern Biomed Eng"},{"issue":"2","key":"2021090815320995600_ref43","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1007\/s10489-011-0325-9","article-title":"Parallel multi-swarm optimizer for gene selection in DNA microarrays","volume":"37","author":"Garc\u00ed a-Nieto","year":"2012","journal-title":"Appl Intell"},{"key":"2021090815320995600_ref44","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1016\/j.asoc.2015.01.035","article-title":"Distributed featureselection: an application to microarray data classification","volume":"30","author":"Bol\u00f3n-Canedo","year":"2015","journal-title":"Appl Soft Comput"},{"issue":"6","key":"2021090815320995600_ref45","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1007\/s00521-007-0110-1","article-title":"Classification consistency analysis for bootstrapping gene selection","volume":"16","author":"Pang","year":"2007","journal-title":"Neural Comput Applic"},{"issue":"2","key":"2021090815320995600_ref46","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1016\/j.compbiomed.2009.11.014","article-title":"Tumor classification by combining PNN classifier ensemble with neighborhood rough set based gene reduction","volume":"40","author":"Wang","year":"2010","journal-title":"Comput Biol Med"},{"issue":"3","key":"2021090815320995600_ref47","doi-asserted-by":"crossref","first-page":"2752","DOI":"10.1016\/j.eswa.2010.08.065","article-title":"Colon cancer prediction with genetics profiles using evolutionary techniques","volume":"38","author":"Kulkarni","year":"2011","journal-title":"Expert Syst Appl"},{"issue":"6870","key":"2021090815320995600_ref48","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/415436a","article-title":"Prediction of central nervous system embryonal tumour outcome based on gene expression","volume":"415","author":"Pomeroy","year":"2002","journal-title":"Nature"},{"issue":"51","key":"2021090815320995600_ref49","doi-asserted-by":"crossref","first-page":"31470","DOI":"10.1074\/jbc.271.49.31470","article-title":"Molecular characterization of human Zyxin","volume":"269","author":"Macalma","year":"1996","journal-title":"J Biol Chem"},{"key":"2021090815320995600_ref50","article-title":"Knowledge Discovery Approaches to Gene Expression Data Interpretation","author":"Aguilar-Ruiz","year":"2013","journal-title":"Appl Mach Learn"},{"issue":"4","key":"2021090815320995600_ref51","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1016\/j.jbi.2004.07.009","article-title":"Cancer classification and prediction using logistic regression with Bayesian gene selection","volume":"37","author":"Zhou","year":"2004","journal-title":"J Biomed Inform"},{"issue":"1","key":"2021090815320995600_ref52","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-4-24","article-title":"A data review and re-assessment of ovarian cancer serum proteomic profiling","volume":"4","author":"Sorace","year":"2003","journal-title":"BMC Bioinformatics"},{"issue":"5","key":"2021090815320995600_ref53","doi-asserted-by":"crossref","first-page":"e190","DOI":"10.3324\/haematol.2014.115337","article-title":"MLL partial tandem duplication leukemia cells are sensitive to small molecule DOT1L inhibition","volume":"100","author":"Kuhn","year":"2015","journal-title":"Haematologica"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab097\/40260240\/bbab097.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab097\/40260240\/bbab097.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,8]],"date-time":"2021-09-08T15:33:00Z","timestamp":1631115180000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab097\/6238587"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,20]]},"references-count":53,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,9,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab097","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9]]},"published":{"date-parts":[[2021,4,20]]},"article-number":"bbab097"}}