{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T10:58:01Z","timestamp":1772449081008,"version":"3.50.1"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Feature selection approaches have been widely applied to deal with the small sample size problem in the analysis of micro-array datasets. For the multiclass problem, the proposed methods are based on the idea of selecting a gene subset to distinguish all classes. However, it will be more effective to solve a multiclass problem by splitting it into a set of two-class problems and solving each problem with a respective classification system.<\/jats:p>\n               <jats:p>Results: We propose a genetic programming (GP)-based approach to analyze multiclass microarray datasets. Unlike the traditional GP, the individual proposed in this article consists of a set of small-scale ensembles, named as sub-ensemble (denoted by SE). Each SE consists of a set of trees. In application, a multiclass problem is divided into a set of two-class problems, each of which is tackled by a SE first. The SEs tackling the respective two-class problems are combined to construct a GP individual, so each individual can deal with a multiclass problem directly. Effective methods are proposed to solve the problems arising in the fusion of SEs, and a greedy algorithm is designed to keep high diversity in SEs. This GP is tested in five datasets. The results show that the proposed method effectively implements the feature selection and classification tasks.<\/jats:p>\n               <jats:p>Contact: \u00a0lkhqz@163.com; khliu1977@gmail.com<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn644","type":"journal-article","created":{"date-parts":[[2008,12,17]],"date-time":"2008-12-17T03:38:42Z","timestamp":1229485122000},"page":"331-337","source":"Crossref","is-referenced-by-count":60,"title":["A genetic programming-based approach to the classification of multiclass microarray datasets"],"prefix":"10.1093","volume":"25","author":[{"given":"Kun-Hong","family":"Liu","sequence":"first","affiliation":[{"name":"1 School of Software, Xiamen University, Xiamen, Fujian, 361005, china, 2Intelligent Computing Lab, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui, 230031, China, 3Department of Automation and 4School of Life Science, University of Science and Technology of China, Hefei, Anhui 230026, China"},{"name":"1 School of Software, Xiamen University, Xiamen, Fujian, 361005, china, 2Intelligent Computing Lab, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui, 230031, China, 3Department of Automation and 4School of Life Science, University of Science and Technology of China, Hefei, Anhui 230026, China"},{"name":"1 School of Software, Xiamen University, Xiamen, Fujian, 361005, china, 2Intelligent Computing Lab, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui, 230031, China, 3Department of Automation and 4School of Life Science, University of Science and Technology of China, Hefei, Anhui 230026, China"}]},{"given":"Chun-Gui","family":"Xu","sequence":"additional","affiliation":[{"name":"1 School of Software, Xiamen University, Xiamen, Fujian, 361005, china, 2Intelligent Computing Lab, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui, 230031, China, 3Department of Automation and 4School of Life Science, University of Science and Technology of China, Hefei, Anhui 230026, China"},{"name":"1 School of Software, Xiamen University, Xiamen, Fujian, 361005, china, 2Intelligent Computing Lab, Hefei Institute of Intelligent Machines, Chinese Academy of Sciences, P.O. Box 1130, Hefei, Anhui, 230031, China, 3Department of Automation and 4School of Life Science, University of Science and Technology of China, Hefei, Anhui 230026, China"}]}],"member":"286","published-online":{"date-parts":[[2008,12,16]]},"reference":[{"key":"2023013110005094700_B1","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"2023013110005094700_B2","volume-title":"Classification and Regression Trees.","author":"Breiman","year":"1984"},{"key":"2023013110005094700_B3","doi-asserted-by":"crossref","first-page":"13790","DOI":"10.1073\/pnas.191502998","article-title":"Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses","volume":"98","author":"Bhattacharjee","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013110005094700_B4","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1016\/j.artmed.2003.06.001","article-title":"A constrained-syntax genetic programming system for discovering classification rules: application to medical data sets","volume":"30","author":"Bojarczuk","year":"2004","journal-title":"Artif. Intell. Med."},{"key":"2023013110005094700_B5","doi-asserted-by":"crossref","first-page":"1957","DOI":"10.1016\/j.patcog.2004.03.016","article-title":"Learning effective classifiers with Z-value measure based on genetic programming","volume":"37","author":"Chien","year":"2004","journal-title":"Pattern Recognit."},{"key":"2023013110005094700_B6","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1093\/bioinformatics\/btl008","article-title":"Genetic test bed for feature selection","volume":"22","author":"Choudhary","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013110005094700_B7","doi-asserted-by":"crossref","first-page":"3583","DOI":"10.1093\/bioinformatics\/bth447","article-title":"BagBoosting for tumor classification with gene expression data","volume":"20","author":"Dettling","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013110005094700_B8","doi-asserted-by":"crossref","first-page":"822","DOI":"10.1038\/35090585","article-title":"Delineation of prognostic biomarkers in prostate cancer","volume":"412","author":"Dhanasekaran","year":"2001","journal-title":"Nature"},{"key":"2023013110005094700_B9","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023013110005094700_B10","doi-asserted-by":"crossref","first-page":"993","DOI":"10.1109\/34.58871","article-title":"Neural network ensembles","volume":"12","author":"Hansen","year":"1990","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023013110005094700_B11","doi-asserted-by":"crossref","first-page":"832","DOI":"10.1109\/34.709601","article-title":"The random subspace method for constructing decision forests","volume":"20","author":"Ho","year":"1998","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023013110005094700_B12","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1016\/j.artmed.2005.06.002","article-title":"The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming","volume":"36","author":"Hong","year":"2006","journal-title":"Artif. Intell. Med."},{"key":"2023013110005094700_B13","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1186\/1471-2105-6-148","article-title":"Feature selection and classification for microarray data analysis: evolutionary methods for identifying predictive genes","volume":"6","author":"Jirapech-Umpai","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023013110005094700_B14","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1038\/89044","article-title":"Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks","volume":"7","author":"Khan","year":"2001","journal-title":"Nat. Med."},{"key":"2023013110005094700_B15","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1109\/4235.873235","article-title":"Application of genetic programming for multicategory pattern classification","volume":"4","author":"Kishore","year":"2000","journal-title":"IEEE Trans. Evol. Comput."},{"key":"2023013110005094700_B16","volume-title":"Genetic Programming: On the Programming of Computers by Means of Natural Selection.","author":"Koza","year":"1992"},{"key":"2023013110005094700_B17","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/S0031-3203(99)00041-2","article-title":"Comparison of algorithms that select features for pattern classifiers","volume":"33","author":"Kudo","year":"2000","journal-title":"Pattern Recognit."},{"key":"2023013110005094700_B18","doi-asserted-by":"crossref","DOI":"10.1002\/0471660264","volume-title":"Combining pattern classifiers: methods and algorithms.","author":"Kuncheva","year":"2004"},{"key":"2023013110005094700_B19","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1023\/B:GENP.0000030196.55525.f7","article-title":"Genetic programming for mining DNA chip data from cancer patients","volume":"5","author":"Langdon","year":"2004","journal-title":"Genet. Program. Evol. Mach."},{"key":"2023013110005094700_B20","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1016\/j.csda.2004.03.017","article-title":"An extensive comparison of recent classification tools applied to microarray data","volume":"48","author":"Lee","year":"2005","journal-title":"Comput. Stat. Data Anal."},{"key":"2023013110005094700_B21","doi-asserted-by":"crossref","first-page":"1131","DOI":"10.1093\/bioinformatics\/17.12.1131","article-title":"Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA\/KNN method","volume":"17","author":"Li","year":"2001","journal-title":"Bioinformatics"},{"key":"2023013110005094700_B22","doi-asserted-by":"crossref","first-page":"1638","DOI":"10.1093\/bioinformatics\/bth098","article-title":"Application of the GA\/KNN method to SELDI proteomics data","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013110005094700_B23","doi-asserted-by":"crossref","first-page":"2426","DOI":"10.1016\/j.patcog.2006.01.004","article-title":"Pattern classification in DNA microarray data of multiple tumor types","volume":"39","author":"Lin","year":"2006","journal-title":"Pattern Recognit."},{"key":"2023013110005094700_B24","doi-asserted-by":"crossref","first-page":"2691","DOI":"10.1093\/bioinformatics\/bti419","article-title":"Multiclass cancer classification and biomarker discovery using GA-based algorithms","volume":"21","author":"Liu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013110005094700_B25","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1109\/TEVC.2004.825567","article-title":"A novel approach to design classifiers using genetic programming","volume":"8","author":"Muni","year":"2004","journal-title":"IEEE Trans. Evol. Comput."},{"key":"2023013110005094700_B26","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1093\/bioinformatics\/19.1.37","article-title":"Genetic algorithms applied to multi-class prediction for the analysis of gene expression data","volume":"19","author":"Ooi","year":"2003","journal-title":"Bioinformatics"},{"key":"2023013110005094700_B27","doi-asserted-by":"crossref","first-page":"3185","DOI":"10.1093\/bioinformatics\/bth383","article-title":"Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction","volume":"20","author":"Pochet","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013110005094700_B28","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1038\/73432","article-title":"Systematic variation in gene expression patterns in human cancer cell lines","volume":"24","author":"Ross","year":"2000","journal-title":"Nat. Genet."},{"key":"2023013110005094700_B29","doi-asserted-by":"crossref","first-page":"1776","DOI":"10.1007\/3-540-45110-2_69","article-title":"Dynamic maximum tree depth \u2013 a simple technique for avoiding bloat in tree-based GP","volume-title":"Genetic and Evolutionary Computation Conference 2003.","author":"Silva","year":"2003"},{"key":"2023013110005094700_B30","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1007\/s10994-006-9449-2","article-title":"An analysis of diversity measures","volume":"65","author":"Tang","year":"2006","journal-title":"Mach. Learn."},{"key":"2023013110005094700_B31","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1593\/neo.07121","article-title":"Feature selection and molecular classification of cancer using genetic programming","volume":"9","author":"Yu","year":"2007","journal-title":"Neoplasia"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/3\/331\/48983240\/bioinformatics_25_3_331.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/3\/331\/48983240\/bioinformatics_25_3_331.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T18:44:36Z","timestamp":1675190676000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/3\/331\/244785"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,12,16]]},"references-count":31,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2009,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn644","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,2,1]]},"published":{"date-parts":[[2008,12,16]]}}}