{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T11:09:02Z","timestamp":1775646542278,"version":"3.50.1"},"reference-count":67,"publisher":"Oxford University Press (OUP)","issue":"8","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The microarray report measures the expressions of tens of thousands of genes, producing a feature vector that is high in dimensionality and that contains much irrelevant information. This dimensionality degrades classification performance. Moreover, datasets typically contain few samples for training, leading to the \u2018curse of dimensionality\u2019 problem. It is essential, therefore, to find good methods for reducing the size of the feature set.<\/jats:p>\n               <jats:p>Results: In this article, we propose a method for gene microarray classification that combines different feature reduction approaches for improving classification performance. Using a support vector machine (SVM) as our classifier, we examine an SVM trained using a set of selected genes; an SVM trained using the feature set obtained by Neighborhood Preserving Embedding feature transform; a set of SVMs trained using a set of orthogonal wavelet coefficients of different wavelet mothers; a set of SVMs trained using texture descriptors extracted from the microarray, considering it as an image; and an ensemble that combines the best feature extraction methods listed above. The positive results reported offer confirmation that combining different features extraction methods greatly enhances system performance. The experiments were performed using several different datasets, and our results [expressed as both accuracy and area under the receiver operating characteristic (ROC) curve] show the goodness of the proposed approach with respect to the state of the art.<\/jats:p>\n               <jats:p>Availability: The MATHLAB code of the proposed approach is publicly available at bias.csr.unibo.it\/nanni\/micro.rar<\/jats:p>\n               <jats:p>Contact: \u00a0loris.nanni@unipd.it<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts108","type":"journal-article","created":{"date-parts":[[2012,3,6]],"date-time":"2012-03-06T04:17:59Z","timestamp":1331007479000},"page":"1151-1157","source":"Crossref","is-referenced-by-count":37,"title":["Combining multiple approaches for gene microarray classification"],"prefix":"10.1093","volume":"28","author":[{"given":"Loris","family":"Nanni","sequence":"first","affiliation":[]},{"given":"Sheryl","family":"Brahnam","sequence":"additional","affiliation":[]},{"given":"Alessandra","family":"Lumini","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2012,3,5]]},"reference":[{"key":"2023012711533411100_B1","doi-asserted-by":"crossref","first-page":"6745","DOI":"10.1073\/pnas.96.12.6745","article-title":"Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays","volume":"96","author":"Alon","year":"1999","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533411100_B2","doi-asserted-by":"crossref","first-page":"816","DOI":"10.1038\/nm733","article-title":"Gene-expression profiles predict survival of patients with lung adenocarcinoma","volume":"8","author":"Beer","year":"2002","journal-title":"Nat. Med."},{"key":"2023012711533411100_B3","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1089\/106652700750050943","article-title":"Tissue classification with gene expression profiles","volume":"7","author":"Ben-Dor","year":"2003","journal-title":"J. Comput. Biol."},{"key":"2023012711533411100_B4","first-page":"60","article-title":"Classification of DNA microarray data with random projection ensembles of polynomial","volume-title":"18th Italian Workshop on Neural Networks.","author":"Bertoni","year":"2009"},{"key":"2023012711533411100_B5","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1016\/j.patcog.2011.06.006","article-title":"An ensemble of filters and classifiers for microarray data classification","volume":"45","author":"Bol\u00f3n-Canedo","year":"2012","journal-title":"Pattern Recognit."},{"key":"2023012711533411100_B6","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1109\/TCBB.2007.1014","article-title":"A blocking strategy to improve gene selection for classification of gene expression data","volume":"4","author":"Bontempi","year":"2007","journal-title":"IEEE\/ACM Trans. Comput. Biol. Biofrom."},{"key":"2023012711533411100_B7","doi-asserted-by":"crossref","first-page":"2562","DOI":"10.1093\/bioinformatics\/bth285","article-title":"A mixture model-based strategy for selecting sets of genes in multiclass response microarray experiments","volume":"20","author":"Broet","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012711533411100_B8","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1073\/pnas.97.1.262","article-title":"Knowledge-based analysis of microarray gene expression data by using support vector machines","volume":"97","author":"Brown","year":"2000","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533411100_B9","doi-asserted-by":"crossref","first-page":"610","DOI":"10.1126\/science.274.5287.610","article-title":"Assessing genetic information with high-density dna arrays","volume":"274","author":"Chee","year":"1996","journal-title":"Science"},{"key":"2023012711533411100_B10","doi-asserted-by":"crossref","first-page":"3209","DOI":"10.1016\/j.eswa.2010.09.009","article-title":"A novel support vector sampling technique to improve classification accuracy and to identify key genes of leukaemia and prostrate cancer","volume":"38","author":"Chen","year":"2011","journal-title":"Expert Syst Appl"},{"key":"2023012711533411100_B11","doi-asserted-by":"crossref","first-page":"636","DOI":"10.1109\/TCBB.2009.8","article-title":"A sparse learning machine for high-dimensional data with application to microarray gene analysis","volume":"7","author":"Cheng","year":"2010","journal-title":"IEEE\/ACM Trans. Comput. Biol. Biofrom."},{"key":"2023012711533411100_B12","first-page":"1","article-title":"Statistical comparisons of classifiers over multiple data sets","volume":"7","author":"Demsar","year":"2006","journal-title":"J. Mach. Learn. Res."},{"key":"2023012711533411100_B13","doi-asserted-by":"crossref","first-page":"15623","DOI":"10.1073\/pnas.95.26.15623","article-title":"Identification of genes differently regulated by interferon alpha, beta, or gamma using oligonucleotide arrays","volume":"95","author":"Der","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533411100_B14","volume-title":"Statistics: the Exploration and Analysis of Data.","author":"Devore","year":"1997"},{"key":"2023012711533411100_B15","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1198\/016214502753479248","article-title":"Comparison of discrimination methods for the classification of tumors using gene expression data","volume":"97","author":"Dudoit","year":"2002","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012711533411100_B16","first-page":"I","article-title":"Baysean learning of sparse classifiers","volume-title":"Computer Vision and Pattern Recognition (CVPR '01).","author":"Figuiredo","year":"2001"},{"key":"2023012711533411100_B17","doi-asserted-by":"crossref","first-page":"6503","DOI":"10.1158\/0008-5472.CAN-04-0452","article-title":"Gene expression profiling of gliomas strongly predicts survival","volume":"64","author":"Freije","year":"2004","journal-title":"Cancer Res."},{"key":"2023012711533411100_B18","first-page":"64","article-title":"Data selection for support vector machine classifiers","volume-title":"Association for Computing Machinery Special Interest Group on Knowledge Discovery and Data Mining.","author":"Fung","year":"2000"},{"key":"2023012711533411100_B19","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","article-title":"Support vector machine classification and validation of cancer tissue samples using microarray expression data","volume":"16","author":"Furey","year":"2000","journal-title":"Bioinformatics."},{"key":"2023012711533411100_B20","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1109\/TCBB.2010.36","article-title":"Cancer classification from gene expression data by NPPC ensemble","volume":"8","author":"Ghorai","year":"2011","journal-title":"IEEE\/ACM Trans. Comput. Biol. Biofrom."},{"key":"2023012711533411100_B21","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1155\/JBB.2005.147","article-title":"Classification and selection of biomarkers in genomic data using LASSO","volume":"2","author":"Ghosh","year":"2005","journal-title":"J. Biomed. Biotechnol."},{"key":"2023012711533411100_B22","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class predition by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science."},{"key":"2023012711533411100_B23","first-page":"4963","article-title":"Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma","volume":"62","author":"Gordon","year":"2002","journal-title":"Cancer Res."},{"key":"2023012711533411100_B24","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using support vector machines","volume":"46","author":"Guyon","year":"2002","journal-title":"Mach. Learn."},{"key":"2023012711533411100_B25","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The Elements of Statistical Learning.","author":"Hastie","year":"2009"},{"key":"2023012711533411100_B26","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1016\/j.jtbi.2010.11.017","article-title":"Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition","volume":"271","author":"Hayat","year":"2011","journal-title":"J. Theor. Biol."},{"key":"2023012711533411100_B27","article-title":"Neighborhood preserving embedding","volume-title":"Tenth IEEE International Conference on Computer Vision (ICCV'2005)","author":"He","year":"2005"},{"key":"2023012711533411100_B28","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1016\/j.compbiomed.2010.06.004","article-title":"Decision forest for clssification of gene expression data","volume":"40","author":"Huang","year":"2010","journal-title":"Comput. Biol. Med."},{"key":"2023012711533411100_B29","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1016\/j.artmed.2005.01.006","article-title":"Gene extraction for cancer diagnosis by support vector machines-an improvement","volume":"40","author":"Huang","year":"2005","journal-title":"Artif. Intel. Med."},{"key":"2023012711533411100_B30","doi-asserted-by":"crossref","first-page":"2375","DOI":"10.1016\/j.neucom.2010.03.024","article-title":"A hybrid LDA and genetic algorithm for gene selection and classification of microarray data","volume":"73","author":"Huerta","year":"2010","journal-title":"Neurocomputing"},{"key":"2023012711533411100_B31","doi-asserted-by":"crossref","DOI":"10.1109\/ICDM.2005.113","article-title":"Orthogonal Neighborhood Preserving Projections","volume-title":"IEEE International conference on Data Mining.","author":"Kokiopoulou","year":"2005"},{"key":"2023012711533411100_B32","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/S0031-3203(99)00041-2","article-title":"Comparison of algorithms that select features for pattern classifiers","volume":"33","author":"Kudo","year":"2000","journal-title":"Pattern Recognit."},{"key":"2023012711533411100_B33","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1023\/A:1022859003006","article-title":"Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy","volume":"51","author":"Kuncheva","year":"2003","journal-title":"Mach. Learn."},{"key":"2023012711533411100_B34","doi-asserted-by":"crossref","first-page":"3146","DOI":"10.1093\/bioinformatics\/bth379","article-title":"Statistical method for identifying diferential gene-gene coexpression patterns","volume":"20","author":"Lai","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012711533411100_B35","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1109\/TCBB.2008.36","article-title":"Investigating the efficiacy of nonlinear dimensionality reduction schemes in classifying gene- and protein-expression studies","volume":"5","author":"Lee","year":"2008","journal-title":"IEEE\/ACM Trans. Comput. Biol. Biofrom."},{"key":"2023012711533411100_B36","first-page":"51","article-title":"A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns","volume":"13","author":"Liu","year":"2002","journal-title":"Genome Inform."},{"key":"2023012711533411100_B37","doi-asserted-by":"crossref","first-page":"985","DOI":"10.1016\/j.neucom.2008.04.010","article-title":"Wavelet feature extraction for high dimensional microarray data","volume":"72","author":"Liu","year":"2009","journal-title":"Neurocomputing"},{"key":"2023012711533411100_B38","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1016\/j.compbiomed.2011.04.005","article-title":"Methods of forward feature selection based on the aggregation of classifiers generated by single attribute","volume":"41","author":"Luo","year":"2011","journal-title":"Comput Biol Med."},{"key":"2023012711533411100_B39","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1016\/j.artmed.2006.06.002","article-title":"Selection of relevant genes in cancer diagnosis based on their prediction accuracy","volume":"40","author":"Maglietta","year":"2007","journal-title":"Artif. Intel. Med."},{"key":"2023012711533411100_B40","doi-asserted-by":"crossref","first-page":"990","DOI":"10.1016\/j.eswa.2010.07.104","article-title":"Wavelet selection for disease classification by DNA microarray data","volume":"38","author":"Nanni","year":"2011","journal-title":"Expert Syst Appl."},{"key":"2023012711533411100_B41","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1016\/j.artmed.2010.02.006","article-title":"Local binary patterns variants as texture descriptors for medical image analysis","volume":"49","author":"Nanni","year":"2010","journal-title":"Artif. Intel. Med."},{"key":"2023012711533411100_B42","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1109\/TPAMI.2002.1017623","article-title":"Multiresolution gray-scale and rotation invariant texture classification with local binary patterns","volume":"24","author":"Ojala","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023012711533411100_B43","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1007\/978-3-540-69905-7_27","article-title":"Blur insensitive texture classification using local phase quantization","volume-title":"International Conference on Image and Signal Processing.","author":"Ojansivu","year":"2008"},{"key":"2023012711533411100_B44","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/j.patrec.2011.09.016","article-title":"An effective double-bounded tree-connected isomap algorithm for microarray data classification","volume":"33","author":"Orsenigo","year":"2011","journal-title":"Pattern Recognit. Lett."},{"key":"2023012711533411100_B45","doi-asserted-by":"crossref","first-page":"2489","DOI":"10.1016\/j.patrec.2010.08.003","article-title":"Improved direct LDA and its application to DNA microarray gene expression data","volume":"31","author":"Paliwal","year":"2010","journal-title":"Pattern Recognit. Lett."},{"key":"2023012711533411100_B46","doi-asserted-by":"crossref","first-page":"1226","DOI":"10.1109\/TPAMI.2005.159","article-title":"Feature selection on mutual information: criteria of max-dependency, max-relevance, and min-redundancy","volume":"27","author":"Peng","year":"2005","journal-title":"IEEE Trans. Pattern Anal. Mach. Intel."},{"key":"2023012711533411100_B47","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1016\/j.compbiomed.2005.04.001","article-title":"A novel ensemble machine learning for robust microarray data classification","volume":"36","author":"Peng","year":"2006","journal-title":"Comput. Biol. Med."},{"key":"2023012711533411100_B48","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1016\/S0140-6736(02)07746-2","article-title":"Use of proteomic patterns in serum to identify ovarian cancer","volume":"359","author":"Petricoin","year":"2002","journal-title":"Lancet"},{"key":"2023012711533411100_B49","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/415436a","article-title":"Prediction of central nervous system embryonal tumour outcome based on gene expression","volume":"415","author":"Pomeroy","year":"2002","journal-title":"Nature"},{"key":"2023012711533411100_B50","first-page":"888","article-title":"A study of performance on microarray data sets for a classifier based on information theoretic learning","volume":"24","author":"Porto-D\u00edaz","year":"2011","journal-title":"Neural Netw."},{"key":"2023012711533411100_B51","doi-asserted-by":"crossref","first-page":"1119","DOI":"10.1016\/0167-8655(94)90127-9","article-title":"Floating search methods in feature selection","volume":"5","author":"Pudil","year":"1994","journal-title":"Pattern Recognit. Lett."},{"key":"2023012711533411100_B52","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1109\/TNN.2003.809398","article-title":"The generalized LASSO","volume":"15","author":"Roth","year":"2004","journal-title":"IEEE Trans. Neural Netw."},{"key":"2023012711533411100_B53","doi-asserted-by":"crossref","first-page":"2323","DOI":"10.1126\/science.290.5500.2323","article-title":"Nonlinear dimensionality reduction by locally linear embedding","volume":"290","author":"Roweis","year":"2000","journal-title":"Science"},{"key":"2023012711533411100_B54","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1109\/TCBB.2005.22","article-title":"Dimension reduction-based penalized logistic regression for cancer classification using microarray data","volume":"2","author":"Shen","year":"2005","journal-title":"IEEE\/ACM Trans. Comput. Biol. Biofrom."},{"key":"2023012711533411100_B55","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1186\/1471-2105-12-375","article-title":"Top scoring pairs for feature selection in machine learning and applications to cancer outcome prediction","volume":"12","author":"Shi","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012711533411100_B56","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nm0102-68","article-title":"Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning","volume":"8","author":"Shipp","year":"2002","journal-title":"Nat. Med."},{"key":"2023012711533411100_B57","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/S1535-6108(02)00030-2","article-title":"Gene expression correlates of clinical prostate cancer behavior","volume":"1","author":"Singh","year":"2002","journal-title":"Cancer Cell"},{"key":"2023012711533411100_B58","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1186\/1471-2105-9-319","article-title":"A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification","volume":"9","author":"Statnikov","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012711533411100_B59","doi-asserted-by":"crossref","first-page":"2346","DOI":"10.1016\/j.neucom.2010.02.024","article-title":"Finding optimal classifiers for small feature sets in genomics and protoemics","volume":"73","author":"Stiglic","year":"2010","journal-title":"Neurocomputing"},{"key":"2023012711533411100_B60","article-title":"Protein subcellular localization of fluorescence imagery using spatial and transform domain features","author":"Tahir","year":"2011","journal-title":"Bioinformatics."},{"key":"2023012711533411100_B61","doi-asserted-by":"crossref","first-page":"2907","DOI":"10.1073\/pnas.96.6.2907","article-title":"Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation","volume":"96","author":"Tamayo","year":"1999","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533411100_B62","first-page":"75","article-title":"Ensemble machine learning on gene expression data for cancer classification","volume":"2","author":"Tan","year":"2003","journal-title":"Appl. Bioinformatics"},{"key":"2023012711533411100_B63","doi-asserted-by":"crossref","first-page":"6567","DOI":"10.1073\/pnas.082099299","article-title":"Diagnosis of multiple cancer types by shrunken centroids of gene expression","volume":"99","author":"Tibshirani","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533411100_B64","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1214\/ss\/1056397488","article-title":"Class predition by nearest shrunken centroids, with application to DNA microarrays","volume":"18","author":"Tibshirani","year":"2003","journal-title":"Stat. Sci."},{"key":"2023012711533411100_B65","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1186\/1471-2407-7-55","article-title":"Novel markers for differentiation of lobular and ductal invasive breast carcinomas by laser microdissection and microarry analysis","volume":"7","author":"Turashvilli","year":"2007","journal-title":"BMC Cancer"},{"key":"2023012711533411100_B66","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van 't Veer","year":"2002","journal-title":"Nature"},{"key":"2023012711533411100_B67","first-page":"483","article-title":"Characterization of a family of algorithms for generalized discriminant analysis on undersampled problems","volume":"6","author":"Ye","year":"2005","journal-title":"J. Mach. Learn. Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/8\/1151\/48930277\/bioinformatics_28_8_1151.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/8\/1151\/48930277\/bioinformatics_28_8_1151.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T12:21:52Z","timestamp":1674822112000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/8\/1151\/196256"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,3,5]]},"references-count":67,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2012,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts108","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2012,4]]},"published":{"date-parts":[[2012,3,5]]}}}