{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T12:59:42Z","timestamp":1773925182883,"version":"3.50.1"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"9","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Given the thousands of genes and the small number of samples, gene selection has emerged as an important research problem in microarray data analysis. Support Vector Machine\u2014Recursive Feature Elimination (SVM-RFE) is one of a group of recently described algorithms which represent the stat-of-the-art for gene selection. Just like SVM itself, SVM-RFE was originally designed to solve binary gene selection problems. Several groups have extended SVM-RFE to solve multiclass problems using one-versus-all techniques. However, the genes selected from one binary gene selection problem may reduce the classification performance in other binary problems.<\/jats:p><jats:p>Results: In the present study, we propose a family of four extensions to SVM-RFE (called MSVM-RFE) to solve the multiclass gene selection problem, based on different frameworks of multiclass SVMs. By simultaneously considering all classes during the gene selection stages, our proposed extensions identify genes leading to more accurate classification.<\/jats:p><jats:p>Contact: \u00a0david.tuck@yale.edu<\/jats:p><jats:p>Supplementary information: Supplementary materials, including a detailed review of both binary and multiclass SVMs, and complete experimental results, are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm036","type":"journal-article","created":{"date-parts":[[2007,5,9]],"date-time":"2007-05-09T17:34:16Z","timestamp":1178732056000},"page":"1106-1114","source":"Crossref","is-referenced-by-count":210,"title":["MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data"],"prefix":"10.1093","volume":"23","author":[{"given":"Xin","family":"Zhou","sequence":"first","affiliation":[]},{"given":"David P.","family":"Tuck","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2007,5,1]]},"reference":[{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"6745","DOI":"10.1073\/pnas.96.12.6745","article-title":"Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays","volume":"96","author":"Alon","year":"1999","journal-title":"Pro. Nat Acad. Sci"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"6562","DOI":"10.1073\/pnas.102102699","article-title":"Selection bias in gene extraction on the basis of microarray gene-expression data","volume":"99","author":"Ambroise","year":"2002","journal-title":"Pro. Nat Acad. Sci"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1038\/ng765","article-title":"MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia","volume":"30","author":"Armstrong","year":"2002","journal-title":"Nat. Genet"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"13790","DOI":"10.1073\/pnas.191502998","article-title":"Classification of human lung carcinomas by mrna expression profiling reveals distinct adenocarcinoma subclasses","volume":"98","author":"Bhattacharjee","year":"2001","journal-title":"Pro. Nat Acad. Sci"},{"key":"2023041209122013800_","first-page":"77","article-title":"Comparison of classifier methods: A case study in handwriting digit recognition","author":"Bottou","year":"1994"},{"key":"2023041209122013800_","volume-title":"Classification and Regression Trees","author":"Breiman","year":"1984"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1073\/pnas.97.1.262","article-title":"Knowledge-based analysis of microarray gene expression data by using support vector machines","volume":"97","author":"Brown","year":"2000","journal-title":"Pro. Nat Acad. Sci"},{"key":"2023041209122013800_","first-page":"3","article-title":"An evaluation of gene selection methods for multi-class microarray data classification","author":"Chai","year":"2004"},{"key":"2023041209122013800_","author":"Chang","year":"2001","journal-title":"LIBSVM: a library for support vector machines"},{"key":"2023041209122013800_","first-page":"265","article-title":"On the algorithmic implementation of multiclass kernel-based vector machines","volume":"2","author":"Crammer","year":"2001","journal-title":"J. Mach. Learn. Res"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"10713","DOI":"10.1073\/pnas.0602949103","article-title":"Computational prediction of methylation status in human genomic sequences","volume":"103","author":"Das","year":"2006","journal-title":"Pro. Nat Acad. Sci"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1198\/016214502753479248","article-title":"Comparison of discrimination methods for the classification of tumors using gene expression data","volume":"97","author":"Dudoit","year":"2002","journal-title":"J. Am. Stat. Assoc"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"1400","DOI":"10.1038\/sj.leu.2404270","article-title":"Gene expression signature of primary imatinib-resistant chronic myeloid leukemia patients","volume":"20","author":"Frank","year":"2006","journal-title":"Leukemia"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","article-title":"Support vector machine classification and validation of cancer tissue samples using microarray expression data","volume":"16","author":"Furey","year":"2000","journal-title":"Bioinformatics"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1016\/0022-247X(68)90201-1","article-title":"Proper efficiency and the theory of vector maximization","volume":"22","author":"Geoffrion","year":"1968","journal-title":"J. Math. Anal. Appl"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1145\/641876.641880","article-title":"Object-oriented software for quadratic programming","volume":"29","author":"Gertz","year":"2003","journal-title":"ACM T. Math. Software"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"2840","DOI":"10.1158\/1078-0432.CCR-04-1935","article-title":"Colocalization of the tetraspanins, CO-029 and CD151, with integrins in human pancreatic adenocarcinoma: Impact on cell motility","volume":"11","author":"Gesierich","year":"2005","journal-title":"Clinical Cancer Res"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","author":"Golub","year":"1999","journal-title":"Science, 286"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using support vector machines","volume":"46","author":"Guyon","year":"2002","journal-title":"Mach. Learn"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1186\/1471-2407-6-186","article-title":"The claudin gene family: expression in normal and neoplastic tissues","volume":"6","author":"Hewitt","year":"2006","journal-title":"BMC Cancer"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"0277","DOI":"10.1002\/mas.20072","article-title":"Processing and classification of protein mass spectra","volume":"25","author":"Hilario","year":"2006","journal-title":"Mass Spectrom. Rev"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1109\/72.991427","article-title":"A comparison of methods for multiclass support vector machines","volume":"13","author":"Hsu","year":"2002","journal-title":"IEEE T. Neural Networ"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"4730","DOI":"10.1074\/jbc.M410362200","article-title":"Galectin-4 binds to sulfated glycosphingolipids and carcinoembryonic antigen in patches on the cell surface of human colon adenocarcinoma cells.","volume":"280","author":"Ideo","year":"2005","journal-title":"J. Biol. Chem"},{"key":"2023041209122013800_","first-page":"1137","article-title":"A study of cross-validation and bootstrap for accuracy estimation and model selection","author":"Kohavi","year":"1995"},{"key":"2023041209122013800_","first-page":"255","article-title":"Pairwise classification and support vector machines","volume-title":"Advances in Kernel Methods: Support Vector Learning","author":"Kre\u00dfel","year":"1999"},{"key":"2023041209122013800_","first-page":"598","article-title":"Optimal brain damage","volume-title":"Advances in Neural Information Processing Systems II","author":"LeCun","year":"1990"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1093\/bioinformatics\/19.1.90","article-title":"Gene selection: a Bayesian variable selection approach","volume":"19","author":"Lee","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1198\/016214504000000098","article-title":"Multicategory support vector machines: theory and application to the classification of microarray data and satellite radiance data","volume":"99","author":"Lee","year":"2004","journal-title":"J. Am. Stat. Assoc"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"2429","DOI":"10.1093\/bioinformatics\/bth267","article-title":"A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1007\/0-306-47815-3_9","article-title":"Classifying microarray data using support vector machines","volume-title":"A Practical Approach to Microarray Data Analysis","author":"Mukherjee","year":"2003"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"1656","DOI":"10.1200\/JCO.2005.03.2755","article-title":"Estrogen-regulated genes predict survival in hormone receptor-positive breast cancers","volume":"24","author":"Oh","year":"2006","journal-title":"J. Clin. Oncol"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1007\/978-1-4615-0813-7_11","volume-title":"Uncertainty in Geometric Computations","author":"Poggio","year":"2002"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"15149","DOI":"10.1073\/pnas.211566398","article-title":"Multiclass cancer diagnosis using tumor gene expression signatures","volume":"98","author":"Ramaswamy","year":"2001","journal-title":"Pro. Nat. Acad. Sci"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"706","DOI":"10.1137\/S0036144502411986","article-title":"An analytical method for multiclass molecular cancer classification","volume":"45","author":"Rifkin","year":"2003","journal-title":"SIAM Review"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1038\/73432","article-title":"Systematic variation in gene expression patterns in human cancer cell","volume":"24","author":"Ross","year":"2000","journal-title":"Nat. Genet"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1093\/bioinformatics\/bti033","article-title":"A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis","volume":"21","author":"Statnikov","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"10787","DOI":"10.1073\/pnas.191368598","article-title":"Chemosensitivity prediction by transcriptional profiling","volume":"98","author":"Staunton","year":"2001","journal-title":"Pro. Nat. Acad. Sci"},{"key":"2023041209122013800_","first-page":"7388","article-title":"Molecular classification of human carcinomas by use of gene expression signatures","volume":"61","author":"Su","year":"2001","journal-title":"Cancer Res"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"6567","DOI":"10.1073\/pnas.082099299","article-title":"Diagnosis of multiple cancer types by shrunken centroids of gene expression","volume":"99","author":"Tibshirani","year":"2002","journal-title":"Proc. Nat. Acad. Sci"},{"key":"2023041209122013800_","volume-title":"Statistical Learning Theory\\\/","author":"Vapnik","year":"1998"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"11462","DOI":"10.1073\/pnas.201162998","article-title":"Predicting the clinical status of human breast cancer by using gene expression profiles","volume":"98","author":"West","year":"2001","journal-title":"Proc. Nat. Acad. Sci"},{"key":"2023041209122013800_","article-title":"Support vector machines for multiclass pattern recognition","author":"Weston","year":"1999"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"S316","DOI":"10.1093\/bioinformatics\/17.suppl_1.S316","article-title":"Molecular classification of multiple tumor types","volume":"17","author":"Yeang","year":"2001","journal-title":"Bioinformatics"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1093\/bioinformatics\/bti736","article-title":"Gene selection using support vector machines with non-convex penalty","volume":"22","author":"Zhang","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"1028","DOI":"10.1038\/sj.leu.2404227","article-title":"Gene expression profiling of CD34+ cells identifies a molecular signature of chronic myeloid leukemia blast crisis","volume":"20","author":"Zheng","year":"2006","journal-title":"Leukemia"},{"key":"2023041209122013800_","first-page":"414","article-title":"Gene selection of DNA microarray data based on Regularization Networks","volume-title":"IDEAL","author":"Zhou","year":"2005"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"1559","DOI":"10.1093\/bioinformatics\/bti216","article-title":"LS Bound based gene selection for DNA microarray data","volume":"21","author":"Zhou","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041209122013800_","doi-asserted-by":"crossref","first-page":"2507","DOI":"10.1093\/bioinformatics\/btl438","article-title":"The ties problem resulting from counting-based error estimators and its impact on gene selection algorithms.","volume":"22","author":"Zhou","year":"2006","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/9\/1106\/49859049\/bioinformatics_23_9_1106.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/9\/1106\/49859049\/bioinformatics_23_9_1106.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,14]],"date-time":"2024-02-14T04:58:01Z","timestamp":1707886681000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/9\/1106\/272173"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,5,1]]},"references-count":48,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2007,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm036","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,5,1]]},"published":{"date-parts":[[2007,5,1]]}}}