{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T15:14:47Z","timestamp":1769181287913,"version":"3.49.0"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3015,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Recurrent chromosomal alterations provide cytological and molecular positions for the diagnosis and prognosis of cancer. Comparative genomic hybridization (CGH) has been useful in understanding these alterations in cancerous cells. CGH datasets consist of samples that are represented by large dimensional arrays of intervals. Each sample consists of long runs of intervals with losses and gains.<\/jats:p>\n               <jats:p>In this article, we develop novel SVM-based methods for classification and feature selection of CGH data. For classification, we developed a novel similarity kernel that is shown to be more effective than the standard linear kernel used in SVM. For feature selection, we propose a novel method based on the new kernel that iteratively selects features that provides the maximum benefit for classification. We compared our methods against the best wrapper-based and filter-based approaches that have been used for feature selection of large dimensional biological data. Our results on datasets generated from the Progenetix database, suggests that our methods are considerably superior to existing methods.<\/jats:p>\n               <jats:p>Availability: All software developed in this article can be downloaded from http:\/\/plaza.ufl.edu\/junliu\/feature.tar.gz<\/jats:p>\n               <jats:p>Contact: \u00a0juliu@cise.ufl.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn145","type":"journal-article","created":{"date-parts":[[2008,6,27]],"date-time":"2008-06-27T07:43:13Z","timestamp":1214552593000},"page":"i86-i95","source":"Crossref","is-referenced-by-count":29,"title":["Classification and feature selection algorithms for multi-class CGH data"],"prefix":"10.1093","volume":"24","author":[{"given":"Jun","family":"Liu","sequence":"first","affiliation":[{"name":"Computer and Information Science and Engineering, University of Florida, Gainesville, FL 32611, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sanjay","family":"Ranka","sequence":"additional","affiliation":[{"name":"Computer and Information Science and Engineering, University of Florida, Gainesville, FL 32611, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tamer","family":"Kahveci","sequence":"additional","affiliation":[{"name":"Computer and Information Science and Engineering, University of Florida, Gainesville, FL 32611, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,7,1]]},"reference":[{"key":"2023020210393579700_B1","doi-asserted-by":"crossref","DOI":"10.2144\/000112102","article-title":"An online database and bioinformatics toolbox to support data mining in cancer cytogenetics","author":"Baudis","year":"2006","journal-title":"Biotechniques"},{"key":"2023020210393579700_B2","doi-asserted-by":"crossref","first-page":"1228","DOI":"10.1093\/bioinformatics\/17.12.1228","article-title":"Progenetix.net: an online repository for molecular cytogenetic aberration data","volume":"17","author":"Baudis","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210393579700_B3","doi-asserted-by":"crossref","first-page":"1437","DOI":"10.1182\/blood.V88.4.1437.bloodjournal8841437","article-title":"High incidence of chromosomal imbalances and gene amplifications in the classical follicular variant of follicle center lymphoma","volume":"88","author":"Bentz","year":"1996","journal-title":"Blood"},{"key":"2023020210393579700_B4","article-title":"MATLAB support vector machine toolbox (v0.55\u03b2)","author":"Cawley","year":"2000"},{"key":"2023020210393579700_B5","first-page":"3","article-title":"An evaluation of gene selection methods for multi-class microarray data classification. In","author":"Chai","year":"2004","journal-title":"Proceedings of the Second European Workshop on Data Mining and Text Mining in Bioinformatics"},{"key":"2023020210393579700_B6","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1162\/neco.2007.19.5.1155","article-title":"Training a support vector machine in the primal","volume":"19","author":"Chapelle","year":"2007","journal-title":"Neural Comput."},{"key":"2023020210393579700_B7","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1089\/cmb.1999.6.37","article-title":"Inferring tree models for oncogenesis from comparative genome hybridization data","volume":"6","author":"Desper","year":"1999","journal-title":"J. Comput. Biol"},{"key":"2023020210393579700_B8","first-page":"127","article-title":"Analysis of gene expression profiles: class discovery and leaf ordering. In","volume-title":"RECOMB","author":"Ding"},{"key":"2023020210393579700_B9","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1142\/S0219720005001004","article-title":"Minimum redundancy feature selection from microarray gene expression data","volume":"3","author":"Ding","year":"2005","journal-title":"J. Bioinform. Comput. Biol"},{"key":"2023020210393579700_B10","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1109\/TNB.2005.853657","article-title":"Multiple SVM-RFE for gene selection in cancer classification with expression data","volume":"4","author":"Duan","year":"2005","journal-title":"IEEE Trans. Nanobiosci"},{"key":"2023020210393579700_B11","article-title":"Colon Cancer, Adenocarcinoma","author":"El-Deiry","year":"2006","journal-title":"emedicine"},{"key":"2023020210393579700_B12","volume-title":"International Classification of Diseases for Oncology (ICD-O), Third Edition","author":"Fritz","year":"2000"},{"key":"2023020210393579700_B13","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-6-67","article-title":"Evaluation of gene importance in microarray data based upon probability of selection","volume":"6","author":"Fu","year":"2005","journal-title":"BMC Bioinform"},{"key":"2023020210393579700_B14","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023020210393579700_B15","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1101\/SQB.1994.059.01.074","article-title":"Molecular cytogenetics of human breast cancer","volume":"59","author":"Gray","year":"1994","journal-title":"Cold Spring Harb. Symp. Quant. Biol"},{"key":"2023020210393579700_B16","first-page":"1157","article-title":"An introduction to variable and feature selection","volume":"3","author":"Guyon","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"2023020210393579700_B17","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using support vector machines","volume":"46","author":"Guyon","year":"2002","journal-title":"Mach. Learn"},{"key":"2023020210393579700_B18","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1002\/gcc.20143","article-title":"Statistical Behavior of Complex Cancer Karyotypes","volume":"42","author":"Hoglund","year":"2005","journal-title":"Genes Chromosomes Cancer"},{"key":"2023020210393579700_B19","doi-asserted-by":"crossref","first-page":"10","DOI":"10.3322\/canjclin.55.1.10","article-title":"Cancer Statistics, 2005","volume":"55","author":"Jemal","year":"2005","journal-title":"CA Cancer J. Clin"},{"key":"2023020210393579700_B20","first-page":"169","article-title":"Making large-scale support vector machine learning practical","author":"Joachims","year":"1999","journal-title":"Adv. Kernel Methods: Support Vector Learn"},{"key":"2023020210393579700_B21","doi-asserted-by":"crossref","first-page":"1381","DOI":"10.1182\/blood.V99.4.1381","article-title":"Classical hodgkin lymphoma is characterized by recurrent copy number gains of the short arm of chromosome 2","volume":"99","author":"Joos","year":"2002","journal-title":"Blood"},{"key":"2023020210393579700_B22","doi-asserted-by":"crossref","first-page":"818","DOI":"10.1126\/science.1359641","article-title":"Comparative genomic hybridization for molecular cytogenetic analysis of solid tumors","volume":"258","author":"Kallioniemi","year":"1992","journal-title":"Science"},{"key":"2023020210393579700_B23","doi-asserted-by":"crossref","first-page":"2429","DOI":"10.1093\/bioinformatics\/bth267","article-title":"A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020210393579700_B24","doi-asserted-by":"crossref","first-page":"1971","DOI":"10.1093\/bioinformatics\/btl185","article-title":"Distance-based clustering of CGH data","volume":"22","author":"Liu","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210393579700_B25","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1155\/2001\/852674","article-title":"Cluster analysis of comparative genomic hybridization (CGH) data using self-organizing maps: Application to prostate carcinomas","volume":"23","author":"Mattfeldt","year":"2001","journal-title":"Anal. Cell. Pathol"},{"key":"2023020210393579700_B26","author":"Mitelman","year":"1995","journal-title":"International System for Cytogenetic Nomenclature"},{"key":"2023020210393579700_B27","doi-asserted-by":"crossref","first-page":"1340","DOI":"10.1126\/science.176.4041.1340","article-title":"Tumor etiology and chromosome pattern","volume":"176","author":"Mitelman","year":"1972","journal-title":"Science"},{"key":"2023020210393579700_B28","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/17.suppl_1.S157","article-title":"Feature selection for dna methylation based cancer classification","volume":"17","author":"Model","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210393579700_B29","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1145\/369133.369228","article-title":"Gene functional classification from heterogeneous data. In","author":"Pavlidis","year":"2001","journal-title":"RECOMB"},{"key":"2023020210393579700_B30","doi-asserted-by":"crossref","DOI":"10.1038\/ng1569","article-title":"Array comparative genomic hybridization and its applications in cancer","volume":"37","author":"Pinkel","year":"2005","journal-title":"Nat. Genet"},{"key":"2023020210393579700_B31","first-page":"1357","article-title":"Variable selection using SVM based criteria","volume":"3","author":"Rakotomamonjy","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"2023020210393579700_B32","doi-asserted-by":"crossref","first-page":"15149","DOI":"10.1073\/pnas.211566398","article-title":"Multiclass cancer diagnosis using tumor gene expression signatures","volume":"98","author":"Ramaswamy","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210393579700_B33","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1002\/mc.2940070303","article-title":"How many mutations are required for tumorigenesis? implications from human cancer data","volume":"7","author":"Renan","year":"1993","journal-title":"Mol. Carcinog"},{"key":"2023020210393579700_B34","author":"Tan","year":"2005","journal-title":"Introduction to Data Mining, (First Edn)"},{"key":"2023020210393579700_B35","doi-asserted-by":"crossref","first-page":"2280","DOI":"10.1200\/JCO.2005.06.104","article-title":"Unequivocal Delineation of Clinicogenetic Subgroups and Development of a New Model for Improved Outcome Prediction in Neuroblastoma","volume":"23","author":"Vandesompele","year":"2005","journal-title":"J. Clin. Oncol"},{"key":"2023020210393579700_B36","author":"Vapnik","year":"1998","journal-title":"Statistical Learning Theory"},{"key":"2023020210393579700_B37","first-page":"668","article-title":"Feature selection for SVMs. In","author":"Weston","year":"2000","journal-title":"NIPS"},{"key":"2023020210393579700_B38","doi-asserted-by":"crossref","first-page":"737","DOI":"10.1145\/1014052.1014149","article-title":"Redundancy based feature selection for microarray data. In","volume-title":"KDD","author":"Yu","year":"2004"},{"key":"2023020210393579700_B39","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1186\/1471-2105-7-197","article-title":"Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data","volume":"7","author":"Zhang","year":"2006","journal-title":"BMC Bioinform"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i86\/49052375\/bioinformatics_24_13_i86.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i86\/49052375\/bioinformatics_24_13_i86.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T12:22:39Z","timestamp":1675340559000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/13\/i86\/227270"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,1]]},"references-count":39,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2008,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn145","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,7,1]]},"published":{"date-parts":[[2008,7,1]]}}}