{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T14:25:27Z","timestamp":1770992727487,"version":"3.50.1"},"reference-count":51,"publisher":"Oxford University Press (OUP)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: An important application of gene expression microarray data is the classification of samples into categories. Accurate classification depends upon the method used to identify the most relevant genes. Owing to the large number of genes and relatively small sample size, the selection process can be unstable. Modification of existing methods for achieving better analysis of microarray data is needed.<\/jats:p>\n               <jats:p>Results: We propose a Bayesian stochastic variable selection approach for gene selection based on a probit regression model with a generalized singular g-prior distribution for regression coefficients. Using simulation-based Markov chain Monte Carlo methods for simulating parameters from the posterior distribution, an efficient and dependable algorithm is implemented. It is also shown that this algorithm is robust to the choices of initial values, and produces posterior probabilities of related genes for biological interpretation. The performance of the proposed approach is compared with other popular methods in gene selection and classification via the well-known colon cancer and leukemia datasets in microarray literature.<\/jats:p>\n               <jats:p>Availability: A free Matlab code to perform gene selection is available at http:\/\/www.sta.cuhk.edu.hk\/xysong\/geneselection\/.<\/jats:p>\n               <jats:p>Contact: \u00a0ajyang81@gmail.com; xysong@sta.cuhk.edu.hk.<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp638","type":"journal-article","created":{"date-parts":[[2009,11,19]],"date-time":"2009-11-19T01:13:16Z","timestamp":1258593196000},"page":"215-222","source":"Crossref","is-referenced-by-count":53,"title":["Bayesian variable selection for disease classification using gene expression data"],"prefix":"10.1093","volume":"26","author":[{"given":"Yang","family":"Ai-Jun","sequence":"first","affiliation":[{"name":"Department of Statistics, The Chinese University of Hong Kong, Hong Kong, P.R.China"}]},{"given":"Song","family":"Xin-Yuan","sequence":"additional","affiliation":[{"name":"Department of Statistics, The Chinese University of Hong Kong, Hong Kong, P.R.China"}]}],"member":"286","published-online":{"date-parts":[[2009,11,17]]},"reference":[{"key":"2023012508215251100_B1","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1080\/01621459.1993.10476321","article-title":"Bayesian analysis of binary and polychotomous response data","volume":"88","author":"Albert","year":"1993","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012508215251100_B2","doi-asserted-by":"crossref","first-page":"6745","DOI":"10.1073\/pnas.96.12.6745","article-title":"Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays","volume":"96","author":"Alon","year":"1999","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508215251100_B3","doi-asserted-by":"crossref","first-page":"6562","DOI":"10.1073\/pnas.102102699","article-title":"Selection bias in gene extraction on the basis of microarray gene-expression data","volume":"99","author":"Ambroise","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508215251100_B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/bioinformatics\/btg062","article-title":"Effective dimension reduction methods for tumor classification using gene expression data","volume":"19","author":"Antoniadis","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B5","doi-asserted-by":"crossref","first-page":"644","DOI":"10.1093\/bioinformatics\/btg462","article-title":"Optimization models for cancer classification: extracting gene interaction information from microarray expression data","volume":"20","author":"Antonov","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B6","doi-asserted-by":"crossref","first-page":"3423","DOI":"10.1093\/bioinformatics\/bth419","article-title":"Gene selection using a two-level hierarchical Bayesian model","volume":"20","author":"Bae","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B7","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1089\/106652700750050943","article-title":"Tissue classification with gene expression profiles","volume":"7","author":"Ben-Dor","year":"2000","journal-title":"J. Comput. Biol."},{"key":"2023012508215251100_B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/gb-2002-3-4-research0017","article-title":"New feature subset selection procedures for classification of expression profiles","volume":"3","author":"Bo","year":"2002","journal-title":"Genome Biol."},{"key":"2023012508215251100_B9","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1111\/1467-9868.00144","article-title":"Multivariate Bayesian variable selection and prediction","volume":"60","author":"Brown","year":"1998","journal-title":"J. R. Stat. Soc. B"},{"key":"2023012508215251100_B10","doi-asserted-by":"crossref","first-page":"3583","DOI":"10.1093\/bioinformatics\/bth447","article-title":"BagBoosting for tumor classification with gene expression data","volume":"20","author":"Dettling","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B11","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1093\/bioinformatics\/btf867","article-title":"Boosting for tumor classification with gene expression data","volume":"19","author":"Dettling","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B12","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4613-8643-8","volume-title":"Non-Uniform Random Variate Generation","author":"Devroye","year":"1986"},{"key":"2023012508215251100_B13","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1002\/cfg.62","article-title":"Small sample issues for microarray-based classification","volume":"2","author":"Dougherty","year":"2001","journal-title":"Comp. Funct. Genomics"},{"key":"2023012508215251100_B14","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1198\/016214502753479248","article-title":"Comparison of discrimination methods for the classification of tumors using gene expression data","volume":"97","author":"Dudoit","year":"2002","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012508215251100_B15","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","article-title":"Support vector machine classification and validation of cancer tissue samples using microarray expression data","volume":"16","author":"Furey","year":"2000","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B16","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1080\/01621459.1993.10476353","article-title":"Variable selection via Gibbs sampling","volume":"88","author":"George","year":"1993","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012508215251100_B17","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1109\/TPAMI.1984.4767596","article-title":"Stochastic relaxation, Gibbs distribution, and the Bayesian restoration of images","volume":"6","author":"Geman","year":"1984","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023012508215251100_B18","volume-title":"Markov Chain Monte Carlo in Practise","author":"Gilks","year":"1996"},{"key":"2023012508215251100_B19","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023012508215251100_B20","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1198\/016214507000000068","article-title":"Variable selection in regression mixture modeling for the discovery of gene regulatory networks","volume":"102","author":"Gupta","year":"2007","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012508215251100_B21","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1056\/NEJM200102223440801","article-title":"Gene expression profiles in hereditary breast cancer","volume":"344","author":"Hedelfank","year":"2001","journal-title":"N. Eng. J. Med."},{"key":"2023012508215251100_B22","first-page":"53","article-title":"Improved gene selection for classification of microarrays","volume":"8","author":"Jaeger","year":"2003","journal-title":"Pac. Symp. Biocomput."},{"key":"2023012508215251100_B23","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1038\/89044","article-title":"Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks","volume":"7","author":"Khan","year":"2001","journal-title":"Nat. Med."},{"key":"2023012508215251100_B24","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1016\/S0962-8924(97)01219-1","article-title":"Integrins and GTPases in tumour cell growth, motility and invasion","volume":"8","author":"Keely","year":"1998","journal-title":"Trends Cell Biol."},{"key":"2023012508215251100_B25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/00401706.1968.10490530","article-title":"Estimation of error rates in discriminant analysis","volume":"10","author":"Lachenbruch","year":"1968","journal-title":"Technometrics"},{"key":"2023012508215251100_B26","doi-asserted-by":"crossref","first-page":"592","DOI":"10.1198\/jcgs.2009.08027","article-title":"Transdimensional sampling algorithms for Bayesian variable selection in classification problems with many more variables than observations","volume":"18","author":"Lamnisos","year":"2009","journal-title":"J. Comput. Graph. Stat."},{"key":"2023012508215251100_B27","first-page":"1","article-title":"ofw: an R package to selection continuous variables for multiclass classification with a stochastic wrapper method","volume":"28","author":"Le Cao","year":"2008","journal-title":"J. Stat. Softw."},{"key":"2023012508215251100_B28","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1390","article-title":"A sparse PLS for variable selection when integrating omics data","volume":"7","author":"Le Cao","year":"2008","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023012508215251100_B29","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1093\/bioinformatics\/19.1.90","article-title":"Gene selection: a Bayesian variable selection approach","volume":"19","author":"Lee","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B30","doi-asserted-by":"crossref","first-page":"1332","DOI":"10.1093\/bioinformatics\/18.10.1332","article-title":"Bayesian automatic relevance determination algorithms for classifying gene expression data","volume":"18","author":"Li","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B31","doi-asserted-by":"crossref","first-page":"1471","DOI":"10.1186\/1471-2105-8-60","article-title":"Supervised group Lasso with applications to microarray data analysis","volume":"8","author":"Ma","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012508215251100_B32","doi-asserted-by":"crossref","first-page":"31470","DOI":"10.1074\/jbc.271.49.31470","article-title":"Molecular characterization of human zyxin","volume":"271","author":"Maccalma","year":"1996","journal-title":"J. Biol. Chem."},{"key":"2023012508215251100_B33","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1002\/0471725293","volume-title":"Discriminant Analysis and Statistical Pattern Recognition","author":"McLachlan","year":"1992"},{"key":"2023012508215251100_B34","doi-asserted-by":"crossref","DOI":"10.1002\/047172842X","volume-title":"Analyzing Microarray Gene Expression Data.","author":"McLachlan","year":"2004"},{"key":"2023012508215251100_B35","first-page":"383","article-title":"Correcting for selection bias via cross-validation in the classification of microarray data","volume-title":"Beyond Parametrics in Interdisciplinary Research: Festschrift in Honour of Professor Pranab K. Sen.","author":"McLachlan","year":"2008"},{"key":"2023012508215251100_B36","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1093\/bioinformatics\/18.1.39","article-title":"Tumor classification by partial least squares using microarray gene expression data","volume":"18","author":"Nguyen","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B37","first-page":"3124","article-title":"Transcriptional gene expression profiles of colorectal adenoma, adenocarcinoma, and normal tissue examined by oligonucleotidearrays","volume":"61","author":"Notterman","year":"2001","journal-title":"Cancer Res."},{"key":"2023012508215251100_B38","doi-asserted-by":"crossref","first-page":"546","DOI":"10.1093\/bioinformatics\/18.4.546","article-title":"A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments","volume":"18","author":"Pan","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B39","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1016\/j.jeconom.2007.10.003","article-title":"Bayesian identification, selection and estimation of semiparametric functions in high dimensional additive models","volume":"143","author":"Panagiotelisa","year":"2008","journal-title":"J. Econom."},{"key":"2023012508215251100_B40","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1093\/bioinformatics\/btp038","article-title":"Papers on normalization, variable selection, classification or clustering of microarray data","volume":"25","author":"Rocke","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B41","first-page":"5151","article-title":"Uroguanylin treatment suppresses polyp formation in the Apc(Min\/+) mouse and induces apoptosis in human colon adenocarcinoma cells via cyclic GMP","volume":"60","author":"Shailubhai","year":"2000","journal-title":"Cancer Res."},{"key":"2023012508215251100_B42","doi-asserted-by":"crossref","first-page":"1339","DOI":"10.1016\/0161-5890(95)00113-1","article-title":"Antibodies against human CD63 activate transfected rat basophilic leukemia (RBL-2H3) cells","volume":"32","author":"Smith","year":"1995","journal-title":"Mol. Immunol."},{"key":"2023012508215251100_B43","doi-asserted-by":"crossref","first-page":"1111","DOI":"10.1056\/NEJM198704303161802","article-title":"Clinical importance of myeloid antigen expression in adult acute lymphoblastic leukemia","volume":"316","author":"Sobol","year":"1987","journal-title":"N. Eng. J. Med."},{"key":"2023012508215251100_B44","doi-asserted-by":"crossref","first-page":"23833","DOI":"10.1074\/jbc.272.38.23833","article-title":"Identification and characterization of cathepsin B as the cellular MARCKS cleaving enzyme","volume":"272","author":"Spizz","year":"1997","journal-title":"J. Biol. Chem."},{"key":"2023012508215251100_B45","doi-asserted-by":"crossref","first-page":"6567","DOI":"10.1073\/pnas.082099299","article-title":"Diagnosis of multiple cancer types by shrunken centroids of gene expression","volume":"99","author":"Tibshirani","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508215251100_B46","doi-asserted-by":"crossref","first-page":"1454","DOI":"10.1093\/bioinformatics\/18.11.1454","article-title":"Nonparametric methods for identifying differentially expressed genes in microarray data","volume":"18","author":"Troyanskaya","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B47","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1016\/S0167-4889(02)00349-X","article-title":"Zyxin and paxillin proteins: focal adhesion plaque LIM domain proteins go nuclear","volume":"1593","author":"Wang","year":"2003","journal-title":"Biochim. Biophys. Acta"},{"key":"2023012508215251100_B48","first-page":"733","article-title":"Bayesian factor regression models in the large p small n paradigm","author":"West","year":"2000","journal-title":"Bayesian Statistics 7."},{"key":"2023012508215251100_B49","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1038\/sj.onc.1203982","article-title":"Suppression of the tumorigenicity of mutant p53- transformed rat embryo fibroblasts through expression of a newly cloned rat nonmuscle myosin heavy chain-B","volume":"20","author":"Yam","year":"2001","journal-title":"Oncogene"},{"key":"2023012508215251100_B50","doi-asserted-by":"crossref","first-page":"2394","DOI":"10.1093\/bioinformatics\/bti319","article-title":"Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data","volume":"21","author":"Yeung","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012508215251100_B51","first-page":"233","article-title":"On assessing prior distributions and Bayesian regression analysis with g-prior distributions","volume-title":"Bayesian Inference and Decision Techniques: Essays in Honor of Bruno de Finetti.","author":"Zellner","year":"1986"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/2\/215\/48857547\/bioinformatics_26_2_215.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/2\/215\/48857547\/bioinformatics_26_2_215.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:22:45Z","timestamp":1674634965000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/2\/215\/209617"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,11,17]]},"references-count":51,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2010,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp638","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,1,15]]},"published":{"date-parts":[[2009,11,17]]}}}