{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T02:50:14Z","timestamp":1776307814236,"version":"3.50.1"},"reference-count":137,"publisher":"Oxford University Press (OUP)","issue":"19","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Feature selection techniques have become an apparent need in many bioinformatics applications. In addition to the large pool of techniques that have already been developed in the machine learning and data mining fields, specific applications in bioinformatics have led to a wealth of newly proposed techniques.<\/jats:p><jats:p>In this article, we make the interested reader aware of the possibilities of feature selection, providing a basic taxonomy of feature selection techniques, and discussing their use, variety and potential in a number of both common as well as upcoming bioinformatics applications.<\/jats:p><jats:p>Contact: \u00a0yvan.saeys@psb.ugent.be<\/jats:p><jats:p>Supplementary information: \u00a0http:\/\/bioinformatics.psb.ugent.be\/supplementary_data\/yvsae\/fsreview<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm344","type":"journal-article","created":{"date-parts":[[2007,8,25]],"date-time":"2007-08-25T00:30:08Z","timestamp":1188001808000},"page":"2507-2517","source":"Crossref","is-referenced-by-count":4072,"title":["A review of feature selection techniques in bioinformatics"],"prefix":"10.1093","volume":"23","author":[{"given":"Yvan","family":"Saeys","sequence":"first","affiliation":[{"name":"1 Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium and Bioinformatics and Evolutionary Genomics group, Department of Molecular Genetics, Ghent University, B-9052 Ghent, Belgium and 2Department of Computer Science and Artificial Intelligence, Computer Science Faculty, University of the Basque Country, Paseo Manuel de Lardizabal 1, 20018 Donostia - San Sebasti\u00e1n, Spain"}]},{"given":"I\u00f1aki","family":"Inza","sequence":"additional","affiliation":[{"name":"1 Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium and Bioinformatics and Evolutionary Genomics group, Department of Molecular Genetics, Ghent University, B-9052 Ghent, Belgium and 2Department of Computer Science and Artificial Intelligence, Computer Science Faculty, University of the Basque Country, Paseo Manuel de Lardizabal 1, 20018 Donostia - San Sebasti\u00e1n, Spain"}]},{"given":"Pedro","family":"Larra\u00f1aga","sequence":"additional","affiliation":[{"name":"1 Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium and Bioinformatics and Evolutionary Genomics group, Department of Molecular Genetics, Ghent University, B-9052 Ghent, Belgium and 2Department of Computer Science and Artificial Intelligence, Computer Science Faculty, University of the Basque Country, Paseo Manuel de Lardizabal 1, 20018 Donostia - San Sebasti\u00e1n, Spain"}]}],"member":"286","published-online":{"date-parts":[[2007,8,24]]},"reference":[{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"6745","DOI":"10.1073\/pnas.96.12.6745","article-title":"Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays","volume":"96","author":"Alon","year":"1999","journal-title":"Proc. Nat. Acad. Sci. USA"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"195","DOI":"10.2165\/00822942-200504030-00004","article-title":"Feature selection and the class imbalance problem in predicting protein function from sequence","volume":"4","author":"Al-Shahib","year":"2005","journal-title":"Appl. Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"6562","DOI":"10.1073\/pnas.102102699","article-title":"Selection bias in gene extraction on the basis of microarray gene-expression data","volume":"99","author":"Ambroise","year":"2002","journal-title":"Proc. Nat. Acad. Sci. USA"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1093\/bioinformatics\/17.6.509","article-title":"A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes","volume":"17","author":"Baldi","year":"2001","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1093\/bioinformatics\/18.3.395","article-title":"An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers","volume":"18","author":"Ball","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","first-page":"773","article-title":"Pattern recognition and reduction of dimensionality. In","volume-title":"Handbook of Statistics II","author":"Ben-Bassat","year":"1982"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1089\/106652700750050943","article-title":"Tissue classification with gene expression profiles","volume":"7","author":"Ben-Dor","year":"2000","journal-title":"J. Comput. Biol."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"592","DOI":"10.1002\/pmic.200500192","article-title":"A robust meta classification strategy for cancer detection from MS data","volume":"6","author":"Bhanot","year":"2006","journal-title":"Proteomics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1373","DOI":"10.1142\/S0218001404003800","article-title":"Gene selection for cancer classification using wrapper approaches","volume":"18","author":"Blanco","year":"2004","journal-title":"Int. J. Pattern Recognit. Artif. Intell."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-4-research0017","article-title":"New feature subset selection procedures for classification of expression profiles","volume":"3","author":"B\u00f8","year":"2002","journal-title":"Genome Biol."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1093\/bioinformatics\/btg419","article-title":"Is cross-validation valid for small-sample microarray classification?","volume":"20","author":"Braga-Neto","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.febslet.2004.07.055","article-title":"Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments","volume":"573","author":"Breitling","year":"2004","journal-title":"FEBS Lett."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1093\/bioinformatics\/bti760","article-title":"PCP: a program for supervised classification of gene expression profiles","volume":"22","author":"Buturovic","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","article-title":"SVM and Kernel Methods Matlab Toolbox. In","volume-title":"Perception Syst\u00e8mes et Information","author":"Canu","year":"2003"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1086\/381000","article-title":"Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium","volume":"74","author":"Carlson","year":"2004","journal-title":"Am. J. Hum. Genet."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/14.2.139","article-title":"Feature selection for genetic sequence classification","volume":"14","author":"Chuzhanova","year":"1998","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1093\/bib\/6.1.57","article-title":"A survey of current work in biomedical text mining","volume":"6","author":"Cohen","year":"2005","journal-title":"Brief. Bioinformatics"},{"key":"2023041208441392600_","first-page":"54","article-title":"A comparative study on feature selection for E.coli promoter recognition","volume":"11","author":"Conilione","year":"2005","journal-title":"Int. J. Inf. Technol"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1007\/978-0-387-47509-7_4","article-title":"Pre-processing mass spectrometry data. In","volume-title":"Fundamentals of Data Mining in Genomics and Proteomics","author":"Coombes","year":"2007"},{"key":"2023041208441392600_","first-page":"84","article-title":"Combined optimization of feature selection and algorithm parameter interaction in machine learning of language","author":"Daelemans","year":"2003"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1038\/ng1001-229","article-title":"High-resolution haplotype structure in the human genome","volume":"29","author":"Daly","year":"2001","journal-title":"Nat. Genet."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1186\/1471-2105-6-173","article-title":"Normal uniform mixture differential gene expression detection in cDNA microarrays","volume":"6","author":"Dean","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1093\/bioinformatics\/18.suppl_2.S75","article-title":"Feature subset selection for splice site prediction","volume":"18","author":"Degroeve","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"4636","DOI":"10.1093\/nar\/27.23.4636","article-title":"Improved microbial gene identification with GLIMMER","volume":"27","author":"Delcher","year":"1999","journal-title":"Nucleic Acids Res."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1186\/1471-2105-7-3","article-title":"Gene selection and classification of microarray data using random forest","volume":"7","author":"D\u00edaz-Uriarte","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","first-page":"523","article-title":"Minimum redundancy feature selection from microarray gene expression data","author":"Ding","year":"2003"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1093\/bioinformatics\/btg1011","article-title":"Combining NLP and probabilistic categorisation for document and term selection for Swiss-Prot medical annotation","volume":"19","author":"Dobrokhotov","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","volume-title":"Pattern Classification","author":"Duda","year":"2001"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1198\/016214502753479248","article-title":"Comparison of discriminant methods for the classification of tumors using gene expression data","volume":"97","author":"Dudoit","year":"2002","journal-title":"J. Am. Stat. Assoc"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1214\/ss\/1056397487","article-title":"Multiple hypothesis testing in microarray experiments","volume":"18","author":"Dudoit","year":"2003","journal-title":"Stat. Sci."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1198\/016214501753382129","article-title":"Empirical Bayes analysis of a microarray experiment","volume":"96","author":"Efron","year":"2001","journal-title":"J. Am. Stat. Assoc."},{"key":"2023041208441392600_","first-page":"216","article-title":"PubMiner:machine learning-based text mining for biomedical information analysis. In","volume":"Vol. 3192","author":"Eom","year":"2000","journal-title":"Lecture Notes in Artificial Intelligence"},{"key":"2023041208441392600_","first-page":"403","author":"Ferri","year":"1994","journal-title":"Pattern Recognition in Practice IV, Multiple Paradigms, Comparative Studies and Hybrid Systems"},{"key":"2023041208441392600_","first-page":"1289","article-title":"An extensive empirical study of feature selection metrics for text classification","volume":"3","author":"Forman","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1186\/1471-2105-7-126","article-title":"A two-sample Bayesian t-test for microarray data","volume":"7","author":"Fox","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2225","DOI":"10.1126\/science.1069424","article-title":"The structure of haplotype blocks in the human genome","volume":"296","author":"Gabriel","year":"2002","journal-title":"Science"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"3138","DOI":"10.1093\/bioinformatics\/bti494","article-title":"Proteomic mass spectra classification using decision tree based ensemble methods","volume":"21","author":"Geurts","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"e184","DOI":"10.1093\/bioinformatics\/btl230","article-title":"Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks","volume":"22","author":"Gevaert","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1155\/JBB.2005.147","article-title":"Classification and selection of biomarkers in genomic data using LASSO","volume":"2005","author":"Ghosh","year":"2005","journal-title":"J. Biomed. Biotechnol."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023041208441392600_","first-page":"830","article-title":"Application of genetic algorithm\u2014support vector machine hybrid for prediction of clinical phenotypes based on geneome-wide SNP profiles of sib pairs. In","volume-title":"Lecture Notes in Computer Science 3614","author":"Gong","year":"2005"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1924","DOI":"10.1093\/bioinformatics\/btl196","article-title":"Comparative gene marker selection suite","volume":"22","author":"Gould","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","first-page":"1157","article-title":"An introduction to variable and feature selection","volume":"3","author":"Guyon","year":"2003","journal-title":"J. Mach Learn Res."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using support vector machines","volume":"46","author":"Guyon","year":"2002","journal-title":"Mach. Learn."},{"key":"2023041208441392600_","article-title":"Correlation-based feature selection for machine learning","volume-title":"PhD Thesis","author":"Hall","year":"1999"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"i195","DOI":"10.1093\/bioinformatics\/bti1021","article-title":"Tag SNP selection in genotype data for maximizing SNP prediction accuracy","volume":"21","author":"Halperin","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2136","DOI":"10.1093\/bioinformatics\/btl350","article-title":"Substring selection for biomedical document classification","volume":"22","author":"Han","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2558","DOI":"10.1093\/bioinformatics\/btl420","article-title":"MLR-tagging: informative SNP selection for unphased genotypes based on multiple linear regression","volume":"22","author":"He","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"409","DOI":"10.1002\/mas.20072","article-title":"Processing and classification of protein mass spectra","volume":"25","author":"Hilario","year":"2006","journal-title":"Mass Spectrom. Rev."},{"key":"2023041208441392600_","volume-title":"Adaptation in Natural and Artificial Systems","author":"Holland","year":"1975"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/S0004-3702(00)00052-7","article-title":"Feature subset selection by Bayesian networks based optimization","volume":"123","author":"Inza","year":"2000","journal-title":"Artif. Intell."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.artmed.2004.01.007","article-title":"Filter versus wrapper gene selection approaches in DNA microarray domains","volume":"31","author":"Inza","year":"2004","journal-title":"Artif. Intell. Med."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/1472-6947-6-27","article-title":"An assessment of recently published gene expression data analyses: reporting experimental design and statistical factors","volume":"6","author":"Jafari","year":"2006","journal-title":"BMC Med. Inform. Decis. Mak."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1038\/nrg1768","article-title":"Literature mining for the biologist: from information retrieval to biological discovery","volume":"7","author":"Jensen","year":"2006","journal-title":"Nat. Rev. Genet."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1186\/1471-2105-5-81","article-title":"Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes","volume":"5","author":"Jiang","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1186\/1471-2105-6-148","article-title":"Feature selection and classification for microarray data analysis: evolutionary methods for identifying predictive genes","volume":"6","author":"Jirapech-Umpai","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","first-page":"41","article-title":"Feature selection in proteomic pattern data with support vector machines","author":"Jong","year":"2004"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1167","DOI":"10.1093\/bioinformatics\/18.9.1167","article-title":"Identification of regulatory elements using a feature selection method","volume":"18","author":"Keles","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1186\/1471-2105-7-411","article-title":"miTarget: microRNA target gene prediction using a support vector machine","volume":"7","author":"Kim","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1007\/978-94-009-9941-1_3","volume-title":"Pattern Recognition and Signal Processing, Chapter Feature Set Search Algorithms","author":"Kittler","year":"1978"},{"key":"2023041208441392600_","first-page":"234","article-title":"Data mining using MLC++: a machine learning library in C++. In","volume-title":"Tools with Artificial Intelligence","author":"Kohavi","year":"1996"},{"key":"2023041208441392600_","first-page":"284","article-title":"Toward optimal feature selection. In","volume-title":"Proceedings of the Thirteenth International Conference on Machine Learning","author":"Koller","year":"1996"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1038\/85776","article-title":"Variation in the spice of life","volume":"27","author":"Kruglyak","year":"2001","journal-title":"Nat. Genet"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"e211","DOI":"10.1093\/bioinformatics\/btl233","article-title":"BNTagger: improved tagging SNP selection using Bayesian networks","volume":"22","author":"LeeP","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1093\/bioinformatics\/btg458","article-title":"CHOISS for selection on single nucleotide polymorphism markers on interval regularity","volume":"20","author":"Lee","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1016\/j.csda.2004.03.017","article-title":"An extensive comparison of recent classification tools applied to microarray data","volume":"48","author":"Lee","year":"2005","journal-title":"Comput. Stat. and Data Anal."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1093\/bioinformatics\/19.1.90","article-title":"Gene selection: a Bayesian variable selection approach","volume":"19","author":"Lee","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1093\/bioinformatics\/btk005","article-title":"EDGE: extraction and analysis of differential gene expression","volume":"22","author":"Leek","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1186\/1471-2105-6-68","article-title":"Feature selection and nearest centroid classification for protein mass spectrometry","volume":"6","author":"Levner","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1131","DOI":"10.1093\/bioinformatics\/17.12.1131","article-title":"Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA\/KNN method","volume":"17","author":"Li","year":"2001","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1638","DOI":"10.1093\/bioinformatics\/bth098","article-title":"Applications of the GA\/KNN method to SELDI proteomics data","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2429","DOI":"10.1093\/bioinformatics\/bth267","article-title":"A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","first-page":"137","article-title":"How many genes are needed for a discriminant microarray data analysis? In","author":"Li"},{"key":"2023041208441392600_","first-page":"1184","article-title":"Large-scale ensemble decision analysis of sib-pair ibd profiles for identification of the relevant molecular signatures for alcoholism. In","volume-title":"Lecture Notes in Computer Science 3614","author":"Li","year":"2005"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"850","DOI":"10.1086\/425587","article-title":"Finding haplotype tagging SNPs by use of principal components analysis","volume":"73","author":"Lin","year":"2004","journal-title":"Am. J. Hum. Genet."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4615-5689-3","volume-title":"Feature Selection for Knowledge Discovery and Data Mining","author":"Liu","year":"1998"},{"key":"2023041208441392600_","first-page":"51","article-title":"A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns","volume":"13","author":"Liu","year":"2002","journal-title":"Genome Inform."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"255","DOI":"10.3233\/ISB-00132","article-title":"Using amino acid patterns to accurately predict translation initiation sites. In","volume":"4","author":"Liu","year":"2004","journal-title":"Silico Biol"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1186\/1471-2105-5-110","article-title":"Tests for finding complex patterns of differential expression in cancers: towards individualized medicine","volume":"5","author":"Lyons-Weiler","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2393","DOI":"10.1016\/j.patcog.2006.07.010","article-title":"Selecting features in microarray classification using ROC curves","volume":"39","author":"Mamitsuka","year":"2006","journal-title":"Pattern Recognit."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"4356","DOI":"10.1093\/bioinformatics\/bti724","article-title":"Regularized ROC method for disease classification and biomarker selection with microarray data","volume":"21","author":"Ma","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1093\/bioinformatics\/btl602","article-title":"Prophet, a web-based tool for class prediction using microarray data","volume":"23","author":"Medina","year":"2007","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"3301","DOI":"10.1093\/bioinformatics\/bti499","article-title":"Prediction error estimation: a comparison of resampling methods","volume":"21","author":"Molinaro","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1089\/106652701300099074","article-title":"On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data","volume":"8","author":"Newton","year":"2001","journal-title":"J. Comput. Biol."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1093\/bioinformatics\/19.1.37","article-title":"Genetic algorithms applied to multi-class prediction for the analysis of gene expression data","volume":"19","author":"Ooi","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1333","DOI":"10.1093\/bioinformatics\/btg167","article-title":"On the use of permutation in and the performance of a class of nonparametric methods to detect differential gene expression","volume":"19","author":"Pan","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","first-page":"52","article-title":"A nonparametric scoring algorithm for identifying informative genes from microarray data","volume":"6","author":"Park","year":"2001","journal-title":"Pac. Symp. on Biocompu"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1186\/1471-2105-7-345","article-title":"Individualized markers optimize class prediction of microarray data","volume":"7","author":"Pavlidis","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1373\/49.4.533","article-title":"Mass spectometry-based diagnostic: the upcoming revolution in disease detection","volume":"49","author":"Petricoin","year":"2003","journal-title":"Clin. Chem."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1016\/S0140-6736(02)07746-2","article-title":"Use of proteomics patterns in serum to identify ovarian cancer","volume":"359","author":"Petricoin","year":"2002","journal-title":"The Lancet"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"556","DOI":"10.1093\/bioinformatics\/btk013","article-title":"Multidimensional local false discovery rate for microarray studies","volume":"22","author":"Ploner","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1737","DOI":"10.1093\/bioinformatics\/bth160","article-title":"Improving false discovery rate estimation","volume":"20","author":"Pounds","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2320","DOI":"10.1002\/pmic.200400857","article-title":"Mining mass-spectra for diagnosis and biomarker discovery of cerebral accidents","volume":"4","author":"Prados","year":"2004","journal-title":"Proteomics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"4039","DOI":"10.1093\/bioinformatics\/bti670","article-title":"Analysis of mass spectral serum profiles for biomarker selection","volume":"21","author":"Ressom","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1093\/bioinformatics\/btl678","article-title":"Peak selection from MALDI-TOF mass spectra using ant colony optimization","volume":"23","author":"Ressom","year":"2007","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1038\/73432","article-title":"Systematic variation in gene expression patterns in human cancer cell lines","volume":"24","author":"Ross","year":"2000","journal-title":"Nat. Genet."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2383","DOI":"10.1016\/j.patcog.2005.11.001","article-title":"Incremental wrapper-based gene selection from microarray data for cancer classification","volume":"39","author":"Ruiz","year":"2006","journal-title":"Pattern Recognit."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1186\/1471-2105-5-64","article-title":"Feature selection for splice site prediction: a new method using EDA-based feature ranking","volume":"5","author":"Saeys","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"414","DOI":"10.1093\/bioinformatics\/btl639","article-title":"In search of the small ones: improved prediction of short exons in vertebrates, plants, fungi, and protists","volume":"23","author":"Saeys","year":"2007","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"544","DOI":"10.1093\/nar\/26.2.544","article-title":"Microbial gene identification using interpolated markov models","volume":"26","author":"Salzberg","year":"1998","journal-title":"Nucleic Acids Res."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2921","DOI":"10.1093\/bioinformatics\/bti436","article-title":"twilight; a Bioconductor package for estimating the local false discovery rate","volume":"21","author":"Scheid","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/j.artmed.2004.04.002","article-title":"Data mining and genetic algorithm based gene\/SNP selection","volume":"31","author":"Shah","year":"2004","journal-title":"Artif. Intell. Med."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1093\/bioinformatics\/btl532","article-title":"Combining functional and linkage disequilibrium information in the selection of tag snps","volume":"23","author":"Sham","year":"2007","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/j.jbi.2005.04.002","article-title":"A machine learning perspective on the development of clinical decision support systems utilizing mass spectra of blood samples","volume":"39","author":"Shin","year":"2006","journal-title":"J. Biomed. Inform."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1142\/S0218001488000145","article-title":"On automatic feature selection","volume":"2","author":"Siedelecky","year":"1998","journal-title":"Int. J. Pattern Recogni."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2430","DOI":"10.1093\/bioinformatics\/btl407","article-title":"What should be expected from feature selection in small-sample settings","volume":"22","author":"Sima","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1093\/bioinformatics\/bti081","article-title":"Superior feature-set ranking for small samples using bolstered error estimation","volume":"21","author":"Sima","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1089\/10665270360688219","article-title":"Discriminative motifs","volume":"10","author":"Sinha","year":"2003","journal-title":"J. Comput. Biol."},{"key":"2023041208441392600_","first-page":"293","article-title":"Prototype and feature selection by sampling and random mutation hill climbing algorithms","author":"Skalak","year":"1994"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1027","article-title":"Linear models and empirical Bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat. Appl. in Genet. and Mol. Biol."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1484","DOI":"10.1093\/bioinformatics\/btg182","article-title":"Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions","volume":"19","author":"Somorjai","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1093\/bioinformatics\/bti033","article-title":"A comprhensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis","volume":"21","author":"Statnikov","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1111\/1467-9868.00346","article-title":"A direct approach to false discovery rates","volume":"64","author":"Storey","year":"2002","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1587","DOI":"10.1093\/bioinformatics\/btg179","article-title":"RankGene: identification of diagnostic genes based on expression data","volume":"19","author":"Su","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2553","DOI":"10.1093\/bioinformatics\/bth282","article-title":"Identification of DNA regulatory motifs using Bayesian variable selection","volume":"20","author":"Tadesse","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1227","DOI":"10.1101\/gr.165101","article-title":"An efficient and robust statistical modeling approach to discover differentially expressed genes using genomic expression profiles","volume":"11","author":"Thomas","year":"2001","journal-title":"Genome Res."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"3034","DOI":"10.1093\/bioinformatics\/bth357","article-title":"Sample classification from protein mass spectrometry, by \u2018peak probability contrast\u2019","volume":"20","author":"Tibshirani","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1154","DOI":"10.1093\/bioinformatics\/btl074","article-title":"GALGO: an R package for multivariate variable selection using genetic algorithms","volume":"22","author":"Trevino","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1454","DOI":"10.1093\/bioinformatics\/18.11.1454","article-title":"Nonparametric methods for identifying differentially expressed genes in microarray data","volume":"18","author":"Troyanskaya","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to ionizing radiation response. In","volume":"Vol. 98","author":"Tusher","year":"2001","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"e507","DOI":"10.1093\/bioinformatics\/btl214","article-title":"Novel unsupervised feature filtering of biological data","volume":"22","author":"Varshavsky","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/j.compbiolchem.2004.11.001","article-title":"Gene selection from microarray data for cancer classification\u2013a machine learning approach","volume":"29","author":"Wang","year":"2005","journal-title":"Comput. Biol. Chem."},{"key":"2023041208441392600_","first-page":"1057","article-title":"Tumor classification based on DNA copy number aberrations determined using SNPS arrays","volume":"5","author":"Wang","year":"2006","journal-title":"Oncol. Rep."},{"key":"2023041208441392600_","first-page":"1439","article-title":"Use of the zero-norm with linear models and kernel methods","volume":"3","author":"Weston","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"2023041208441392600_","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten","year":"2005"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1636","DOI":"10.1093\/bioinformatics\/btg210","article-title":"Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data","volume":"19","author":"Wu","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","first-page":"601","article-title":"Feature selection for high-dimensional genomic microarray data","author":"Xing","year":"2001"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1878","DOI":"10.1101\/gr.190001","article-title":"Biomarker identification by feature wrappers","volume":"11","author":"Xiong","year":"2001","journal-title":"Genome Res."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"1084","DOI":"10.1093\/bioinformatics\/bti108","article-title":"Identifying differentially expressed genes from microarray experiments via statistic synthesis","volume":"21","author":"Yang","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1016\/S1535-6108(02)00032-6","article-title":"Classification, subtype discovery, and prediction of outcome in pediatric lymphoblastic leukemia by gene expression profiling","volume":"1","author":"Yeoh","year":"2002","journal-title":"Cancer Cell"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"R83","DOI":"10.1186\/gb-2003-4-12-r83","article-title":"Multiclass classification of microarray data with repeated measurements: application to cancer","volume":"4","author":"Yeung","year":"2003","journal-title":"Genome Biol."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2394","DOI":"10.1093\/bioinformatics\/bti319","article-title":"Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data","volume":"21","author":"Yeung","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"i487","DOI":"10.1093\/bioinformatics\/bti1030","article-title":"Bayesian neural network approaches to ovarian cancer identification from high-resolution mass spectrometry data","volume":"21","author":"Yu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"2200","DOI":"10.1093\/bioinformatics\/bti370","article-title":"Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data","volume":"21","author":"Yu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","first-page":"1205","article-title":"Efficient feature selection via analysis of relevance and redundancy","volume":"5","author":"Yu","year":"2004","journal-title":"J. Mach. Learn. Res."},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"689","DOI":"10.1093\/bioinformatics\/18.5.689","article-title":"Support vector machines with selective kernel scaling for protein classification and identification of key amino acid positions","volume":"18","author":"Zavaljevsky","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041208441392600_","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1186\/1471-2105-7-197","article-title":"Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data","volume":"7","author":"Zhang","year":"2006","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/19\/2507\/49857541\/bioinformatics_23_19_2507.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/19\/2507\/49857541\/bioinformatics_23_19_2507.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,20]],"date-time":"2025-01-20T16:58:39Z","timestamp":1737392319000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/19\/2507\/185254"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,8,24]]},"references-count":137,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2007,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm344","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,10,1]]},"published":{"date-parts":[[2007,8,24]]}}}