{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T01:05:05Z","timestamp":1775869505791,"version":"3.50.1"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: We propose a method for studying the stability of biomarker lists obtained from functional genomics studies. It is common to adopt resampling methods to tune and evaluate marker-based diagnostic and prognostic systems in order to prevent selection bias. Such caution promotes honest estimation of class prediction, but leads to alternative sets of solutions. In microarray studies, the difference in lists may be bewildering, also due to the presence of modules of functionally related genes. Methods for assessing stability understand the dependency of the markers on the data or on the predictor's type and help selecting solutions.<\/jats:p>\n               <jats:p>Results: A computational framework for comparing sets of ranked biomarker lists is presented. Notions and algorithms are based on concepts from permutation group theory. We introduce several algebraic indicators and metric methods for symmetric groups, including the Canberra distance, a weighted version of Spearman's footrule. We also consider distances between partial lists and an aggregation of sets of lists into an optimal list based on voting theory (Borda count). The stability indicators are applied in practical situations to several synthetic, cancer microarray and proteomics datasets. The addressed issues are predictive classification, presence of modules, comparison of alternative biomarker lists, outlier removal, control of selection bias by randomization techniques and enrichment analysis.<\/jats:p>\n               <jats:p>Availability: Supplementary Material and software are available at the address http:\/\/biodcv.fbk.eu\/listspy.html<\/jats:p>\n               <jats:p>Contact: \u00a0furlan@fbk.eu<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm550","type":"journal-article","created":{"date-parts":[[2007,11,17]],"date-time":"2007-11-17T01:25:29Z","timestamp":1195262729000},"page":"258-264","source":"Crossref","is-referenced-by-count":70,"title":["Algebraic stability indicators for ranked lists in molecular profiling"],"prefix":"10.1093","volume":"24","author":[{"given":"Giuseppe","family":"Jurman","sequence":"first","affiliation":[{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"}]},{"given":"Stefano","family":"Merler","sequence":"additional","affiliation":[{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"}]},{"given":"Annalisa","family":"Barla","sequence":"additional","affiliation":[{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"},{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"}]},{"given":"Silvano","family":"Paoli","sequence":"additional","affiliation":[{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"},{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"}]},{"given":"Antonio","family":"Galea","sequence":"additional","affiliation":[{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"}]},{"given":"Cesare","family":"Furlanello","sequence":"additional","affiliation":[{"name":"1 FBK, via Sommarive 18, I-38100 Povo (Trento), 2DISI, University of Genova, via Dodecaneso 35, I-16146 Genova and 3DIT, University of Trento, via Sommarive 14, I-38100 Povo (Trento), Italy"}]}],"member":"286","published-online":{"date-parts":[[2007,11,16]]},"reference":[{"key":"2023020209490700800_B1","doi-asserted-by":"crossref","first-page":"6562","DOI":"10.1073\/pnas.102102699","article-title":"Selection bias in gene extraction on the basis of microarray gene-expression data","volume":"99","author":"Ambroise","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020209490700800_B2","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1186\/1471-2105-7-407","article-title":"Identifying genes that contribute most to good classification in microarrays","volume":"7","author":"Baker","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020209490700800_B3","first-page":"941","article-title":"Proteome profiling without selection bias","volume-title":"Proceedings of IEEE-CBMS 2006","author":"Barla","year":"2006"},{"key":"2023020209490700800_B4","article-title":"M\u00e9moire sur les \u00e9lections au scrutin","author":"Borda","year":"1781","journal-title":"Histoire de l'Acad\u00e9mie Royale des Sciences"},{"key":"2023020209490700800_B5","first-page":"481","article-title":"Permutation editing and matching via embeddings","volume-title":"Proceedings of ICALP 01","author":"Cormode","year":"2001"},{"key":"2023020209490700800_B6","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4612-1106-8","volume-title":"Metric Methods for Analyzing Partially Ranked Data. LNS 34","author":"Critchlow","year":"1985"},{"key":"2023020209490700800_B7","doi-asserted-by":"crossref","first-page":"2356","DOI":"10.1093\/bioinformatics\/btl400","article-title":"Reliable gene signatures for microarray classification: assessment of stability and performance","volume":"22","author":"Davis","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020209490700800_B8","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1204","article-title":"Combining results of microarray experiments: a rank aggregation approach","volume":"5","author":"DeConde","year":"2006","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023020209490700800_B9","article-title":"Group representations in probability and statistics","author":"Diaconis","year":"1988","journal-title":"Institute of Mathematical Statistics Lecture Notes \u2013 Monograph Series 11"},{"key":"2023020209490700800_B10","first-page":"613","article-title":"Rank aggregation for the web","volume-title":"Proceedings of IWWWCC-WWW10","author":"Dwork","year":"2001"},{"key":"2023020209490700800_B11","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1137\/S0895480102412856","article-title":"Comparing top-k lists","volume":"17","author":"Fagin","year":"2003","journal-title":"SIAM J. Discrete Math."},{"key":"2023020209490700800_B12","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1186\/1471-2105-4-54","article-title":"Entropy-based gene ranking without selection bias for the predictive classification of microarray data","volume":"4","author":"Furlanello","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023020209490700800_B13","article-title":"Algorithms on Strings, Trees and Sequences","author":"Gusfield","year":"1997","journal-title":"CUP"},{"key":"2023020209490700800_B14","first-page":"218","article-title":"Stability of feature selection algorithms","volume-title":"Proceedings of IEEE-ICDM 05","author":"Kalousis","year":"2005"},{"key":"2023020209490700800_B15","doi-asserted-by":"crossref","first-page":"340","DOI":"10.1080\/0025570X.2006.11953430","article-title":"Spectral analysis of the supreme court","volume":"79","author":"Lawson","year":"2006","journal-title":"Math. Mag."},{"key":"2023020209490700800_B16","doi-asserted-by":"crossref","first-page":"2315","DOI":"10.1093\/bioinformatics\/btl385","article-title":"OrderedList \u2013 a Bioconductor package for detecting similarity in ordered gene lists","volume":"22","author":"Lottaz","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020209490700800_B17","article-title":"Feature selection, SVM-based classification and application to mass spectrometry data analysis","author":"Marchiori","year":"2006","journal-title":"Bioinformatics Data Analysis and Tools. Lecture Notes"},{"key":"2023020209490700800_B18","first-page":"72","article-title":"Deriving the kernel from training data","volume":"4472","author":"Merler","year":"2007","journal-title":"Proceedings MCS 2007, LNCS"},{"key":"2023020209490700800_B19","doi-asserted-by":"crossref","first-page":"3301","DOI":"10.1093\/bioinformatics\/bti499","article-title":"Prediction error estimation: a comparison of resampling methods","volume":"21","author":"Molinaro","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020209490700800_B20","volume-title":"Chaotic Elections! A Mathematician Looks at Voting","author":"Saari","year":"2001"},{"key":"2023020209490700800_B21","doi-asserted-by":"crossref","first-page":"1169","DOI":"10.1093\/jnci\/djj364","article-title":"Development and evaluation of therapeutically relevant predictive classifiers using gene expression profiling","volume":"98","author":"Simon","year":"2006","journal-title":"J. Natl Cancer Inst."},{"key":"2023020209490700800_B22","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1093\/jnci\/djj052","article-title":"Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis","volume":"98","author":"Sotiriou","year":"2006","journal-title":"J. Natl Cancer Inst."},{"key":"2023020209490700800_B23","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020209490700800_B24","article-title":"A stability metric for typological features","author":"Wichmann","year":"2006","journal-title":"Sprachtypologie und Universalienforschung"},{"key":"2023020209490700800_B25","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1177\/117693510600200031","article-title":"Ovarian cancer classification based on mass spectrometry analysis of sera","volume":"2","author":"Wu","year":"2006","journal-title":"Cancer Inform."},{"key":"2023020209490700800_B26","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1142\/S0219720006002120","article-title":"Similarities of ordered gene lists","volume":"4","author":"Yang","year":"2006","journal-title":"J. Bioinform. Comput. Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/2\/258\/49045109\/bioinformatics_24_2_258.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/2\/258\/49045109\/bioinformatics_24_2_258.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T10:29:23Z","timestamp":1675333763000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/2\/258\/226884"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,11,16]]},"references-count":26,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2008,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm550","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,1,15]]},"published":{"date-parts":[[2007,11,16]]}}}