{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,19]],"date-time":"2026-01-19T06:35:16Z","timestamp":1768804516632,"version":"3.49.0"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Hierarchical clustering is a common approach to study protein and gene expression data. This unsupervised technique is used to find clusters of genes or proteins which are expressed in a coordinated manner across a set of conditions. Because of both the biological and technical variability, experimental repetitions are generally performed. In this work, we propose an approach to evaluate the stability of clusters derived from hierarchical clustering by taking repeated measurements into account.<\/jats:p>\n               <jats:p>Results: The method is based on the bootstrap technique that is used to obtain pseudo-hierarchies of genes from resampled datasets. Based on a fast dynamic programming algorithm, we compare the original hierarchy to the pseudo-hierarchies and assess the stability of the original gene clusters. Then a shuffling procedure can be used to assess the significance of the cluster stabilities. Our approach is illustrated on simulated data and on two microarray datasets. Compared to the standard hierarchical clustering methodology, it allows to point out the dubious and stable clusters, and thus avoids misleading interpretations.<\/jats:p>\n               <jats:p>Availability: The programs were developed in C and R languages.<\/jats:p>\n               <jats:p>Contact: \u00a0brehelin@lirmm.fr<\/jats:p>\n               <jats:p>Supplementary information: Supplementary Material and source code are available at address http:\/\/www.lirmm.fr\/~brehelin\/Stability\/<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn017","type":"journal-article","created":{"date-parts":[[2008,1,20]],"date-time":"2008-01-20T01:13:55Z","timestamp":1200791635000},"page":"682-688","source":"Crossref","is-referenced-by-count":10,"title":["Using repeated measurements to validate hierarchical gene clusters"],"prefix":"10.1093","volume":"24","author":[{"given":"Laurent","family":"Br\u00e9h\u00e9lin","sequence":"first","affiliation":[{"name":"1 M\u00e9thodes et Algorithmes pour la Bioinformatique, LIRMM, CNRS - University Montpellier II, 2INRA, Unit\u00e9 prot\u00e9omique, 2 Place Viala, 34060 Montpellier C\u00e9dex 1 and 3INRA, Unit\u00e9 Biostatistique et Processus Spatiaux, 84914 Avignon C\u00e9dex 9, France"}]},{"given":"Olivier","family":"Gascuel","sequence":"additional","affiliation":[{"name":"1 M\u00e9thodes et Algorithmes pour la Bioinformatique, LIRMM, CNRS - University Montpellier II, 2INRA, Unit\u00e9 prot\u00e9omique, 2 Place Viala, 34060 Montpellier C\u00e9dex 1 and 3INRA, Unit\u00e9 Biostatistique et Processus Spatiaux, 84914 Avignon C\u00e9dex 9, France"}]},{"given":"Olivier","family":"Martin","sequence":"additional","affiliation":[{"name":"1 M\u00e9thodes et Algorithmes pour la Bioinformatique, LIRMM, CNRS - University Montpellier II, 2INRA, Unit\u00e9 prot\u00e9omique, 2 Place Viala, 34060 Montpellier C\u00e9dex 1 and 3INRA, Unit\u00e9 Biostatistique et Processus Spatiaux, 84914 Avignon C\u00e9dex 9, France"},{"name":"1 M\u00e9thodes et Algorithmes pour la Bioinformatique, LIRMM, CNRS - University Montpellier II, 2INRA, Unit\u00e9 prot\u00e9omique, 2 Place Viala, 34060 Montpellier C\u00e9dex 1 and 3INRA, Unit\u00e9 Biostatistique et Processus Spatiaux, 84914 Avignon C\u00e9dex 9, France"}]}],"member":"286","published-online":{"date-parts":[[2008,1,19]]},"reference":[{"key":"2023020210114328400_B1","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/bth088","article-title":"Gostat: find statistically overrepresented gene ontologies within a group of genes","volume":"20","author":"Beissbarth","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020210114328400_B2","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1191\/1471082X05st096oa","article-title":"Mixture of linear mixed models for clustering gene expression profiles from repeated microarray experiments","volume":"5","author":"Celeux","year":"2005","journal-title":"Stat. Modelling"},{"key":"2023020210114328400_B3","doi-asserted-by":"crossref","first-page":"54","DOI":"10.2307\/2683591","article-title":"Lowess: a program for smoothing scatterplots by robust locally weigted regression","volume":"35","author":"Cleveland","year":"1981","journal-title":"Am. Stat"},{"key":"2023020210114328400_B4","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-7-research0036","article-title":"A prediction-based resampling method for estimating the number of cluster in a dataset","volume":"3","author":"Dudoit","year":"2002","journal-title":"Genome Biol"},{"key":"2023020210114328400_B5","first-page":"54","article-title":"Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy","volume":"1","author":"Efron","year":"1986","journal-title":"Stat. Sci"},{"key":"2023020210114328400_B6","doi-asserted-by":"crossref","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","article-title":"Cluster analysis and display of genome-wide expression patterns","volume":"95","author":"Eisen","year":"1998","journal-title":"PNAS"},{"key":"2023020210114328400_B7","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1093\/bioinformatics\/17.2.126","article-title":"A hierarchical unsupervised growing neural network for clustering gene expression patterns","volume":"17","author":"Herrero","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210114328400_B8","doi-asserted-by":"crossref","first-page":"14732","DOI":"10.1073\/pnas.261293398","article-title":"A transcriptional roadmap to wood formation","volume":"98","author":"Hertzberg","year":"2001","journal-title":"PNAS"},{"key":"2023020210114328400_B9","doi-asserted-by":"crossref","first-page":"783","DOI":"10.2307\/2408678","article-title":"Confidence limites on phylogenies: an approach using the bootstrap","volume":"39","author":"Felsenstein","year":"1985","journal-title":"Evolution"},{"key":"2023020210114328400_B10","first-page":"203","article-title":"Statistical analysis of a gene expression microarray experiment with replication","volume":"12","author":"Kerr","year":"2002","journal-title":"Stat. Sinica"},{"key":"2023020210114328400_B11","doi-asserted-by":"crossref","first-page":"8961","DOI":"10.1073\/pnas.161273698","article-title":"Bootstrapping cluster analysis: assessing the reliability of conclusions from microarray experiments","volume":"98","author":"Kerr","year":"2001","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023020210114328400_B12","first-page":"617","article-title":"Stability based model selection","volume-title":"Advances in Neural Information Processing Systems.","author":"Lange","year":"2003"},{"key":"2023020210114328400_B13","doi-asserted-by":"crossref","first-page":"1462","DOI":"10.1093\/bioinformatics\/18.11.1462","article-title":"Methods for assessing reproductibility of clustering patterns oberved in analyses of microarray data","volume":"18","author":"McShane","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020210114328400_B14","doi-asserted-by":"crossref","first-page":"1222","DOI":"10.1093\/bioinformatics\/bth068","article-title":"Bayesian mixture model based clustering of replicated microarray data","volume":"20","author":"Medvedovic","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020210114328400_B15","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/A:1023949509487","article-title":"Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data","volume":"52","author":"Monti","year":"2003","journal-title":"Machine Learning"},{"key":"2023020210114328400_B16","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-4-36","article-title":"Cluster stability scores for microarray data in cancer studies","volume":"4","author":"Smolkin","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023020210114328400_B17","doi-asserted-by":"crossref","first-page":"2907","DOI":"10.1073\/pnas.96.6.2907","article-title":"Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation","volume":"96","author":"Tamayo","year":"1999","journal-title":"PNAS"},{"key":"2023020210114328400_B18","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1038\/10343","article-title":"Systematic determination of genetic network architecture","volume":"22","author":"Tavazoie","year":"1999","journal-title":"Nat. Genetic"},{"key":"2023020210114328400_B19","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1093\/bioinformatics\/bti817","article-title":"Clusterv: a tool for assessing the reliability of clusters discovered in DNA microarray data","volume":"22","author":"Valentini","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210114328400_B20","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1080\/01621459.1963.10500845","article-title":"Hierarchical clustering to optimise an objective function","volume":"58","author":"Ward","year":"1963","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020210114328400_B21","doi-asserted-by":"crossref","first-page":"977","DOI":"10.1093\/bioinformatics\/17.10.977","article-title":"Model-based clustering and data transformations for gene expression data","volume":"17","author":"Yeung","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210114328400_B22","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1093\/bioinformatics\/17.4.309","article-title":"Validating clustering for gene expression data","volume":"17","author":"Yeung","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210114328400_B23","doi-asserted-by":"crossref","first-page":"R34","DOI":"10.1186\/gb-2003-4-5-r34","article-title":"Clustering gene-expression data with repeated measurements","volume":"4","author":"Yeung","year":"2003","journal-title":"Genome Biol"},{"key":"2023020210114328400_B24","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1007\/s101420000019","article-title":"Assessing reliability of gene clusters from gene expression data","volume":"1","author":"Zhang","year":"2000","journal-title":"Funct. Integrat. Genomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/5\/682\/49050385\/bioinformatics_24_5_682.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/5\/682\/49050385\/bioinformatics_24_5_682.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T11:49:35Z","timestamp":1675338575000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/5\/682\/202947"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,1,19]]},"references-count":24,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2008,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn017","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,3,1]]},"published":{"date-parts":[[2008,1,19]]}}}