{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,3]],"date-time":"2025-11-03T22:57:04Z","timestamp":1762210624160},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: It is far from trivial to select the most effective clustering method and its parameterization, for a particular set of gene expression data, because there are a very large number of possibilities. Although many researchers still prefer to use hierarchical clustering in one form or another, this is often sub-optimal. Cluster ensemble research solves this problem by automatically combining multiple data partitions from different clusterings to improve both the robustness and quality of the clustering result. However, many existing ensemble techniques use an association matrix to summarize sample-cluster co-occurrence statistics, and relations within an ensemble are encapsulated only at coarse level, while those existing among clusters are completely neglected. Discovering these missing associations may greatly extend the capability of the ensemble methodology for microarray data clustering.<\/jats:p>\n               <jats:p>Results: The link-based cluster ensemble (LCE) method, presented here, implements these ideas and demonstrates outstanding performance. Experiment results on real gene expression and synthetic datasets indicate that LCE: (i) usually outperforms the existing cluster ensemble algorithms in individual tests and, overall, is clearly class-leading; (ii) generates excellent, robust performance across different types of data, especially with the presence of noise and imbalanced data clusters; (iii) provides a high-level data matrix that is applicable to many numerical clustering techniques; and (iv) is computationally efficient for large datasets and gene clustering.<\/jats:p>\n               <jats:p>Availability: Online supplementary and implementation are available at: http:\/\/users.aber.ac.uk\/nii07\/bioinformatics2010<\/jats:p>\n               <jats:p>Contact: \u00a0nii07@aber.ac.uk; natthakan@mfu.ac.th<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq226","type":"journal-article","created":{"date-parts":[[2010,5,6]],"date-time":"2010-05-06T00:50:39Z","timestamp":1273107039000},"page":"1513-1519","source":"Crossref","is-referenced-by-count":102,"title":["LCE: a link-based cluster ensemble method for improved gene expression data analysis"],"prefix":"10.1093","volume":"26","author":[{"given":"Natthakan","family":"Iam-on","sequence":"first","affiliation":[{"name":"1 Department of Computer Science, Aberystwyth University, Aberystwyth, Ceredigion, UK and 2 Department of Mathematics and Computer Science, Royal Thai Air Force Academy, Thailand"}]},{"given":"Tossapon","family":"Boongoen","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, Aberystwyth University, Aberystwyth, Ceredigion, UK and 2 Department of Mathematics and Computer Science, Royal Thai Air Force Academy, Thailand"},{"name":"1 Department of Computer Science, Aberystwyth University, Aberystwyth, Ceredigion, UK and 2 Department of Mathematics and Computer Science, Royal Thai Air Force Academy, Thailand"}]},{"given":"Simon","family":"Garrett","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, Aberystwyth University, Aberystwyth, Ceredigion, UK and 2 Department of Mathematics and Computer Science, Royal Thai Air Force Academy, Thailand"}]}],"member":"286","published-online":{"date-parts":[[2010,5,5]]},"reference":[{"key":"2023012508060725700_B1","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1038\/ng765","article-title":"MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia","volume":"30","author":"Armstrong","year":"2002","journal-title":"Nat. Genet."},{"key":"2023012508060725700_B2","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1016\/j.artmed.2008.07.014","article-title":"Fuzzy ensemble clustering based on random projections for DNA microarray data analysis","volume":"45","author":"Avogadri","year":"2009","journal-title":"Artif. Intell. Med."},{"key":"2023012508060725700_B3","doi-asserted-by":"crossref","first-page":"8679","DOI":"10.1158\/0008-5472.CAN-05-1204","article-title":"Functional network analysis reveals extended gliomagenesis pathway maps and three novel MYC-interacting genes in human gliomas","volume":"65","author":"Bredel","year":"2005","journal-title":"Cancer Res."},{"key":"2023012508060725700_B4","doi-asserted-by":"crossref","first-page":"4164","DOI":"10.1073\/pnas.0308531101","article-title":"Metagenes and molecular pattern discovery using matrix factorization","volume":"101","author":"Brunet","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508060725700_B5","doi-asserted-by":"crossref","first-page":"1929","DOI":"10.1091\/mbc.02-02-0023","article-title":"Gene expression patterns in human liver cancers","volume":"13","author":"Chen","year":"2002","journal-title":"Mol. Biol. Cell."},{"key":"2023012508060725700_B6","doi-asserted-by":"crossref","first-page":"31","DOI":"10.2353\/jmoldx.2006.050056","article-title":"Prognostic gene expression signatures can be measured in tissues collected in RNAlater preservative","volume":"8","author":"Chowdary","year":"2006","journal-title":"J. Mol. Diagn."},{"key":"2023012508060725700_B7","doi-asserted-by":"crossref","first-page":"497","DOI":"10.1186\/1471-2105-9-497","article-title":"Clustering cancer gene expression data: a comparative study","volume":"9","author":"de Souto","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012508060725700_B8","volume-title":"Pattern Classification.","author":"Duda","year":"2000","edition":"2"},{"key":"2023012508060725700_B9","doi-asserted-by":"crossref","first-page":"1090","DOI":"10.1093\/bioinformatics\/btg038","article-title":"Bagging to improve the accuracy of a clustering procedure","volume":"19","author":"Dudoit","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012508060725700_B10","first-page":"36","article-title":"Solving cluster ensemble problems by bipartite graph partitioning","volume-title":"Proceedings of International Conference on Machine Learning","author":"Fern","year":"2004"},{"key":"2023012508060725700_B11","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1109\/TPAMI.2005.113","article-title":"Combining multiple clusterings using evidence accumulation","volume":"27","author":"Fred","year":"2005","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023012508060725700_B12","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023012508060725700_B13","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1016\/j.inffus.2005.01.008","article-title":"Moderate diversity for better cluster ensembles","volume":"7","author":"Hadjitodorov","year":"2006","journal-title":"Inform. Fusion"},{"key":"2023012508060725700_B14","doi-asserted-by":"crossref","first-page":"3201","DOI":"10.1093\/bioinformatics\/bti517","article-title":"Computational cluster validation in post-genomic data analysis","volume":"21","author":"Handl","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012508060725700_B15","first-page":"222","article-title":"Refining pairwise similarity matrix for cluster ensemble problem with cluster relations","volume-title":"Proceedings of Eleventh International Conference on Discovery Science","author":"Iam-on","year":"2008"},{"key":"2023012508060725700_B16","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1006\/jpdc.1997.1404","article-title":"Multilevel k-way partitioning scheme for irregular graphs","volume":"48","author":"Karypis","year":"1998","journal-title":"J. Parallel Distrib. Comput."},{"key":"2023012508060725700_B17","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1109\/92.748202","article-title":"Multilevel hypergraph partitioning: applications in VLSI domain","volume":"7","author":"Karypis","year":"1999","journal-title":"IEEE Trans. VLSI Syst."},{"key":"2023012508060725700_B18","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1038\/89044","article-title":"Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks","volume":"7","author":"Khan","year":"2001","journal-title":"Nat. Med."},{"key":"2023012508060725700_B19","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1186\/1471-2105-10-260","article-title":"MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering","volume":"10","author":"Kim","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012508060725700_B20","first-page":"1214","article-title":"Using diversity in cluster ensembles","volume-title":"Proceedings of the IEEE International Conference on Systems, Man & Cybernetics","author":"Kuncheva","year":"2004"},{"key":"2023012508060725700_B21","doi-asserted-by":"crossref","first-page":"1798","DOI":"10.1109\/TPAMI.2006.226","article-title":"Evaluation of stability of k-means cluster ensembles with respect to random initialization","volume":"28","author":"Kuncheva","year":"2006","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023012508060725700_B22","first-page":"105","article-title":"Experimental comparison of cluster ensemble methods","volume-title":"Proceedings of International Conference on Fusion","author":"Kuncheva","year":"2006"},{"key":"2023012508060725700_B23","doi-asserted-by":"crossref","first-page":"13167","DOI":"10.1073\/pnas.1733249100","article-title":"Robust singular value decomposition analysis of microarray data","volume":"100","author":"Liu","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508060725700_B24","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","article-title":"A tutorial on spectral clustering","volume":"17","author":"Luxburg","year":"2007","journal-title":"Stat. Comput."},{"key":"2023012508060725700_B25","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1093\/bioinformatics\/18.3.413","article-title":"A mixture model-based approach to the clustering of microarray expression data","volume":"18","author":"McLachlan","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012508060725700_B26","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/A:1023949509487","article-title":"Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data","volume":"52","author":"Monti","year":"2003","journal-title":"Mach. Learn."},{"key":"2023012508060725700_B27","first-page":"849","article-title":"On spectral clustering: analysis and an algorithm","volume":"14","author":"Ng","year":"2002","journal-title":"NIPS"},{"key":"2023012508060725700_B28","first-page":"1602","article-title":"Gene expressionbased classification of malignant gliomas correlates better with survival than histological classification","volume":"63","author":"Nutt","year":"2003","journal-title":"Cancer Res."},{"key":"2023012508060725700_B29","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/415436a","article-title":"Prediction of central nervous system embryonal tumour outcome based on gene expression","volume":"415","author":"Pomeroy","year":"2002","journal-title":"Nature"},{"key":"2023012508060725700_B30","doi-asserted-by":"crossref","first-page":"15149","DOI":"10.1073\/pnas.211566398","article-title":"Multiclass cancer diagnosis using tumor gene expression signatures","volume":"98","author":"Ramaswamy","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508060725700_B31","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1504\/IJMSO.2006.011006","article-title":"Survey on test collections and techniques for personal name matching","volume":"1","author":"Reuther","year":"2006","journal-title":"Int. J. Metadata Semantics Ontologies"},{"key":"2023012508060725700_B32","doi-asserted-by":"crossref","first-page":"888","DOI":"10.1109\/34.868688","article-title":"Normalized cuts and image segmentation","volume":"22","author":"Shi","year":"2000","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023012508060725700_B33","doi-asserted-by":"crossref","first-page":"8418","DOI":"10.1073\/pnas.0932692100","article-title":"Repeated observation of breast tumor subtypes in independent gene expression data sets","volume":"100","author":"Sorlie","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508060725700_B34","first-page":"583","article-title":"Cluster ensembles: a knowledge reuse framework for combining multiple partitions","volume":"3","author":"Strehl","year":"2002","journal-title":"J. Mach. Learn. Res."},{"key":"2023012508060725700_B35","first-page":"7388","article-title":"Molecular classification of human carcinomas by use of gene expression signatures","volume":"61","author":"Su","year":"2001","journal-title":"Cancer Res."},{"key":"2023012508060725700_B36","doi-asserted-by":"crossref","first-page":"R94","DOI":"10.1186\/gb-2004-5-11-r94","article-title":"Consensus clustering and functional interpretation of gene-expression data","volume":"5","author":"Swift","year":"2004","journal-title":"Genome Biol."},{"key":"2023012508060725700_B37","doi-asserted-by":"crossref","first-page":"2888","DOI":"10.1093\/bioinformatics\/btm463","article-title":"Graph-based consensus clustering for class discovery from gene expression data","volume":"23","author":"Yu","year":"2007","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/12\/1513\/48858674\/bioinformatics_26_12_1513.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/12\/1513\/48858674\/bioinformatics_26_12_1513.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:09:52Z","timestamp":1674634192000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/12\/1513\/286962"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,5,5]]},"references-count":37,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2010,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq226","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,6,15]]},"published":{"date-parts":[[2010,5,5]]}}}