{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T18:54:21Z","timestamp":1762541661745},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"21","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Recent advancements in microarray technology allows simultaneous monitoring of the expression levels of a large number of genes over different time points. Clustering is an important tool for analyzing such microarray data, typical properties of which are its inherent uncertainty, noise and imprecision. In this article, a two-stage clustering algorithm, which employs a recently proposed variable string length genetic scheme and a multiobjective genetic clustering algorithm, is proposed. It is based on the novel concept of points having significant membership to multiple classes. An iterated version of the well-known Fuzzy C-Means is also utilized for clustering.<\/jats:p><jats:p>Results: The significant superiority of the proposed two-stage clustering algorithm as compared to the average linkage method, Self Organizing Map (SOM) and a recently developed weighted Chinese restaurant-based clustering method (CRC), widely used methods for clustering gene expression data, is established on a variety of artificial and publicly available real life data sets. The biological relevance of the clustering solutions are also analyzed.<\/jats:p><jats:p>Contact: \u00a0anirbanbuba@yahoo.com<\/jats:p><jats:p>Supplementary information: The processed and normalized data sets, supplementary figures, tables and other related materials are available at http:\/\/d.1asphost.com\/anirbanmukhopadhyay\/simmts.html<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm418","type":"journal-article","created":{"date-parts":[[2007,8,26]],"date-time":"2007-08-26T00:13:47Z","timestamp":1188087227000},"page":"2859-2865","source":"Crossref","is-referenced-by-count":148,"title":["An improved algorithm for clustering gene expression data"],"prefix":"10.1093","volume":"23","author":[{"given":"Sanghamitra","family":"Bandyopadhyay","sequence":"first","affiliation":[{"name":"1 Machine Intelligence Unit, Indian Statistical Institute, Kolkata-700108, 2Department of Computer Science & Engineering, University of Kalyani, Kalyani-741235 and 3Department of Computer Science & Engineering, Jadavpur University, Kolkata 700032, India"}]},{"given":"Anirban","family":"Mukhopadhyay","sequence":"additional","affiliation":[{"name":"1 Machine Intelligence Unit, Indian Statistical Institute, Kolkata-700108, 2Department of Computer Science & Engineering, University of Kalyani, Kalyani-741235 and 3Department of Computer Science & Engineering, Jadavpur University, Kolkata 700032, India"}]},{"given":"Ujjwal","family":"Maulik","sequence":"additional","affiliation":[{"name":"1 Machine Intelligence Unit, Indian Statistical Institute, Kolkata-700108, 2Department of Computer Science & Engineering, University of Kalyani, Kalyani-741235 and 3Department of Computer Science & Engineering, Jadavpur University, Kolkata 700032, India"}]}],"member":"286","published-online":{"date-parts":[[2007,8,25]]},"reference":[{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1093\/bioinformatics\/btg455","article-title":"FatiGO: a web tool for finding significant associations of gene ontology terms with groups of genes","volume":"20","author":"Al-Shahrour","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"1506","DOI":"10.1109\/TGRS.2007.892604","article-title":"Multiobjective genetic clustering for pixel classification in remote sensing imagery","volume":"45","author":"Bandyopadhyay","year":"2007","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-0450-1","volume-title":"Pattern Recognition with Fuzzy Objective Function Algorithms","author":"Bezdek","year":"1981"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"699","DOI":"10.1126\/science.282.5389.699","article-title":"The transcriptional program of sporulation in budding yeast","volume":"282","author":"Chu","year":"1998","journal-title":"Science"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1109\/4235.996017","article-title":"A fast and elitist multiobjective genetic algorithm: NSGA-II","volume":"6","author":"Deb","year":"2002","journal-title":"IEEE Trans. Evol. Comput."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"973","DOI":"10.1093\/bioinformatics\/btg119","article-title":"Fuzzy c-means method for clustering microarray data","volume":"19","author":"Dembele","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","article-title":"Cluster analysis and display og genome-wide expression patterns","volume":"95","author":"Eisen","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"717","DOI":"10.1109\/TFUZZ.2005.856560","article-title":"A new convergence proof of fuzzy c-means","volume":"13","author":"Groll","year":"2005","journal-title":"IEEE Trans. Fuzzy Syst."},{"journal-title":"Nonparametric Statistical Methods","year":"1999","author":"Hollander","key":"2023041107264820300_"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1126\/science.283.5398.83","article-title":"The transcriptional program in the response of the human fibroblasts to serum","volume":"283","author":"Iyer","year":"1999","journal-title":"Science"},{"volume-title":"Algorithms for Clustering Data","year":"1988","author":"Jain","key":"2023041107264820300_"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-7-134","article-title":"Effect of data normalization on fuzzy clustering of DNA microarray data","volume":"7","author":"Kim","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"1455","DOI":"10.1016\/S0031-3203(99)00137-5","article-title":"Genetic algorithm based clustering technique","volume":"33","author":"Maulik","year":"2000","journal-title":"Pattern Recognit."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"1650","DOI":"10.1109\/TPAMI.2002.1114856","article-title":"Performance evaluation of some clustering algorithms and validity indices","volume":"24","author":"Maulik","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"1075","DOI":"10.1109\/TGRS.2003.810924","article-title":"Fuzzy partitioning using a real-coded variable-length genetic algorithm for pixel classification","volume":"41","author":"Maulik","year":"2003","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1016\/j.fss.2005.04.009","article-title":"A study of some fuzzy cluster validity indices, genetic clustering and application to pixel classification","volume":"155","author":"Pakhira","year":"2005","journal-title":"Fuzzy Sets Syst."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"1988","DOI":"10.1093\/bioinformatics\/btl284","article-title":"Clustering microarray gene expression data using weighted Chinese restaurant process","volume":"22","author":"Qin","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: a graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J. Comput. Appl. Math."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"1787","DOI":"10.1093\/bioinformatics\/btg232","article-title":"CLICK and EXPANDER: a system for clustering and visualizing gene expression data","volume":"19","author":"Sharan","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"2907","DOI":"10.1073\/pnas.96.6.2907","article-title":"Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation","volume":"96","author":"Tamayo","year":"1999","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"The Gene Ontology Consortium","year":"2000","journal-title":"Nat. Genet."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"1073","DOI":"10.1093\/bioinformatics\/18.8.1073","article-title":"Analysis of expression profile using fuzzy adaptive resonance theory","volume":"18","author":"Tomida","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1073\/pnas.95.1.334","article-title":"Large-scale temporal gene expression mapping of central nervous system development","volume":"95","author":"Wen","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1109\/34.85677","article-title":"A validity measure for fuzzy clustering","volume":"13","author":"Xie","year":"1991","journal-title":"IEEE Trans. Pattern Anal. Mach. Intelli."},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1093\/bioinformatics\/17.9.763","article-title":"An empirical study on principal component analysis for clustering gene expression data","volume":"17","author":"Yeung","year":"2001","journal-title":"Bioinformatics"},{"key":"2023041107264820300_","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1093\/bioinformatics\/17.4.309","article-title":"Validating clustering for gene expression data","volume":"17","author":"Yeung","year":"2001","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/21\/2859\/49822478\/bioinformatics_23_21_2859.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/21\/2859\/49822478\/bioinformatics_23_21_2859.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,13]],"date-time":"2023-05-13T22:40:03Z","timestamp":1684017603000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/21\/2859\/371902"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,8,25]]},"references-count":26,"journal-issue":{"issue":"21","published-print":{"date-parts":[[2007,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm418","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2007,11,1]]},"published":{"date-parts":[[2007,8,25]]}}}