{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T03:34:10Z","timestamp":1773804850357,"version":"3.50.1"},"reference-count":46,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The huge growth in gene expression data calls for the implementation of automatic tools for data processing and interpretation.<\/jats:p><jats:p>Results: We present a new and comprehensive machine learning data mining framework consisting in a non-linear PCA neural network for feature extraction, and probabilistic principal surfaces combined with an agglomerative approach based on Negentropy aimed at clustering gene microarray data. The method, which provides a user-friendly visualization interface, can work on noisy data with missing points and represents an automatic procedure to get, with no a priori assumptions, the number of clusters present in the data. Cell-cycle dataset and a detailed analysis confirm the biological nature of the most significant clusters.<\/jats:p><jats:p>Availability: The software described here is a subpackage part of the ASTRONEURAL package and is available upon request from the corresponding author.<\/jats:p><jats:p>Contact: \u00a0robtag@unisa.it<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btk026","type":"journal-article","created":{"date-parts":[[2006,1,6]],"date-time":"2006-01-06T01:39:12Z","timestamp":1136511552000},"page":"589-596","source":"Crossref","is-referenced-by-count":32,"title":["A multi-step approach to time series analysis and gene expression clustering"],"prefix":"10.1093","volume":"22","author":[{"given":"R.","family":"Amato","sequence":"first","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"}]},{"given":"A.","family":"Ciaramella","sequence":"additional","affiliation":[{"name":"Dipartimento di Matematica e Informatica, University of Salerno 2 \u00a0 2 \u00a0 \u00a0 Fisciano, Salerno, ITALY"}]},{"given":"N.","family":"Deniskina","sequence":"additional","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"},{"name":"Institute of Information Transmission Problems, Russian Academy of Sciences 3 \u00a0 3 \u00a0 \u00a0 Moscow, Russia"}]},{"given":"C. Del","family":"Mondo","sequence":"additional","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"}]},{"given":"D.","family":"di Bernardo","sequence":"additional","affiliation":[{"name":"Telethon Institute of Genetics and Medicine 4 \u00a0 4 \u00a0 \u00a0 Naples, ITALY"}]},{"given":"C.","family":"Donalek","sequence":"additional","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"},{"name":"Department of Astronomy, California Institute of Technology 5 \u00a0 5 \u00a0 \u00a0 Pasadena CA, USA"}]},{"given":"G.","family":"Longo","sequence":"additional","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"},{"name":"INFN\u2014Istituto Nazionale Fisica Nucleare 6 \u00a0 6 \u00a0 \u00a0 Sezione di Napoli, Naples, ITALY"},{"name":"INAF\u2014Istituto Nazionale di Astrofisica 7 \u00a0 7 \u00a0 \u00a0 Sezione di Napoli, Naples, ITALY"}]},{"given":"G.","family":"Mangano","sequence":"additional","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"},{"name":"INFN\u2014Istituto Nazionale Fisica Nucleare 6 \u00a0 6 \u00a0 \u00a0 Sezione di Napoli, Naples, ITALY"},{"name":"Department of Physics, Syracuse University 8 \u00a0 8 \u00a0 \u00a0 Syracuse NY, USA"}]},{"given":"G.","family":"Miele","sequence":"additional","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"},{"name":"INFN\u2014Istituto Nazionale Fisica Nucleare 6 \u00a0 6 \u00a0 \u00a0 Sezione di Napoli, Naples, ITALY"}]},{"given":"G.","family":"Raiconi","sequence":"additional","affiliation":[{"name":"Dipartimento di Matematica e Informatica, University of Salerno 2 \u00a0 2 \u00a0 \u00a0 Fisciano, Salerno, ITALY"},{"name":"INFN\u2014Istituto Nazionale Fisica Nucleare 6 \u00a0 6 \u00a0 \u00a0 Sezione di Napoli, Naples, ITALY"}]},{"given":"A.","family":"Staiano","sequence":"additional","affiliation":[{"name":"Dipartimento di Scienze Fisiche, University of Naples \u2018Federico II\u2019 1 \u00a0 1 \u00a0 \u00a0 Naples, ITALY"},{"name":"Dipartimento di Matematica e Informatica, University of Salerno 2 \u00a0 2 \u00a0 \u00a0 Fisciano, Salerno, ITALY"}]},{"given":"R.","family":"Tagliaferri","sequence":"additional","affiliation":[{"name":"Dipartimento di Matematica e Informatica, University of Salerno 2 \u00a0 2 \u00a0 \u00a0 Fisciano, Salerno, ITALY"},{"name":"INFN\u2014Istituto Nazionale Fisica Nucleare 6 \u00a0 6 \u00a0 \u00a0 Sezione di Napoli, Naples, ITALY"}]}],"member":"286","published-online":{"date-parts":[[2006,1,5]]},"reference":[{"key":"2023012408520276900_b1","doi-asserted-by":"crossref","first-page":"10101","DOI":"10.1073\/pnas.97.18.10101","article-title":"Singular value decomposition for genome-wide expression data processing and modeling","volume":"97","author":"Alter","year":"2000","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408520276900_b2","doi-asserted-by":"crossref","first-page":"1207","DOI":"10.1111\/j.1349-7006.2002.tb01225.x","article-title":"Fuzzy neural network applied to gene expression profilling for predicting the prognosis of diffuse large B-cell lymphoma","volume":"93","author":"Ando","year":"2002","journal-title":"Jpn. J. Cancer Res."},{"key":"2023012408520276900_b3","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198538493.001.0001","volume-title":"Neural Networks for Pattern Recognition","author":"Bishop","year":"1995"},{"key":"2023012408520276900_b4","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1162\/089976698300017953","article-title":"GTM: The generative topographic mapping","volume":"10","author":"Bishop","year":"1998","journal-title":"Neural Comput."},{"key":"2023012408520276900_b5","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1038\/84792","article-title":"Regulatory element detection using correlation with expression","volume":"27","author":"Bussermaker","year":"2001","journal-title":"Nat. Genet."},{"key":"2023012408520276900_b6","first-page":"32","article-title":"Gene expression pattern analysis via latent variable models coupled with topographic clustering","volume":"1","author":"Chang","year":"2003","journal-title":"Genom. Inform."},{"key":"2023012408520276900_b7","unstructured":"Chang K. 2000 Nonlinear Dimensionality Reduction Using Probabilistic Principal Surfaces, PhD Thesis, Department of Electrical and Computer Engineering, University of Texas at Austin, USA"},{"key":"2023012408520276900_b8","first-page":"n1","article-title":"A unified model for probabilistic principal surfaces","volume":"23","author":"Chang","year":"2001","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"2023012408520276900_b9","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1117\/12.281504","article-title":"Ratio-based decision and the quantitative analysis of cDNA microarray images","author":"Chen","year":"1997","journal-title":"J. Biomed. Opt."},{"key":"2023012408520276900_b10","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/S1097-2765(00)80114-8","article-title":"A genom-wide transcriptional analysis of the mitotic cells","volume":"2","author":"Cho","year":"1998","journal-title":"Mol. Cell"},{"issue":"Database issue","key":"2023012408520276900_b11","doi-asserted-by":"crossref","first-page":"D311","DOI":"10.1093\/nar\/gkh033","article-title":"Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms","volume":"32","author":"Christie","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408520276900_b12","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1051\/0004-6361:20035771","article-title":"A Multifrequency Analysis of Radio Variability of Blazars","volume":"419","author":"Ciaramella","year":"2004","journal-title":"J. Astron. Astrophys."},{"key":"2023012408520276900_b13","doi-asserted-by":"crossref","first-page":"1164","DOI":"10.1093\/bioinformatics\/bti093","article-title":"Comparison of computational methods for the identification of cell cycle-regulated genes. [Erratum (2005) Bioinformatics, 21, 3063.]","volume":"21","author":"de Lichtenberg","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408520276900_b14","doi-asserted-by":"crossref","first-page":"n1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum-Likelihood from Incomplete Data Via the EM Algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"J. R. Sta. Soc."},{"key":"2023012408520276900_b15","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1038\/nbt1075","article-title":"Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks","volume":"23","author":"di Bernardo","year":"2005","journal-title":"Nat. Biotechnol."},{"key":"2023012408520276900_b16","volume-title":"Pattern Classification","author":"Duda","year":"2001","edition":"2nd edn."},{"key":"2023012408520276900_b17","doi-asserted-by":"crossref","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","article-title":"Cluster analysis and display of genome-wide expression patterns","volume":"95","author":"Eisen","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408520276900_b18","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1038\/1670","article-title":"Data management and analysis for gene expression arrays","volume":"20","author":"Ermolaeva","year":"1998","journal-title":"Nat. Genet."},{"key":"2023012408520276900_b19","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-21606-5","volume-title":"The Elements of Statistical Learning","author":"Hastie","year":"2001"},{"key":"2023012408520276900_b20","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/S0092-8674(00)00015-5","article-title":"Functional discovery via a compendium of expression profiles","volume":"102","author":"Hughes","year":"2000","journal-title":"Cell"},{"key":"2023012408520276900_b21","doi-asserted-by":"crossref","DOI":"10.1002\/0471221317","volume-title":"Independent Component Analysis","author":"Hyv\u00e4rinen","year":"2001"},{"key":"2023012408520276900_b22","doi-asserted-by":"crossref","first-page":"R76","DOI":"10.1186\/gb-2003-4-11-r76","article-title":"Application of independent component analysis to microarrays","volume":"4","author":"Lee","year":"2003","journal-title":"Genom. Biol."},{"key":"2023012408520276900_b23","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1093\/bioinformatics\/18.1.51","article-title":"Linear modes of gene expression determined by independent component analysis","volume":"18","author":"Liebermeister","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012408520276900_b24","volume-title":"Principal Component Analysis","author":"Jolliffe","year":"2002","edition":"2nd edn."},{"key":"2023012408520276900_b25","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1016\/0893-6080(94)90060-4","article-title":"Representation and separation of signals using non-linear PCA type learing","volume":"7","author":"Karhunen","year":"1994","journal-title":"Neural Netw."},{"key":"2023012408520276900_b26","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1016\/0893-6080(94)00098-7","article-title":"Generalizations of principal component analysys, optimization problems and neural networks","volume":"8","author":"Karhunen","year":"1995","journal-title":"Neural Netw."},{"key":"2023012408520276900_b27","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1017\/S0016672301005055","article-title":"Statistical design and the analysis of gene expression microarray data","volume":"77","author":"Kerr","year":"2001","journal-title":"Genet. Res."},{"key":"2023012408520276900_b28","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-97610-0","volume-title":"Self-Organizing Maps","author":"Kohonen","year":"1995"},{"key":"2023012408520276900_b29","first-page":"1","article-title":"Clustering Using Neural Networks and Kullback-Leibler Divergency","author":"Martins","year":"2004"},{"key":"2023012408520276900_b30","doi-asserted-by":"crossref","first-page":"1112","DOI":"10.1101\/gr.225302","article-title":"Interactive exploration of microarray gene expression patterns in a reduced dimensional space","volume":"12","author":"Misra","year":"2002","journal-title":"Genome Res."},{"key":"2023012408520276900_b31","article-title":"Support vector machine classification of microarray data","author":"Mukherjee","year":"1999"},{"key":"2023012408520276900_b32","first-page":"385","article-title":"Learning in nonlinear constrained Hebbian network","volume-title":"Artificial Neural Networks","author":"Oja","year":"1991"},{"key":"2023012408520276900_b33","first-page":"16","article-title":"Principal and independent components in neural networks\u2014recent developments","author":"Oja","year":"1996"},{"key":"2023012408520276900_b34","doi-asserted-by":"crossref","first-page":"16","DOI":"10.2202\/1544-6115.1070","article-title":"Error distribution for gene expression data","volume":"4","author":"Purdom","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023012408520276900_b35","doi-asserted-by":"crossref","first-page":"3273","DOI":"10.1091\/mbc.9.12.3273","article-title":"Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization","volume":"9","author":"Spellman","year":"1998","journal-title":"Mol. Biol. Cell"},{"key":"2023012408520276900_b36","unstructured":"Staiano A. 2003 Unsupervised neural networks for the extraction of scientific information from astronomical data, PhD Thesis, University of Salerno Italy"},{"key":"2023012408520276900_b37","first-page":"63","article-title":"High-D data visualization methods via probabilistic principal surfaces for data mining applications","author":"Staiano","year":"2005"},{"key":"2023012408520276900_b38","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1051\/aas:1999254","article-title":"Spectral analysis of stellar light curves by means of neural networks","volume":"137","author":"Tagliaferri","year":"1999","journal-title":"Astron. Astrophys. Suppl. Ser."},{"key":"2023012408520276900_b39","doi-asserted-by":"crossref","first-page":"535","DOI":"10.1016\/S0098-3004(00)00166-7","article-title":"Soft computing methodologies for spectral analysis in cyclostratigraphy","volume":"27","author":"Tagliaferri","year":"2001","journal-title":"Comput. Geosci."},{"key":"2023012408520276900_b40","doi-asserted-by":"crossref","first-page":"2907","DOI":"10.1073\/pnas.96.6.2907","article-title":"Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation","volume":"96","author":"Tamayo","year":"1999","journal-title":"Proc. Natl Acad. Sci., USA"},{"key":"2023012408520276900_b41","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1016\/S0014-5793(99)00524-4","article-title":"Analysis of gene expression data using self-organizing maps","volume":"451","author":"T\u00f6r\u00f6nen","year":"1999","journal-title":"FEBS Lett."},{"key":"2023012408520276900_b42","doi-asserted-by":"crossref","first-page":"research0071.1","DOI":"10.1186\/gb-2002-3-12-research0071","article-title":"Bayesian analysis of gene expression levels: statistical quantification of relative mRNA level across multiple treatments or samples","volume":"3","author":"Townsend","year":"2002","journal-title":"Genome Biol."},{"key":"2023012408520276900_b43","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1186\/1471-2105-5-54","article-title":"Resolution of large and small differences in gene expression using models for the Bayesian analysis of gene expression levels and spotted DNA microarrays","volume":"5","author":"Townsend","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023012408520276900_b44","doi-asserted-by":"crossref","first-page":"2549","DOI":"10.1093\/nar\/29.12.2549","article-title":"Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects","volume":"29","author":"Tseng","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023012408520276900_b45","doi-asserted-by":"crossref","first-page":"625","DOI":"10.1089\/106652701753307520","article-title":"Assessing gene significance from cDNA microarray expression data via mixed models","volume":"8","author":"Wolfinger","year":"2001","journal-title":"J. Comput. Biol."},{"key":"2023012408520276900_b46","doi-asserted-by":"crossref","first-page":"977","DOI":"10.1093\/bioinformatics\/17.10.977","article-title":"Model based clustering and data transformations for gene expression data","volume":"17","author":"Yeung","year":"2001","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/5\/589\/48838414\/bioinformatics_22_5_589.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/5\/589\/48838414\/bioinformatics_22_5_589.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,6]],"date-time":"2025-01-06T23:15:50Z","timestamp":1736205350000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/5\/589\/205917"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,1,5]]},"references-count":46,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2006,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btk026","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2006,3,1]]},"published":{"date-parts":[[2006,1,5]]}}}