{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,19]],"date-time":"2025-03-19T11:15:42Z","timestamp":1742382942668,"version":"3.35.0"},"reference-count":50,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3015,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The regulation of proliferation and differentiation of embryonic and adult stem cells into mature cells is central to developmental biology. Gene expression measured in distinguishable developmental stages helps to elucidate underlying molecular processes. In previous work we showed that functional gene modules, which act distinctly in the course of development, can be represented by a mixture of trees. In general, the similarities in the gene expression programs of cell populations reflect the similarities in the differentiation path.<\/jats:p><jats:p>Results: We propose a novel model for gene expression profiles and an unsupervised learning method to estimate developmental similarity and infer differentiation pathways. We assess the performance of our model on simulated data and compare it with favorable results to related methods. We also infer differentiation pathways and predict functional modules in gene expression data of lymphoid development.<\/jats:p><jats:p>Conclusions: We demonstrate for the first time how, in principal, the incorporation of structural knowledge about the dependence structure helps to reveal differentiation pathways and potentially relevant functional gene modules from microarray datasets. Our method applies in any area of developmental biology where it is possible to obtain cells of distinguishable differentiation stages.<\/jats:p><jats:p>Availability: The implementation of our method (GPL license), data and additional results are available at http:\/\/algorithmics.molgen.mpg.de\/Supplements\/InfDif\/<\/jats:p><jats:p>Contact: \u00a0filho@molgen.mpg.de, schliep@molgen.mpg.de<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data is available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn153","type":"journal-article","created":{"date-parts":[[2008,6,27]],"date-time":"2008-06-27T07:43:13Z","timestamp":1214552593000},"page":"i156-i164","source":"Crossref","is-referenced-by-count":16,"title":["Inferring differentiation pathways from gene expression"],"prefix":"10.1093","volume":"24","author":[{"given":"Ivan G.","family":"Costa","sequence":"first","affiliation":[{"name":"Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stefan","family":"Roepcke","sequence":"additional","affiliation":[{"name":"Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christoph","family":"Hafemeister","sequence":"additional","affiliation":[{"name":"Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Schliep","sequence":"additional","affiliation":[{"name":"Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,7,1]]},"reference":[{"key":"2023020210371275800_B1","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1182\/blood-2002-06-1780","article-title":"Transcriptional accessibility for genes of multiple tissues and hematopoietic lineages is hierarchically controlled during early hematopoiesis","volume":"101","author":"Akashi","year":"2003","journal-title":"Blood"},{"key":"2023020210371275800_B2","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1186\/1471-2164-8-46","article-title":"\u2018NeuroStem Chip\u2019: a novel highly specialized tool to study neural differentiation pathways in human stem cells","volume":"8","author":"Anisimov","year":"2007","journal-title":"BMC Genomics"},{"key":"2023020210371275800_B3","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023020210371275800_B4","doi-asserted-by":"crossref","first-page":"803","DOI":"10.2307\/2532201","article-title":"Model-based gaussian and non-gaussian clustering","volume":"49","author":"Banfield","year":"1993","journal-title":"Biometrics"},{"key":"2023020210371275800_B5","doi-asserted-by":"crossref","first-page":"2493","DOI":"10.1093\/bioinformatics\/bth283","article-title":"Analyzing time series gene expression data","volume":"20","author":"Bar-Joseph","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020210371275800_B6","doi-asserted-by":"crossref","first-page":"i22","DOI":"10.1093\/bioinformatics\/17.suppl_1.S22","article-title":"Fast optimal leaf ordering for hierarchical clustering","volume":"17","author":"Bar-Joseph","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210371275800_B7","first-page":"36","article-title":"Learning multiple evolutionary pathways from cross-sectional data. In","author":"Beerenwinkel","year":"2004"},{"key":"2023020210371275800_B8","doi-asserted-by":"crossref","first-page":"1464","DOI":"10.1093\/bioinformatics\/bth088","article-title":"GOstat: find statistically overrepresented gene ontologies within a group of genes","volume":"20","author":"Beissbarth","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020210371275800_B9","doi-asserted-by":"crossref","first-page":"1165","DOI":"10.1214\/aos\/1013699998","article-title":"The control of the false discovery rate in multiple testing under dependency","volume":"29","author":"Benjamini","year":"2001","journal-title":"Annal. Stat."},{"key":"2023020210371275800_B10","doi-asserted-by":"crossref","first-page":"4164","DOI":"10.1073\/pnas.0308531101","article-title":"Metagenes and molecular pattern discovery using matrix factorization","volume":"101","author":"Brunet","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210371275800_B11","doi-asserted-by":"crossref","DOI":"10.1201\/9781420057669","volume-title":"Bayes and Empirical Bayes Methods for Data Analysis","author":"Carlin","year":"2000"},{"key":"2023020210371275800_B12","doi-asserted-by":"crossref","first-page":"781","DOI":"10.1016\/0031-3203(94)00125-6","article-title":"Gaussian parsimonious clustering models","volume":"28","author":"Celeux","journal-title":"Pattern Recognition"},{"key":"2023020210371275800_B13","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1093\/biomet\/asm007","article-title":"Estimation of a covariance matrix with zeros","volume":"94","author":"Chaudhuri","year":"2007","journal-title":"Biometrika"},{"key":"2023020210371275800_B14","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1109\/TIT.1968.1054142","article-title":"Approximating discrete probability distributions with dependence trees","volume":"14","author":"Chow","year":"1968","journal-title":"IEEE Trans. Info. Theory"},{"key":"2023020210371275800_B15","volume-title":"Introduction to Algorithms","author":"Cormen","year":"2001","edition":"2n edn"},{"key":"2023020210371275800_B16","doi-asserted-by":"crossref","first-page":"S3","DOI":"10.1186\/1471-2105-8-S10-S3","article-title":"Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data","volume":"8","author":"Costa","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023020210371275800_B17","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1186\/1471-2172-8-25","article-title":"Gene expression tress in blood cell development","volume":"8","author":"Costa","year":"2007","journal-title":"BMC Immunol"},{"key":"2023020210371275800_B18","doi-asserted-by":"crossref","DOI":"10.1002\/0471200611","volume-title":"Elements of Information Theory","author":"Cover","year":"1991"},{"key":"2023020210371275800_B19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"JRSSB"},{"key":"2023020210371275800_B20","doi-asserted-by":"crossref","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","article-title":"Cluster analysis and display of genome-wide expression patterns","volume":"95","author":"Eisen","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210371275800_B21","doi-asserted-by":"crossref","first-page":"i159","DOI":"10.1093\/bioinformatics\/bti1022","article-title":"Clustering short time series gene expression data","volume":"21","author":"Ernst","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020210371275800_B22","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1186\/1471-2164-8-264","article-title":"Genomic expression during human myelopoiesis","volume":"8","author":"Ferrari","year":"2007","journal-title":"BMC Genomics"},{"key":"2023020210371275800_B23","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1007\/s00357-007-0004-5","article-title":"Bayesian regularization for normal mixture estimation and model-based clustering","volume":"24","author":"Fraley","year":"2007","journal-title":"J. Classif"},{"key":"2023020210371275800_B24","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1126\/science.1094068","article-title":"Inferring cellular networks using probabilistic graphical models","volume":"303","author":"Friedman","year":"2004","journal-title":"Science"},{"key":"2023020210371275800_B25","unstructured":"Graf T \u00a0TrumppA Haematopoietic stem cells, niches and differentiation pathways Poster. Nat. Rev. Immunol 2007 (last accessed date January 1 2008) URL http:\/\/www.nature.com\/nri\/posters\/hsc\/index.html"},{"key":"2023020210371275800_B26","first-page":"63","article-title":"Comparing partitions","volume":"2","author":"Hubbert","year":"1985","journal-title":"J. Classif"},{"key":"2023020210371275800_B27","doi-asserted-by":"crossref","first-page":"S96","DOI":"10.1093\/bioinformatics\/18.suppl_1.S96","article-title":"Variance stabilization applied to microarray data calibration and to the quantification of differential expression","volume":"18","author":"Huber","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020210371275800_B28","doi-asserted-by":"crossref","first-page":"686","DOI":"10.1038\/ni0706-686","article-title":"Gene expression microarrays: glimpses of the immunological genome","volume":"7","author":"Hyatt","year":"2006","journal-title":"Nat. Immunol"},{"issue":"Database issue","key":"2023020210371275800_B29","doi-asserted-by":"crossref","first-page":"D354","DOI":"10.1093\/nar\/gkj102","article-title":"From genomics to chemical genomics: new developments in KEGG","volume":"34","author":"Kanehisa","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020210371275800_B30","doi-asserted-by":"crossref","DOI":"10.1002\/9780470316801","volume-title":"Finding Groups in Data: an Introduction to Cluster Analysis","author":"Kaufman","year":"1990"},{"key":"2023020210371275800_B31","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198522195.001.0001","volume-title":"Graphical Models","author":"Lauritzen","year":"1996"},{"key":"2023020210371275800_B32","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1111\/j.2517-6161.1988.tb01721.x","article-title":"Local computations with probabilities on graphical structures and their application to expert systems","volume":"50","author":"Lauritzen","year":"1998","journal-title":"J. Royal Statis. Soc. B"},{"key":"2023020210371275800_B33","doi-asserted-by":"crossref","first-page":"497","DOI":"10.1038\/nri1633","article-title":"Transcriptional networks in developing and mature B cells","volume":"5","author":"Matthias","year":"2005","journal-title":"Nat. Rev. Immunol"},{"key":"2023020210371275800_B34","doi-asserted-by":"crossref","DOI":"10.1002\/0471721182","volume-title":"Finite Mixture Models","author":"McLachlan","year":"2000"},{"key":"2023020210371275800_B35","first-page":"281","article-title":"Some methods of classification and analysis of multivariate observations. In","volume-title":"5th Berkeley Symposium in Mathematics, Statistics and Probability","author":"McQueen","year":"1967"},{"key":"2023020210371275800_B36","first-page":"1","article-title":"Learning with mixtures of trees","volume":"1","author":"Meila","year":"2001","journal-title":"J. Mach. Learn. Res"},{"key":"2023020210371275800_B37","first-page":"849","article-title":"On spectral clustering: analysis and an algorithm. In","volume-title":"Advances in Neural Information Processing Systems 13","author":"Ng","year":"2001"},{"key":"2023020210371275800_B38","first-page":"830","article-title":"Thymocyte stimulation by anti-TCR-beta, but not by anti-TCR-alpha, leads to induction of developmental transcription program","volume-title":"J. Leukoc. Biol","author":"Niederberger","year":"2005"},{"key":"2023020210371275800_B39","doi-asserted-by":"crossref","first-page":"8102","DOI":"10.1073\/pnas.0402065101","article-title":"Natural killer cells distinguish innocuous and destructive forms of pancreatic islet autoimmunity","volume":"101","author":"Poirot","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210371275800_B40","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1007\/s00357-006-0018-y","article-title":"Generation of random clusters with specified degree of separation","volume":"23","author":"Qiu","year":"2006","journal-title":"J. Classif."},{"key":"2023020210371275800_B41","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1146\/annurev.immunol.23.021704.115737","article-title":"Molecular genetics of T cell development","volume":"23","author":"Rothenberg","year":"2005","journal-title":"Annu. Rev. Immunol"},{"key":"2023020210371275800_B42","article-title":"A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics","volume":"4","author":"Schaefer","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2023020210371275800_B43","article-title":"Semi-supervised clustering of yeast gene expression","volume-title":"Japanese-German Workshop on Data Analysis and Classification","author":"Sch\u00f6nhuth","year":"2006"},{"key":"2023020210371275800_B44","first-page":"126","article-title":"Distributions of tree comparison metrics-some new results","volume":"42","author":"Steel","year":"1993","journal-title":"Syst. Biol"},{"key":"2023020210371275800_B45","doi-asserted-by":"crossref","first-page":"S136","DOI":"10.1093\/bioinformatics\/18.suppl_1.S136","article-title":"Discovering statistically significant biclusters in gene expression data","volume":"18","author":"Tanay","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020210371275800_B46","first-page":"504","article-title":"Learning mixtures of dag models. In","author":"Thiesson","year":"1998"},{"key":"2023020210371275800_B47","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-12-research0088","article-title":"Systematic determination of patterns of gene expression during Drosophila embryogenesis","volume":"3","author":"Tomancak","year":"2002","journal-title":"Genome. Biol."},{"key":"2023020210371275800_B48","doi-asserted-by":"crossref","first-page":"e82","DOI":"10.1371\/journal.pbio.0030082","article-title":"Basal immunoglobulin signaling actively maintains developmental stage in immature B cells","volume":"3","author":"Tze","year":"2005","journal-title":"PLoS Biol"},{"key":"2023020210371275800_B49","article-title":"Som toolbox for matlab","volume-title":"Technical report","author":"Vesanto","year":"2000"},{"key":"2023020210371275800_B50","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1111\/j.0105-2896.2006.00371.x","article-title":"A shared gene-expression signature in innate-like lymphocytes","volume":"210","author":"Yamagata","year":"2006","journal-title":"Immunol. Rev"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i156\/49052933\/bioinformatics_24_13_i156.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i156\/49052933\/bioinformatics_24_13_i156.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,30]],"date-time":"2025-01-30T21:40:28Z","timestamp":1738273228000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/13\/i156\/229786"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,1]]},"references-count":50,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2008,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn153","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2008,7,1]]},"published":{"date-parts":[[2008,7,1]]}}}