{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T22:46:55Z","timestamp":1759963615123,"version":"3.33.0"},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"8","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: A fundamental task in systems biology is the identification of groups of genes that are involved in the cellular response to particular signals. At its simplest level, this often reduces to identifying biological quantities (mRNA abundance, enzyme concentrations, etc.) which are differentially expressed in two different conditions. Popular approaches involve using t-test statistics, based on modelling the data as arising from a mixture distribution. A common assumption of these approaches is that the data are independent and identically distributed; however, biological quantities are usually related through a complex (weighted) network of interactions, and often the more pertinent question is which subnetworks are differentially expressed, rather than which genes. Furthermore, in many interesting cases (such as high-throughput proteomics and metabolomics), only very partial observations are available, resulting in the need for efficient imputation techniques.<\/jats:p><jats:p>Results: We introduce Mixture Model on Graphs (MMG), a novel probabilistic model to identify differentially expressed submodules of biological networks and pathways. The method can easily incorporate information about weights in the network, is robust against missing data and can be easily generalized to directed networks. We propose an efficient sampling strategy to infer posterior probabilities of differential expression, as well as posterior probabilities over the model parameters. We assess our method on artificial data demonstrating significant improvements over standard mixture model clustering. Analysis of our model results on quantitative high-throughput proteomic data leads to the identification of biologically significant subnetworks, as well as the prediction of the expression level of a number of enzymes, some of which are then verified experimentally.<\/jats:p><jats:p>Availability: MATLAB code is available from http:\/\/www.dcs.shef.ac.uk\/~guido\/software.html<\/jats:p><jats:p>Contact: \u00a0guido@dcs.shef.ac.uk<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn066","type":"journal-article","created":{"date-parts":[[2008,2,22]],"date-time":"2008-02-22T01:52:37Z","timestamp":1203645157000},"page":"1078-1084","source":"Crossref","is-referenced-by-count":34,"title":["MMG: a probabilistic tool to identify submodules of metabolic pathways"],"prefix":"10.1093","volume":"24","author":[{"given":"Guido","family":"Sanguinetti","sequence":"first","affiliation":[{"name":"1 Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Road, Sheffield, S1 4DP, UK and 2Biological and Environmental Systems Group, Department of Chemical and Process Engineering, University of Sheffield, Mappin Street, Sheffield, S1 3JD, UK"}]},{"given":"Josselin","family":"Noirel","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Road, Sheffield, S1 4DP, UK and 2Biological and Environmental Systems Group, Department of Chemical and Process Engineering, University of Sheffield, Mappin Street, Sheffield, S1 3JD, UK"}]},{"given":"Phillip C.","family":"Wright","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Road, Sheffield, S1 4DP, UK and 2Biological and Environmental Systems Group, Department of Chemical and Process Engineering, University of Sheffield, Mappin Street, Sheffield, S1 3JD, UK"}]}],"member":"286","published-online":{"date-parts":[[2008,2,21]]},"reference":[{"key":"2023020210011200400_B1","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1126\/science.286.5439.509","article-title":"Emergence of scaling in random networks","volume":"286","author":"Barabasi","year":"1999","journal-title":"Science"},{"key":"2023020210011200400_B2","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1016\/j.jmb.2005.09.079","article-title":"Inferring meaningful pathways in weighted metabolic networks","volume":"356","author":"Croes","year":"2006","journal-title":"J. Mol. Biol."},{"key":"2023020210011200400_B3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"J. R. Stat. Soc. B"},{"key":"2023020210011200400_B4","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1198\/016214501753382129","article-title":"Empirical Bayes analysis of a microarray experiment","volume":"96","author":"Efron","year":"2001","journal-title":"J. Am. Stat. Asssoc."},{"volume-title":"Bayesian Data Analysis.","year":"2004","author":"Gelman","key":"2023020210011200400_B5"},{"key":"2023020210011200400_B6","doi-asserted-by":"crossref","first-page":"1663","DOI":"10.1093\/bioinformatics\/bth139","article-title":"Mixture models for assessing differential expression in complex tissues using microarray data","volume":"20","author":"Ghosh","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020210011200400_B7","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1038\/35036627","article-title":"The large-scale organization of metabolic networks","volume":"407","author":"Jeong","year":"2000","journal-title":"Nature"},{"key":"2023020210011200400_B8","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"Kegg: Kyoto encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023020210011200400_B9","doi-asserted-by":"crossref","first-page":"D546","DOI":"10.1093\/nar\/gkj107","article-title":"TRANSPATH: an information resource for storing and visualizing signalling pathways and their pathological aberrations","volume":"34","author":"Krull","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023020210011200400_B10","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1089\/106652701300099074","article-title":"On differential variability of expression ratios: Improving statistical inference about gene expression changes from microarray data","volume":"8","author":"Newton","year":"2001","journal-title":"J. Comput. Biol."},{"key":"2023020210011200400_B11","doi-asserted-by":"crossref","DOI":"10.1093\/bfgp\/eln011","article-title":"Automated extraction of meaningful pathways from quantitative proteomics data","author":"Noirel","year":"2008","journal-title":"Brief. Funct. Genomics Proteomics"},{"key":"2023020210011200400_B12","doi-asserted-by":"crossref","DOI":"10.1021\/pr700604v","article-title":"Quantitative shotgun proteomics of enriched heterocysts from Nostoc sp. pcc 7120 using 8-plex isobaric peptide tags","author":"Ow","year":"2008","journal-title":"J. Proteomic Res."},{"key":"2023020210011200400_B13","article-title":"Classification of microarray data using gene networks","volume":"35","author":"Rapaport","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023020210011200400_B14","doi-asserted-by":"crossref","first-page":"1154","DOI":"10.1074\/mcp.M400129-MCP200","article-title":"Multiplexed protein quantitation in Saccharomices cerevisiae using amine-reactive isobaric tagging reagents","volume":"3","author":"Ross","year":"2004","journal-title":"Mol. Cell. Prot.,"},{"key":"2023020210011200400_B15","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1007\/s00253-006-0528-x","article-title":"Perspectives and advances of biological H2 production in microorganisms","volume":"72","author":"Rupprecht","year":"2006","journal-title":"Appl. Microbiol. Biotechnol"},{"key":"2023020210011200400_B16","doi-asserted-by":"crossref","first-page":"3748","DOI":"10.1093\/bioinformatics\/bti617","article-title":"Accounting for probe-level noise in principal component analysis of microarray data","volume":"21","author":"Sanguinetti","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020210011200400_B17","doi-asserted-by":"crossref","DOI":"10.1007\/11885191_11","article-title":"Identifying submodules of cellular regulatory networks","author":"Sanguinetti","year":"2006","journal-title":"In Proceedings of Computational Methods in Systems Biology"},{"key":"2023020210011200400_B18","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1021\/pr060517v","article-title":"An iTRAQ-based quantitative analysis to elaborate the proteomic response of Nostoc sp. pcc7120 under N2 fixing conditions","volume":"621","author":"Stensj\u00f6","year":"2007","journal-title":"J. Proteome Res."},{"key":"2023020210011200400_B19","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to ionizing radiation response","volume":"98","author":"Tusher","year":"2001","journal-title":"Proc. Natl Acad. Sci"},{"key":"2023020210011200400_B20","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1093\/bioinformatics\/btm612","article-title":"Incorporating gene networks into statistical tests for genomic data via a spatially correlated mixture model","volume":"24","author":"Wei","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020210011200400_B21","doi-asserted-by":"crossref","first-page":"1537","DOI":"10.1093\/bioinformatics\/btm129","article-title":"A Markov random field model for network-based analysis of genomic data","volume":"23","author":"Wei","year":"2007","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/8\/1078\/49046538\/bioinformatics_24_8_1078.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/8\/1078\/49046538\/bioinformatics_24_8_1078.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,28]],"date-time":"2025-01-28T18:01:49Z","timestamp":1738087309000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/8\/1078\/212710"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,2,21]]},"references-count":21,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2008,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn066","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2008,4,15]]},"published":{"date-parts":[[2008,2,21]]}}}