{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T03:31:10Z","timestamp":1761708670918,"version":"3.38.0"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T00:00:00Z","timestamp":1674604800000},"content-version":"vor","delay-in-days":4243,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.5"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: High-throughput measurement techniques for metabolism and gene expression provide a wealth of information for the identification of metabolic network models. Yet, missing observations scattered over the dataset restrict the number of effectively available datapoints and make classical regression techniques inaccurate or inapplicable. Thorough exploitation of the data by identification techniques that explicitly cope with missing observations is therefore of major importance.<\/jats:p><jats:p>Results: We develop a maximum-likelihood approach for the estimation of unknown parameters of metabolic network models that relies on the integration of statistical priors to compensate for the missing data. In the context of the linlog metabolic modeling framework, we implement the identification method by an Expectation-Maximization (EM) algorithm and by a simpler direct numerical optimization method. We evaluate performance of our methods by comparison to existing approaches, and show that our EM method provides the best results over a variety of simulated scenarios. We then apply the EM algorithm to a real problem, the identification of a model for the Escherichia coli central carbon metabolism, based on challenging experimental data from the literature. This leads to promising results and allows us to highlight critical identification issues.<\/jats:p><jats:p>Contact: \u00a0sara.berthoumieux@inria.fr; eugenio.cinquemani@inria.fr<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr225","type":"journal-article","created":{"date-parts":[[2011,6,17]],"date-time":"2011-06-17T23:32:32Z","timestamp":1308353552000},"page":"i186-i195","source":"Crossref","is-referenced-by-count":21,"title":["Identification of metabolic network models from incomplete high-throughput datasets"],"prefix":"10.1093","volume":"27","author":[{"given":"Sara","family":"Berthoumieux","sequence":"first","affiliation":[{"name":"1 INRIA Grenoble - Rh\u00f4ne-Alpes, Montbonnot and 2Laboratoire de Biom\u00e9trie et Biologie Evolutive, CNRS UMR 5558, Universit\u00e9 Lyon 1, INRA, Villeurbanne, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Matteo","family":"Brilli","sequence":"additional","affiliation":[{"name":"1 INRIA Grenoble - Rh\u00f4ne-Alpes, Montbonnot and 2Laboratoire de Biom\u00e9trie et Biologie Evolutive, CNRS UMR 5558, Universit\u00e9 Lyon 1, INRA, Villeurbanne, France"},{"name":"1 INRIA Grenoble - Rh\u00f4ne-Alpes, Montbonnot and 2Laboratoire de Biom\u00e9trie et Biologie Evolutive, CNRS UMR 5558, Universit\u00e9 Lyon 1, INRA, Villeurbanne, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hidde","family":"de Jong","sequence":"additional","affiliation":[{"name":"1 INRIA Grenoble - Rh\u00f4ne-Alpes, Montbonnot and 2Laboratoire de Biom\u00e9trie et Biologie Evolutive, CNRS UMR 5558, Universit\u00e9 Lyon 1, INRA, Villeurbanne, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Kahn","sequence":"additional","affiliation":[{"name":"1 INRIA Grenoble - Rh\u00f4ne-Alpes, Montbonnot and 2Laboratoire de Biom\u00e9trie et Biologie Evolutive, CNRS UMR 5558, Universit\u00e9 Lyon 1, INRA, Villeurbanne, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eugenio","family":"Cinquemani","sequence":"additional","affiliation":[{"name":"1 INRIA Grenoble - Rh\u00f4ne-Alpes, Montbonnot and 2Laboratoire de Biom\u00e9trie et Biologie Evolutive, CNRS UMR 5558, Universit\u00e9 Lyon 1, INRA, Villeurbanne, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2011,6,14]]},"reference":[{"key":"2023012512185198500_B1","doi-asserted-by":"crossref","first-page":"886","DOI":"10.1111\/j.1742-4658.2008.06844.x","article-title":"Systems biology: Parameter estimation for biochemical models","volume":"276","author":"Ashyraliyev","year":"2009","journal-title":"FEBS J."},{"key":"2023012512185198500_B2","doi-asserted-by":"crossref","first-page":"2578","DOI":"10.1074\/jbc.M508090200","article-title":"A quantitative approach to catabolite repression in Escherichia coli","volume":"281","author":"Bettenbrock","year":"2005","journal-title":"J. Biol. Chem."},{"key":"2023012512185198500_B3","first-page":"707","article-title":"Incremental singular value decomposition of uncertain data with missing values","volume-title":"Proceedings of the 7th European Conference Computer Vision (ECCV 2002)","author":"Brand","year":"2002"},{"key":"2023012512185198500_B4","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1016\/j.biosystems.2010.03.001","article-title":"Hybrid dynamic modeling of Escherichia coli central metabolic network combining Michaelis-Menten and approximate kinetic equations","volume":"100","author":"Costa","year":"2010","journal-title":"Biosystems"},{"key":"2023012512185198500_B5","volume-title":"Elements of Information Theory","author":"Cover","year":"2006","edition":"2"},{"key":"2023012512185198500_B6","first-page":"81","article-title":"System identification challenges from systems biology","volume-title":"Proceedings of the 14th IFAC Symposium System Identification (SYSID 2006)","author":"Crampin","year":"2006"},{"key":"2023012512185198500_B7","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1049\/iet-syb:20070030","article-title":"Challenges in lin-log modelling of glycolysis in Lactococcus lactis","volume":"2","author":"del Rosario","year":"2008","journal-title":"IET Syst. Biol."},{"key":"2023012512185198500_B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"J. Roy. Stat. Soc. Ser. B"},{"key":"2023012512185198500_B9","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1146\/annurev.psych.58.110405.085530","article-title":"Missing data analysis: Making it work in the real world","volume":"60","author":"Graham","year":"2009","journal-title":"Annu. Rev. Psychol."},{"key":"2023012512185198500_B10","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1016\/j.ymben.2008.10.002","article-title":"Translating biochemical network models between different kinetic formats","volume":"11","author":"Hadlich","year":"2009","journal-title":"Metab. Eng."},{"key":"2023012512185198500_B11","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1016\/j.jbiotec.2007.08.029","article-title":"Topology of the global regulatory network of carbon limitation in Escherichia coli","volume":"132","author":"Hardiman","year":"2007","journal-title":"J. Biotechnol."},{"key":"2023012512185198500_B12","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1002\/(SICI)1097-0290(19970420)54:2<91::AID-BIT1>3.0.CO;2-Q","article-title":"Effects of spatiotemporal variations on metabolic control: Approximate analysis using (log)linear kinetic models","volume":"54","author":"Hatzimanikatis","year":"1997","journal-title":"Biotechnol. Bioeng."},{"key":"2023012512185198500_B13","doi-asserted-by":"crossref","first-page":"534","DOI":"10.1002\/bit.20558","article-title":"Approximative kinetic formats used in metabolic network modeling","volume":"91","author":"Heijnen","year":"2005","journal-title":"Biotechnol. Bioeng."},{"key":"2023012512185198500_B14","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4613-1161-4","volume-title":"The Regulation of Cellular Systems","author":"Heinrich","year":"1996"},{"key":"2023012512185198500_B15","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1198\/000313007X172556","article-title":"Much ado about nothing: a comparison of missing data methods and software to fit incomplete data regression models","volume":"61","author":"Horton","year":"2007","journal-title":"Am. Stat."},{"key":"2023012512185198500_B16","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1126\/science.1132067","article-title":"Multiple high-throughput analyses monitor the response of E. coli to perturbations","volume":"316","author":"Ishii","year":"2007","journal-title":"Science"},{"key":"2023012512185198500_B17","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-1904-8","volume-title":"Principal Component Analysis","author":"Jolliffe","year":"1986"},{"key":"2023012512185198500_B18","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1038\/msb.2010.10","article-title":"Bacterial adaptation through distributed sensing of metabolic fluxes","volume":"6","author":"Kotte","year":"2010","journal-title":"Mol. Syst. Biol."},{"key":"2023012512185198500_B19","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1186\/1742-4682-3-41","article-title":"Bringing metabolic networks to life: Convenience rate law and thermodynamic constraints","volume":"3","author":"Liebermeister","year":"2006","journal-title":"Theor. Biol. Med. Model."},{"key":"2023012512185198500_B20","doi-asserted-by":"crossref","DOI":"10.1002\/9781119013563","volume-title":"Statistical Analysis with Missing Data","author":"Little","year":"2002"},{"volume-title":"Randomization, Bootstrap and Monte-Carlo Methods in Biology","year":"1997","author":"Manly","key":"2023012512185198500_B21"},{"key":"2023012512185198500_B22","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1007\/s00285-010-0350-z","article-title":"Derivation, identification and validation of a computational model of a novel synthetic regulatory network in yeast","volume":"62","author":"Marucci","year":"2011","journal-title":"J. Math. Biol."},{"key":"2023012512185198500_B23","doi-asserted-by":"crossref","first-page":"540","DOI":"10.1186\/1471-2105-7-540","article-title":"A method for estimation of elasticities in metabolic networks using steady state and dynamic metabolomics data and linlog kinetics","volume":"7","author":"Nikerel","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012512185198500_B24","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.ymben.2008.07.004","article-title":"Model reduction and a priori kinetic parameter identifiability analysis using metabolome time series for metabolic reaction networks with linlog kinetics","volume":"11","author":"Nikerel","year":"2009","journal-title":"Metab. Eng."},{"key":"2023012512185198500_B25","doi-asserted-by":"crossref","first-page":"2088","DOI":"10.1093\/bioinformatics\/btg287","article-title":"A Bayesian missing value estimation method for gene expression profile data","volume":"19","author":"Oba","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012512185198500_B26","doi-asserted-by":"crossref","first-page":"1923","DOI":"10.1093\/bioinformatics\/btp358","article-title":"Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood","volume":"25","author":"Raue","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512185198500_B27","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1093\/biomet\/63.3.581","article-title":"Inference and missing data","volume":"63","author":"Rubin","year":"1976","journal-title":"Biometrika"},{"key":"2023012512185198500_B28","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1080\/01621459.1996.10476908","article-title":"Multiple imputation after 18+ years","volume":"81","author":"Rubin","year":"1996","journal-title":"J. Am. Stat. A."},{"volume-title":"Biochemical Systems Analysis: A Study of Function and Design in Molecular Biology","year":"1976","author":"Savageau","key":"2023012512185198500_B29"},{"key":"2023012512185198500_B30","doi-asserted-by":"crossref","first-page":"3887","DOI":"10.1093\/bioinformatics\/bti634","article-title":"Non-linear PCA: A missing data approach","volume":"21","author":"Scholz","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012512185198500_B31","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1186\/1752-0509-4-6","article-title":"Towards a genome-scale kinetic model of cellular metabolism","volume":"4","author":"Smallbone","year":"2010","journal-title":"BMC Syst. Biol."},{"key":"2023012512185198500_B32","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1007\/978-3-662-03295-4_7","article-title":"System identification with information theoretic criteria","volume-title":"Identification, Adaptation, Learning","author":"Stoorvogel","year":"1996"},{"key":"2023012512185198500_B33","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1016\/S1096-7176(03)00025-9","article-title":"Dynamic simulation and metabolic re-design of a branched pathway using linlog kinetics","volume":"5","author":"Visser","year":"2003","journal-title":"Metab. Eng."},{"key":"2023012512185198500_B34","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1016\/j.ymben.2004.07.001","article-title":"Optimal re-design of primary metabolism in Escherichia coli using linlog kinetics","volume":"6","author":"Visser","year":"2004","journal-title":"Metab. Eng."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/13\/i186\/48870868\/bioinformatics_27_13_i186.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/13\/i186\/48870868\/bioinformatics_27_13_i186.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,6]],"date-time":"2025-03-06T08:37:47Z","timestamp":1741250267000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/13\/i186\/180135"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,6,14]]},"references-count":34,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2011,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr225","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2011,7,1]]},"published":{"date-parts":[[2011,6,14]]}}}