{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,23]],"date-time":"2026-06-23T04:13:07Z","timestamp":1782187987526,"version":"3.54.5"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009105","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2021,9,17]],"date-time":"2021-09-17T00:00:00Z","timestamp":1631836800000}}],"reference-count":49,"publisher":"Public Library of Science (PLoS)","issue":"9","license":[{"start":{"date-parts":[[2021,9,7]],"date-time":"2021-09-07T00:00:00Z","timestamp":1630972800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100004440","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["222837\/Z\/21\/Z"],"award-info":[{"award-number":["222837\/Z\/21\/Z"]}],"id":[{"id":"10.13039\/100004440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000265","name":"Medical Research Council","doi-asserted-by":"publisher","award":["MR\/R008922\/1"],"award-info":[{"award-number":["MR\/R008922\/1"]}],"id":[{"id":"10.13039\/501100000265","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Minist\u00e8re de l'Enseignement sup\u00e9rieur, de la Recherche et de l'Innovation"},{"DOI":"10.13039\/501100001665","name":"Agence Nationale de la Recherche","doi-asserted-by":"publisher","award":["ANR-INBS-0010"],"award-info":[{"award-number":["ANR-INBS-0010"]}],"id":[{"id":"10.13039\/501100001665","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001665","name":"agence nationale de la recherche","doi-asserted-by":"publisher","award":["ANR-19-CE45-0021"],"award-info":[{"award-number":["ANR-19-CE45-0021"]}],"id":[{"id":"10.13039\/501100001665","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001665","name":"agence nationale de la recherche","doi-asserted-by":"publisher","award":["DFG: 431572533"],"award-info":[{"award-number":["DFG: 431572533"]}],"id":[{"id":"10.13039\/501100001665","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000268","name":"Biotechnology and Biological Sciences Research Council","doi-asserted-by":"publisher","award":["BB\/T007974\/1"],"award-info":[{"award-number":["BB\/T007974\/1"]}],"id":[{"id":"10.13039\/501100000268","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01 HL133932-01"],"award-info":[{"award-number":["R01 HL133932-01"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100013342","name":"NIHR Imperial Biomedical Research Centre","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100013342","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Over-representation analysis (ORA) is one of the commonest pathway analysis approaches used for the functional interpretation of metabolomics datasets. Despite the widespread use of ORA in metabolomics, the community lacks guidelines detailing its best-practice use. Many factors have a pronounced impact on the results, but to date their effects have received little systematic attention. Using five publicly available datasets, we demonstrated that changes in parameters such as the background set, differential metabolite selection methods, and pathway database used can result in profoundly different ORA results. The use of a non-assay-specific background set, for example, resulted in large numbers of false-positive pathways. Pathway database choice, evaluated using three of the most popular metabolic pathway databases (KEGG, Reactome, and BioCyc), led to vastly different results in both the number and function of significantly enriched pathways. Factors that are specific to metabolomics data, such as the reliability of compound identification and the chemical bias of different analytical platforms also impacted ORA results. Simulated metabolite misidentification rates as low as 4% resulted in both gain of false-positive pathways and loss of truly significant pathways across all datasets. Our results have several practical implications for ORA users, as well as those using alternative pathway analysis methods. We offer a set of recommendations for the use of ORA in metabolomics, alongside a set of minimal reporting guidelines, as a first step towards the standardisation of pathway analysis in metabolomics.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1009105","type":"journal-article","created":{"date-parts":[[2021,9,7]],"date-time":"2021-09-07T14:03:42Z","timestamp":1631023422000},"page":"e1009105","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":155,"title":["Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis"],"prefix":"10.1371","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1548-4346","authenticated-orcid":true,"given":"Cecilia","family":"Wieder","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4313-2786","authenticated-orcid":true,"given":"Cl\u00e9ment","family":"Frainay","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3393-1405","authenticated-orcid":true,"given":"Nathalie","family":"Poupin","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4938-4418","authenticated-orcid":true,"given":"Pablo","family":"Rodr\u00edguez-Mier","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8143-8183","authenticated-orcid":true,"given":"Florence","family":"Vinson","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2085-0339","authenticated-orcid":true,"given":"Juliette","family":"Cooke","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3418-850X","authenticated-orcid":true,"given":"Rachel PJ","family":"Lai","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1164-8465","authenticated-orcid":true,"given":"Jacob G.","family":"Bundy","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9401-2894","authenticated-orcid":true,"given":"Fabien","family":"Jourdan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Timothy","family":"Ebbels","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"340","published-online":{"date-parts":[[2021,9,7]]},"reference":[{"key":"pcbi.1009105.ref001","article-title":"Identifying significantly impacted pathways: A comprehensive review and assessment","volume":"20","author":"TM Nguyen","year":"2019","journal-title":"Genome Biol"},{"key":"pcbi.1009105.ref002","first-page":"e1002375","volume-title":"PLoS Computational Biology","author":"P Khatri","year":"2012"},{"key":"pcbi.1009105.ref003","first-page":"387","volume-title":"Methods in Molecular Biology","author":"A Karnovsky","year":"2020"},{"key":"pcbi.1009105.ref004","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-017-2006-0","article-title":"Evaluation and comparison of bioinformatic tools for the enrichment analysis of metabolomics data","volume":"19","author":"A Marco-Ramell","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1009105.ref005","volume-title":"Frontiers in Physiology","author":"MA Garc\u00eda-Campos","year":"2015"},{"key":"pcbi.1009105.ref006","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1038\/10343","article-title":"Systematic determination of genetic network architecture","volume":"22","author":"S Tavazoie","year":"1999","journal-title":"Nat Genet"},{"key":"pcbi.1009105.ref007","first-page":"98","article-title":"Global functional profiling of gene expression","volume":"81","author":"S Dr\u01ceghici","year":"2003","journal-title":"Genomics"},{"key":"pcbi.1009105.ref008","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1186\/s12859-021-04124-5","article-title":"Popularity and performance of bioinformatics software: the case of gene set analysis","volume":"22","author":"C Xie","year":"2021","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1009105.ref009","doi-asserted-by":"crossref","first-page":"6678","DOI":"10.1038\/s41598-018-24978-9","article-title":"Relationships between digestive efficiency and metabolomic profiles of serum and intestinal contents in chickens","volume":"8","author":"S Beauclercq","year":"2018","journal-title":"Sci Rep"},{"key":"pcbi.1009105.ref010","first-page":"1","article-title":"Metabolomics and pathway analyses to characterize metabolic alterations in pregnant dairy cows on D 17 and D 45 after AI","volume":"8","author":"YS Guo","year":"2018","journal-title":"Sci Rep."},{"key":"pcbi.1009105.ref011","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-019-13498-3","article-title":"Metabolomics analysis of human acute graft-versus-host disease reveals changes in host and microbiota-derived metabolites","volume":"10","author":"D Michonneau","year":"2019","journal-title":"Nat Commun"},{"key":"pcbi.1009105.ref012","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1002\/iid3.61","article-title":"The metabolomics of asthma control: A promising link between genetics and disease","volume":"3","author":"MJ McGeachie","year":"2015","journal-title":"Immun Inflamm Dis"},{"key":"pcbi.1009105.ref013","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1016\/j.meegid.2019.01.003","article-title":"1H nuclear magnetic resonance-based metabolic profiling of cerebrospinal fluid to identify metabolic features and markers for tuberculosis meningitis","volume":"68","author":"P Zhang","year":"2019","journal-title":"Infect Genet Evol"},{"key":"pcbi.1009105.ref014","first-page":"37","volume-title":"Metabolomics","author":"A Rosato","year":"2018"},{"key":"pcbi.1009105.ref015","first-page":"27","volume-title":"Nucleic Acids Research","author":"M Kanehisa","year":"2000"},{"key":"pcbi.1009105.ref016","first-page":"D498","article-title":"The reactome pathway knowledgebase","volume":"48","author":"B Jassal","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009105.ref017","doi-asserted-by":"crossref","first-page":"1085","DOI":"10.1093\/bib\/bbx085","article-title":"The BioCyc collection of microbial genomes and metabolic pathways","volume":"20","author":"PD Karp","year":"2018","journal-title":"Brief Bioinform"},{"key":"pcbi.1009105.ref018","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1093\/bioinformatics\/btt703","article-title":"Causal analysis approaches in ingenuity pathway analysis","volume":"30","author":"A Kr\u00e4mer","year":"2014","journal-title":"Bioinformatics"},{"key":"pcbi.1009105.ref019","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41540-019-0082-7","article-title":"ComPath: an ecosystem for exploring, analyzing, and curating mappings across pathway databases","volume":"5","author":"D Domingo-Fern\u00e1ndez","year":"2019","journal-title":"npj Syst Biol Appl"},{"key":"pcbi.1009105.ref020","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11306-007-0082-2","article-title":"Proposed minimum reporting standards for chemical analysis: Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI)","volume":"3","author":"LW Sumner","year":"2007","journal-title":"Metabolomics"},{"key":"pcbi.1009105.ref021","first-page":"277","volume-title":"Annual Review of Biochemistry","author":"W Lu","year":"2017"},{"key":"pcbi.1009105.ref022","doi-asserted-by":"crossref","first-page":"W510","DOI":"10.1093\/nar\/gky299","article-title":"IPath3.0: Interactive pathways explorer v3","volume":"46","author":"Y Darzi","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009105.ref023","first-page":"1","article-title":"InteractiVenn: a web-based tool for the analysis of sets through Venn diagrams","volume":"16","author":"H Heberle","year":"2015","journal-title":"BMC Bioinforma 2015 161"},{"key":"pcbi.1009105.ref024","doi-asserted-by":"crossref","first-page":"891","DOI":"10.1093\/bib\/bbv090","article-title":"Transcriptomic and metabolomic data integration","volume":"17","author":"R Cavill","year":"2016","journal-title":"Brief Bioinform"},{"key":"pcbi.1009105.ref025","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1007\/978-1-4939-2377-9_13","article-title":"The strengths and weaknesses of NMR spectroscopy and mass spectrometry with particular focus on metabolomics research","volume":"1277","author":"AHM Emwas","year":"2015","journal-title":"Methods Mol Biol"},{"key":"pcbi.1009105.ref026","doi-asserted-by":"crossref","first-page":"350","DOI":"10.1007\/s11306-014-0656-8","article-title":"Metabolite identification: are you sure? And how do your peers gauge your confidence?","volume":"10","author":"DJ Creek","year":"2014","journal-title":"Metabolomics"},{"key":"pcbi.1009105.ref027","first-page":"44","volume-title":"Metabolomics","author":"WB Dunn","year":"2013"},{"key":"pcbi.1009105.ref028","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1186\/1752-0509-5-165","article-title":"Critical assessment of human metabolic pathway databases: A stepping stone for future integration","volume":"5","author":"MD Stobbe","year":"2011","journal-title":"BMC Syst Biol"},{"key":"pcbi.1009105.ref029","first-page":"1","article-title":"Pathway size matters: the influence of pathway granularity on over-representation","volume":"22","author":"PD Karp","year":"2021","journal-title":"BMC Genomics"},{"key":"pcbi.1009105.ref030","doi-asserted-by":"crossref","first-page":"28","DOI":"10.3390\/metabo9020028","article-title":"Consistency, inconsistency, and ambiguity of metabolite names in biochemical databases used for genome-scale metabolic modelling","volume":"9","author":"N Pham","year":"2019","journal-title":"Metabolites"},{"key":"pcbi.1009105.ref031","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1007\/s11306-020-01663-5","article-title":"Improving lipid mapping in Genome Scale Metabolic Networks using ontologies","volume":"16","author":"N Poupin","year":"2020","journal-title":"Metabolomics"},{"key":"pcbi.1009105.ref032","first-page":"705","volume-title":"Nature Methods","author":"L Wadi","year":"2016"},{"key":"pcbi.1009105.ref033","doi-asserted-by":"crossref","DOI":"10.3390\/metabo8030051","article-title":"Mind the gap: Mapping mass spectral databases in genome-scale metabolic networks reveals poorly covered areas","volume":"8","author":"C Frainay","year":"2018","journal-title":"Metabolites"},{"key":"pcbi.1009105.ref034","first-page":"30","volume-title":"Quantitative Biology","author":"AA Labena","year":"2018"},{"key":"pcbi.1009105.ref035","doi-asserted-by":"crossref","first-page":"1203","DOI":"10.3389\/fgene.2019.01203","article-title":"The Impact of Pathway Database Choice on Statistical Enrichment Analysis and Predictive Modeling","volume":"10","author":"S Mubeen","year":"2019","journal-title":"Front Genet"},{"key":"pcbi.1009105.ref036","first-page":"37","article-title":"ConsensusPathDB\u2014A database for integrating human functional interaction networks","author":"A Kamburov","year":"2009","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009105.ref037","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-019-2863-9","article-title":"PathMe: merging and exploring mechanistic pathway knowledge","volume":"20","author":"D Domingo-Fern\u00e1ndez","year":"2019","journal-title":"BMC Bioinforma"},{"key":"pcbi.1009105.ref038","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/giaa162","article-title":"Lilikoi V2.0: a deep learning\u2013enabled, personalized pathway-based R package for diagnosis and prognosis predictions using metabolomics data.","volume":"10","author":"X Fang","year":"2021","journal-title":"Gigascience"},{"key":"pcbi.1009105.ref039","doi-asserted-by":"crossref","first-page":"103","DOI":"10.3390\/metabo11020103","article-title":"Ranking Metabolite Sets by Their Activity Levels","volume":"11","author":"K McLuskey","year":"2021","journal-title":"Metabolites"},{"key":"pcbi.1009105.ref040","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-019-12298-z","article-title":"High-fat diet fuels prostate cancer progression by rewiring the metabolome and amplifying the MYC program","volume":"10","author":"DP Labb\u00e9","year":"2019","journal-title":"Nat Commun"},{"key":"pcbi.1009105.ref041","first-page":"968","volume-title":"Nature Medicine","author":"S Yachida","year":"2019"},{"key":"pcbi.1009105.ref042","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1007\/s11306-018-1393-1","article-title":"Serum metabolomic profiles associated with postmenopausal hormone use","volume":"14","author":"VL Stevens","year":"2018","journal-title":"Metabolomics"},{"key":"pcbi.1009105.ref043","doi-asserted-by":"crossref","first-page":"2027","DOI":"10.1083\/jcb.201702058","article-title":"Multi-omics analysis identifies ATF4 as a key regulator of the mitochondrial stress response in mammals","volume":"216","author":"PM Quir\u00f3s","year":"2017","journal-title":"J Cell Biol"},{"key":"pcbi.1009105.ref044","doi-asserted-by":"crossref","first-page":"907","DOI":"10.15252\/msb.20167150","article-title":"Genomewide landscape of gene\u2013metabolome associations in Escherichia coli","volume":"13","author":"T Fuhrer","year":"2017","journal-title":"Mol Syst Biol"},{"key":"pcbi.1009105.ref045","first-page":"D440","article-title":"MetaboLights: A resource evolving in response to the needs of its scientific community","volume":"48","author":"K Haug","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009105.ref046","doi-asserted-by":"crossref","first-page":"D1266","DOI":"10.1093\/nar\/gkx965","article-title":"The BioStudies database-one stop shop for all data supporting a life sciences study","volume":"46","author":"U Sarkans","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009105.ref047","doi-asserted-by":"crossref","first-page":"W486","DOI":"10.1093\/nar\/gky310","article-title":"MetaboAnalyst 4.0: Towards more transparent and integrative metabolomics analysis","volume":"46","author":"J Chong","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009105.ref048","doi-asserted-by":"crossref","first-page":"3241","DOI":"10.1093\/bioinformatics\/btt547","article-title":"BioServices: a common Python package to access biological Web Services programmatically","volume":"29","author":"T Cokelaer","year":"2013","journal-title":"Bioinformatics"},{"key":"pcbi.1009105.ref049","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing","volume":"57","author":"Y Benjamini","year":"1995","journal-title":"J R Stat Soc Ser B."}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009105","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2021,9,17]],"date-time":"2021-09-17T00:00:00Z","timestamp":1631836800000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009105","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,7]],"date-time":"2024-09-07T19:48:18Z","timestamp":1725738498000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009105"}},"subtitle":[],"editor":[{"given":"Kiran Raosaheb","family":"Patil","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2021,9,7]]},"references-count":49,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2021,9,7]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009105","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.05.24.445406","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,9,7]]}}}