{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,22]],"date-time":"2025-11-22T11:27:47Z","timestamp":1763810867533,"version":"3.41.2"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2022,9,1]],"date-time":"2022-09-01T00:00:00Z","timestamp":1661990400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Biological pathways are a broadly used formalism for representing and interpreting the cascade of biochemical reactions underlying cellular and biological mechanisms. Pathway representation provides an ontological link among biomolecules such as RNA, DNA, small molecules, proteins, protein complexes, hormones and genes. Frequently, pathway annotations are used to identify mechanisms linked to genes within affected biological contexts. This important role and the simplicity and elegance in representing complex interactions led to an explosion of pathway representations and databases. Unfortunately, the lack of overlap across databases results in inconsistent enrichment analysis results, unless databases are integrated. However, due to absence of consensus, guidelines or gold standards in pathway definition and representation, integration of data across pathway databases is not straightforward. Despite multiple attempts to provide consolidated pathways, highly related, redundant, poorly overlapping or ambiguous pathways continue to render pathways analysis inconsistent and hard to interpret. Ontology-based integration will promote unbiased, comprehensive yet streamlined analysis of experiments, and will reduce the number of enriched pathways when performing pathway enrichment analysis. Moreover, appropriate and consolidated pathways provide better training data for pathway prediction algorithms. In this manuscript, we describe the current methods for pathway consolidation, their strengths and pitfalls, and highlight directions for future improvements to this research area.<\/jats:p>","DOI":"10.1093\/bib\/bbac368","type":"journal-article","created":{"date-parts":[[2022,9,5]],"date-time":"2022-09-05T21:29:06Z","timestamp":1662413346000},"source":"Crossref","is-referenced-by-count":3,"title":["Pathway integration and annotation: building a puzzle with non-matching pieces and no reference picture"],"prefix":"10.1093","volume":"23","author":[{"given":"Giuseppe","family":"Agapito","sequence":"first","affiliation":[{"name":"Department of Law, Economics and Social Sciences, University Magna Gr\u00e6cia of Catanzaro , Italy"},{"name":"Data Analytic Research Center, University Magna Gr\u00e6cia of Catanzaro , Italy"}]},{"given":"Chiara","family":"Pastrello","sequence":"additional","affiliation":[{"name":"Osteoarthritis Research Program, Division of Orthopaedics, Schroeder Arthritis Institute, University Health Network , Toronto, Canada"}]},{"given":"Yun","family":"Niu","sequence":"additional","affiliation":[{"name":"Osteoarthritis Research Program, Division of Orthopaedics, Schroeder Arthritis Institute, University Health Network , Toronto, Canada"}]},{"given":"Igor","family":"Jurisica","sequence":"additional","affiliation":[{"name":"Osteoarthritis Research Program, Division of Orthopaedics, Schroeder Arthritis Institute, University Health Network , Toronto, Canada"},{"name":"Departments of Medical Biophysics and Computer Science Canada, University of Toronto , Toronto, Canada"},{"name":"Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute , Toronto, Canada"},{"name":"Institute of Neuroimmunology, Slovak Academy of Sciences , Bratislava, Slovakia"}]}],"member":"286","published-online":{"date-parts":[[2022,9,5]]},"reference":[{"key":"2022092013234393000_ref1","doi-asserted-by":"crossref","DOI":"10.3389\/fgene.2019.01203","article-title":"The impact of pathway database choice on statistical enrichment analysis and predictive modeling","volume":"10","author":"Mubeen","year":"2019","journal-title":"Front Genet"},{"issue":"3","key":"2022092013234393000_ref2","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1089\/152791601750294344","article-title":"BioCarta","volume":"2","author":"Nishimura","year":"2001","journal-title":"Biotechnology software & Internet journal"},{"issue":"suppl_1","key":"2022092013234393000_ref3","doi-asserted-by":"crossref","first-page":"D623","DOI":"10.1093\/nar\/gkm900","article-title":"The metacyc database of metabolic pathways and enzymes and the biocyc collection of pathway\/genome databases","volume":"36","author":"Caspi","year":"2007","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2022092013234393000_ref4","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1093\/nar\/30.1.56","article-title":"The ecocyc database","volume":"30","author":"Karp","year":"2002","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2022092013234393000_ref5","doi-asserted-by":"crossref","first-page":"D445","DOI":"10.1093\/nar\/gkz862","article-title":"The metacyc database of metabolic pathways and enzymes-a 2019 update","volume":"48","author":"Caspi","year":"2020","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2022092013234393000_ref6","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"Kegg: Kyoto encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2022092013234393000_ref7","doi-asserted-by":"crossref","first-page":"D419","DOI":"10.1093\/nar\/gky1038","article-title":"Panther version 14: more genomes, a new panther go-slim and improvements in enrichment analysis tools","volume":"47","author":"Mi","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2022092013234393000_ref8","doi-asserted-by":"crossref","first-page":"D649","DOI":"10.1093\/nar\/gkx1132","article-title":"The reactome pathway knowledgebase","volume":"46","author":"Fabregat","year":"2017","journal-title":"Nucleic Acids Res"},{"issue":"15","key":"2022092013234393000_ref9","doi-asserted-by":"crossref","first-page":"4377","DOI":"10.1093\/bioinformatics\/btaa529","article-title":"Biopax-parser: parsing and enrichment analysis of biopax pathways","volume":"36","author":"Agapito","year":"2020","journal-title":"Bioinformatics"},{"issue":"23","key":"2022092013234393000_ref10","doi-asserted-by":"crossref","first-page":"17561","DOI":"10.1007\/s00500-020-05243-6","article-title":"Cpea: a parallel method to perform pathway enrichment analysis using multiple pathways databases","volume":"24","author":"Agapito","year":"2020","journal-title":"Soft Computing"},{"issue":"43","key":"2022092013234393000_ref11","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc Natl Acad Sci"},{"issue":"D1","key":"2022092013234393000_ref12","first-page":"D479","article-title":"Pathdip 4: an extended pathway annotations and enrichment analysis resource for human, model organisms and domesticated species","volume":"48","author":"Rahmati","year":"2020","journal-title":"Nucleic Acids Res"},{"issue":"19","key":"2022092013234393000_ref13","doi-asserted-by":"crossref","first-page":"3779","DOI":"10.1093\/bioinformatics\/btz132","article-title":"Causaltab: the psi-mitab 2.8 updated format for signalling data representation and dissemination","volume":"35","author":"Perfetto","year":"2019","journal-title":"Bioinformatics"},{"issue":"7","key":"2022092013234393000_ref14","doi-asserted-by":"crossref","first-page":"e184","DOI":"10.1371\/journal.pbio.0060184","article-title":"Wikipathways: pathway editing for the people","volume":"6","author":"Pico","year":"2008","journal-title":"PLoS Biol"},{"key":"2022092013234393000_ref15","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1007\/978-3-540-24750-0_4","volume-title":"Handbook on ontologies","author":"Antoniou","year":"2004"},{"key":"2022092013234393000_ref16","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1007\/978-3-540-24750-0_3","volume-title":"Handbook on ontologies","author":"McBride","year":"2004"},{"issue":"9","key":"2022092013234393000_ref17","doi-asserted-by":"crossref","first-page":"935","DOI":"10.1038\/nbt.1666","article-title":"The biopax community standard for pathway data sharing","volume":"28","author":"Demir","year":"2010","journal-title":"Nat Biotechnol"},{"key":"2022092013234393000_ref18","doi-asserted-by":"crossref","DOI":"10.1093\/database\/baq037","article-title":"The biomolecular interaction network database in psi-mi 2.5","volume":"2011","author":"Isserlin","year":"2011","journal-title":"Database"},{"issue":"8","key":"2022092013234393000_ref19","doi-asserted-by":"crossref","first-page":"e9110","DOI":"10.15252\/msb.20199110","article-title":"Sbml level 3: an extensible format for the exchange and reuse of biological models","volume":"16","author":"Keating","year":"2020","journal-title":"Mol Syst Biol"},{"issue":"2\u20133","key":"2022092013234393000_ref20","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1016\/j.pbiomolbio.2004.01.004","article-title":"Cellml: its future, present and past","volume":"85","author":"Lloyd","year":"2004","journal-title":"Prog Biophys Mol Biol"},{"issue":"2","key":"2022092013234393000_ref21","doi-asserted-by":"crossref","first-page":"676","DOI":"10.1093\/bib\/bbaa377","article-title":"Comprehensive pathway enrichment analysis workflows: Covid-19 case study","volume":"22","author":"Agapito","year":"2021","journal-title":"Brief Bioinform"},{"issue":"1","key":"2022092013234393000_ref22","doi-asserted-by":"crossref","first-page":"15","DOI":"10.11613\/BM.2010.004","article-title":"Comparing groups for statistical differences: how to choose the right statistical test?","volume":"20","author":"Marusteri","year":"2010","journal-title":"Biochem Med"},{"issue":"1","key":"2022092013234393000_ref23","first-page":"1","article-title":"Identifying significantly impacted pathways: a comprehensive review and assessment","volume":"20","author":"Nguyen","year":"2019","journal-title":"Genome Biol"},{"key":"2022092013234393000_ref24","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1109\/PDP55904.2022.00041","volume-title":"2022 30th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)","author":"Agapito","year":"2022"},{"key":"2022092013234393000_ref25","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1109\/GENSiPS.2011.6169447","volume-title":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","author":"Doderer","year":"2011"},{"issue":"1","key":"2022092013234393000_ref26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-019-3146-1","article-title":"A comparative study of topology-based pathway enrichment analysis methods","volume":"20","author":"Ma","year":"2019","journal-title":"BMC bioinformatics"},{"key":"2022092013234393000_ref27","doi-asserted-by":"crossref","first-page":"D685","DOI":"10.1093\/nar\/gkq1039","article-title":"Pathway commons, a web resource for biological pathway data","volume":"39","author":"Ethan Cerami","year":"2010","journal-title":"Nucleic Acids Res"},{"issue":"10","key":"2022092013234393000_ref28","doi-asserted-by":"crossref","first-page":"1889","DOI":"10.1038\/nprot.2016.117","article-title":"Analyzing and interpreting genome data at the network level with consensuspathdb","volume":"11","author":"Herwig","year":"2016","journal-title":"Nat Protoc"},{"key":"2022092013234393000_ref29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/database\/bav006","article-title":"Pathcards: multi-source consolidation of human biological pathways","volume":"2015","author":"Belinky","year":"2015","journal-title":"Database"},{"issue":"6","key":"2022092013234393000_ref30","article-title":"Integrated pathway clusters with coherent biological themes for target prioritisation","volume":"9","author":"Chen","year":"2014","journal-title":"PLoS One"},{"issue":"Suppl 6","key":"2022092013234393000_ref31","article-title":"Pathway distiller - multisource biological pathway consolidation","volume":"13","author":"Doderer","year":"2012","journal-title":"BMC Genomics"},{"key":"2022092013234393000_ref32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/database\/bau126","article-title":"Comparison of human cell signaling pathway databases-evolution, drawbacks and challenges","volume":"2015","author":"Chowdhury","year":"2015","journal-title":"Database"},{"issue":"165","key":"2022092013234393000_ref33","article-title":"Critical assessment of human metabolic pathway databases: a stepping stone for future integration","volume":"5","author":"Stobbe","year":"2011","journal-title":"BMC Syst Biol"},{"issue":"290","key":"2022092013234393000_ref34","article-title":"Pathway databases and tools for their exploitation: benefits, current limitations and challenges","volume":"5","author":"Bauer-Mehren","year":"2009","journal-title":"Mol Syst Biol"},{"issue":"3","key":"2022092013234393000_ref35","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1038\/s41540-019-0082-7","article-title":"Compath: an ecosystem for exploring, analyzing, and curating mappings across pathway databases","volume":"5","author":"Domingo-Fernandez","year":"2019","journal-title":"npj Systems Biology and Applications"},{"issue":"1","key":"2022092013234393000_ref36","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1186\/2041-1480-5-7","article-title":"The pathway ontology - updates and applications","volume":"5","author":"Petri","year":"2014","journal-title":"Journal of Biomedical Semantics"},{"issue":"D1","key":"2022092013234393000_ref37","doi-asserted-by":"crossref","first-page":"D419","DOI":"10.1093\/nar\/gkw1082","article-title":"pathDIP: an annotated resource for known and predicted human gene-pathway associations and pathway enrichment analysis","volume":"45","author":"Rahmati","year":"2016","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"2022092013234393000_ref38","doi-asserted-by":"crossref","first-page":"e1007090","DOI":"10.1371\/journal.pcbi.1007090","article-title":"Connecting signaling and metabolic pathways in egf receptor-mediated oncogenesis of glioblastoma","volume":"15","author":"Bag","year":"2019","journal-title":"PLoS Comput Biol"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac368\/45937320\/bbac368.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac368\/45937320\/bbac368.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,20]],"date-time":"2022-09-20T18:12:41Z","timestamp":1663697561000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac368\/6691914"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9]]},"references-count":38,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,9,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac368","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"type":"print","value":"1467-5463"},{"type":"electronic","value":"1477-4054"}],"subject":[],"published-other":{"date-parts":[[2022,9]]},"published":{"date-parts":[[2022,9]]},"article-number":"bbac368"}}