{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T06:33:13Z","timestamp":1780641193510,"version":"3.54.1"},"reference-count":57,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2024,7,23]],"date-time":"2024-07-23T00:00:00Z","timestamp":1721692800000},"content-version":"vor","delay-in-days":22,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["2343019 and 2203236"],"award-info":[{"award-number":["2343019 and 2203236"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000104","name":"National Aeronautics and Space Administration","doi-asserted-by":"publisher","award":["80NSSC22M0255"],"award-info":[{"award-number":["80NSSC22M0255"]}],"id":[{"id":"10.13039\/100000104","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000057","name":"National Institute of General Medical Sciences","doi-asserted-by":"publisher","award":["GM103440 and 1R44GM152152-01"],"award-info":[{"award-number":["GM103440 and 1R44GM152152-01"]}],"id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["1U01CA274573-01A1"],"award-info":[{"award-number":["1U01CA274573-01A1"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,7,23]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>This manuscript describes the development of a resource module that is part of a learning platform named \u2018NIGMS Sandbox for Cloud-based Learning\u2019 (https:\/\/github.com\/NIGMS\/NIGMS-Sandbox). The module delivers learning materials on Cloud-based Consensus Pathway Analysis in an interactive format that uses appropriate cloud resources for data access and analyses. Pathway analysis is important because it allows us to gain insights into biological mechanisms underlying conditions. But the availability of many pathway analysis methods, the requirement of coding skills, and the focus of current tools on only a few species all make it very difficult for biomedical researchers to self-learn and perform pathway analysis efficiently. Furthermore, there is a lack of tools that allow researchers to compare analysis results obtained from different experiments and different analysis methods to find consensus results. To address these challenges, we have designed a cloud-based, self-learning module that provides consensus results among established, state-of-the-art pathway analysis techniques to provide students and researchers with necessary training and example materials. The training module consists of five Jupyter Notebooks that provide complete tutorials for the following tasks: (i) process expression data, (ii) perform differential analysis, visualize and compare the results obtained from four differential analysis methods (limma, t-test, edgeR, DESeq2), (iii) process three pathway databases (GO, KEGG and Reactome), (iv) perform pathway analysis using eight methods (ORA, CAMERA, KS test, Wilcoxon test, FGSEA, GSA, SAFE and PADOG) and (v) combine results of multiple analyses. We also provide examples, source code, explanations and instructional videos for trainees to complete each Jupyter Notebook. The module supports the analysis for many model (e.g. human, mouse, fruit fly, zebra fish) and non-model species. The module is publicly available at https:\/\/github.com\/NIGMS\/Consensus-Pathway-Analysis-in-the-Cloud.<\/jats:p>\n               <jats:p>This manuscript describes the development of a resource module that is part of a learning platform named ``NIGMS Sandbox for Cloud-based Learning'' https:\/\/github.com\/NIGMS\/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox [1] at the beginning of this Supplement. This module delivers learning materials on the analysis of bulk and single-cell ATAC-seq data in an interactive format that uses appropriate cloud resources for data access and analyses.<\/jats:p>","DOI":"10.1093\/bib\/bbae222","type":"journal-article","created":{"date-parts":[[2024,7,23]],"date-time":"2024-07-23T13:40:40Z","timestamp":1721742040000},"source":"Crossref","is-referenced-by-count":22,"title":["CCPA: cloud-based, self-learning modules for consensus pathway analysis using GO, KEGG and Reactome"],"prefix":"10.1093","volume":"25","author":[{"given":"Ha","family":"Nguyen","sequence":"first","affiliation":[{"name":"Auburn University Department of Computer Science and Software Engineering, , AL 36849, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Van-Dung","family":"Pham","sequence":"additional","affiliation":[{"name":"Auburn University Department of Computer Science and Software Engineering, , AL 36849, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hung","family":"Nguyen","sequence":"additional","affiliation":[{"name":"Auburn University Department of Computer Science and Software Engineering, , AL 36849, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Bang","family":"Tran","sequence":"additional","affiliation":[{"name":"California State University, Sacramento Department of Computer Science, , CA 95819, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Juli","family":"Petereit","sequence":"additional","affiliation":[{"name":"University of Nevada, Reno Nevada Bioinformatics Center, , NV 89557, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tin","family":"Nguyen","sequence":"additional","affiliation":[{"name":"Auburn University Department of Computer Science and Software Engineering, , AL 36849, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2024,7,23]]},"reference":[{"key":"2024072312405387800_ref1","article-title":"NIGMS Sandbox: A Learning Platform toward Democratizing Cloud Computing for Biomedical Research","author":"Lei","journal-title":"Brief Bioinform"},{"issue":"6","key":"2024072312405387800_ref2","doi-asserted-by":"crossref","first-page":"1107","DOI":"10.1038\/bjc.2011.584","article-title":"A network-based, integrative study to identify core biological pathways that drive breast cancer clinical subtypes","volume":"106","author":"Dutta","year":"2012","journal-title":"Br J Cancer"},{"key":"2024072312405387800_ref3","first-page":"12","article-title":"Identification of a subtype of hepatocellular carcinoma with poor prognosis based on expression of genes within the glucose metabolic pathway","volume":"11","author":"Zhang","year":"2019","journal-title":"Cancer"},{"issue":"10","key":"2024072312405387800_ref4","doi-asserted-by":"crossref","first-page":"1447","DOI":"10.1373\/clinchem.2012.200477","article-title":"Companion biomarkers: paving the pathway to personalized treatment for cancer","volume":"59","author":"Duffy","year":"2013","journal-title":"Clin Chem"},{"key":"2024072312405387800_ref5","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1007\/978-1-4614-8778-4_5","article-title":"A personalized treatment for lung cancer: molecular pathways, targeted therapies, and genomic characterization","volume":"799","author":"Hensing","year":"2013","journal-title":"Systems Analysis of Human Multigene Disorders"},{"issue":"6","key":"2024072312405387800_ref6","doi-asserted-by":"crossref","first-page":"e0131183","DOI":"10.1371\/journal.pone.0131183","article-title":"Identification of personalized chemoresistance genes in subtypes of basal-like breast cancer based on functional differences using pathway analysis","volume":"10","author":"Tong","year":"2015","journal-title":"PloS One"},{"issue":"3","key":"2024072312405387800_ref7","doi-asserted-by":"crossref","first-page":"409","DOI":"10.1093\/bioinformatics\/btv588","article-title":"A novel bi-level meta-analysis approach-applied to biological pathway analysis","volume":"32","author":"Nguyen","year":"2016","journal-title":"Bioinformatics"},{"key":"2024072312405387800_ref8","doi-asserted-by":"crossref","first-page":"18571","DOI":"10.1038\/s41598-023-41374-0","article-title":"A novel approach for predicting upstream regulators (PURE) that affect gene expression","volume":"13","author":"Nguyen","year":"2023","journal-title":"Sci Rep"},{"issue":"9","key":"2024072312405387800_ref9","doi-asserted-by":"crossref","first-page":"1950","DOI":"10.1093\/bioinformatics\/bti267","article-title":"Testing association of a pathway with survival using gene expression data","volume":"21","author":"Goeman","year":"2005","journal-title":"Bioinformatics"},{"key":"2024072312405387800_ref10","doi-asserted-by":"crossref","first-page":"6047","DOI":"10.1038\/s41598-021-84787-5","article-title":"Pancancer survival analysis of cancer hallmark genes","volume":"11","author":"Nagy","year":"2021","journal-title":"Sci Rep"},{"key":"2024072312405387800_ref11","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.lssr.2022.07.006","article-title":"Mouse genomic associations with in vitro sensitivity to simulated space radiation","volume":"36","author":"Cekanaviciute","year":"2022","journal-title":"Life Sciences in Space Research"},{"key":"2024072312405387800_ref12","doi-asserted-by":"crossref","first-page":"971282","DOI":"10.3389\/fphys.2022.971282","article-title":"Quantitative proteomic analytic approaches to identify metabolic changes in the medial prefrontal cortex of rats exposed to space radiation","volume":"13","author":"Laiakis","year":"2022","journal-title":"Front Physiol"},{"issue":"2","key":"2024072312405387800_ref13","doi-asserted-by":"crossref","first-page":"e1002375","DOI":"10.1371\/journal.pcbi.1002375","article-title":"Ten years of pathway analysis: current approaches and outstanding challenges","volume":"8","author":"Khatri","year":"2012","journal-title":"PLoS Comput Biol"},{"key":"2024072312405387800_ref14","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1186\/s13059-019-1790-4","article-title":"Identifying significantly impacted pathways: a comprehensive review and assessment","volume":"20","author":"Nguyen","year":"2019","journal-title":"Genome Biol"},{"issue":"6","key":"2024072312405387800_ref15","doi-asserted-by":"crossref","first-page":"bbac435","DOI":"10.1093\/bib\/bbac435","article-title":"A comprehensive survey of the approaches for pathway analysis using multi-omics data integration","volume":"23","author":"Maghsoudi","year":"2022","journal-title":"Brief Bioinform"},{"issue":"1","key":"2024072312405387800_ref16","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1093\/nar\/30.1.207","article-title":"Gene expression omnibus: NCBI gene expression and hybridization array data repository","volume":"30","author":"Edgar","year":"2002","journal-title":"Nucleic Acids Res"},{"issue":"Database Issue","key":"2024072312405387800_ref17","doi-asserted-by":"crossref","first-page":"D562","DOI":"10.1093\/nar\/gki022","article-title":"NCBI GEO: mining millions of expression profiles\u2013database and tools","volume":"33","author":"Barrett","year":"2005","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"2024072312405387800_ref18","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1056\/NEJMp1607591","article-title":"Toward a shared vision for cancer genomic data","volume":"375","author":"Grossman","year":"2016","journal-title":"New England Journal of Medicine"},{"issue":"1","key":"2024072312405387800_ref19","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1093\/nar\/gkg091","article-title":"ArrayExpress\u2013a public repository for microarray gene expression data at the EBI","volume":"31","author":"Brazma","year":"2003","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2024072312405387800_ref20","doi-asserted-by":"crossref","first-page":"D987","DOI":"10.1093\/nar\/gks1174","article-title":"ArrayExpress update\u2013trends in database growth and links to data analysis tools","volume":"41","author":"Rustici","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2024072312405387800_ref21","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1038\/nrg.2017.113","article-title":"Cloud computing for genomic data analysis and collaboration","volume":"19","author":"Langmead","year":"2018","journal-title":"Nat Rev Genet"},{"key":"2024072312405387800_ref22","doi-asserted-by":"crossref","first-page":"288","DOI":"10.1038\/s41587-019-0360-3","article-title":"Butler enables rapid cloud-based analysis of thousands of human genomes","volume":"38","author":"Yakneen","year":"2020","journal-title":"Nat Biotechnol"},{"issue":"3","key":"2024072312405387800_ref23","doi-asserted-by":"crossref","first-page":"486","DOI":"10.1002\/spe.2544","article-title":"Resource provisioning in science clouds: requirements and challenges","volume":"48","author":"Garc\u00eda","year":"2018","journal-title":"Softw Pract Exp"},{"issue":"1","key":"2024072312405387800_ref24","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The gene ontology consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat Genet"},{"issue":"1","key":"2024072312405387800_ref25","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: Kyoto Encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res"},{"issue":"suppl_1","key":"2024072312405387800_ref26","doi-asserted-by":"crossref","first-page":"D619","DOI":"10.1093\/nar\/gkn863","article-title":"Reactome knowledgebase of human biological pathways and processes","volume":"37","author":"Matthews","year":"2009","journal-title":"Nucleic Acids Res"},{"issue":"W1","key":"2024072312405387800_ref27","doi-asserted-by":"crossref","first-page":"W114","DOI":"10.1093\/nar\/gkab421","article-title":"CPA: a web-based platform for consensus pathway analysis and interactive visualization","volume":"49","author":"Nguyen","year":"2021","journal-title":"Nucleic Acids Res"},{"issue":"14","key":"2024072312405387800_ref28","doi-asserted-by":"crossref","first-page":"1846","DOI":"10.1093\/bioinformatics\/btm254","article-title":"GEOquery: a bridge between the gene expression omnibus (GEO) and BioConductor","volume":"23","author":"Davis","year":"2007","journal-title":"Bioinformatics"},{"key":"2024072312405387800_ref29","volume-title":"AnnotationDbi: Manipulation of SQLite-based annotations in Bioconductor","author":"Pag\u00e8s","year":"2023"},{"key":"2024072312405387800_ref30","volume-title":"hgu133plus2.db: Affymetrix Human Genome U133 Plus 2.0 Array annotation data (chip hgu133plus2)","author":"Carlson","year":"2016"},{"issue":"1","key":"2024072312405387800_ref31","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1186\/s12859-022-04606-0","article-title":"DrGA: cancer driver gene analysis in a simpler manner","volume":"23","author":"Nguyen","year":"2022","journal-title":"BMC Bioinformatics"},{"key":"2024072312405387800_ref32","first-page":"gkae267","article-title":"Fourteen years of cellular deconvolution: methodology, applications, technical evaluation and outstanding challenges","author":"Nguyen","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2024072312405387800_ref33","first-page":"397","volume-title":"Limma: Linear models for microarray data. In: Bioinformatics and computational biology solutions using R and Bioconductor","author":"Smyth","year":"2005"},{"key":"2024072312405387800_ref34","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"issue":"1","key":"2024072312405387800_ref35","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2024072312405387800_ref36","first-page":"1438","article-title":"From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline","volume":"5","author":"Chen","year":"2016","journal-title":"F1000Research"},{"key":"2024072312405387800_ref37","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1038\/s41576-019-0150-2","article-title":"RNA sequencing: the teenage years","volume":"20","author":"Stark","year":"2019","journal-title":"Nat Rev Genet"},{"issue":"D1","key":"2024072312405387800_ref38","doi-asserted-by":"crossref","first-page":"D325","DOI":"10.1093\/nar\/gkaa1113","article-title":"The gene ontology resource: enriching a GOld mine","volume":"49","author":"The Gene Ontology Consortium","year":"2021","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2024072312405387800_ref39","doi-asserted-by":"crossref","first-page":"D353","DOI":"10.1093\/nar\/gkw1092","article-title":"KEGG: new perspectives on genomes, pathways, diseases and drugs","volume":"45","author":"Kanehisa","year":"2017","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2024072312405387800_ref40","doi-asserted-by":"crossref","first-page":"D672","DOI":"10.1093\/nar\/gkad1025","article-title":"The Reactome pathway knowledgebase 2024","volume":"52","author":"Milacic","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2024072312405387800_ref41","volume-title":"topGO: Enrichment Analysis for Gene Ontology","author":"Alexa","year":"2023"},{"key":"2024072312405387800_ref42","volume-title":"KEGGREST: Client-side REST access to the Kyoto Encyclopedia of Genes and Genomes (KEGG)","author":"Tenenbaum","year":"2023"},{"key":"2024072312405387800_ref43","volume-title":"R interface to the Reactome graph database","author":"Poon","year":"2023"},{"issue":"3","key":"2024072312405387800_ref44","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1093\/bib\/bbr049","article-title":"Gene set enrichment analysis: performance evaluation and usage guidelines","volume":"13","author":"Hung","year":"2012","journal-title":"Brief Bioinform"},{"key":"2024072312405387800_ref45","doi-asserted-by":"crossref","first-page":"1265","DOI":"10.3389\/fgene.2020.574661","article-title":"Multi-omics analysis detects novel prognostic subgroups of breast cancer","volume":"11","author":"Nguyen","year":"2020","journal-title":"Front Genet"},{"issue":"6","key":"2024072312405387800_ref46","doi-asserted-by":"crossref","first-page":"80","DOI":"10.2307\/3001968","article-title":"Individual comparisons by ranking methods","volume":"1","author":"Wilcoxon","year":"1945","journal-title":"Biometrics"},{"issue":"253","key":"2024072312405387800_ref47","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1080\/01621459.1951.10500769","article-title":"The Kolmogorov-Smirnov test for goodness of fit","volume":"46","author":"Massey Jr","year":"1951","journal-title":"J Am Stat Assoc"},{"issue":"2","key":"2024072312405387800_ref48","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1006\/geno.2002.6698","article-title":"Profiling gene expression using onto-express","volume":"79","author":"Khatri","year":"2002","journal-title":"Genomics"},{"issue":"W1","key":"2024072312405387800_ref49","doi-asserted-by":"crossref","first-page":"W199","DOI":"10.1093\/nar\/gkz401","article-title":"WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs","volume":"47","author":"Liao","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2024072312405387800_ref50","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1214\/07-AOAS101","article-title":"On testing the significance of sets of genes","volume":"1","author":"Efron","year":"2007","journal-title":"The Annals of Applied Statistics"},{"key":"2024072312405387800_ref51","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1186\/1471-2105-13-136","article-title":"Down-weighting overlapping genes improves gene set analysis","volume":"13","author":"Tarca","year":"2012","journal-title":"BMC Bioinformatics"},{"key":"2024072312405387800_ref52","first-page":"060012","article-title":"Fast gene set enrichment analysis","author":"Korotkevich","year":"2021","journal-title":"BioRxiv"},{"issue":"17","key":"2024072312405387800_ref53","doi-asserted-by":"crossref","first-page":"e133","DOI":"10.1093\/nar\/gks461","article-title":"CAMERA: a competitive gene set test accounting for inter-gene correlation","volume":"40","author":"Di","year":"2012","journal-title":"Nucleic Acids Res"},{"issue":"9","key":"2024072312405387800_ref54","doi-asserted-by":"crossref","first-page":"1943","DOI":"10.1093\/bioinformatics\/bti260","article-title":"Significance analysis of functional categories in gene expression studies: a structured permutation approach","volume":"21","author":"Barry","year":"2005","journal-title":"Bioinformatics"},{"issue":"4","key":"2024072312405387800_ref55","first-page":"153","article-title":"How to perform a meta-analysis with R: a practical tutorial","volume":"22","author":"Balduzzi","year":"2019","journal-title":"BMJ Mental Health"},{"issue":"3","key":"2024072312405387800_ref56","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1152\/physiolgenomics.00208.2006","article-title":"Gene expression profiles in anatomically and functionally distinct regions of the normal aged human brain","volume":"28","author":"Liang","year":"2007","journal-title":"Physiol Genomics"},{"issue":"10","key":"2024072312405387800_ref57","doi-asserted-by":"crossref","first-page":"1024","DOI":"10.1038\/s41588-020-0696-0","article-title":"An integrated multi-omics approach identifies epigenetic alterations associated with Alzheimer\u2019s disease","volume":"52","author":"Nativio","year":"2020","journal-title":"Nat Genet"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/Supplement_1\/bbae222\/58618835\/bbae222.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/Supplement_1\/bbae222\/58618835\/bbae222.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,23]],"date-time":"2024-07-23T13:41:34Z","timestamp":1721742094000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbae222\/7718483"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7]]},"references-count":57,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2024,7,23]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbae222","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,7]]},"article-number":"bbae222"}}