{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,25]],"date-time":"2025-10-25T12:22:43Z","timestamp":1761394963597},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2017,1,31]],"date-time":"2017-01-31T00:00:00Z","timestamp":1485820800000},"content-version":"vor","delay-in-days":26,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,5,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>RNA-seq has become the technology of choice for interrogating the transcriptome. However, most methods for RNA-seq differential expression (DE) analysis do not utilize prior knowledge of biological networks to detect DE genes. With the increased availability and quality of biological network databases, methods that can utilize this prior knowledge are needed and will offer biologists with a viable, more powerful alternative when analyzing RNA-seq data.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We propose a three-state Markov Random Field (MRF) method that utilizes known biological pathways and interaction to improve sensitivity and specificity and therefore reducing false discovery rates (FDRs) when detecting differentially expressed genes from RNA-seq data. The method requires normalized count data (e.g. in Fragments or Reads Per Kilobase of transcript per Million mapped reads (FPKM\/RPKM) format) as its input and it is implemented in an R package pathDESeq available from Github. Simulation studies demonstrate that our method outperforms the two-state MRF model for various sample sizes. Furthermore, for a comparable FDR, it has better sensitivity than DESeq, EBSeq, edgeR and NOISeq. The proposed method also picks more top Gene Ontology terms and KEGG pathways terms when applied to real dataset from colorectal cancer and hepatocellular carcinoma studies, respectively. Overall, these findings clearly highlight the power of our method relative to the existing methods that do not utilize prior knowledge of biological network.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and Implementation<\/jats:title><jats:p>As an R package at https:\/\/github.com\/MalathiSIDona\/pathDESeq<\/jats:p><\/jats:sec><jats:sec><jats:title>To install the package type<\/jats:title><jats:p>install_github(\"MalathiSIDona\/pathDESeq\",build_vignettes\u2009=\u2009TRUE). After installation, type vignette(\"pathDESeq\") to access the vignette.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btw833","type":"journal-article","created":{"date-parts":[[2017,2,8]],"date-time":"2017-02-08T02:50:44Z","timestamp":1486522244000},"page":"1505-1513","source":"Crossref","is-referenced-by-count":21,"title":["Powerful differential expression analysis incorporating network topology for next-generation sequencing data"],"prefix":"10.1093","volume":"33","author":[{"given":"Malathi S.I","family":"Dona","sequence":"first","affiliation":[{"name":"Department of Mathematics and Statistics, La Trobe University, Melbourne, VIC, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luke A","family":"Prendergast","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Statistics, La Trobe University, Melbourne, VIC, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Suresh","family":"Mathivanan","sequence":"additional","affiliation":[{"name":"Department of Biochemistry and Genetics, La Trobe Institute of Molecular Science, La Trobe University, Melbourne, VIC, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shivakumar","family":"Keerthikumar","sequence":"additional","affiliation":[{"name":"Department of Biochemistry and Genetics, La Trobe Institute of Molecular Science, La Trobe University, Melbourne, VIC, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Agus","family":"Salim","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Statistics, La Trobe University, Melbourne, VIC, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2017,1,5]]},"reference":[{"key":"2023020205103654300_btw833-B1","author":"Allen","year":"2014"},{"key":"2023020205103654300_btw833-B2","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1093\/bioinformatics\/btu638","article-title":"HTSeq\u2014a Python framework to work with high-throughput sequencing data","volume":"31","author":"Anders","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020205103654300_btw833-B3","doi-asserted-by":"crossref","first-page":"R106.","DOI":"10.1186\/gb-2010-11-10-r106","article-title":"Differential expression analysis for sequence count data","volume":"11","author":"Anders","year":"2010","journal-title":"Genome Biol"},{"key":"2023020205103654300_btw833-B4","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. Roy. Stat. Soc. Ser. B"},{"key":"2023020205103654300_btw833-B5","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1111\/j.2517-6161.1986.tb01412.x","article-title":"On the statistical analysis of dirty pictures","volume":"48","author":"Besag","year":"1986","journal-title":"J. Roy. Stat. Soc. Ser. B"},{"key":"2023020205103654300_btw833-B6","doi-asserted-by":"crossref","first-page":"D470","DOI":"10.1093\/nar\/gku1204","article-title":"The BioGRID interaction database: 2015 update","volume":"43","author":"Chatr-Aryamontri","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020205103654300_btw833-B7","doi-asserted-by":"crossref","first-page":"D472","DOI":"10.1093\/nar\/gkt1102","article-title":"The Reactome pathway knowledgebase","volume":"42","author":"Croft","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023020205103654300_btw833-B8","doi-asserted-by":"crossref","first-page":"D876","DOI":"10.1093\/nar\/gkq963","article-title":"The UCSC Genome Browser database: update 2011","volume":"39","author":"Fujita","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2023020205103654300_btw833-B9","doi-asserted-by":"crossref","first-page":"5079","DOI":"10.1038\/sj.onc.1208696","article-title":"Meta-analysis of microarray data on pancreatic cancer defines a set of commonly dysregulated genes","volume":"24","author":"Gr\u00fctzmann","year":"2005","journal-title":"Oncogene"},{"key":"2023020205103654300_btw833-B10","doi-asserted-by":"crossref","first-page":"20130950","DOI":"10.1098\/rsif.2013.0950","article-title":"Separate enrichment analysis of pathways for up- and downregulated genes","volume":"11","author":"Hong","year":"2014","journal-title":"J. R. Soc. Interface"},{"key":"2023020205103654300_btw833-B11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/nar\/gkn923","article-title":"Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists","volume":"37","author":"Huang","year":"2009","journal-title":"Nucleic Acids Res"},{"key":"2023020205103654300_btw833-B12","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1038\/nprot.2008.211","article-title":"Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources","volume":"4","author":"Huang","year":"2009","journal-title":"Nat. Protocols"},{"key":"2023020205103654300_btw833-B13","doi-asserted-by":"crossref","first-page":"1653","DOI":"10.1016\/j.molonc.2014.06.016","article-title":"A nineteen gene-based risk score classifier predicts prognosis of colorectal cancer patients","volume":"8","author":"Kim","year":"2014","journal-title":"Mol. Oncol"},{"key":"2023020205103654300_btw833-B14","doi-asserted-by":"crossref","first-page":"144.","DOI":"10.1186\/1471-2105-6-144","article-title":"PAGE: parametric analysis of gene set enrichment","volume":"6","author":"Kim","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023020205103654300_btw833-B15","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with bowtie 2","volume":"9","author":"Langmead","year":"2012","journal-title":"Nat. Methods"},{"key":"2023020205103654300_btw833-B16","doi-asserted-by":"crossref","first-page":"7.","DOI":"10.4103\/1477-3163.78268","article-title":"Systematic enrichment of gene expression profiling studies identifies consensus pathways implicated in colorectal development","volume":"10","author":"Lascorz","year":"2011","journal-title":"J. Carcinog"},{"key":"2023020205103654300_btw833-B17","doi-asserted-by":"crossref","first-page":"D19","DOI":"10.1093\/nar\/gkq1019","article-title":"The sequence read archive","volume":"39","author":"Leinonen","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023020205103654300_btw833-B18","doi-asserted-by":"crossref","first-page":"1035","DOI":"10.1093\/bioinformatics\/btt087","article-title":"EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments","volume":"29","author":"Leng","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020205103654300_btw833-B19","doi-asserted-by":"crossref","first-page":"12755","DOI":"10.1007\/s13277-016-5186-8","article-title":"Meta-analysis of gene expression profiles identifies differential biomarkers for hepatocellular carcinoma and cholangiocarcinoma","volume":"37","author":"Likhitrattanapisal","year":"2016","journal-title":"Tumour Biol"},{"key":"2023020205103654300_btw833-B20","doi-asserted-by":"crossref","first-page":"32607","DOI":"10.18632\/oncotarget.8927","article-title":"Potential diagnostic and prognostic marker dimethylglycine dehydrogenase (dmgdh) suppresses hepatocellular carcinoma metastasis in vitro and in vivo","volume":"7","author":"Liu","year":"2016","journal-title":"Oncotarget"},{"key":"2023020205103654300_btw833-B21","doi-asserted-by":"crossref","first-page":"161.","DOI":"10.1186\/1471-2105-10-161","article-title":"GAGE: generally applicable gene set enrichment for pathway analysis","volume":"10","author":"Luo","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023020205103654300_btw833-B22","doi-asserted-by":"crossref","first-page":"3448","DOI":"10.1093\/bioinformatics\/bti551","article-title":"BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks","volume":"21","author":"Maere","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020205103654300_btw833-B23","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1089\/106652701300099074","article-title":"On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data","volume":"8","author":"Newton","year":"2001","journal-title":"J. Comp. Biol"},{"key":"2023020205103654300_btw833-B24","doi-asserted-by":"crossref","first-page":"gkv007","DOI":"10.1093\/nar\/gkv007","article-title":"limma powers differential expression analyses for RNA-sequencing and microarray studies","volume":"43","author":"Ritchie","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020205103654300_btw833-B25","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020205103654300_btw833-B26","doi-asserted-by":"crossref","first-page":"e151","DOI":"10.1093\/nar\/gkl766","article-title":"Absolute enrichment: gene set enrichment analysis for homeostatic systems","volume":"34","author":"Saxena","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020205103654300_btw833-B27","doi-asserted-by":"crossref","first-page":"2498","DOI":"10.1101\/gr.1239303","article-title":"Cytoscape: a software environment for integrated models of biomolecular interaction networks","volume":"13","author":"Shannon","year":"2003","journal-title":"Genome Res"},{"key":"2023020205103654300_btw833-B28","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1027","article-title":"Linear models and empirical Bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2023020205103654300_btw833-B29","doi-asserted-by":"crossref","first-page":"91.","DOI":"10.1186\/1471-2105-14-91","article-title":"A comparison of methods for differential expression analysis of RNA-seq data","volume":"14","author":"Soneson","year":"2013","journal-title":"BMC Bioinformatics"},{"key":"2023020205103654300_btw833-B30","doi-asserted-by":"crossref","first-page":"D535","DOI":"10.1093\/nar\/gkj109","article-title":"BioGRID: a general repository for interaction datasets","volume":"34","author":"Stark","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020205103654300_btw833-B31","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020205103654300_btw833-B32","doi-asserted-by":"crossref","first-page":"2213","DOI":"10.1101\/gr.124321.111","article-title":"Differential expression in RNA-seq: a matter of depth","volume":"21","author":"Tarazona","year":"2011","journal-title":"Genome Res"},{"key":"2023020205103654300_btw833-B33","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nrg2484","article-title":"RNA-Seq: a revolutionary tool for transcriptomics","volume":"10","author":"Wang","year":"2009","journal-title":"Nat. Rev. Genet"},{"key":"2023020205103654300_btw833-B34","doi-asserted-by":"crossref","first-page":"818","DOI":"10.1039\/c2mb05466c","article-title":"Extensive up-regulation of gene expression in cancer: the normalised use of microarray data","volume":"8","author":"Wang","year":"2012","journal-title":"Mol. Biosyst"},{"key":"2023020205103654300_btw833-B35","doi-asserted-by":"crossref","first-page":"1777","DOI":"10.1093\/bioinformatics\/btu090","article-title":"SeqGSEA: a Bioconductor package for gene set enrichment analysis of RNA-Seq data integrating differential expression and splicing","volume":"30","author":"Wang","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020205103654300_btw833-B36","doi-asserted-by":"crossref","first-page":"1537","DOI":"10.1093\/bioinformatics\/btm129","article-title":"A Markov random field model for network-based analysis of genomic data","volume":"23","author":"Wei","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020205103654300_btw833-B37","first-page":"1","article-title":"The NBP negative binomial model for assessing differential eene expression from RNA-Seq","volume":"10","author":"Yanming","year":"2011","journal-title":"Stat. Appl. Genet. Mol. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/10\/1505\/49038743\/bioinformatics_33_10_1505.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/10\/1505\/49038743\/bioinformatics_33_10_1505.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,22]],"date-time":"2024-06-22T10:28:26Z","timestamp":1719052106000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/10\/1505\/2832781"}},"subtitle":[],"editor":[{"given":"Ziv","family":"Bar-Joseph","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2017,1,5]]},"references-count":37,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2017,5,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw833","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,5,15]]},"published":{"date-parts":[[2017,1,5]]}}}