{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:02Z","timestamp":1740185102459,"version":"3.37.3"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2017,9,4]],"date-time":"2017-09-04T00:00:00Z","timestamp":1504483200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["CA149653, CA164384, CA149147, CA184902, CA148826 and CA187512"],"award-info":[{"award-number":["CA149653, CA164384, CA149147, CA184902, CA148826 and CA187512"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Recent advances in high-throughput RNA sequencing (RNA-seq) technologies have made it possible to reconstruct the full transcriptome of various types of cells. It is important to accurately assemble transcripts or identify isoforms for an improved understanding of molecular mechanisms in biological systems.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We have developed a novel Bayesian method, SparseIso, to reliably identify spliced isoforms from RNA-seq data. A spike-and-slab prior is incorporated into the Bayesian model to enforce the sparsity for isoform identification, effectively alleviating the problem of overfitting. A Gibbs sampling procedure is further developed to simultaneously identify and quantify transcripts from RNA-seq data. With the sampling approach, SparseIso estimates the joint distribution of all candidate transcripts, resulting in a significantly improved performance in detecting lowly expressed transcripts and multiple expressed isoforms of genes. Both simulation study and real data analysis have demonstrated that the proposed SparseIso method significantly outperforms existing methods for improved transcript assembly and isoform identification.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The SparseIso package is available at http:\/\/github.com\/henryxushi\/SparseIso.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx557","type":"journal-article","created":{"date-parts":[[2017,9,2]],"date-time":"2017-09-02T11:10:20Z","timestamp":1504350620000},"page":"56-63","source":"Crossref","is-referenced-by-count":7,"title":["SparseIso: a novel Bayesian approach to identify alternatively spliced isoforms from RNA-seq data"],"prefix":"10.1093","volume":"34","author":[{"given":"Xu","family":"Shi","sequence":"first","affiliation":[{"name":"Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA, USA"}]},{"given":"Xiao","family":"Wang","sequence":"additional","affiliation":[{"name":"Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA, USA"}]},{"given":"Tian-Li","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Pathology, Johns Hopkins Medical Institutions, Baltimore, MD, USA"}]},{"given":"Leena","family":"Hilakivi-Clarke","sequence":"additional","affiliation":[{"name":"Department of Oncology and Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC, USA"}]},{"given":"Robert","family":"Clarke","sequence":"additional","affiliation":[{"name":"Department of Oncology and Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7256-1374","authenticated-orcid":false,"given":"Jianhua","family":"Xuan","sequence":"additional","affiliation":[{"name":"Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA, USA"}]}],"member":"286","published-online":{"date-parts":[[2017,9,4]]},"reference":[{"key":"2023020208402310400_btx557-B1","doi-asserted-by":"crossref","first-page":"E4821","DOI":"10.1073\/pnas.1320101110","article-title":"Characterization of the human ESC transcriptome by hybrid sequencing","volume":"110","author":"Au","year":"2013","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020208402310400_btx557-B2","first-page":"btu317","article-title":"Efficient RNA isoform identification and quantification from RNA-Seq data with network flows","author":"Bernard","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020208402310400_btx557-B3","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1198\/10618600152627906","article-title":"Sampling truncated normal, beta, and gamma densities","volume":"10","author":"Damien","year":"2001","journal-title":"J. Comput. Graph. Stat"},{"key":"2023020208402310400_btx557-B4","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1038\/nature11233","article-title":"Landscape of transcription in human cells","volume":"489","author":"Djebali","year":"2012","journal-title":"Nature"},{"key":"2023020208402310400_btx557-B5","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1093\/bioinformatics\/bts635","article-title":"STAR: ultrafast universal RNA-seq aligner","volume":"29","author":"Dobin","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020208402310400_btx557-B6","doi-asserted-by":"crossref","first-page":"10073","DOI":"10.1093\/nar\/gks666","article-title":"Modelling and simulating generic RNA-Seq experiments with the flux simulator","volume":"40","author":"Griebel","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023020208402310400_btx557-B7","doi-asserted-by":"crossref","first-page":"503","DOI":"10.1038\/nbt.1633","article-title":"Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs","volume":"28","author":"Guttman","year":"2010","journal-title":"Nat. Biotechnol"},{"key":"2023020208402310400_btx557-B8","doi-asserted-by":"crossref","first-page":"e131.","DOI":"10.1093\/nar\/gkq224","article-title":"Biases in Illumina transcriptome sequencing caused by random hexamer priming","volume":"38","author":"Hansen","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023020208402310400_btx557-B9","doi-asserted-by":"crossref","first-page":"13115","DOI":"10.1073\/pnas.1420404112","article-title":"Genome-wide modeling of transcription kinetics reveals patterns of RNA production delays","volume":"112","author":"Honkela","year":"2015","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020208402310400_btx557-B10","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1038\/nprot.2008.211","article-title":"Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources","volume":"4","author":"Huang","year":"2008","journal-title":"Nat. Protoc"},{"key":"2023020208402310400_btx557-B11","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1214\/009053604000001147","article-title":"Spike and slab variable selection: frequentist and Bayesian strategies","volume":"33","author":"Ishwaran","year":"2005","journal-title":"Ann. Stat"},{"key":"2023020208402310400_btx557-B12","doi-asserted-by":"crossref","first-page":"1026","DOI":"10.1093\/bioinformatics\/btp113","article-title":"Statistical inferences for isoform expression in RNA-Seq","volume":"25","author":"Jiang","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020208402310400_btx557-B13","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/1749-8104-5-31","article-title":"Two novel human NUMB isoforms provide a potential link between development and cancer","volume":"5","author":"Karaczyn","year":"2010","journal-title":"Neural Dev"},{"key":"2023020208402310400_btx557-B14","doi-asserted-by":"crossref","first-page":"D764","DOI":"10.1093\/nar\/gkt1168","article-title":"The UCSC genome browser database: 2014 update","volume":"42","author":"Karolchik","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023020208402310400_btx557-B15","doi-asserted-by":"crossref","first-page":"R36","DOI":"10.1186\/gb-2013-14-4-r36","article-title":"TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions","volume":"14","author":"Kim","year":"2013","journal-title":"Genome Biol"},{"key":"2023020208402310400_btx557-B16","doi-asserted-by":"crossref","first-page":"323.","DOI":"10.1186\/1471-2105-12-323","article-title":"RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome","volume":"12","author":"Li","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020208402310400_btx557-B17","doi-asserted-by":"crossref","first-page":"1693","DOI":"10.1089\/cmb.2011.0171","article-title":"IsoLasso: a LASSO regression approach to RNA-Seq based transcriptome assembly","volume":"18","author":"Li","year":"2011","journal-title":"J. Comput. Biol. J. Comput. Mol. Cell Biol"},{"key":"2023020208402310400_btx557-B18","doi-asserted-by":"crossref","first-page":"2914","DOI":"10.1093\/bioinformatics\/bts559","article-title":"Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads","volume":"28","author":"Li","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020208402310400_btx557-B19","doi-asserted-by":"crossref","first-page":"e00264","DOI":"10.1042\/BSR20150074","article-title":"Oestrogen receptors interact with the \u03b1-catalytic subunit of AMP-activated protein kinase","volume":"35","author":"Lipovka","year":"2015","journal-title":"Biosci. Rep"},{"key":"2023020208402310400_btx557-B20","doi-asserted-by":"crossref","first-page":"501.","DOI":"10.1186\/s13059-014-0501-4","article-title":"Bayesian transcriptome assembly","volume":"15","author":"Maretty","year":"2014","journal-title":"Genome Biol"},{"key":"2023020208402310400_btx557-B21","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1101\/gr.142232.112","article-title":"iReckon: simultaneous isoform discovery and abundance estimation from RNA-seq data","volume":"23","author":"Mezlini","year":"2013","journal-title":"Genome Res"},{"key":"2023020208402310400_btx557-B22","doi-asserted-by":"crossref","first-page":"1023","DOI":"10.1080\/01621459.1988.10478694","article-title":"Bayesian variable selection in linear regression","volume":"83","author":"Mitchell","year":"1988","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020208402310400_btx557-B23","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1038\/nmeth.1226","article-title":"Mapping and quantifying mammalian transcriptomes by RNA-Seq","volume":"5","author":"Mortazavi","year":"2008","journal-title":"Nat. Methods"},{"key":"2023020208402310400_btx557-B25","doi-asserted-by":"crossref","first-page":"863","DOI":"10.2307\/2532104","article-title":"Maximum likelihood estimation for the negative binomial dispersion parameter","volume":"46","author":"Piegorsch","year":"1990","journal-title":"Biometrics"},{"key":"2023020208402310400_btx557-B26","doi-asserted-by":"crossref","first-page":"D756","DOI":"10.1093\/nar\/gkt1114","article-title":"RefSeq: an update on mammalian reference sequences","volume":"42","author":"Pruitt","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023020208402310400_btx557-B27","first-page":"297","article-title":"Detecting and tracking political abuse in social media","volume":"11","author":"Ratkiewicz","year":"2011","journal-title":"ICWSM"},{"key":"2023020208402310400_btx557-B28","doi-asserted-by":"crossref","first-page":"R22","DOI":"10.1186\/gb-2011-12-3-r22","article-title":"Improving RNA-Seq expression estimates by correcting for fragment bias","volume":"12","author":"Roberts","year":"2011","journal-title":"Genome Biol"},{"key":"2023020208402310400_btx557-B29","doi-asserted-by":"crossref","first-page":"D670","DOI":"10.1093\/nar\/gku1177","article-title":"The UCSC genome browser database: 2015 update","volume":"43","author":"Rosenbloom","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020208402310400_btx557-B30","doi-asserted-by":"crossref","first-page":"R63","DOI":"10.1186\/bcr1769","article-title":"Autocrine WNT signaling contributes to breast cancer cell proliferation via the canonical WNT pathway and EGFR transactivation","volume":"9","author":"Schlange","year":"2007","journal-title":"Breast Cancer Res"},{"key":"2023020208402310400_btx557-B31","doi-asserted-by":"crossref","first-page":"511","DOI":"10.1038\/nbt.1621","article-title":"Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation","volume":"28","author":"Trapnell","year":"2010","journal-title":"Nat. Biotechnol"},{"key":"2023020208402310400_btx557-B32","doi-asserted-by":"crossref","first-page":"470","DOI":"10.1038\/nature07509","article-title":"Alternative isoform regulation in human tissue transcriptomes","volume":"456","author":"Wang","year":"2008","journal-title":"Nature"},{"key":"2023020208402310400_btx557-B33","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1093\/bioinformatics\/btq696","article-title":"Using non-uniform read distribution models to improve isoform expression inference in RNA-Seq","volume":"27","author":"Wu","year":"2011","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/1\/56\/49043411\/bioinformatics_34_1_56.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/1\/56\/49043411\/bioinformatics_34_1_56.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T08:48:03Z","timestamp":1675327683000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/1\/56\/4103400"}},"subtitle":[],"editor":[{"given":"Ziv","family":"Bar-Joseph","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,9,4]]},"references-count":32,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx557","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2018,1,1]]},"published":{"date-parts":[[2017,9,4]]}}}