{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:15Z","timestamp":1740185115445,"version":"3.37.3"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2019,12,13]],"date-time":"2019-12-13T00:00:00Z","timestamp":1576195200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["IIS 1755761"],"award-info":[{"award-number":["IIS 1755761"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["1R01GM113952-01A1"],"award-info":[{"award-number":["1R01GM113952-01A1"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Accurate estimation of transcript isoform abundance is critical for downstream transcriptome analyses and can lead to precise molecular mechanisms for understanding complex human diseases, like cancer. Simplex mRNA Sequencing (RNA-Seq) based isoform quantification approaches are facing the challenges of inherent sampling bias and unidentifiable read origins. A large-scale experiment shows that the consistency between RNA-Seq and other mRNA quantification platforms is relatively low at the isoform level compared to the gene level. In this project, we developed a platform-integrated model for transcript quantification (IntMTQ) to improve the performance of RNA-Seq on isoform expression estimation. IntMTQ, which benefits from the mRNA expressions reported by the other platforms, provides more precise RNA-Seq-based isoform quantification and leads to more accurate molecular signatures for disease phenotype prediction.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In the experiments to assess the quality of isoform expression estimated by IntMTQ, we designed three tasks for clustering and classification of 46 cancer cell lines with four different mRNA quantification platforms, including newly developed NanoString\u2019s nCounter technology. The results demonstrate that the isoform expressions learned by IntMTQ consistently provide more and better molecular features for downstream analyses compared with five baseline algorithms which consider RNA-Seq data only. An independent RT-qPCR experiment on seven genes in twelve cancer cell lines showed that the IntMTQ improved overall transcript quantification. The platform-integrated algorithms could be applied to large-scale cancer studies, such as The Cancer Genome Atlas (TCGA), with both RNA-Seq and array-based platforms available.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Source code is available at: https:\/\/github.com\/CompbioLabUcf\/IntMTQ.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz932","type":"journal-article","created":{"date-parts":[[2019,12,10]],"date-time":"2019-12-10T12:12:09Z","timestamp":1575979929000},"page":"2466-2473","source":"Crossref","is-referenced-by-count":5,"title":["Platform-integrated mRNA isoform quantification"],"prefix":"10.1093","volume":"36","author":[{"given":"Jiao","family":"Sun","sequence":"first","affiliation":[{"name":"Department of Computer Science"},{"name":"Genomics and Bioinformatics Cluster, University of Central Florida , Orlando, FL 32816, USA"}]},{"given":"Jae-Woong","family":"Chang","sequence":"additional","affiliation":[{"name":"Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota Twin Cities , Minneapolis, MN 55455, USA"}]},{"given":"Teng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Mathematics, University of Central Florida , Orlando, FL 32816, USA"}]},{"given":"Jeongsik","family":"Yong","sequence":"additional","affiliation":[{"name":"Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota Twin Cities , Minneapolis, MN 55455, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8378-1073","authenticated-orcid":false,"given":"Rui","family":"Kuang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University of Minnesota Twin Cities , Minneapolis, MN 55455, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3605-9373","authenticated-orcid":false,"given":"Wei","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Computer Science"},{"name":"Genomics and Bioinformatics Cluster, University of Central Florida , Orlando, FL 32816, USA"}]}],"member":"286","published-online":{"date-parts":[[2019,12,13]]},"reference":[{"key":"2023020108352104300_btz932-B1","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1038\/nature11003","article-title":"The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity","volume":"483","author":"Barretina","year":"2012","journal-title":"Nature"},{"key":"2023020108352104300_btz932-B2","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1038\/nbt.3519","article-title":"Near-optimal probabilistic RNA-seq quantification","volume":"34","author":"Bray","year":"2016","journal-title":"Nat. Biotechnol"},{"key":"2023020108352104300_btz932-B3","doi-asserted-by":"crossref","first-page":"506","DOI":"10.1186\/s12859-017-1925-0","article-title":"Integration of RNA-seq data with heterogeneous microarray data for breast cancer profiling","volume":"18","author":"Castillo","year":"2017","journal-title":"BMC Bioinf"},{"key":"2023020108352104300_btz932-B4","doi-asserted-by":"crossref","first-page":"5996","DOI":"10.1093\/nar\/gky340","article-title":"An integrative model for alternative polyadenylation, IntMAP, delineates mTOR-modulated endoplasmic reticulum stress response","volume":"46","author":"Chang","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023020108352104300_btz932-B5","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1186\/s13059-016-0881-8","article-title":"A survey of best practices for RNA-seq data analysis","volume":"17","author":"Conesa","year":"2016","journal-title":"Genome Biol"},{"key":"2023020108352104300_btz932-B6","first-page":"260","article-title":"Comparative evaluation of isoform-level gene expression estimation algorithms for RNA-seq and exon-array platforms","volume":"18","author":"Dapas","year":"2016","journal-title":"Brief. Bioinf"},{"key":"2023020108352104300_btz932-B7","doi-asserted-by":"crossref","first-page":"2343","DOI":"10.1101\/gad.1973010","article-title":"Alternative pre-mRNA splicing regulation in cancer: pathways and programs unhinged","volume":"24","author":"David","year":"2010","journal-title":"Genes Dev"},{"key":"2023020108352104300_btz932-B8","first-page":"1","article-title":"CVXPY: a Python-embedded modeling language for convex optimization","volume":"17","author":"Diamond","year":"2016","journal-title":"J.\u00a0Mach. Learn. Res"},{"key":"2023020108352104300_btz932-B9","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1038\/nrc1299","article-title":"A census of human cancer genes","volume":"4","author":"Futreal","year":"2004","journal-title":"Nat. Rev. Cancer"},{"key":"2023020108352104300_btz932-B10","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1038\/nbt1385","article-title":"Direct multiplexed measurement of gene expression with color-coded probe pairs","volume":"26","author":"Geiss","year":"2008","journal-title":"Nat. Biotechnol"},{"key":"2023020108352104300_btz932-B11","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1056\/NEJMp1607591","article-title":"Toward a shared vision for cancer genomic data","volume":"375","author":"Grossman","year":"2016","journal-title":"N. Engl. J. Med"},{"key":"2023020108352104300_btz932-B12","doi-asserted-by":"crossref","first-page":"e39","DOI":"10.1093\/nar\/gks1026","article-title":"DiffSplice: the genome-wide detection of differential splicing events with RNA-seq","volume":"41","author":"Hu","year":"2013","journal-title":"Nucleic Acids Res"},{"first-page":"127","year":"2012","author":"Huang","key":"2023020108352104300_btz932-B13"},{"key":"2023020108352104300_btz932-B14","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1007\/BF01908075","article-title":"Comparing partitions","volume":"2","author":"Hubert","year":"1985","journal-title":"J.\u00a0Class"},{"key":"2023020108352104300_btz932-B15","doi-asserted-by":"crossref","first-page":"R36","DOI":"10.1186\/gb-2013-14-4-r36","article-title":"Tophat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions","volume":"14","author":"Kim","year":"2013","journal-title":"Genome Biol"},{"key":"2023020108352104300_btz932-B16","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1186\/1471-2105-12-323","article-title":"RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome","volume":"12","author":"Li","year":"2011","journal-title":"BMC Bioinf"},{"key":"2023020108352104300_btz932-B17","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020108352104300_btz932-B18","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1093\/bioinformatics\/btp692","article-title":"RNA-seq gene expression estimation with read mapping uncertainty","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020108352104300_btz932-B19","doi-asserted-by":"crossref","first-page":"647","DOI":"10.1089\/cmb.2017.0056","article-title":"A joint Bayesian model for integrating microarray and RNA sequencing transcriptomic data","volume":"24","author":"Ma","year":"2017","journal-title":"J.\u00a0Comput. Biol"},{"year":"2011","author":"Pachter","key":"2023020108352104300_btz932-B20"},{"key":"2023020108352104300_btz932-B21","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1038\/nmeth.4197","article-title":"Salmon provides fast and bias-aware quantification of transcript expression","volume":"14","author":"Patro","year":"2017","journal-title":"Nat. Methods"},{"key":"2023020108352104300_btz932-B22","first-page":"2825","article-title":"Scikit-Learn: machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J.\u00a0Mach. Learn. Res"},{"key":"2023020108352104300_btz932-B23","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1038\/nmeth.2251","article-title":"Streaming fragment assignment for real-time analysis of sequencing experiments","volume":"10","author":"Roberts","year":"2013","journal-title":"Nat. Methods"},{"key":"2023020108352104300_btz932-B24","doi-asserted-by":"crossref","first-page":"1126","DOI":"10.1038\/s41467-017-01153-8","article-title":"Gene isoforms as expression-based biomarkers predictive of drug response in vitro","volume":"8","author":"Safikhani","year":"2017","journal-title":"Nat. Commun"},{"key":"2023020108352104300_btz932-B25","doi-asserted-by":"crossref","first-page":"E5593","DOI":"10.1073\/pnas.1419161111","article-title":"rMATS: robust and flexible detection of differential alternative splicing from replicate RNA-seq data","volume":"111","author":"Shen","year":"2014","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020108352104300_btz932-B26","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1016\/S0031-3203(98)00091-0","article-title":"An overlap invariant entropy measure of 3D medical image alignment","volume":"32","author":"Studholme","year":"1999","journal-title":"Pattern Recogn"},{"key":"2023020108352104300_btz932-B27","doi-asserted-by":"crossref","first-page":"e4","DOI":"10.1093\/nar\/gkp853","article-title":"MMBGX: a method for estimating expression at the isoform level and detecting differential splicing using whole-transcript Affymetrix arrays","volume":"38","author":"Turro","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023020108352104300_btz932-B28","doi-asserted-by":"crossref","first-page":"1206","DOI":"10.1158\/1541-7786.MCR-16-0459","article-title":"The landscape of isoform switches in human cancers","volume":"15","author":"Vitting-Seerup","year":"2017","journal-title":"Mol. Cancer Res"},{"key":"2023020108352104300_btz932-B29","doi-asserted-by":"crossref","first-page":"152","DOI":"10.3892\/br.2014.407","article-title":"Mechanism of alternative splicing and its regulation","volume":"3","author":"Wang","year":"2015","journal-title":"Biomed. Rep"},{"key":"2023020108352104300_btz932-B30","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nrg2484","article-title":"RNA-seq: a revolutionary tool for transcriptomics","volume":"10","author":"Wang","year":"2009","journal-title":"Nat. Rev. Genet"},{"key":"2023020108352104300_btz932-B31","doi-asserted-by":"crossref","first-page":"3150","DOI":"10.1093\/nar\/gkl396","article-title":"An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs","volume":"34","author":"Xing","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020108352104300_btz932-B32","doi-asserted-by":"crossref","first-page":"e1002975","DOI":"10.1371\/journal.pcbi.1002975","article-title":"Network-based survival analysis reveals subnetwork signatures for predicting outcomes of ovarian cancer treatment","volume":"9","author":"Zhang","year":"2013","journal-title":"PLoS Comput. Biol"},{"key":"2023020108352104300_btz932-B33","doi-asserted-by":"crossref","first-page":"e1004465","DOI":"10.1371\/journal.pcbi.1004465","article-title":"Network-based isoform quantification with RNA-seq data for cancer transcriptome analysis","volume":"11","author":"Zhang","year":"2015","journal-title":"PLoS Comput. Biol"},{"key":"2023020108352104300_btz932-B34","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/s41698-017-0029-7","article-title":"Network-based machine learning and graph theory algorithms for precision oncology","volume":"1","author":"Zhang","year":"2017","journal-title":"NPJ Precision Oncol"},{"key":"2023020108352104300_btz932-B35","doi-asserted-by":"crossref","first-page":"e78644","DOI":"10.1371\/journal.pone.0078644","article-title":"Comparison of RNA-seq and microarray in transcriptome profiling of activated T cells","volume":"9","author":"Zhao","year":"2014","journal-title":"PLoS One"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz932\/32290100\/btz932.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/8\/2466\/48983848\/bioinformatics_36_8_2466.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/8\/2466\/48983848\/bioinformatics_36_8_2466.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T19:47:41Z","timestamp":1675280861000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/8\/2466\/5675495"}},"subtitle":[],"editor":[{"given":"Jan","family":"Gorodkin","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,12,13]]},"references-count":35,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2020,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz932","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,4,15]]},"published":{"date-parts":[[2019,12,13]]}}}