{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:44Z","timestamp":1772138084877,"version":"3.50.1"},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"9","license":[{"start":{"date-parts":[[2017,12,11]],"date-time":"2017-12-11T00:00:00Z","timestamp":1512950400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"name":"Penn Institute for Biomedical Informatics Pilot","award":["R01 AG046544"],"award-info":[{"award-number":["R01 AG046544"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>A key component in many RNA-Seq-based studies is contrasting multiple replicates from different experimental conditions. In this setup, replicates play a key role as they allow to capture underlying biological variability inherent to the compared conditions, as well as experimental variability. However, what constitutes a \u2018bad\u2019 replicate is not necessarily well defined. Consequently, researchers might discard valuable data or downstream analysis may be hampered by failed experiments.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here we develop a probability model to weigh a given RNA-Seq sample as a representative of an experimental condition when performing alternative splicing analysis. We demonstrate that this model detects outlier samples which are consistently and significantly different compared with other samples from the same condition. Moreover, we show that instead of discarding such samples the proposed weighting scheme can be used to downweight samples and specific splicing variations suspected as outliers, gaining statistical power. These weights can then be used for differential splicing (DS) analysis, where the resulting algorithm offers a generalization of the MAJIQ algorithm. Using both synthetic and real-life data, we perform an extensive evaluation of the improved MAJIQ algorithm in different scenarios involving perturbed samples, mislabeled samples, same condition groups, and different levels of coverage, showing it compares favorably to other tools. Overall, this work offers an outlier detection algorithm that can be combined with any splicing pipeline, a generalized and improved version of MAJIQ for DS detection, and evaluation metrics with matching code and data for DS algorithms.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>Software and data are accessible via majiq.biociphers.org\/norton_et_al_2017\/.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx790","type":"journal-article","created":{"date-parts":[[2017,12,7]],"date-time":"2017-12-07T23:15:12Z","timestamp":1512688512000},"page":"1488-1497","source":"Crossref","is-referenced-by-count":34,"title":["Outlier detection for improved differential splicing quantification from RNA-Seq experiments with replicates"],"prefix":"10.1093","volume":"34","author":[{"given":"Scott S","family":"Norton","sequence":"first","affiliation":[{"name":"Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA"}]},{"given":"Jorge","family":"Vaquero-Garcia","sequence":"additional","affiliation":[{"name":"Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA"},{"name":"Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, PA, USA"}]},{"given":"Nicholas F","family":"Lahens","sequence":"additional","affiliation":[{"name":"Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA"}]},{"given":"Gregory R","family":"Grant","sequence":"additional","affiliation":[{"name":"Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA"},{"name":"Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3005-5048","authenticated-orcid":false,"given":"Yoseph","family":"Barash","sequence":"additional","affiliation":[{"name":"Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA"},{"name":"Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, PA, USA"}]}],"member":"286","published-online":{"date-parts":[[2017,12,11]]},"reference":[{"key":"2023012713022873600_btx790-B1","first-page":"357","volume-title":"Methods to Study Splicing from High-Throughput RNA Sequencing Data","author":"Alamancos","year":"2014"},{"key":"2023012713022873600_btx790-B2","doi-asserted-by":"crossref","first-page":"2008","DOI":"10.1101\/gr.133744.111","article-title":"Detecting differential usage of exons from rna-seq data","volume":"22","author":"Anders","year":"2012","journal-title":"Genome Res"},{"key":"2023012713022873600_btx790-B3","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1038\/nature09000","article-title":"Deciphering the splicing code","volume":"465","author":"Barash","year":"2010","journal-title":"Nature"},{"key":"2023012713022873600_btx790-B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/gb-2013-14-10-r114","article-title":"AVISPA: a web tool for the prediction and analysis of alternative splicing","volume":"14","author":"Barash","year":"2013","journal-title":"Genome Biol"},{"key":"2023012713022873600_btx790-B5","doi-asserted-by":"crossref","first-page":"13.","DOI":"10.1186\/s13059-016-0881-8","article-title":"A survey of best practices for RNA-seq data analysis","volume":"17","author":"Conesa","year":"2016","journal-title":"Genome Biol"},{"key":"2023012713022873600_btx790-B6","doi-asserted-by":"crossref","first-page":"5189","DOI":"10.1093\/nar\/gkt211","article-title":"Accurate detection of differential RNA processing","volume":"41","author":"Drewe","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023012713022873600_btx790-B7","article-title":"Fast and accurate differential splicing analysis across multiple conditions with replicates","author":"Entizne","year":"2017","journal-title":"bioRxiv"},{"key":"2023012713022873600_btx790-B8","doi-asserted-by":"crossref","first-page":"2518","DOI":"10.1093\/bioinformatics\/btr427","article-title":"Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM)","volume":"27","author":"Grant","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012713022873600_btx790-B25","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1038\/ng.2653","article-title":"The Genotype-Tissue Expression (GTEx) Project","volume":"45","author":"GTEx Consortium","year":"2013","journal-title":"Nat. Genet"},{"key":"2023012713022873600_btx790-B9","doi-asserted-by":"crossref","first-page":"e39","DOI":"10.1093\/nar\/gks1026","article-title":"DiffSplice: the genome-wide detection of differential splicing events with RNA-seq","volume":"41","author":"Hu","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023012713022873600_btx790-B10","doi-asserted-by":"crossref","first-page":"1511","DOI":"10.1016\/j.cell.2014.11.035","article-title":"A highly conserved program of neuronal microexons is misregulated in autistic brains","volume":"159","author":"Irimia","year":"2014","journal-title":"Cell"},{"key":"2023012713022873600_btx790-B11","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1038\/nature10413","article-title":"Mouse genomic variation and its effect on phenotypes and gene regulation","volume":"477","author":"Keane","year":"2011","journal-title":"Nature"},{"key":"2023012713022873600_btx790-B12","doi-asserted-by":"crossref","first-page":"323.","DOI":"10.1186\/1471-2105-12-323","article-title":"Rsem: accurate transcript quantification from rna-seq data with or without a reference genome","volume":"12","author":"Li","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012713022873600_btx790-B13","doi-asserted-by":"crossref","first-page":"1752","DOI":"10.1214\/11-AOAS466","article-title":"Measuring reproducibility of high-throughput experiments","volume":"5","author":"Li","year":"2011","journal-title":"Ann. Appl. Stat"},{"key":"2023012713022873600_btx790-B14","doi-asserted-by":"crossref","first-page":"364.","DOI":"10.1186\/s12859-014-0364-4","article-title":"Comparisons of computational methods for differential alternative splicing detection using rna-seq in plant systems","volume":"15","author":"Liu","year":"2014","journal-title":"BMC Bioinform"},{"key":"2023012713022873600_btx790-B15","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1038\/ng.2653","article-title":"The genotype-tissue expression (GTEx) project","volume":"45","author":"Lonsdale","year":"2013","journal-title":"Nat. Genet"},{"key":"2023012713022873600_btx790-B16","doi-asserted-by":"crossref","first-page":"1413","DOI":"10.1038\/ng.259","article-title":"Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing","volume":"40","author":"Pan","year":"2008","journal-title":"Nat. Genet"},{"key":"2023012713022873600_btx790-B17","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1038\/nmeth.4197","article-title":"Salmon provides fast and bias-aware quantification of transcript expression","volume":"14","author":"Patro","year":"2017","journal-title":"Nat. Methods"},{"key":"2023012713022873600_btx790-B18","doi-asserted-by":"crossref","first-page":"E5593","DOI":"10.1073\/pnas.1419161111","article-title":"rmats: Robust and flexible detection of differential alternative splicing from replicate rna-seq data","volume":"111","author":"Shen","year":"2014","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"2023012713022873600_btx790-B19","doi-asserted-by":"crossref","first-page":"14330","DOI":"10.1073\/pnas.1616440113","article-title":"Evaluating the evaluation of cancer driver genes","volume":"113","author":"Tokheim","year":"2016","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023012713022873600_btx790-B20","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1038\/nbt.2450","article-title":"Differential analysis of gene regulation at transcript resolution with RNA-seq","volume":"31","author":"Trapnell","year":"2012","journal-title":"Nat. Biotech"},{"key":"2023012713022873600_btx790-B21","doi-asserted-by":"crossref","first-page":"e11752.","DOI":"10.7554\/eLife.11752","article-title":"A new view of transcriptome complexity and regulation through the lens of local splicing variations","volume":"5","author":"Vaquero-Garcia","year":"2016","journal-title":"eLife"},{"key":"2023012713022873600_btx790-B22","first-page":"749","article-title":"Splicing in disease: disruption of the splicing code and the decoding machinery","volume":"8","author":"Wang","year":"2007","journal-title":"Nature"},{"key":"2023012713022873600_btx790-B23","doi-asserted-by":"crossref","first-page":"1254806.","DOI":"10.1126\/science.1254806","article-title":"The human splicing code reveals new insights into the genetic determinants of disease","volume":"347","author":"Xiong","year":"2015","journal-title":"Science"},{"key":"2023012713022873600_btx790-B24","doi-asserted-by":"crossref","first-page":"16219","DOI":"10.1073\/pnas.1408886111","article-title":"A circadian gene expression atlas in mammals: Implications for biology and medicine","volume":"111","author":"Zhang","year":"2014","journal-title":"Proc. Natl. Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/9\/1488\/48915608\/bioinformatics_34_9_1488.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/9\/1488\/48915608\/bioinformatics_34_9_1488.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T08:55:17Z","timestamp":1674809717000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/9\/1488\/4721782"}},"subtitle":[],"editor":[{"given":"Bonnie","family":"Berger","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,12,11]]},"references-count":25,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2018,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx790","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/104059","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,5,1]]},"published":{"date-parts":[[2017,12,11]]}}}