{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T00:05:19Z","timestamp":1775261119794,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2018,2,12]],"date-time":"2018-02-12T00:00:00Z","timestamp":1518393600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"DOI":"10.13039\/100000057","name":"National Institute of General Medical Sciences","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000057","name":"NIGMS","doi-asserted-by":"publisher","award":["P50-GM076468"],"award-info":[{"award-number":["P50-GM076468"]}],"id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Allele-specific expression (ASE) refers to the differential abundance of the allelic copies of a transcript. RNA sequencing (RNA-seq) can provide quantitative estimates of ASE for genes with transcribed polymorphisms. When short-read sequences are aligned to a diploid transcriptome, read-mapping ambiguities confound our ability to directly count reads. Multi-mapping reads aligning equally well to multiple genomic locations, isoforms or alleles can comprise the majority (&amp;gt;85%) of reads. Discarding them can result in biases and substantial loss of information. Methods have been developed that use weighted allocation of read counts but these methods treat the different types of multi-reads equivalently. We propose a hierarchical approach to allocation of read counts that first resolves ambiguities among genes, then among isoforms, and lastly between alleles. We have implemented our model in EMASE software (Expectation-Maximization for Allele Specific Expression) to estimate total gene expression, isoform usage and ASE based on this hierarchical allocation.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Methods that align RNA-seq reads to a diploid transcriptome incorporating known genetic variants improve estimates of ASE and total gene expression compared to methods that use reference genome alignments. Weighted allocation methods outperform methods that discard multi-reads. Hierarchical allocation of reads improves estimation of ASE even when data are simulated from a non-hierarchical model. Analysis of RNA-seq data from F1 hybrid mice using EMASE reveals widespread ASE associated with cis-acting polymorphisms and a small number of parent-of-origin effects.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>EMASE software is available at https:\/\/github.com\/churchill-lab\/emase.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty078","type":"journal-article","created":{"date-parts":[[2018,2,9]],"date-time":"2018-02-09T15:13:04Z","timestamp":1518189184000},"page":"2177-2184","source":"Crossref","is-referenced-by-count":96,"title":["Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression"],"prefix":"10.1093","volume":"34","author":[{"given":"Narayanan","family":"Raghupathy","sequence":"first","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]},{"given":"Kwangbom","family":"Choi","sequence":"additional","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]},{"given":"Matthew J","family":"Vincent","sequence":"additional","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]},{"given":"Glen L","family":"Beane","sequence":"additional","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]},{"given":"Keith S","family":"Sheppard","sequence":"additional","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]},{"given":"Steven C","family":"Munger","sequence":"additional","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]},{"given":"Ron","family":"Korstanje","sequence":"additional","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]},{"given":"Fernando","family":"Pardo-Manual de Villena","sequence":"additional","affiliation":[{"name":"Department of Genetics, The University of North Carolina, Chapel Hill, USA"}]},{"given":"Gary A","family":"Churchill","sequence":"additional","affiliation":[{"name":"The Jackson Laboratory, Bar Harbor, USA"}]}],"member":"286","published-online":{"date-parts":[[2018,2,12]]},"reference":[{"key":"2023051604095090800_bty078-B1","author":"Agresti","year":"2002"},{"key":"2023051604095090800_bty078-B2","doi-asserted-by":"crossref","first-page":"e1004916.","DOI":"10.1371\/journal.pgen.1004916","article-title":"PRDM9 drives evolutionary erosion of hotspots in Mus musculus through haplotype-specific initiation of meiotic recombination","volume":"11","author":"Baker","year":"2015","journal-title":"PLoS Genet"},{"key":"2023051604095090800_bty078-B3","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1038\/nbt.3519","article-title":"Near-optimal probabilistic RNA-seq quantification","volume":"34","author":"Bray","year":"2016","journal-title":"Nat. Biotechnol"},{"key":"2023051604095090800_bty078-B4","doi-asserted-by":"crossref","first-page":"195.","DOI":"10.1186\/s13059-015-0762-6","article-title":"Tools and best practices for data processing in allelic expression analysis","volume":"16","author":"Castel","year":"2015","journal-title":"Genome Biol"},{"key":"2023051604095090800_bty078-B5","doi-asserted-by":"crossref","first-page":"500","DOI":"10.1038\/nature18270","article-title":"Defining the consequences of genetic variation on a proteome-wide scale","volume":"534","author":"Chick","year":"2016","journal-title":"Nature"},{"key":"2023051604095090800_bty078-B6","first-page":"1.","article-title":"A survey of best practices for RNA-seq data analysis","volume":"17","author":"Conesa","year":"2016","journal-title":"Genome Biol"},{"key":"2023051604095090800_bty078-B7","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1016\/j.celrep.2012.06.013","article-title":"Genomic imprinting absent in Drosophila melanogaster adult females","volume":"2","author":"Coolon","year":"2012","journal-title":"Cell Rep"},{"key":"2023051604095090800_bty078-B8","doi-asserted-by":"crossref","first-page":"3207","DOI":"10.1093\/bioinformatics\/btp579","article-title":"Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data","volume":"25","author":"Degner","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051604095090800_bty078-B9","doi-asserted-by":"crossref","first-page":"2778","DOI":"10.1093\/bioinformatics\/btv272","article-title":"Polyester: simulating RNA-seq datasets with differential transcript expression","volume":"31","author":"Frazee","year":"2015","journal-title":"Bioinformatics"},{"key":"2023051604095090800_bty078-B10","doi-asserted-by":"crossref","first-page":"10073","DOI":"10.1093\/nar\/gks666","article-title":"Modelling and simulating generic RNA-seq experiments with the flux simulator","volume":"40","author":"Griebel","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023051604095090800_bty078-B11","doi-asserted-by":"crossref","first-page":"150.","DOI":"10.1186\/s13059-015-0702-5","article-title":"Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data","volume":"16","author":"Kanitz","year":"2015","journal-title":"Genome Biol"},{"key":"2023051604095090800_bty078-B12","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.3317","article-title":"HISAT: a fast spliced aligner with low memory requirements","volume":"12","author":"Kim","year":"2015","journal-title":"Nat. Methods"},{"key":"2023051604095090800_bty078-B13","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1101\/gr.111211.110","article-title":"RNA sequencing reveals the role of splicing polymorphisms in regulating human gene expression","volume":"21","author":"Lalonde","year":"2011","journal-title":"Genome Res"},{"key":"2023051604095090800_bty078-B14","doi-asserted-by":"crossref","first-page":"R25.","DOI":"10.1186\/gb-2009-10-3-r25","article-title":"Ultrafast and memory-efficient alignment of short DNA sequences to the human genome","volume":"10","author":"Langmead","year":"2009","journal-title":"Genome Biol"},{"key":"2023051604095090800_bty078-B15","doi-asserted-by":"crossref","first-page":"R29.","DOI":"10.1186\/gb-2014-15-2-r29","article-title":"Voom: precision weights unlock linear model analysis tools for RNA-seq read counts","volume":"15","author":"Law","year":"2014","journal-title":"Genome Biol"},{"key":"2023051604095090800_bty078-B16","doi-asserted-by":"crossref","first-page":"323.","DOI":"10.1186\/1471-2105-12-323","article-title":"RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome","volume":"12","author":"Li","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023051604095090800_bty078-B17","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1093\/bioinformatics\/btp692","article-title":"RNA-seq gene expression estimation with read mapping uncertainty","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051604095090800_bty078-B18","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1016\/j.cell.2008.03.029","article-title":"Highly integrated single-base resolution maps of the epigenome in Arabidopsis","volume":"133","author":"Lister","year":"2008","journal-title":"Cell"},{"key":"2023051604095090800_bty078-B19","doi-asserted-by":"crossref","first-page":"550.","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with deseq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023051604095090800_bty078-B20","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1534\/genetics.114.165886","article-title":"RNA-seq alignment to individualized genomes improves transcript abundance estimates in multiparent populations","volume":"198","author":"Munger","year":"2014","journal-title":"Genetics"},{"key":"2023051604095090800_bty078-B21","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1126\/science.1158441","article-title":"The transcriptional landscape of the yeast genome defined by RNA sequencing","volume":"320","author":"Nagalakshmi","year":"2008","journal-title":"Science"},{"key":"2023051604095090800_bty078-B22","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1186\/1748-7188-6-9","article-title":"Estimation of alternative splicing isoform frequencies from RNA-seq data","volume":"6","author":"Nicolae","year":"2011","journal-title":"Algorithms Mol. Biol"},{"key":"2023051604095090800_bty078-B23","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1038\/nbt.2862","article-title":"Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms","volume":"32","author":"Patro","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023051604095090800_bty078-B24","doi-asserted-by":"crossref","first-page":"768","DOI":"10.1038\/nature08872","article-title":"Understanding mechanisms underlying human gene expression variation with RNA sequencing","volume":"464","author":"Pickrell","year":"2010","journal-title":"Nature"},{"key":"2023051604095090800_bty078-B25","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edger: a bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051604095090800_bty078-B26","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1038\/msb.2011.54","article-title":"AlleleSeq: analysis of allele-specific expression and binding in a network framework","volume":"7","author":"Rozowsky","year":"2011","journal-title":"Mol. Syst. Biol"},{"key":"2023051604095090800_bty078-B27","doi-asserted-by":"crossref","first-page":"536.","DOI":"10.1186\/1471-2164-14-536","article-title":"Sources of bias in measures of allele-specific expression derived from RNA-sequence data aligned to a single reference genome","volume":"14","author":"Stevenson","year":"2013","journal-title":"BMC Genomics"},{"key":"2023051604095090800_bty078-B28","doi-asserted-by":"crossref","first-page":"R13.","DOI":"10.1186\/gb-2011-12-2-r13","article-title":"Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads","volume":"12","author":"Turro","year":"2011","journal-title":"Genome Biol"},{"key":"2023051604095090800_bty078-B29","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nmeth.3582","article-title":"WASP: allele-specific software for robust molecular quantitative trait locus discovery","volume":"12","author":"van de Geijn","year":"2015","journal-title":"Nat. Methods"},{"key":"2023051604095090800_bty078-B30","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1038\/nature02698","article-title":"Evolutionary changes in cis and trans gene regulation","volume":"430","author":"Wittkopp","year":"2004","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/2177\/50315692\/bioinformatics_34_13_2177.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/2177\/50315692\/bioinformatics_34_13_2177.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T00:11:23Z","timestamp":1684195883000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/13\/2177\/4850941"}},"subtitle":[],"editor":[{"given":"Alfonso","family":"Valencia","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,2,12]]},"references-count":30,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2018,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty078","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/166900","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,7,1]]},"published":{"date-parts":[[2018,2,12]]}}}