{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T12:46:47Z","timestamp":1751633207463},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,9,13]],"date-time":"2016-09-13T00:00:00Z","timestamp":1473724800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,9,13]],"date-time":"2016-09-13T00:00:00Z","timestamp":1473724800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Allele specific expression (ASE) has become an important phenotype, being utilized for the detection of <jats:italic>cis<\/jats:italic>-regulatory variation, nonsense mediated decay and imprinting in the personal genome, and has been used to both identify disease loci and consider the penetrance of damaging alleles. The detection of ASE using high throughput technologies relies on aligning short-read sequencing data, a process that has inherent biases, and there is still a need to develop fast and accurate methods to detect ASE given the unprecedented growth of sequencing information in big data projects.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>Here, we present a new approach to normalize RNA sequencing data in order to call ASE events with high precision in a short time-frame. Using simulated datasets we find that our approach dramatically improves reference allele quantification at heterozygous sites versus default mapping methods and also performs well compared to existing techniques for ASE detection, such as filtering methods and mapping to parental genomes, without the need for complex and time consuming manipulation. Finally, by sequencing the exomes and transcriptomes of 96 well-phenotyped individuals of the CARTaGENE cohort, we characterise the levels of ASE across individuals and find a significant association between the proportion of sites undergoing ASE within the genome and smoking.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>The correct treatment and analysis of RNA sequencing data is vital to control for mapping biases and detect genuine ASE signals. By normalising RNA sequencing information after mapping, we show that this approach can be used to identify biologically relevant signals in personal genomes.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-016-1238-8","type":"journal-article","created":{"date-parts":[[2016,9,13]],"date-time":"2016-09-13T01:00:32Z","timestamp":1473728432000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["A haplotype-based normalization technique for the analysis and detection of allele specific expression"],"prefix":"10.1186","volume":"17","author":[{"given":"Alan","family":"Hodgkinson","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jean-Christophe","family":"Grenier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elias","family":"Gbeha","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Philip","family":"Awadalla","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2016,9,13]]},"reference":[{"issue":"1","key":"1238_CR1","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1093\/hmg\/ddp473","volume":"19","author":"GA Heap","year":"2010","unstructured":"Heap GA, Yang JH, Downes K, Healy BC, Hunt KA, Bockett N, et al. Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing. Hum Mol Genet. 2010;19(1):122\u201334.","journal-title":"Hum Mol Genet"},{"issue":"8","key":"1238_CR2","doi-asserted-by":"publisher","first-page":"533","DOI":"10.1038\/nrg2815","volume":"11","author":"T Pastinen","year":"2010","unstructured":"Pastinen T. Genome-wide allele-specific analysis: insights into regulatory variation. Nat Rev Genet. 2010;11(8):533\u20138.","journal-title":"Nat Rev Genet"},{"issue":"7289","key":"1238_CR3","doi-asserted-by":"publisher","first-page":"773","DOI":"10.1038\/nature08903","volume":"464","author":"SB Montgomery","year":"2010","unstructured":"Montgomery SB, Sammeth M, Gutierrez-Arcelus M, Lach RP, Ingle C, Nisbett J, et al. Transcriptome genetics using second generation sequencing in a Caucasian population. Nature. 2010;464(7289):773\u20137.","journal-title":"Nature"},{"key":"1238_CR4","doi-asserted-by":"crossref","unstructured":"van de Geijn B, McVicker G, Gilad Y, Pritchard JK. WASP: allele-specific software for robust molecular quantitative trait locus discovery. Nature Methods. 2015:12;1061\u201363.","DOI":"10.1038\/nmeth.3582"},{"issue":"1","key":"1238_CR5","doi-asserted-by":"publisher","first-page":"88","DOI":"10.1038\/ng.3162","volume":"47","author":"A Buil","year":"2015","unstructured":"Buil A, Brown AA, Lappalainen T, Vinuela A, Davies MN, Zheng HF, et al. Gene-gene and gene-environment interactions detected by transcriptome sequence analysis in twins. Nat Genet. 2015;47(1):88\u201391.","journal-title":"Nat Genet"},{"issue":"11","key":"1238_CR6","doi-asserted-by":"publisher","first-page":"1216","DOI":"10.1038\/ng.473","volume":"41","author":"B Ge","year":"2009","unstructured":"Ge B, Pokholok DK, Kwan T, Grundberg E, Morcos L, Verlaan DJ, et al. Global patterns of cis variation in human cells revealed by high-density allelic expression analysis. Nat Genet. 2009;41(11):1216\u201322.","journal-title":"Nat Genet"},{"issue":"5","key":"1238_CR7","doi-asserted-by":"publisher","first-page":"e1004304","DOI":"10.1371\/journal.pgen.1004304","volume":"10","author":"KR Kukurba","year":"2014","unstructured":"Kukurba KR, Zhang R, Li X, Smith KS, Knowles DA, How Tan M, et al. Allelic expression of deleterious protein-coding variants across human tissues. PLoS Genet. 2014;10(5):e1004304.","journal-title":"PLoS Genet"},{"issue":"6070","key":"1238_CR8","doi-asserted-by":"publisher","first-page":"823","DOI":"10.1126\/science.1215040","volume":"335","author":"DG MacArthur","year":"2012","unstructured":"MacArthur DG, Balasubramanian S, Frankish A, Huang N, Morris J, Walter K, et al. A systematic survey of loss-of-function variants in human protein-coding genes. Science. 2012;335(6070):823\u20138.","journal-title":"Science"},{"issue":"7468","key":"1238_CR9","doi-asserted-by":"publisher","first-page":"506","DOI":"10.1038\/nature12531","volume":"501","author":"T Lappalainen","year":"2013","unstructured":"Lappalainen T, Sammeth M, Friedlander MR, t Hoen PA, Monlong J, Rivas MA, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501(7468):506\u201311.","journal-title":"Nature"},{"key":"1238_CR10","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1186\/1471-2156-11-25","volume":"11","author":"C Daelemans","year":"2010","unstructured":"Daelemans C, Ritchie ME, Smits G, Abu-Amero S, Sudbery IM, Forrest MS, et al. High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta. BMC Genet. 2010;11:25.","journal-title":"BMC Genet"},{"issue":"7512","key":"1238_CR11","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1038\/nature13602","volume":"512","author":"H Ongen","year":"2014","unstructured":"Ongen H, Andersen CL, Bramsen JB, Oster B, Rasmussen MH, Ferreira PG, et al. Putative cis-regulatory drivers in colorectal cancer. Nature. 2014;512(7512):87\u201390.","journal-title":"Nature"},{"issue":"24","key":"1238_CR12","doi-asserted-by":"publisher","first-page":"3207","DOI":"10.1093\/bioinformatics\/btp579","volume":"25","author":"JF Degner","year":"2009","unstructured":"Degner JF, Marioni JC, Pai AA, Pickrell JK, Nkadori E, Gilad Y, et al. Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics. 2009;25(24):3207\u201312.","journal-title":"Bioinformatics"},{"key":"1238_CR13","doi-asserted-by":"publisher","first-page":"536","DOI":"10.1186\/1471-2164-14-536","volume":"14","author":"KR Stevenson","year":"2013","unstructured":"Stevenson KR, Coolon JD, Wittkopp PJ. Sources of bias in measures of allele-specific expression derived from RNA-sequence data aligned to a single reference genome. BMC Genomics. 2013;14:536.","journal-title":"BMC Genomics"},{"issue":"Suppl 1","key":"1238_CR14","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1186\/s12864-015-2295-5","volume":"17","author":"N Nariai","year":"2016","unstructured":"Nariai N, Kojima K, Mimori T, Kawai Y, Nagasaki M. A Bayesian approach for estimating allele-specific expression from RNA-Seq data with diploid genomes. BMC Genomics. 2016;17 Suppl 1:2.","journal-title":"BMC Genomics"},{"issue":"15","key":"1238_CR15","doi-asserted-by":"publisher","first-page":"2497","DOI":"10.1093\/bioinformatics\/btv074","volume":"31","author":"M Pirinen","year":"2015","unstructured":"Pirinen M, Lappalainen T, Zaitlen NA, Consortium GT, Dermitzakis ET, Donnelly P, et al. Assessing allele-specific expression across multiple tissues from RNA-seq read data. Bioinformatics. 2015;31(15):2497\u2013504.","journal-title":"Bioinformatics"},{"issue":"1","key":"1238_CR16","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1534\/genetics.114.165886","volume":"198","author":"SC Munger","year":"2014","unstructured":"Munger SC, Raghupathy N, Choi K, Simons AK, Gatti DM, Hinerfeld DA, et al. RNA-Seq alignment to individualized genomes improves transcript abundance estimates in multiparent populations. Genetics. 2014;198(1):59\u201373.","journal-title":"Genetics"},{"issue":"24","key":"1238_CR17","doi-asserted-by":"publisher","first-page":"3427","DOI":"10.1093\/bioinformatics\/btr578","volume":"27","author":"M Rivas-Astroza","year":"2011","unstructured":"Rivas-Astroza M, Xie D, Cao X, Zhong S. Mapping personal functional data to personal genomes. Bioinformatics. 2011;27(24):3427\u20139.","journal-title":"Bioinformatics"},{"key":"1238_CR18","doi-asserted-by":"publisher","first-page":"522","DOI":"10.1038\/msb.2011.54","volume":"7","author":"J Rozowsky","year":"2011","unstructured":"Rozowsky J, Abyzov A, Wang J, Alves P, Raha D, Harmanci A, et al. AlleleSeq: analysis of allele-specific expression and binding in a network framework. Mol Syst Biol. 2011;7:522.","journal-title":"Mol Syst Biol"},{"issue":"4","key":"1238_CR19","doi-asserted-by":"publisher","first-page":"R36","DOI":"10.1186\/gb-2013-14-4-r36","volume":"14","author":"D Kim","year":"2013","unstructured":"Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14(4):R36.","journal-title":"Genome Biol"},{"issue":"1","key":"1238_CR20","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1093\/bioinformatics\/bts635","volume":"29","author":"A Dobin","year":"2013","unstructured":"Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15\u201321.","journal-title":"Bioinformatics"},{"issue":"9","key":"1238_CR21","doi-asserted-by":"publisher","first-page":"1297","DOI":"10.1101\/gr.107524.110","volume":"20","author":"A McKenna","year":"2010","unstructured":"McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297\u2013303.","journal-title":"Genome Res"},{"issue":"10","key":"1238_CR22","doi-asserted-by":"publisher","first-page":"1728","DOI":"10.1101\/gr.119784.110","volume":"21","author":"DA Skelly","year":"2011","unstructured":"Skelly DA, Johansson M, Madeoy J, Wakefield J, Akey JM. A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data. Genome Res. 2011;21(10):1728\u201337.","journal-title":"Genome Res"},{"issue":"15","key":"1238_CR23","doi-asserted-by":"publisher","first-page":"3753","DOI":"10.1158\/0008-5472.CAN-12-1160","volume":"72","author":"Y Bosse","year":"2012","unstructured":"Bosse Y, Postma DS, Sin DD, Lamontagne M, Couture C, Gaudreault N, et al. Molecular signature of smoking in human lung tissues. Cancer Res. 2012;72(15):3753\u201363.","journal-title":"Cancer Res"},{"key":"1238_CR24","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1186\/1755-8794-3-29","volume":"3","author":"JC Charlesworth","year":"2010","unstructured":"Charlesworth JC, Curran JE, Johnson MP, Goring HH, Dyer TD, Diego VP, et al. Transcriptomic epidemiology of smoking: the effect of smoking on gene expression in lymphocytes. BMC Med Genomics. 2010;3:29.","journal-title":"BMC Med Genomics"},{"issue":"3","key":"1238_CR25","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1158\/1055-9965.445.13.3","volume":"13","author":"JW Lampe","year":"2004","unstructured":"Lampe JW, Stepaniants SB, Mao M, Radich JP, Dai H, Linsley PS, et al. Signatures of environmental exposures using peripheral leukocyte gene expression: tobacco smoke. Cancer Epidemiol Biomarkers Prev. 2004;13(3):445\u201353.","journal-title":"Cancer Epidemiol Biomarkers Prev"},{"issue":"3","key":"1238_CR26","doi-asserted-by":"publisher","first-page":"691","DOI":"10.1093\/carcin\/bgl199","volume":"28","author":"DM van Leeuwen","year":"2007","unstructured":"van Leeuwen DM, van Agen E, Gottschalk RW, Vlietinck R, Gielen M, van Herwijnen MH, et al. Cigarette smoke-induced differential gene expression in blood cells from monozygotic twin pairs. Carcinogenesis. 2007;28(3):691\u20137.","journal-title":"Carcinogenesis"},{"issue":"5","key":"1238_CR27","doi-asserted-by":"publisher","first-page":"1285","DOI":"10.1093\/ije\/dys160","volume":"42","author":"P Awadalla","year":"2013","unstructured":"Awadalla P, Boileau C, Payette Y, Idaghdour Y, Goulet JP, Knoppers B, et al. Cohort profile of the CARTaGENE study: Quebec\u2019s population-based biobank for public health and personalized genomics. Int J Epidemiol. 2013;42(5):1285\u201399.","journal-title":"Int J Epidemiol"},{"issue":"6182","key":"1238_CR28","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1126\/science.1251110","volume":"344","author":"A Hodgkinson","year":"2014","unstructured":"Hodgkinson A, Idaghdour Y, Gbeha E, Grenier JC, Hip-Ki E, Bruat V, et al. High-resolution genomic analysis of human mitochondrial RNA sequence variation. Science. 2014;344(6182):413\u20135.","journal-title":"Science"},{"issue":"4","key":"1238_CR29","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1038\/ng.3216","volume":"47","author":"JG Hussin","year":"2015","unstructured":"Hussin JG, Hodgkinson A, Idaghdour Y, Grenier JC, Goulet JP, Gbeha E, et al. Recombination affects accumulation of damaging and disease-associated mutations in human populations. Nat Genet. 2015;47(4):400\u20134.","journal-title":"Nat Genet"},{"issue":"14","key":"1238_CR30","doi-asserted-by":"publisher","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","volume":"25","author":"H Li","year":"2009","unstructured":"Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25(14):1754\u201360.","journal-title":"Bioinformatics"},{"issue":"16","key":"1238_CR31","doi-asserted-by":"publisher","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","volume":"25","author":"H Li","year":"2009","unstructured":"Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment\/Map format and SAMtools. Bioinformatics. 2009;25(16):2078\u20139.","journal-title":"Bioinformatics"},{"issue":"2","key":"1238_CR32","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1038\/nmeth.1785","volume":"9","author":"O Delaneau","year":"2012","unstructured":"Delaneau O, Marchini J, Zagury JF. A linear complexity phasing method for thousands of genomes. Nat Methods. 2012;9(2):179\u201381.","journal-title":"Nat Methods"},{"key":"1238_CR33","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1186\/1471-2105-12-323","volume":"12","author":"B Li","year":"2011","unstructured":"Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC bioinform. 2011;12:323.","journal-title":"BMC bioinform"},{"issue":"9","key":"1238_CR34","doi-asserted-by":"publisher","first-page":"467","DOI":"10.1186\/s13059-014-0467-2","volume":"15","author":"NI Panousis","year":"2014","unstructured":"Panousis NI, Gutierrez-Arcelus M, Dermitzakis ET, Lappalainen T. Allelic mapping bias in RNA-sequencing is not a major confounder in eQTL studies. Genome Biol. 2014;15(9):467.","journal-title":"Genome Biol"},{"issue":"15","key":"1238_CR35","doi-asserted-by":"publisher","first-page":"2144","DOI":"10.1093\/bioinformatics\/btr354","volume":"27","author":"JK Pickrell","year":"2011","unstructured":"Pickrell JK, Gaffney DJ, Gilad Y, Pritchard JK. False positive peaks in ChIP-seq and other sequencing-based functional assays caused by unannotated high copy number regions. Bioinformatics. 2011;27(15):2144\u20136.","journal-title":"Bioinformatics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1238-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-1238-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1238-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T18:05:00Z","timestamp":1706810700000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-1238-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,9,13]]},"references-count":35,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["1238"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-1238-8","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,9,13]]},"assertion":[{"value":"14 April 2016","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 September 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 September 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"364"}}