{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:48Z","timestamp":1772138088544,"version":"3.50.1"},"reference-count":14,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2018,4,17]],"date-time":"2018-04-17T00:00:00Z","timestamp":1523923200000},"content-version":"vor","delay-in-days":1,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100002428","name":"Austrian Science Fund","doi-asserted-by":"publisher","award":["W1207-B09"],"award-info":[{"award-number":["W1207-B09"]}],"id":[{"id":"10.13039\/501100002428","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Counting molecules using next-generation sequencing (NGS) suffers from PCR amplification bias, which reduces the accuracy of many quantitative NGS-based experimental methods such as RNA-Seq. This is true even if molecules are made distinguishable using unique molecular identifiers (UMIs) before PCR amplification, and distinct UMIs are counted instead of reads: Molecules that are lost entirely during the sequencing process will still cause underestimation of the molecule count, and amplification artifacts like PCR chimeras create phantom UMIs and thus cause over-estimation.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We introduce the TRUmiCount algorithm to correct for both types of errors. The TRUmiCount algorithm is based on a mechanistic model of PCR amplification and sequencing, whose two parameters have an immediate physical interpretation as PCR efficiency and sequencing depth and can be estimated from experimental data without requiring calibration experiments or spike-ins. We show that our model captures the main stochastic properties of amplification and sequencing, and that it allows us to filter out phantom UMIs and to estimate the number of molecules lost during the sequencing process. Finally, we demonstrate that the phantom-filtered and loss-corrected molecule counts computed by TRUmiCount measure the true number of molecules with considerably higher accuracy than the raw number of distinct UMIs, even if most UMIs are sequenced only once as is typical for single-cell RNA-Seq.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>TRUmiCount is available at http:\/\/www.cibiv.at\/software\/trumicount and through Bioconda (http:\/\/bioconda.github.io).<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary information is available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty283","type":"journal-article","created":{"date-parts":[[2018,4,10]],"date-time":"2018-04-10T15:11:35Z","timestamp":1523373095000},"page":"3137-3144","source":"Crossref","is-referenced-by-count":31,"title":["TRUmiCount: correctly counting absolute numbers of molecules using unique molecular identifiers"],"prefix":"10.1093","volume":"34","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5344-5359","authenticated-orcid":false,"given":"Florian G","family":"Pflug","sequence":"first","affiliation":[{"name":"Center for Integrative Bioinformatics Vienna (CIBIV), Joint Institute of the University of Vienna and Medicial University of Vienna, Max F. Perutz Laboratories (MFPL), Vienna, Austria"}]},{"given":"Arndt","family":"von Haeseler","sequence":"additional","affiliation":[{"name":"Center for Integrative Bioinformatics Vienna (CIBIV), Joint Institute of the University of Vienna and Medicial University of Vienna, Max F. Perutz Laboratories (MFPL), Vienna, Austria"},{"name":"Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, Austria"}]}],"member":"286","published-online":{"date-parts":[[2018,4,16]]},"reference":[{"key":"2023012704175306300_bty283-B1","doi-asserted-by":"crossref","first-page":"R18.","DOI":"10.1186\/gb-2011-12-2-r18","article-title":"Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries","volume":"12","author":"Aird","year":"2011","journal-title":"Genome Biol"},{"key":"2023012704175306300_bty283-B2","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1145\/232826.232854","article-title":"Algorithm 760; rectangular-grid-data surface fitting that has the accuracy of a bicubic polynomial","volume":"22","author":"Akima","year":"1996","journal-title":"ACM Trans. Math Softw"},{"key":"2023012704175306300_bty283-B3","doi-asserted-by":"crossref","first-page":"880.","DOI":"10.1080\/01621459.1974.10480222","article-title":"Empirical bayes methods applied to estimating fire alarm probabilities","volume":"69","author":"Carter","year":"1974","journal-title":"J. Am. Stat. Assoc"},{"key":"2023012704175306300_bty283-B4","volume-title":"The Theory of Branching Processes","author":"Harris","year":"1989"},{"key":"2023012704175306300_bty283-B5","doi-asserted-by":"crossref","first-page":"615","DOI":"10.1006\/jtbi.2003.3211","article-title":"Measurement of the number of molecules of a single mRNA species in a complex mRNA preparation","volume":"221","author":"Hug","year":"2003","journal-title":"J. Theor. Biol"},{"key":"2023012704175306300_bty283-B6","first-page":"361","article-title":"Estimation with quadratic loss","volume":"1","author":"James","year":"1961","journal-title":"Proceedings of the 4th Berkeley Symposium on Mathematical Statistics and Probability"},{"key":"2023012704175306300_bty283-B7","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1038\/nmeth.1778","article-title":"Counting absolute numbers of molecules using unique molecular identifiers","volume":"9","author":"Kivioja","year":"2012","journal-title":"Nat. Methods"},{"key":"2023012704175306300_bty283-B8","doi-asserted-by":"crossref","first-page":"2197","DOI":"10.1093\/nar\/17.6.2197","article-title":"Polymerase chain reaction: replication errors and reliability of gene diagnosis","volume":"17","author":"Krawczak","year":"1989","journal-title":"Nucleic Acids Res"},{"key":"2023012704175306300_bty283-B9","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1101\/gr.079558.108","article-title":"RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays","volume":"18","author":"Marioni","year":"2008","journal-title":"Genome Res"},{"key":"2023012704175306300_bty283-B10","doi-asserted-by":"crossref","first-page":"14508","DOI":"10.1073\/pnas.1208715109","article-title":"Detection of ultra-rare mutations by next-generation sequencing","volume":"109","author":"Schmitt","year":"2012","journal-title":"Proc. Natl. Acad. Sci. U.S.A"},{"key":"2023012704175306300_bty283-B11","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1073\/pnas.1118018109","article-title":"Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes","volume":"109","author":"Shiroguchi","year":"2012","journal-title":"Proc. Natl. Acad. Sci. U.S.A"},{"key":"2023012704175306300_bty283-B12","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1101\/gr.209601.116","article-title":"UMI-tools: modeling sequencing errors in unique molecular identifiers to improve quantification accuracy","volume":"27","author":"Smith","year":"2017","journal-title":"Genome Res"},{"key":"2023012704175306300_bty283-B13","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1089\/cmb.1995.2.49","article-title":"Modeling the polymerase chain reaction","volume":"2","author":"Weiss","year":"1995","journal-title":"J. Comput. Biol"},{"key":"2023012704175306300_bty283-B14","doi-asserted-by":"crossref","first-page":"3082","DOI":"10.1093\/nar\/25.15.3082","article-title":"A coalescent approach to the polymerase chain reaction","volume":"25","author":"Weiss","year":"1997","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/18\/3137\/48919094\/bioinformatics_34_18_3137.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/18\/3137\/48919094\/bioinformatics_34_18_3137.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,26]],"date-time":"2023-01-26T23:39:00Z","timestamp":1674776340000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/18\/3137\/4972993"}},"subtitle":[],"editor":[{"given":"Bonnie","family":"Berger","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,4,16]]},"references-count":14,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2018,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty283","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/217778","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,9,15]]},"published":{"date-parts":[[2018,4,16]]}}}