{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:33:55Z","timestamp":1772138035754,"version":"3.50.1"},"reference-count":16,"publisher":"Oxford University Press (OUP)","issue":"9","license":[{"start":{"date-parts":[[2020,1,27]],"date-time":"2020-01-27T00:00:00Z","timestamp":1580083200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Universit\u00e9 de Rouen Normandie and Vincent Sater"},{"DOI":"10.13039\/501100018696","name":"R\u00e9gion Normandie","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100018696","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Next-generation sequencing has become the go-to standard method for the detection of single-nucleotide variants in tumor cells. The use of such technologies requires a PCR amplification step and a sequencing step, steps in which artifacts are introduced at very low frequencies. These artifacts are often confused with true low-frequency variants that can be found in tumor cells and cell-free DNA. The recent use of unique molecular identifiers (UMI) in targeted sequencing protocols has offered a trustworthy approach to filter out artefactual variants and accurately call low-frequency variants. However, the integration of UMI analysis in the variant calling process led to developing tools that are significantly slower and more memory consuming than raw-reads-based variant callers.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We present UMI-VarCal, a UMI-based variant caller for targeted sequencing data with better sensitivity compared to other variant callers. Being developed with performance in mind, UMI-VarCal stands out from the crowd by being one of the few variant callers that do not rely on SAMtools to do their pileup. Instead, at its core runs an innovative homemade pileup algorithm specifically designed to treat the UMI tags in the reads. After the pileup, a Poisson statistical test is applied at every position to determine if the frequency of the variant is significantly higher than the background error noise. Finally, an analysis of UMI tags is performed, a strand bias and a homopolymer length filter are applied to achieve better accuracy. We illustrate the results obtained using UMI-VarCal through the sequencing of tumor samples and we show how UMI-VarCal is both faster and more sensitive than other publicly available solutions.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The entire pipeline is available at https:\/\/gitlab.com\/vincent-sater\/umi-varcal-master under MIT license.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa053","type":"journal-article","created":{"date-parts":[[2020,1,20]],"date-time":"2020-01-20T07:08:54Z","timestamp":1579504134000},"page":"2718-2724","source":"Crossref","is-referenced-by-count":24,"title":["UMI-VarCal: a new UMI-based variant caller that efficiently improves low-frequency variant detection in paired-end sequencing NGS libraries"],"prefix":"10.1093","volume":"36","author":[{"given":"Vincent","family":"Sater","sequence":"first","affiliation":[{"name":"University of Normandie UNIROUEN , LITIS EA 4108"}]},{"given":"Pierre-Julien","family":"Viailly","sequence":"additional","affiliation":[{"name":"Department of Pathology , Centre Henri Becquerel"},{"name":"INSERM U1245 , University of Normandie UNIROUEN, Rouen 76000, France"}]},{"given":"Thierry","family":"Lecroq","sequence":"additional","affiliation":[{"name":"University of Normandie UNIROUEN , LITIS EA 4108"}]},{"given":"\u00c9lise","family":"Prieur-Gaston","sequence":"additional","affiliation":[{"name":"University of Normandie UNIROUEN , LITIS EA 4108"}]},{"given":"\u00c9lodie","family":"Bohers","sequence":"additional","affiliation":[{"name":"Department of Pathology , Centre Henri Becquerel"},{"name":"INSERM U1245 , University of Normandie UNIROUEN, Rouen 76000, France"}]},{"given":"Mathieu","family":"Viennot","sequence":"additional","affiliation":[{"name":"Department of Pathology , Centre Henri Becquerel"},{"name":"INSERM U1245 , University of Normandie UNIROUEN, Rouen 76000, France"}]},{"given":"Philippe","family":"Ruminy","sequence":"additional","affiliation":[{"name":"Department of Pathology , Centre Henri Becquerel"},{"name":"INSERM U1245 , University of Normandie UNIROUEN, Rouen 76000, France"}]},{"given":"H\u00e9l\u00e8ne","family":"Dauchel","sequence":"additional","affiliation":[{"name":"Department of Pathology , Centre Henri Becquerel"},{"name":"INSERM U1245 , University of Normandie UNIROUEN, Rouen 76000, France"}]},{"given":"Pierre","family":"Vera","sequence":"additional","affiliation":[{"name":"University of Normandie UNIROUEN , LITIS EA 4108"},{"name":"Department of Pathology , Centre Henri Becquerel"}]},{"given":"Fabrice","family":"Jardin","sequence":"additional","affiliation":[{"name":"Department of Pathology , Centre Henri Becquerel"},{"name":"INSERM U1245 , University of Normandie UNIROUEN, Rouen 76000, France"}]}],"member":"286","published-online":{"date-parts":[[2020,1,27]]},"reference":[{"key":"2023013110300873900_btaa053-B1","doi-asserted-by":"crossref","first-page":"e2074","DOI":"10.7717\/peerj.2074","article-title":"DeepSNVMiner: a sequence analysis tool to detect emergent, rare mutations in subsets of cell populations","volume":"4","author":"Andrews","year":"2016","journal-title":"PeerJ"},{"key":"2023013110300873900_btaa053-B2","doi-asserted-by":"crossref","first-page":"212","DOI":"10.1136\/jmedgenet-2016-104295","article-title":"A novel somatic mutation achieves partial rescue in a child with Hutchinson-Gilford progeria syndrome","volume":"54","author":"Bar","year":"2017","journal-title":"J. Med. Genet"},{"key":"2023013110300873900_btaa053-B3","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Statist. Soc. Ser. B Methodol"},{"key":"2023013110300873900_btaa053-B4","doi-asserted-by":"crossref","first-page":"666","DOI":"10.1186\/1471-2164-13-666","article-title":"The effect of strand bias in Illumina short-read sequencing data","volume":"13","author":"Guo","year":"2012","journal-title":"BMC Genomics"},{"key":"2023013110300873900_btaa053-B5","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1016\/j.mrgentox.2012.02.006","article-title":"The use of next generation sequencing technology to study the effect of radiation therapy on mitochondrial DNA mutation","volume":"744","author":"Guo","year":"2012","journal-title":"Mutat. Res"},{"key":"2023013110300873900_btaa053-B6","doi-asserted-by":"crossref","DOI":"10.1186\/s12864-018-4544-x","article-title":"Analytical parameters and validation of homopolymer detection in a pyrosequencing-based next generation sequencing system","volume":"19","author":"Iv\u00e1dy","year":"2018","journal-title":"BMC Genomics"},{"key":"2023013110300873900_btaa053-B7","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1093\/bioinformatics\/btw536","article-title":"SiNVICT: ultra-sensitive detection of single nucleotide variants and indels in circulating tumour DNA","volume":"33","author":"Kockan","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013110300873900_btaa053-B8","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1093\/dnares\/dsv010","article-title":"High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients","volume":"22","author":"Kukita","year":"2015","journal-title":"DNA Res"},{"key":"2023013110300873900_btaa053-B9","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1093\/bioinformatics\/btr665","article-title":"SomaticSniper: identification of somatic point mutations in whole genome sequencing data","volume":"28","author":"Larson","year":"2012","journal-title":"Bioinformatics"},{"key":"2023013110300873900_btaa053-B10","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023013110300873900_btaa053-B11","doi-asserted-by":"crossref","first-page":"79485","DOI":"10.18632\/oncotarget.13103","article-title":"OutLyzer: software for extracting low-allele-frequency tumor mutations from sequencing background noise in clinical practice","volume":"7","author":"Muller","year":"2016","journal-title":"Oncotarget"},{"key":"2023013110300873900_btaa053-B12","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1038\/nbt.3520","article-title":"Integrated digital error suppression for improved detection of circulating tumor DNA","volume":"34","author":"Newman","year":"2016","journal-title":"Nat. Biotechnol"},{"key":"2023013110300873900_btaa053-B13","doi-asserted-by":"crossref","first-page":"14508","DOI":"10.1073\/pnas.1208715109","article-title":"Detection of ultra-rare mutations by next-generation sequencing","volume":"109","author":"Schmitt","year":"2012","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013110300873900_btaa053-B14","doi-asserted-by":"crossref","first-page":"e1005480","DOI":"10.1371\/journal.pcbi.1005480","article-title":"MAGERI: Computational pipeline for molecular-barcoded targeted resequencing","author":"Shugay,","year":"2017","journal-title":"PLoS Comput. Biol."},{"key":"2023013110300873900_btaa053-B15","doi-asserted-by":"crossref","first-page":"1299","DOI":"10.1093\/bioinformatics\/bty790","article-title":"smCounter2: an accurate low-frequency variant caller for targeted sequencing data with unique molecular identifiers","volume":"35","author":"Xu","year":"2019","journal-title":"Bioinformatics"},{"key":"2023013110300873900_btaa053-B16","doi-asserted-by":"crossref","first-page":"12484","DOI":"10.1038\/ncomms12484","article-title":"Clonal haematopoiesis harbouring AML-associated mutations is ubiquitous in healthy adults","author":"Young,","year":"2016","journal-title":"Nat. Commun."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa053\/32517914\/btaa053.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/9\/2718\/48985410\/bioinformatics_36_9_2718.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/9\/2718\/48985410\/bioinformatics_36_9_2718.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,30]],"date-time":"2024-07-30T06:57:30Z","timestamp":1722322650000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/9\/2718\/5716330"}},"subtitle":[],"editor":[{"given":"Alfonso","family":"Valencia","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,1,27]]},"references-count":16,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2020,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa053","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/775817","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,5,1]]},"published":{"date-parts":[[2020,1,27]]}}}