{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,12]],"date-time":"2026-05-12T21:45:47Z","timestamp":1778622347439,"version":"3.51.4"},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2020,7,23]],"date-time":"2020-07-23T00:00:00Z","timestamp":1595462400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31829002"],"award-info":[{"award-number":["31829002"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2019YFA0904400"],"award-info":[{"award-number":["2019YFA0904400"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2016YFC0901604"],"award-info":[{"award-number":["2016YFC0901604"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,5,20]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>DNA variants represent an important source of genetic variations among individuals. Next- generation sequencing (NGS) is the most popular technology for genome-wide variant calling. Third-generation sequencing (TGS) has also recently been used in genetic studies. Although many variant callers are available, no single caller can call both types of variants on NGS or TGS data with high sensitivity and specificity. In this study, we systematically evaluated 11 variant callers on 12 NGS and TGS datasets. For germline variant calling, we tested DNAseq and DNAscope modes from Sentieon, HaplotypeCaller mode from GATK and WGS mode from DeepVariant. All the four callers had comparable performance on NGS data and 30\u00d7 coverage of WGS data was recommended. For germline variant calling on TGS data, we tested DNAseq mode from Sentieon, HaplotypeCaller mode from GATK and PACBIO mode from DeepVariant. All the three callers had similar performance in SNP calling, while DeepVariant outperformed the others in InDel calling. TGS detected more variants than NGS, particularly in complex and repetitive regions. For somatic variant calling on NGS, we tested TNscope and TNseq modes from Sentieon, MuTect2 mode from GATK, NeuSomatic, VarScan2, and Strelka2. TNscope and Mutect2 outperformed the other callers. A higher proportion of tumor sample purity (from 10 to 20%) significantly increased the recall value of calling. Finally, computational costs of the callers were compared and Sentieon required the least computational cost. These results suggest that careful selection of a tool and parameters is needed for accurate SNP or InDel calling under different scenarios.<\/jats:p>","DOI":"10.1093\/bib\/bbaa148","type":"journal-article","created":{"date-parts":[[2020,6,16]],"date-time":"2020-06-16T11:08:20Z","timestamp":1592305700000},"source":"Crossref","is-referenced-by-count":91,"title":["Benchmarking variant callers in next-generation and third-generation sequencing analysis"],"prefix":"10.1093","volume":"22","author":[{"given":"Surui","family":"Pei","sequence":"first","affiliation":[{"name":"Zhongshan Ophthalmic Center at Sun Yat-sen University and Annoroad Gene Technology (Beijing) Co., Ltd"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tao","family":"Liu","sequence":"additional","affiliation":[{"name":"Annoroad Gene Technology (Beijing) Co., Ltd"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xue","family":"Ren","sequence":"additional","affiliation":[{"name":"Annoroad Gene Technology (Beijing) Co., Ltd"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Weizhong","family":"Li","sequence":"additional","affiliation":[{"name":"Zhongshan School of Medicine at Sun Yat-sen University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chongjian","family":"Chen","sequence":"additional","affiliation":[{"name":"Annoroad Gene Technology (Beijing) Co., Ltd"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5589-4836","authenticated-orcid":false,"given":"Zhi","family":"Xie","sequence":"additional","affiliation":[{"name":"Zhongshan Ophthalmic Center at Sun Yat-sen University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,7,23]]},"reference":[{"key":"2021052110185223700_ref1","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1186\/s12859-016-1417-7","article-title":"Detailed simulation of cancer exome sequencing data reveals differences and common limitations of variant callers","volume":"18","author":"Hofmann","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2021052110185223700_ref2","volume-title":"An Introduction to Genetic Analysis","author":"Griffiths","year":"2000","edition":"7th"},{"key":"2021052110185223700_ref3","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1016\/j.jpedp.2015.08.001","article-title":"Whole-exome sequencing as a diagnostic tool for distal renal tubular acidosis","volume":"91","author":"Pereira","year":"2015","journal-title":"J Pediatr (Versao em Portugues)"},{"key":"2021052110185223700_ref4","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1038\/nrneph.2014.95","article-title":"Next-generation sequencing for research and diagnostics in kidney disease","volume":"10","author":"Renkema","year":"2014","journal-title":"Nat Rev Nephrol"},{"key":"2021052110185223700_ref5","doi-asserted-by":"crossref","first-page":"e0151664","DOI":"10.1371\/journal.pone.0151664","article-title":"Evaluation of nine somatic variant callers for detection of somatic mutations in exome and targeted deep sequencing data","volume":"11","author":"Kroigard","year":"2016","journal-title":"PLoS One"},{"key":"2021052110185223700_ref6","doi-asserted-by":"crossref","first-page":"e600","DOI":"10.7717\/peerj.600","article-title":"Detailed comparison of two popular variant calling packages for exome and targeted exon studies","volume":"2","author":"Warden","year":"2014","journal-title":"PeerJ"},{"key":"2021052110185223700_ref7","doi-asserted-by":"crossref","first-page":"11 10 11","DOI":"10.1002\/0471250953.bi1110s43","article-title":"From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline","volume":"43","author":"Van der Auwera","year":"2013","journal-title":"Curr Protoc Bioinformatics"},{"key":"2021052110185223700_ref8","doi-asserted-by":"crossref","first-page":"983","DOI":"10.1038\/nbt.4235","article-title":"A universal SNP and small-indel variant caller using deep neural networks","volume":"36","author":"Poplin","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2021052110185223700_ref9","first-page":"115717","article-title":"The Sentieon genomics tools\u2014a fast and accurate solution to variant calling from next-generation sequence data","author":"Freed","year":"2017","journal-title":"BioRxiv"},{"key":"2021052110185223700_ref10","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1038\/nbt.2514","article-title":"Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples","volume":"31","author":"Cibulskis","year":"2013","journal-title":"Nat Biotechnol"},{"key":"2021052110185223700_ref11","first-page":"250647","article-title":"TNscope: accurate detection of somatic mutations with haplotype-based variant candidate detection and machine learning filtering","author":"Freed","year":"2018","journal-title":"bioRxiv"},{"key":"2021052110185223700_ref12","doi-asserted-by":"crossref","first-page":"1041","DOI":"10.1038\/s41467-019-09027-x","article-title":"Deep convolutional neural networks for accurate somatic mutation detection","volume":"10","author":"Sahraeian","year":"2019","journal-title":"Nat Commun"},{"key":"2021052110185223700_ref13","doi-asserted-by":"crossref","first-page":"591","DOI":"10.1038\/s41592-018-0051-x","article-title":"Strelka22: fast and accurate calling of germline and somatic variants","volume":"15","author":"Kim","year":"2018","journal-title":"Nat Methods"},{"key":"2021052110185223700_ref14","first-page":"1","article-title":"Long-read sequencing for rare human genetic diseases","volume":"65","author":"Mitsuhashi","year":"2019","journal-title":"J Hum Genet"},{"key":"2021052110185223700_ref15","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-019-12493-y","article-title":"Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing","volume":"10","author":"Edge","year":"2019","journal-title":"Nat Commun"},{"key":"2021052110185223700_ref16","first-page":"519025","article-title":"Highly-accurate long-read sequencing improves variant detection and assembly of a human genome","author":"Wenger","year":"2019","journal-title":"BioRxiv"},{"key":"2021052110185223700_ref17","doi-asserted-by":"crossref","first-page":"17875","DOI":"10.1038\/srep17875","article-title":"Systematic comparison of variant calling pipelines using gold standard personal exome variants","volume":"5","author":"Hwang","year":"2015","journal-title":"Sci Rep"},{"key":"2021052110185223700_ref18","doi-asserted-by":"crossref","first-page":"9345","DOI":"10.1038\/s41598-019-45835-3","article-title":"Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers","volume":"9","author":"Chen","year":"2019","journal-title":"Sci Rep"},{"key":"2021052110185223700_ref19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-018-2440-7","article-title":"Comparing the performance of selected variant callers using synthetic data and genome segmentation","volume":"19","author":"Bian","year":"2018","journal-title":"BMC bioinformatics"},{"key":"2021052110185223700_ref20","doi-asserted-by":"crossref","first-page":"160025","DOI":"10.1038\/sdata.2016.25","article-title":"Extensive sequencing of seven human genomes to characterize benchmark reference materials","volume":"3","author":"Zook","year":"2016","journal-title":"Sci Data"},{"key":"2021052110185223700_ref21","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"Auton","year":"2015","journal-title":"Nature"},{"key":"2021052110185223700_ref22","doi-asserted-by":"crossref","first-page":"314","DOI":"10.1109\/IPDPS.2019.00041","volume-title":"IEEE International Parallel and Distributed Processing Symposium (IPDPS): 2019 IEEE","author":"Vasimuddin","year":"2019"},{"key":"2021052110185223700_ref23","first-page":"023754","article-title":"Comparing variant call files for performance benchmarking of next-generation sequencing variant calling pipelines","author":"Cleary","year":"2015","journal-title":"BioRxiv"},{"key":"2021052110185223700_ref24","doi-asserted-by":"crossref","first-page":"555","DOI":"10.1038\/s41587-019-0054-x","article-title":"Best practices for benchmarking germline small-variant calls in human genomes","volume":"37","author":"Krusche","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2021052110185223700_ref25","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1038\/s41587-019-0217-9","article-title":"Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome","volume":"37","author":"Wenger","year":"2019","journal-title":"Nat Biotechnol"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa148\/37963802\/bbaa148.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa148\/37963802\/bbaa148.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,2]],"date-time":"2023-10-02T12:11:39Z","timestamp":1696248699000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaa148\/5875142"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,23]]},"references-count":25,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2021,5,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaa148","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,5]]},"published":{"date-parts":[[2020,7,23]]},"article-number":"bbaa148"}}