{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:11:07Z","timestamp":1772165467562,"version":"3.50.1"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,8,3]],"date-time":"2023-08-03T00:00:00Z","timestamp":1691020800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,8,3]],"date-time":"2023-08-03T00:00:00Z","timestamp":1691020800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Hong Kong Research Grants Council grants GRF","award":["17113721"],"award-info":[{"award-number":["17113721"]}]},{"name":"TRS","award":["T21-705\/20-N"],"award-info":[{"award-number":["T21-705\/20-N"]}]},{"name":"Shenzhen Municipal Government General Program","award":["JCYJ20210324134405015"],"award-info":[{"award-number":["JCYJ20210324134405015"]}]},{"name":"the URC fund at HKU"},{"DOI":"10.13039\/100010890","name":"Oxford Nanopore Technologies","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100010890","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>With the continuous advances in third-generation sequencing technology and the increasing affordability of next-generation sequencing technology, sequencing data from different sequencing technology platforms is becoming more common. While numerous benchmarking studies have been conducted to compare variant-calling performance across different platforms and approaches, little attention has been paid to the potential of leveraging the strengths of different platforms to optimize overall performance, especially integrating Oxford Nanopore and Illumina sequencing data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>\n                      We investigated the impact of multi-platform data on the performance of variant calling through carefully designed experiments with a deep learning-based variant caller named Clair3-MP (Multi-Platform). Through our research, we not only demonstrated the capability of ONT-Illumina data for improved variant calling, but also identified the optimal scenarios for utilizing ONT-Illumina data. In addition, we revealed that the improvement in variant calling using ONT-Illumina data comes from an improvement in difficult genomic regions, such as the large low-complexity regions and segmental and collapse duplication regions. Moreover, Clair3-MP can incorporate reference genome stratification information to achieve a small but measurable improvement in variant calling. Clair3-MP is accessible as an open-source project at:\n                      <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/HKU-BAL\/Clair3-MP\">https:\/\/github.com\/HKU-BAL\/Clair3-MP<\/jats:ext-link>\n                      .\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>These insights have important implications for researchers and practitioners alike, providing valuable guidance for improving the reliability and efficiency of genomic analysis in diverse applications.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-023-05434-6","type":"journal-article","created":{"date-parts":[[2023,8,3]],"date-time":"2023-08-03T12:01:52Z","timestamp":1691064112000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Boosting variant-calling performance with multi-platform sequencing data using Clair3-MP"],"prefix":"10.1186","volume":"24","author":[{"given":"Huijing","family":"Yu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenxian","family":"Zheng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Junhao","family":"Su","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tak-Wah","family":"Lam","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruibang","family":"Luo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,8,3]]},"reference":[{"key":"5434_CR1","doi-asserted-by":"publisher","first-page":"464","DOI":"10.1038\/s41576-023-00590-0","volume":"24","author":"ND Olson","year":"2023","unstructured":"Olson ND, Wagner J, Dwarshuis N, Miga KH, Sedlazeck FJ, Salit M, et al. Variant calling and benchmarking in an era of complete human genome sequences. Nat Rev Genet. 2023;24:464\u201383.","journal-title":"Nat Rev Genet"},{"issue":"3","key":"5434_CR2","doi-asserted-by":"publisher","first-page":"373","DOI":"10.3390\/diagnostics13030373","volume":"13","author":"S Hassan","year":"2023","unstructured":"Hassan S, Bahar R, Johan MF, Mohamed Hashim EK, Abdullah WZ, Esa E, et al. Next-generation sequencing (NGS) and third-generation sequencing (TGS) for the diagnosis of thalassemia. Diagnostics (Basel). 2023;13(3):373.","journal-title":"Diagnostics (Basel)"},{"issue":"7906","key":"5434_CR3","doi-asserted-by":"publisher","first-page":"437","DOI":"10.1038\/s41586-022-04601-8","volume":"604","author":"T Wang","year":"2022","unstructured":"Wang T, Antonacci-Fulton L, Howe K, Lawson HA, Lucas JK, Phillippy AM, et al. The Human Pangenome Project: a global resource to map genomic diversity. Nature. 2022;604(7906):437\u201346.","journal-title":"Nature"},{"key":"5434_CR4","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1146\/annurev-genom-120120-081921","volume":"22","author":"KH Miga","year":"2021","unstructured":"Miga KH, Wang T. The need for a human pangenome reference sequence. Annu Rev Genom Hum Genet. 2021;22:81\u2013102.","journal-title":"Annu Rev Genom Hum Genet"},{"issue":"7968","key":"5434_CR5","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1038\/s41586-023-06173-7","volume":"619","author":"Y Gao","year":"2023","unstructured":"Gao Y, Yang X, Chen H, Tan X, Yang Z, Deng L, et al. A pangenome reference of 36 Chinese populations. Nature. 2023;619(7968):112\u201321.","journal-title":"Nature"},{"issue":"1","key":"5434_CR6","doi-asserted-by":"publisher","first-page":"1784","DOI":"10.1038\/s41467-018-08148-z","volume":"10","author":"MJP Chaisson","year":"2019","unstructured":"Chaisson MJP, Sanders AD, Zhao X, Malhotra A, Porubsky D, Rausch T, et al. Multi-platform discovery of haplotype-resolved structural variation in human genomes. Nat Commun. 2019;10(1):1784.","journal-title":"Nat Commun"},{"issue":"7809","key":"5434_CR7","doi-asserted-by":"publisher","first-page":"444","DOI":"10.1038\/s41586-020-2287-8","volume":"581","author":"RL Collins","year":"2020","unstructured":"Collins RL, Brand H, Karczewski KJ, Zhao X, Alfoldi J, Francioli LC, et al. A structural variation reference for medical and population genetics. Nature. 2020;581(7809):444\u201351.","journal-title":"Nature"},{"issue":"5","key":"5434_CR8","doi-asserted-by":"publisher","first-page":"100129","DOI":"10.1016\/j.xgen.2022.100129","volume":"2","author":"ND Olson","year":"2022","unstructured":"Olson ND, Wagner J, McDaniel J, Stephens SH, Westreich ST, Prasanna AG, et al. PrecisionFDA truth challenge V2: calling variants from short and long reads in difficult-to-map regions. Cell Genom. 2022;2(5):100129.","journal-title":"Cell Genom"},{"issue":"5","key":"5434_CR9","doi-asserted-by":"publisher","first-page":"491","DOI":"10.1038\/ng.806","volume":"43","author":"MA DePristo","year":"2011","unstructured":"DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43(5):491\u20138.","journal-title":"Nat Genet"},{"issue":"1","key":"5434_CR10","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1186\/s13059-018-1462-9","volume":"19","author":"FJ Rang","year":"2018","unstructured":"Rang FJ, Kloosterman WP, de Ridder J. From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy. Genome Biol. 2018;19(1):90.","journal-title":"Genome Biol"},{"issue":"10","key":"5434_CR11","doi-asserted-by":"publisher","first-page":"983","DOI":"10.1038\/nbt.4235","volume":"36","author":"R Poplin","year":"2018","unstructured":"Poplin R, Chang PC, Alexander D, Schwartz S, Colthurst T, Ku A, et al. A universal SNP and small-indel variant caller using deep neural networks. Nat Biotechnol. 2018;36(10):983\u20137.","journal-title":"Nat Biotechnol"},{"issue":"12","key":"5434_CR12","doi-asserted-by":"publisher","first-page":"797","DOI":"10.1038\/s43588-022-00387-x","volume":"2","author":"Z Zheng","year":"2022","unstructured":"Zheng Z, Li S, Su J, Leung AW-S, Lam T-W, Luo R. Symphonizing pileup and full-alignment for deep learning-based long-read variant calling. Nat Comput Sci. 2022;2(12):797\u2013803.","journal-title":"Nat Comput Sci"},{"issue":"1","key":"5434_CR13","doi-asserted-by":"publisher","first-page":"404","DOI":"10.1186\/s12859-021-04311-4","volume":"22","author":"A Ramachandran","year":"2021","unstructured":"Ramachandran A, Lumetta SS, Klee EW, Chen D. HELLO: improved neural network architectures and methodologies for small variant calling. BMC Bioinform. 2021;22(1):404.","journal-title":"BMC Bioinform"},{"issue":"1","key":"5434_CR14","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1186\/s13059-020-02244-4","volume":"22","author":"G Holley","year":"2021","unstructured":"Holley G, Beyter D, Ingimundardottir H, Moller PL, Kristmundsdottir S, Eggertsson HP, et al. Ratatosk: hybrid error correction of long reads enables accurate variant calling and assembly. Genome Biol. 2021;22(1):28.","journal-title":"Genome Biol"},{"issue":"5","key":"5434_CR15","doi-asserted-by":"publisher","first-page":"100128","DOI":"10.1016\/j.xgen.2022.100128","volume":"2","author":"J Wagner","year":"2022","unstructured":"Wagner J, Olson ND, Harris L, Khan Z, Farek J, Mahmoud M, et al. Benchmarking challenging small variants with linked and long reads. Cell Genom. 2022;2(5):100128.","journal-title":"Cell Genom"},{"issue":"9","key":"5434_CR16","doi-asserted-by":"publisher","first-page":"1044","DOI":"10.1038\/s41587-020-0503-6","volume":"38","author":"K Shafin","year":"2020","unstructured":"Shafin K, Pesout T, Lorig-Roach R, Haukness M, Olsen HE, Bosworth C, et al. Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Nat Biotechnol. 2020;38(9):1044\u201353.","journal-title":"Nat Biotechnol"},{"issue":"18","key":"5434_CR17","doi-asserted-by":"publisher","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","volume":"34","author":"H Li","year":"2018","unstructured":"Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094\u2013100.","journal-title":"Bioinformatics"},{"issue":"14","key":"5434_CR18","doi-asserted-by":"publisher","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","volume":"25","author":"H Li","year":"2009","unstructured":"Li H, Durbin R. Fast and accurate short read alignment with Burrows\u2013Wheeler transform. Bioinformatics. 2009;25(14):1754\u201360.","journal-title":"Bioinformatics"},{"issue":"2","key":"5434_CR19","doi-asserted-by":"publisher","first-page":"giab008","DOI":"10.1093\/gigascience\/giab008","volume":"10","author":"P Danecek","year":"2021","unstructured":"Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10(2):giab008.","journal-title":"Gigascience."},{"issue":"5","key":"5434_CR20","doi-asserted-by":"publisher","first-page":"555","DOI":"10.1038\/s41587-019-0054-x","volume":"37","author":"P Krusche","year":"2019","unstructured":"Krusche P, Trigg L, Boutros PC, Mason CE, De La Vega FM, Moore BL, et al. Best practices for benchmarking germline small-variant calls in human genomes. Nat Biotechnol. 2019;37(5):555\u201360.","journal-title":"Nat Biotechnol"},{"issue":"1","key":"5434_CR21","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1038\/nbt.1754","volume":"29","author":"JT Robinson","year":"2011","unstructured":"Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29(1):24\u20136.","journal-title":"Nat Biotechnol"},{"issue":"11","key":"5434_CR22","doi-asserted-by":"publisher","first-page":"1322","DOI":"10.1038\/s41592-021-01299-w","volume":"18","author":"K Shafin","year":"2021","unstructured":"Shafin K, Pesout T, Chang PC, Nattestad M, Kolesnikov A, Goel S, et al. Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads. Nat Methods. 2021;18(11):1322\u201332.","journal-title":"Nat Methods"},{"issue":"6588","key":"5434_CR23","doi-asserted-by":"publisher","first-page":"eabl3533","DOI":"10.1126\/science.abl3533","volume":"376","author":"S Aganezov","year":"2022","unstructured":"Aganezov S, Yan SM, Soto DC, Kirsche M, Zarate S, Avdeyev P, et al. A complete reference genome improves analysis of human genetic variation. Science. 2022;376(6588):eabl3533.","journal-title":"Science"},{"issue":"19","key":"5434_CR24","doi-asserted-by":"publisher","first-page":"e114","DOI":"10.1093\/nar\/gkaa829","volume":"48","author":"T Prodanov","year":"2020","unstructured":"Prodanov T, Bansal V. Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications. Nucleic Acids Res. 2020;48(19):e114.","journal-title":"Nucleic Acids Res"},{"issue":"6","key":"5434_CR25","doi-asserted-by":"publisher","first-page":"498","DOI":"10.1089\/cmb.2014.0157","volume":"22","author":"M Patterson","year":"2015","unstructured":"Patterson M, Marschall T, Pisanti N, van Iersel L, Stougie L, Klau GW, et al. WhatsHap: weighted haplotype assembly for future-generation sequencing reads. J Comput Biol. 2015;22(6):498\u2013509.","journal-title":"J Comput Biol"},{"issue":"5","key":"5434_CR26","doi-asserted-by":"publisher","first-page":"bbac301","DOI":"10.1093\/bib\/bbac301","volume":"23","author":"J Su","year":"2022","unstructured":"Su J, Zheng Z, Ahmed SS, Lam TW, Luo R. Clair3-trio: high-performance nanopore long-read variant calling in family trios with trio-to-trio deep neural networks. Brief Bioinform. 2022;23(5):bbac301.","journal-title":"Brief Bioinform."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05434-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-023-05434-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05434-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,17]],"date-time":"2023-11-17T11:06:54Z","timestamp":1700219214000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-023-05434-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,3]]},"references-count":26,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["5434"],"URL":"https:\/\/doi.org\/10.1186\/s12859-023-05434-6","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.05.31.543184","asserted-by":"object"}]},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,3]]},"assertion":[{"value":"7 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 July 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 August 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"R.L. receives research funding from ONT. The remaining authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"308"}}