{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T01:24:22Z","timestamp":1774229062569,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2024,10,8]],"date-time":"2024-10-08T00:00:00Z","timestamp":1728345600000},"content-version":"vor","delay-in-days":7,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,12,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Whole-exome sequencing (WES) data are frequently used for cancer diagnosis and genome-wide association studies (GWAS), based on high-coverage read mapping, informative variant calling, and high-quality reference genomes. The center position of the currently used genome assembly, GRCh38, is now challenged by two newly published telomere-to-telomere (T2T) genomes, T2T-CHM13 and T2T-YAO, and it becomes urgent to have a comparative study to test population specificity using the three reference genomes based on real case WES data. Here, we report our analysis along this line for 19 tumor samples collected from Chinese patients. The primary comparison of the exon regions among the three references reveals that the sequences in up to \u223c 1% of target regions in T2T-YAO are widely diversified from GRCh38 and may lead to off-target in sequence capture. However, T2T-YAO still outperforms GRCh38 by obtaining 7.41% of more mapped reads. Due to more reliable read-mapping and closer phylogenetic relationship with the samples than GRCh38, T2T-YAO reduces half of variant calls of clinical significance which are mostly benign, while maintaining sensitivity in identifying pathogenic variants. T2T-YAO also outperforms T2T-CHM13 in reducing calls of Chinese-specific variants. Our findings highlight the critical need for employing population-specific reference genomes in genomic analysis to ensure accurate variant analysis and the significant benefits of tailoring these approaches to the unique genetic background of each ethnic group.<\/jats:p>","DOI":"10.1093\/gpbjnl\/qzae070","type":"journal-article","created":{"date-parts":[[2024,10,8]],"date-time":"2024-10-08T12:45:56Z","timestamp":1728391556000},"source":"Crossref","is-referenced-by-count":2,"title":["Enhancing Variant Calling in Whole-exome Sequencing Data Using Population-matched Reference Genomes"],"prefix":"10.1093","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-7931-3725","authenticated-orcid":false,"given":"Shuming","family":"Guo","sequence":"first","affiliation":[{"name":"Linfen Clinical Medicine Research Center, LinFen Central Hospital , LinFen 041000,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-1023-1560","authenticated-orcid":false,"given":"Zhuo","family":"Huang","sequence":"additional","affiliation":[{"name":"China National Center for Bioinformation , Beijing 100101,","place":["China"]},{"name":"Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101,","place":["China"]},{"name":"University of Chinese Academy of Sciences , Beijing 100049,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-1480-1756","authenticated-orcid":false,"given":"Yanming","family":"Zhang","sequence":"additional","affiliation":[{"name":"Linfen Clinical Medicine Research Center, LinFen Central Hospital , LinFen 041000,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4164-2478","authenticated-orcid":false,"given":"Yukun","family":"He","sequence":"additional","affiliation":[{"name":"Department of Respiratory and Critical Care Medicine, Peking University People\u2019s Hospital , Beijing 100044,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-7059-6350","authenticated-orcid":false,"given":"Xiangju","family":"Chen","sequence":"additional","affiliation":[{"name":"Linfen Clinical Medicine Research Center, LinFen Central Hospital , LinFen 041000,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-0444-379X","authenticated-orcid":false,"given":"Wenjuan","family":"Wang","sequence":"additional","affiliation":[{"name":"Linfen Clinical Medicine Research Center, LinFen Central Hospital , LinFen 041000,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-0689-8752","authenticated-orcid":false,"given":"Lansheng","family":"Li","sequence":"additional","affiliation":[{"name":"Linfen Clinical Medicine Research Center, LinFen Central Hospital , LinFen 041000,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5196-0376","authenticated-orcid":false,"given":"Yu","family":"Kang","sequence":"additional","affiliation":[{"name":"China National Center for Bioinformation , Beijing 100101,","place":["China"]},{"name":"Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101,","place":["China"]},{"name":"University of Chinese Academy of Sciences , Beijing 100049,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7415-1416","authenticated-orcid":false,"given":"Zhancheng","family":"Gao","sequence":"additional","affiliation":[{"name":"Department of Respiratory and Critical Care Medicine, Peking University People\u2019s Hospital , Beijing 100044,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7599-2700","authenticated-orcid":false,"given":"Jun","family":"Yu","sequence":"additional","affiliation":[{"name":"University of Chinese Academy of Sciences , Beijing 100049,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2147-3475","authenticated-orcid":false,"given":"Zhenglin","family":"Du","sequence":"additional","affiliation":[{"name":"China National Center for Bioinformation , Beijing 100101,","place":["China"]},{"name":"Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101,","place":["China"]},{"name":"Institute of PSI Genomics , Wenzhou 325024,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9349-4307","authenticated-orcid":false,"given":"Yanan","family":"Chu","sequence":"additional","affiliation":[{"name":"China National Center for Bioinformation , Beijing 100101,","place":["China"]},{"name":"Beijing Institute of Genomics, Chinese Academy of Sciences , Beijing 100101,","place":["China"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2024,10,8]]},"reference":[{"key":"2024123122061411700_qzae070-B1","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nrg.2016.49","article-title":"Coming of age: ten years of next-generation sequencing technologies","volume":"17","author":"Goodwin","year":"2016","journal-title":"Nat Rev Genet"},{"key":"2024123122061411700_qzae070-B2","doi-asserted-by":"crossref","first-page":"1229","DOI":"10.1016\/j.ajhg.2023.06.009","article-title":"Beyond the exome: what\u2019s next in diagnostic testing for Mendelian conditions","volume":"110","author":"Wojcik","year":"2023","journal-title":"Am J Hum Genet"},{"key":"2024123122061411700_qzae070-B3","doi-asserted-by":"crossref","first-page":"1544","DOI":"10.5858\/arpa.2016-0501-RA","article-title":"Review of clinical next-generation sequencing","volume":"141","author":"Yohe","year":"2017","journal-title":"Arch Pathol Lab Med"},{"key":"2024123122061411700_qzae070-B4","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1016\/j.tig.2014.07.001","article-title":"Ten years of next-generation sequencing technology","volume":"30","author":"van Dijk","year":"2014","journal-title":"Trends Genet"},{"key":"2024123122061411700_qzae070-B5","doi-asserted-by":"crossref","first-page":"616","DOI":"10.1001\/jamaoncol.2015.5699","article-title":"Diagnostic yield of clinical tumor and germline whole-exome sequencing for children with solid tumors","volume":"2","author":"Parsons","year":"2016","journal-title":"JAMA Oncol"},{"key":"2024123122061411700_qzae070-B6","doi-asserted-by":"crossref","first-page":"2533","DOI":"10.1182\/blood.2022015790","article-title":"Germ line predisposition variants occur in myelodysplastic syndrome patients of all ages","volume":"140","author":"Feurstein","year":"2022","journal-title":"Blood"},{"key":"2024123122061411700_qzae070-B7","doi-asserted-by":"crossref","first-page":"162","DOI":"10.1038\/s41586-021-04312-6","article-title":"Life histories of myeloproliferative neoplasms inferred from phylogenies","volume":"602","author":"Williams","year":"2022","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B8","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1038\/jhg.2014.7","article-title":"A commentary on the promise of whole-exome sequencing in medical genetics","volume":"59","author":"Kaname","year":"2014","journal-title":"J Hum Genet"},{"key":"2024123122061411700_qzae070-B9","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1038\/35057062","article-title":"Initial sequencing and analysis of the human genome","volume":"409","author":"Lander","year":"2001","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B10","doi-asserted-by":"crossref","first-page":"1304","DOI":"10.1126\/science.1058040","article-title":"The sequence of the human genome","volume":"291","author":"Venter","year":"2001","journal-title":"Science"},{"key":"2024123122061411700_qzae070-B11","doi-asserted-by":"crossref","first-page":"D670","DOI":"10.1093\/nar\/gku1177","article-title":"The UCSC Genome Browser database: 2015 update","volume":"43","author":"Rosenbloom","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2024123122061411700_qzae070-B12","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1126\/science.abj6987","article-title":"The complete sequence of a human genome","volume":"376","author":"Nurk","year":"2022","journal-title":"Science"},{"key":"2024123122061411700_qzae070-B13","doi-asserted-by":"crossref","first-page":"344","DOI":"10.1038\/s41586-023-06457-y","article-title":"The complete sequence of a human Y chromosome","volume":"621","author":"Rhie","year":"2023","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B14","doi-asserted-by":"crossref","first-page":"1085","DOI":"10.1016\/j.gpb.2023.08.001","article-title":"T2T-YAO: a telomere-to-telomere assembled diploid reference genome for Han Chinese","volume":"21","author":"He","year":"2023","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2024123122061411700_qzae070-B15","doi-asserted-by":"crossref","DOI":"10.1093\/g3journal\/jkac321","article-title":"The first gapless, reference-quality, fully annotated genome from a Southern Han Chinese individual","volume":"13","author":"Chao","year":"2023","journal-title":"G3 (Bethesda)"},{"key":"2024123122061411700_qzae070-B16","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1038\/s41422-023-00849-5","article-title":"The complete and fully-phased diploid genome of a male Han Chinese","volume":"33","author":"Yang","year":"2023","journal-title":"Cell Res"},{"key":"2024123122061411700_qzae070-B17","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1038\/s41586-023-05896-x","article-title":"A draft human pangenome reference","volume":"617","author":"Liao","year":"2023","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B18","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1038\/s41586-023-06173-7","article-title":"A pangenome reference of 36 Chinese populations","volume":"619","author":"Gao","year":"2023","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B19","doi-asserted-by":"crossref","DOI":"10.1126\/science.abl3533","article-title":"A complete reference genome improves analysis of human genetic variation","volume":"376","author":"Aganezov","year":"2022","journal-title":"Science"},{"key":"2024123122061411700_qzae070-B20","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1038\/s41586-022-05325-5","article-title":"Semi-automated assembly of high-quality diploid human reference genomes","volume":"611","author":"Jarvis","year":"2022","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B21","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1038\/s41586-020-2308-7","article-title":"The mutational constraint spectrum quantified from variation in 141,456 humans","volume":"581","author":"Karczewski","year":"2020","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B22","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1001\/jama.2023.9498","article-title":"A more diverse and complete reference human genome is poised to change medicine","volume":"330","author":"Madhusoodanan","year":"2023","journal-title":"JAMA"},{"key":"2024123122061411700_qzae070-B23","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1038\/s41586-023-05895-y","article-title":"Increased mutation and gene conversion within human segmental duplications","volume":"617","author":"Vollger","year":"2023","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B24","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1038\/s41586-023-06425-6","article-title":"Assembly of 43 human Y chromosomes reveals extensive complexity and variation","volume":"621","author":"Hallast","year":"2023","journal-title":"Nature"},{"key":"2024123122061411700_qzae070-B25","author":"Freed","year":"2022"},{"key":"2024123122061411700_qzae070-B26","doi-asserted-by":"crossref","first-page":"D835","DOI":"10.1093\/nar\/gkz972","article-title":"ClinVar: improvements to accessing data","volume":"48","author":"Landrum","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2024123122061411700_qzae070-B27","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1001\/jamaoncol.2020.6371","article-title":"Tumor mutation burden and cancer treatment","volume":"7","author":"Fusco","year":"2021","journal-title":"JAMA Oncol"},{"key":"2024123122061411700_qzae070-B28","first-page":"250647","article-title":"TNscope: accurate detection of somatic mutations with haplotype-based variant candidate detection and machine learning filtering","author":"Freed","year":"2018","journal-title":"bioRxiv"},{"key":"2024123122061411700_qzae070-B29","article-title":"FPfilter: a false-positive-specific filter for whole-genome sequencing variant calling from GATK","volume":"003525","author":"Tan","year":"2020","journal-title":"bioRxiv"},{"key":"2024123122061411700_qzae070-B30","doi-asserted-by":"crossref","first-page":"110986","DOI":"10.1016\/j.intimp.2023.110986","article-title":"The predictive value of tumor mutation burden on survival of gastric cancer patients treated with immune checkpoint inhibitors: a systematic review and meta-analysis","volume":"124","author":"Ke","year":"2023","journal-title":"Int Immunopharmacol"},{"key":"2024123122061411700_qzae070-B31","doi-asserted-by":"crossref","first-page":"1342","DOI":"10.1007\/s10120-021-01207-3","article-title":"Tumor mutation burden is correlated with response and prognosis in microsatellite-stable (MSS) gastric cancer patients undergoing neoadjuvant chemotherapy","volume":"24","author":"Li","year":"2021","journal-title":"Gastric Cancer"},{"key":"2024123122061411700_qzae070-B32","doi-asserted-by":"crossref","first-page":"3047","DOI":"10.1093\/bioinformatics\/btw354","article-title":"MultiQC: summarize analysis results for multiple tools and samples in a single report","volume":"32","author":"Ewels","year":"2016","journal-title":"Bioinformatics"},{"key":"2024123122061411700_qzae070-B33","doi-asserted-by":"crossref","DOI":"10.1093\/gigascience\/giab008","article-title":"Twelve years of SAMtools and BCFtools","volume":"10","author":"Danecek","year":"2021","journal-title":"Gigascience"},{"key":"2024123122061411700_qzae070-B34","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1016\/S0168-9525(00)02024-2","article-title":"EMBOSS: the European molecular biology open software suite","volume":"16","author":"Rice","year":"2000","journal-title":"Trends Genet"},{"key":"2024123122061411700_qzae070-B35","doi-asserted-by":"crossref","first-page":"2202","DOI":"10.1093\/bioinformatics\/btv112","article-title":"Unified representation of genetic variants","volume":"31","author":"Tan","year":"2015","journal-title":"Bioinformatics"},{"key":"2024123122061411700_qzae070-B36","doi-asserted-by":"crossref","first-page":"e164","DOI":"10.1093\/nar\/gkq603","article-title":"ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data","volume":"38","author":"Wang","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2024123122061411700_qzae070-B37","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1016\/j.gpb.2021.08.001","article-title":"The Genome Sequence Archive Family: toward explosive data growth and diverse data types","volume":"19","author":"Chen","year":"2021","journal-title":"Genomics Proteomics Bioinformatics"}],"container-title":["Genomics, Proteomics &amp; Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/gpb\/advance-article-pdf\/doi\/10.1093\/gpbjnl\/qzae070\/59639736\/qzae070.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/gpb\/article-pdf\/22\/5\/qzae070\/59639736\/qzae070.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/gpb\/article-pdf\/22\/5\/qzae070\/59639736\/qzae070.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,31]],"date-time":"2024-12-31T17:06:35Z","timestamp":1735664795000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/gpb\/article\/doi\/10.1093\/gpbjnl\/qzae070\/7815730"}},"subtitle":[],"editor":[{"given":"Ting","family":"Wang","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2024,10]]},"references-count":37,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,12,3]]}},"URL":"https:\/\/doi.org\/10.1093\/gpbjnl\/qzae070","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2024.08.19.608554","asserted-by":"object"}]},"ISSN":["1672-0229","2210-3244"],"issn-type":[{"value":"1672-0229","type":"print"},{"value":"2210-3244","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,10]]},"published":{"date-parts":[[2024,10]]},"article-number":"qzae070"}}