{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T19:49:24Z","timestamp":1758311364706,"version":"3.44.0"},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"12","license":[{"start":{"date-parts":[[2025,7,16]],"date-time":"2025-07-16T00:00:00Z","timestamp":1752624000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0"},{"start":{"date-parts":[[2025,7,16]],"date-time":"2025-07-16T00:00:00Z","timestamp":1752624000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0"}],"funder":[{"DOI":"10.13039\/100014718","name":"Innovative Research Group Project of the National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["32070678"],"award-info":[{"award-number":["32070678"]}],"id":[{"id":"10.13039\/100014718","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Intell"],"published-print":{"date-parts":[[2025,8]]},"DOI":"10.1007\/s10489-025-06607-x","type":"journal-article","created":{"date-parts":[[2025,7,16]],"date-time":"2025-07-16T12:03:01Z","timestamp":1752667381000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["TransSSVs: a Transformer-based deep learning model for accurate detection of somatic small variants in paired tumor and normal sequencing data"],"prefix":"10.1007","volume":"55","author":[{"given":"Jing","family":"Meng","sequence":"first","affiliation":[]},{"given":"Jiangyuan","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Jingze","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Wenkai","family":"Song","sequence":"additional","affiliation":[]},{"given":"Ming","family":"Li","sequence":"additional","affiliation":[]},{"given":"Aiping","family":"Wu","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6280-6347","authenticated-orcid":false,"given":"Taijiao","family":"Jiang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,7,16]]},"reference":[{"key":"6607_CR1","doi-asserted-by":"publisher","first-page":"842","DOI":"10.1056\/NEJMra1204892","volume":"368","author":"S Aparicio","year":"2013","unstructured":"Aparicio S, Caldas C (2013) The implications of clonal genome evolution for cancer medicine. N Engl J Med 368:842\u2013851","journal-title":"N Engl J Med"},{"key":"6607_CR2","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1038\/nature10762","volume":"481","author":"M Greaves","year":"2012","unstructured":"Greaves M (2012) Maley. Clonal evolution in cancer. Nature 481:306\u2013313","journal-title":"Nature"},{"key":"6607_CR3","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1016\/j.csbj.2018.01.003","volume":"16","author":"C Xu","year":"2018","unstructured":"Xu C (2018) A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data. Comput Struct Biotechnol J 16:15\u201324","journal-title":"Comput Struct Biotechnol J"},{"key":"6607_CR4","unstructured":"J. K. Teer (2014) An improved Understanding of cancer genomics through massively parallel sequencing. Transl Cancer Res. 3:243\u2013259"},{"key":"6607_CR5","doi-asserted-by":"publisher","first-page":"vbad001","DOI":"10.1093\/bioadv\/vbad001","volume":"3","author":"S Zhang","year":"2023","unstructured":"Zhang S, Fan R, Liu Y, Chen S, Liu Q (2023) Zeng. Applications of transformer-based Language models in bioinformatics: a survey. Bioinform Adv 3:vbad001","journal-title":"Bioinform Adv"},{"key":"6607_CR6","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1038\/nrg3445","volume":"14","author":"JC Mwenifumbo","year":"2013","unstructured":"Mwenifumbo JC (2013) Marra. Cancer genome-sequencing study design. Nat Rev Genet 14:321\u2013332","journal-title":"Nat Rev Genet"},{"key":"6607_CR7","doi-asserted-by":"crossref","unstructured":"Alioto TS, Buchhalter I, Derdak S, Hutter B, Eldridge MD, Hovig E, Heisler LE, Beck TA, Simpson JT, Tonon L, Sertier AS, Patch AM, Jager N, Ginsbach P, Drews R, Paramasivam N, Kabbe R, Chotewutmontri S, Diessl N, Previti C, Schmidt S, Brors B, Feuerbach L, Heinold M, Grobner S, Korshunov A, Tarpey PS, Butler AP, Hinton J, Jones D, Menzies A, Raine K, Shepherd R, Stebbings L, Teague JW, Ribeca P, Giner FC, Beltran S, Raineri E, Dabad M, Heath SC, Gut M, Denroche RE, Harding NJ, Yamaguchi TN, Fujimoto A, Nakagawa H, Quesada V, Valdes-Mas R, Nakken S, Vodak D, Bower L, Lynch AG, Anderson CL, Waddell N, Pearson JV, Grimmond SM, Peto M, Spellman P, He M, Kandoth C, Lee S, Zhang J, Letourneau L, Ma S, Seth S, Torrents D, Xi L, Wheeler DA, Lopez-Otin C, Campo E, Campbell PJ, Boutros PC, Puente XS, Gerhard DS, Pfister SM, J. D., McPherson TJ, Hudson M, Schlesner P, Lichter R, Eils (2015) D. T. Jones, I. G. Gut. A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing, Nat Commun. 6:10001","DOI":"10.1038\/ncomms10001"},{"key":"6607_CR8","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1038\/nbt.2514","volume":"31","author":"K Cibulskis","year":"2013","unstructured":"Cibulskis K, Lawrence MS, Carter SL, Sivachenko A, Jaffe D, Sougnez C, Gabriel S, Meyerson M, Lander ES (2013) Getz. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol 31:213\u2013219","journal-title":"Nat Biotechnol"},{"key":"6607_CR9","doi-asserted-by":"publisher","first-page":"e89","DOI":"10.1093\/nar\/gkt126","volume":"41","author":"Y Shiraishi","year":"2013","unstructured":"Shiraishi Y, Sato Y, Chiba K, Okuno Y, Nagata Y, Yoshida K, Shiba N, Hayashi Y, Kume H, Homma Y, Sanada M, Ogawa S (2013) Miyano. An empirical bayesian framework for somatic mutation detection from cancer genome sequencing data. Nucleic Acids Res 41:e89","journal-title":"Nucleic Acids Res"},{"key":"6607_CR10","doi-asserted-by":"publisher","first-page":"568","DOI":"10.1101\/gr.129684.111","volume":"22","author":"DC Koboldt","year":"2012","unstructured":"Koboldt DC, Zhang Q, Larson DE, Shen D, McLellan MD, Lin L, Miller CA, Mardis ER, Ding L (2012) Wilson. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res 22:568\u2013576","journal-title":"Genome Res"},{"key":"6607_CR11","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1186\/s13059-015-0758-2","volume":"16","author":"LT Fang","year":"2015","unstructured":"Fang LT, Afshar PT, Chhibber A, Mohiyuddin M, Fan Y, Mu JC, Gibeling G, Barr S, Asadi NB, Gerstein MB, Koboldt DC, Wang W, Wong WH (2015) Lam. An ensemble approach to accurately detect somatic mutations using somaticseq. Genome Biol 16:197","journal-title":"Genome Biol"},{"key":"6607_CR12","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1093\/bioinformatics\/btr629","volume":"28","author":"J Ding","year":"2012","unstructured":"Ding J, Bashashati A, Roth A, Oloumi A, Tse K, Zeng T, Haffari G, Hirst M, Marra MA, Condon A, Aparicio S, Shah SP (2012) Feature-based classifiers for somatic mutation detection in tumour-normal paired sequencing data. Bioinformatics 28:167\u2013175","journal-title":"Bioinformatics"},{"key":"6607_CR13","doi-asserted-by":"crossref","unstructured":"D. J. Wilkinson (2007) Bayesian methods in bioinformatics and computational systems biology. Brief Bioinform. 8:109\u2013116","DOI":"10.1093\/bib\/bbm007"},{"key":"6607_CR14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-12-1","volume":"12","author":"D Frazer Meacham","year":"2011","unstructured":"Frazer Meacham D, Boffelli J, Dhahbi DIK, Martin M, Singer (2011) Lior Pachter. Identification and correction of systematic error in high-throughput sequence data. BMC Bioinformatics 12:1\u201311","journal-title":"BMC Bioinformatics"},{"key":"6607_CR15","doi-asserted-by":"publisher","first-page":"464","DOI":"10.1038\/s41576-023-00590-0","volume":"24","author":"D Nathan","year":"2023","unstructured":"Nathan D, Olson J, Wagner N, Dwarshuis KH, Miga FJ, Sedlazeck M, Salit, Justin M, Zook (2023) Variant calling and benchmarking in an era of complete human genome sequences. Nat Rev Genet 24:464\u2013483","journal-title":"Nat Rev Genet"},{"key":"6607_CR16","doi-asserted-by":"publisher","first-page":"1798","DOI":"10.1109\/TPAMI.2013.50","volume":"35","author":"Y Bengio","year":"2013","unstructured":"Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35:1798\u20131828","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6607_CR17","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436\u2013444","journal-title":"Nature"},{"key":"6607_CR18","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","volume":"61","author":"J Schmidhuber","year":"2015","unstructured":"Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85\u2013117","journal-title":"Neural Netw"},{"key":"6607_CR19","doi-asserted-by":"publisher","first-page":"bbae033","DOI":"10.1093\/bib\/bbae033","volume":"25","author":"J Jing Meng","year":"2024","unstructured":"Jing Meng J, Liu W, Song H, Li J, Wang L, Zhang Y, Peng A, Wu T, Jiang (2024) PREDAC-CNN: predicting antigenic clusters of seasonal influenza A viruses with convolutional neural network. Brief Bioinform 25:bbae033","journal-title":"Brief Bioinform"},{"key":"6607_CR20","doi-asserted-by":"publisher","first-page":"184119","DOI":"10.1109\/ACCESS.2024.3503413","volume":"12","author":"N Mansoor Hayat","year":"2024","unstructured":"Mansoor Hayat N, Ahmad A, Nasir (2024) Zeeshan Ahmad Tariq. Hybrid deep learning EfficientNetV2 and vision transformer (EffNetV2-ViT) model for breast Cancer histopathological image classification. IEEE Access 12:184119\u2013184131","journal-title":"IEEE Access"},{"key":"6607_CR21","doi-asserted-by":"crossref","unstructured":"Nouman Ahmad R, Strand Bj\u00f6rn, Sparres\u00e4ter S, Tarai E, Lundstr\u00f6m (2023) G\u00f6ran Bergstr\u00f6m, H\u00e5kan Ahlstr\u00f6m, Joel Kullberg. Automatic segmentation of large-scale CT image datasets for detailed body composition analysis. BMC Bioinformatics, 24","DOI":"10.1186\/s12859-023-05462-2"},{"key":"6607_CR22","doi-asserted-by":"publisher","first-page":"2751","DOI":"10.1007\/s00371-021-02153-y","volume":"38","author":"S Nouman Ahmad","year":"2021","unstructured":"Nouman Ahmad S, Asghar, Saira Andleeb Gillani (2021) Transfer learning-assisted multi-resolution breast cancer histopathological images classification. Visual Comput 38:2751\u20132770","journal-title":"Visual Comput"},{"key":"6607_CR23","doi-asserted-by":"publisher","first-page":"1041","DOI":"10.1038\/s41467-019-09027-x","volume":"10","author":"SME Sahraeian","year":"2019","unstructured":"Sahraeian SME, Liu R, Lau B, Podesta K, Mohiyuddin M (2019) Y. K. Lam. Deep convolutional neural networks for accurate somatic mutation detection. Nat Commun 10:1041","journal-title":"Nat Commun"},{"key":"6607_CR24","doi-asserted-by":"crossref","unstructured":"Meng J, Victor B, He Z, Liu H, Jiang T (2021) DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network. Brief Bioinform, 22","DOI":"10.1093\/bib\/bbaa272"},{"key":"6607_CR25","doi-asserted-by":"publisher","first-page":"1512","DOI":"10.1038\/s41588-023-01465-0","volume":"55","author":"N Brandes","year":"2023","unstructured":"Brandes N, Goldman G, Wang CH, Ye CJ, Ntranos V (2023) Genome-wide prediction of disease variant effects with a deep protein Language model. Nat Genet 55:1512\u20131522","journal-title":"Nat Genet"},{"key":"6607_CR26","doi-asserted-by":"crossref","unstructured":"Chen J, Xu H, Tao W, Chen Z, Zhao Y, Jing-Dong J (2023) Han. Transformer for one stop interpretable cell type annotation. Nat Commun, 14","DOI":"10.1038\/s41467-023-35923-4"},{"key":"6607_CR27","doi-asserted-by":"publisher","first-page":"300","DOI":"10.1038\/s42256-022-00459-7","volume":"4","author":"Y Chu","year":"2022","unstructured":"Chu Y, Zhang Y, Wang Q, Zhang L, Wang X, Wang Y, Salahub DR, Xu Q, Wang J, Jiang X, Xiong Y (2022) Dong-Qing Wei. A transformer-based model to predict peptide\u2013HLA class I binding and optimize mutated peptides for vaccine design. Nat Mach Intell 4:300\u2013311","journal-title":"Nat Mach Intell"},{"key":"6607_CR28","doi-asserted-by":"crossref","unstructured":"Zhang S, Liu RFY, Chen S, Liu Q, Zeng W (2023) Alex Bateman. Applications of transformer-based Language models in bioinformatics: a survey. Bioinf Adv, 3","DOI":"10.1093\/bioadv\/vbad001"},{"key":"6607_CR29","doi-asserted-by":"publisher","first-page":"1196","DOI":"10.1038\/s41592-021-01252-x","volume":"18","author":"V \u017diga Avsec","year":"2021","unstructured":"\u017diga Avsec V, Agarwal D, Visentin JR, Ledsam A, Grabska-Barwinska KR, Taylor Y, Assael J, Jumper P, Kohli, David R (2021) Kelley. Effective gene expression prediction from sequence by integrating long-range interactions. Nat Methods 18:1196\u20131203","journal-title":"Nat Methods"},{"key":"6607_CR30","doi-asserted-by":"crossref","unstructured":"Artur Sza\u0142ata K, Hrovatin H, Cui B, Wang, Fabian J (2024) Theis. Transformers in single-cell omics: a review and new perspectives, Nature Methods. 21:1430\u20131443","DOI":"10.1038\/s41592-024-02353-z"},{"key":"6607_CR31","doi-asserted-by":"publisher","first-page":"24607","DOI":"10.1038\/srep24607","volume":"6","author":"DW Craig","year":"2016","unstructured":"Craig DW, Nasser S, Corbett R, Chan SK, Murray L, Legendre C, Tembe W, Adkins J, Kim N, Wong S, Baker A, Enriquez D, Pond S, Pleasance E, Mungall AJ, Moore RA, McDaniel T, Ma Y, Jones SJ, Marra MA, Carpten JD (2016) Liang. A somatic reference standard for cancer genome sequencing. Sci Rep 6:24607","journal-title":"Sci Rep"},{"key":"6607_CR32","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1016\/j.cels.2015.08.015","volume":"1","author":"M Griffith","year":"2015","unstructured":"Griffith M, Miller CA, Griffith OL, Krysiak K, Skidmore ZL, Ramu A, Walker JR, Dang HX, Trani L, Larson DE, Demeter RT, Wendl MC, McMichael JF, Austin RE, Magrini V, McGrath SD, Ly A, Kulkarni S, Cordes MG, Fronick CC, Fulton RS, Maher CA, Ding L, Klco JM, Mardis ER (2015) Ley, R. K. Wilson. Optimizing cancer genome sequencing and analysis. Cell Syst 1:210\u2013223","journal-title":"Cell Syst"},{"key":"6607_CR33","unstructured":"Li H (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM., arXiv:1303.3997v2 [q-bio.GN]"},{"key":"6607_CR34","doi-asserted-by":"publisher","first-page":"1297","DOI":"10.1101\/gr.107524.110","volume":"20","author":"A McKenna","year":"2010","unstructured":"McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M (2010) DePristo. The genome analysis toolkit: a mapreduce framework for analyzing next-generation DNA sequencing data. Genome Res 20:1297\u20131303","journal-title":"Genome Res"},{"key":"6607_CR35","doi-asserted-by":"publisher","first-page":"e0202982","DOI":"10.1371\/journal.pone.0202982","volume":"13","author":"J Meng","year":"2018","unstructured":"Meng J, Chen YP (2018) A database of simulated tumor genomes towards accurate detection of somatic small variants in cancer. PLoS ONE 13:e0202982","journal-title":"PLoS ONE"},{"key":"6607_CR36","doi-asserted-by":"publisher","first-page":"246","DOI":"10.1038\/nbt.2835","volume":"32","author":"JM Zook","year":"2014","unstructured":"Zook JM, Chapman B, Wang J, Mittelman D, Hofmann O, Hide W (2014) Salit. Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls. Nat Biotechnol 32:246\u2013251","journal-title":"Nat Biotechnol"},{"key":"6607_CR37","doi-asserted-by":"crossref","unstructured":"Lin T-Y, Goyal P, Girshick R, He K (2017) Piotr Dollar. Focal Loss for Dense Object Detection. IEEE International Conference on Computer Vision (ICCV). 2999\u20133007","DOI":"10.1109\/ICCV.2017.324"},{"key":"6607_CR38","doi-asserted-by":"publisher","first-page":"2705","DOI":"10.1093\/bioinformatics\/btac188","volume":"38","author":"L Liangpeng Nie","year":"2022","unstructured":"Liangpeng Nie L, Quan T, Wu R, He Q, Lyu (2022) Pier Luigi Martelli. TransPPMP: predicting pathogenicity of frameshift and non-sense mutations by a transformer based on protein features. Bioinformatics 38:2705\u20132711","journal-title":"Bioinformatics"},{"key":"6607_CR39","unstructured":"Jimmy Lei Ba Diederik P Kingma. Adam: A method for stochastic optimization, arXiv:1412.6980 2015"},{"key":"6607_CR40","doi-asserted-by":"publisher","first-page":"748","DOI":"10.1186\/s12864-017-4134-3","volume":"18","author":"V Vijayan","year":"2017","unstructured":"Vijayan V, Yiu SM, Zhang L (2017) Improving somatic variant identification through integration of genome and exome data. BMC Genomics 18:748","journal-title":"BMC Genomics"},{"key":"6607_CR41","unstructured":"Vaswani NSA, Parmar N, Uszkoreit J, Jones L, Gomez AN (2017) \u0141ukasz Kaiser, Illia Polosukhin. Attention is all you need, 31st Conference on Neural Information Processing Systems (NIPS 2017)"},{"key":"6607_CR42","doi-asserted-by":"crossref","unstructured":"Kiran Krishnamachari D, Lu A, Swift-Scott A, Yeraliyev K, Lee W, Huang (2022) Sim Ngak Leng, Anders Jacobsen Skanderup. Accurate somatic variant detection using weakly supervised deep learning. Nat Commun, 13","DOI":"10.1038\/s41467-022-31765-8"},{"key":"6607_CR43","doi-asserted-by":"publisher","first-page":"885","DOI":"10.1038\/s41587-021-00861-3","volume":"39","author":"P Daniel","year":"2021","unstructured":"Daniel P, Cooke DC, Wedge (2021) Gerton lunter. A unified haplotype-based method for accurate and comprehensive variant calling. Nat Biotechnol 39:885\u2013892","journal-title":"Nat Biotechnol"}],"container-title":["Applied Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-025-06607-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10489-025-06607-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-025-06607-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T15:57:44Z","timestamp":1758297464000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10489-025-06607-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,16]]},"references-count":43,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2025,8]]}},"alternative-id":["6607"],"URL":"https:\/\/doi.org\/10.1007\/s10489-025-06607-x","relation":{},"ISSN":["0924-669X","1573-7497"],"issn-type":[{"type":"print","value":"0924-669X"},{"type":"electronic","value":"1573-7497"}],"subject":[],"published":{"date-parts":[[2025,7,16]]},"assertion":[{"value":"30 April 2025","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 July 2025","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have declared that no competing interests exist.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"874"}}