{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T11:29:23Z","timestamp":1767871763080,"version":"3.49.0"},"reference-count":60,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T00:00:00Z","timestamp":1767830400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>As a driving force of the Fourth Industrial Revolution, deep learning methods have achieved significant success across various fields, including genetic and genomic studies. While individual-level genetic data is ideal for deep learning models, privacy concerns and data-sharing restrictions often limit its availability to researchers.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>In this paper, we investigated the potential applications of deep learning models\u2014including deep neural networks, convolutional neural networks, recurrent neural networks, and transformers\u2014when only genetic summary data, such as linkage disequilibrium matrices, is available. The bootstrap method was used to approximate the test error. Simulation studies and real data analyses were conducted to compare the performance of deep learning methods in genetic risk prediction using individual-level genetic data versus genetic summary data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>The test mean squared errors (MSEs) of most applied deep learning models are comparable when using individual-level data versus summary data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>Our results suggest that suitable deep learning methods could also serve as an alternative approach to predict disease related traits when only linkage disequilibrium matrices are available as input.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.3389\/fbinf.2025.1657021","type":"journal-article","created":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T06:37:08Z","timestamp":1767854228000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Genetic risk predictions using deep learning models with summary data"],"prefix":"10.3389","volume":"5","author":[{"given":"Angela","family":"Wang","sequence":"first","affiliation":[]},{"given":"Elena","family":"Xiao","sequence":"additional","affiliation":[]},{"given":"Jason","family":"Cheng","sequence":"additional","affiliation":[]},{"given":"Xiaoxi","family":"Shen","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2026,1,8]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"107776","DOI":"10.1016\/j.ijpe.2020.107776","article-title":"Industry 4.0 technologies assessment: a sustainability perspective","volume":"229","author":"Bai","year":"2020","journal-title":"Int. J. Prod. Econ."},{"key":"B2","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1109\/72.279181","article-title":"Learning long-term dependencies with gradient descent is difficult","volume":"5","author":"Bengio","year":"1994","journal-title":"IEEE Trans. Neural Netw."},{"key":"B3","doi-asserted-by":"publisher","first-page":"e1003608","DOI":"10.1371\/journal.pgen.1003608","article-title":"Prediction of complex human traits using the genomic best linear unbiased predictor","volume":"9","author":"Campos","year":"2013","journal-title":"PLOS Genet."},{"key":"B4","doi-asserted-by":"publisher","first-page":"793","DOI":"10.1056\/NEJMp1500523","article-title":"A new initiative on precision medicine","volume":"372","author":"Collins","year":"2015","journal-title":"N. Engl. J. Med."},{"key":"B5","doi-asserted-by":"publisher","first-page":"661","DOI":"10.1038\/nature05911","article-title":"Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls","volume":"447","author":"Consortium","year":"2007","journal-title":"Nature"},{"key":"B6","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1007\/BF02551274","article-title":"Approximation by superpositions of a sigmoidal function","volume":"2","author":"Cybenko","year":"1989","journal-title":"Math. Control Signal Syst."},{"key":"B7","doi-asserted-by":"publisher","first-page":"1898","DOI":"10.1109\/TNNLS.2022.3185742","article-title":"Significance tests of feature relevance for a black-box learner","volume":"35","author":"Dai","year":"2024","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"B8","doi-asserted-by":"crossref","DOI":"10.1201\/9780429246593","volume-title":"An introduction to the bootstrap","author":"Efron","year":"1994"},{"key":"B9","doi-asserted-by":"publisher","first-page":"856","DOI":"10.1038\/s42003-022-03812-z","article-title":"Non-linear machine learning models incorporating SNPs and PRS improve polygenic prediction in diverse human populations","volume":"5","author":"Elgart","year":"2022","journal-title":"Commun. Biol."},{"key":"B10","doi-asserted-by":"publisher","first-page":"2225","DOI":"10.1126\/science.1069424","article-title":"The structure of haplotype blocks in the human genome","volume":"296","author":"Gabriel","year":"2002","journal-title":"Science"},{"key":"B11","doi-asserted-by":"publisher","first-page":"14579","DOI":"10.1038\/s41598-024-65317-5","article-title":"Distributed transformer for high order epistasis detection in large-scale datasets","volume":"14","author":"Gra\u00e7a","year":"2024","journal-title":"Sci. Rep."},{"key":"B12","doi-asserted-by":"publisher","first-page":"602","DOI":"10.1016\/j.neunet.2005.06.042","article-title":"Framewise phoneme classification with bidirectional LSTM and other neural network architectures","volume":"18","author":"Graves","year":"2005","journal-title":"Neural Netw. IJCNN"},{"key":"B13","doi-asserted-by":"publisher","first-page":"1366","DOI":"10.1093\/bioinformatics\/bty811","article-title":"Powerful and efficient SNP-Set association tests across multiple phenotypes using GWAS summary data","volume":"35","author":"Guo","year":"2019","journal-title":"Bioinformatics"},{"key":"B14","doi-asserted-by":"publisher","first-page":"S151","DOI":"10.1016\/j.neurobiolaging.2014.02.033","article-title":"Genome-wide interaction analysis reveals replicated epistatic effects on brain structure","volume":"36","author":"Hibar","year":"2015","journal-title":"Neurobiol. Aging"},{"key":"B15","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"B16","doi-asserted-by":"publisher","first-page":"1171","DOI":"10.1214\/009053607000000677","article-title":"Kernel methods in machine learning","volume":"36","author":"Hofmann","year":"2008","journal-title":"Ann. Statistics"},{"key":"B17","first-page":"1","article-title":"Significance tests for neural networks","volume":"21","author":"Horel","year":"2020","journal-title":"J. Mach. Learn. Res."},{"key":"B18","doi-asserted-by":"publisher","first-page":"359","DOI":"10.1016\/0893-6080(89)90020-8","article-title":"Multilayer feedforward networks are universal approximators","volume":"2","author":"Hornik","year":"1989","journal-title":"Neural Networks"},{"key":"B19","doi-asserted-by":"publisher","first-page":"e54483","DOI":"10.1371\/journal.pone.0054483","article-title":"Dissecting the gene dose-effects of the APOE \u03b54 and \u03b52 alleles on hippocampal volumes in aging and alzheimer\u2019s disease","volume":"8","author":"Hostage","year":"2013","journal-title":"PLOS ONE"},{"key":"B20","volume-title":"An introduction to statistical learning, springer texts in statistics","author":"James","year":"2013"},{"key":"B21","doi-asserted-by":"publisher","first-page":"R182","DOI":"10.1093\/hmg\/ddr378","article-title":"Genetic risk prediction in complex disease","volume":"20","author":"Jostins","year":"2011","journal-title":"Hum. Mol. Genet."},{"key":"B22","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1016\/j.neuron.2014.05.041","article-title":"Alzheimer\u2019s disease genetics: from the bench to the clinic","volume":"83","author":"Karch","year":"2014","journal-title":"Neuron"},{"key":"B23","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1412.6980","article-title":"Adam: a method for stochastic optimization","author":"Kingma","year":"2017"},{"key":"B24","doi-asserted-by":"publisher","DOI":"10.5555\/3618408.3619133","article-title":"TabDDPM: modelling tabular data with diffusion models","author":"Kotelnikov","year":"2024"},{"key":"B25","doi-asserted-by":"publisher","first-page":"1178","DOI":"10.1093\/bioinformatics\/btv719","article-title":"Adaptive gene- and pathway-trait association testing with GWAS summary statistics","volume":"32","author":"Kwak","year":"2016","journal-title":"Bioinformatics"},{"key":"B26","volume-title":"Generalization and network design strategies","author":"LeCun","year":"1989"},{"key":"B27","doi-asserted-by":"publisher","first-page":"242","DOI":"10.1002\/gepi.21790","article-title":"A generalized genetic random field method for the genetic association analysis of sequencing data","volume":"38","author":"Li","year":"2014","journal-title":"Genet. Epidemiol."},{"key":"B28","doi-asserted-by":"publisher","first-page":"6999","DOI":"10.1109\/TNNLS.2021.3084827","article-title":"A survey of convolutional neural networks: analysis, applications, and prospects","volume":"33","author":"Li","year":"2022","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"B29","doi-asserted-by":"publisher","first-page":"7954","DOI":"10.1038\/s41467-023-43565-9","article-title":"Accurate and efficient estimation of local heritability using summary statistics and the linkage disequilibrium matrix","volume":"14","author":"Li","year":"2023","journal-title":"Nat. Commun."},{"key":"B30","doi-asserted-by":"publisher","first-page":"24317486","DOI":"10.1101\/2024.11.18.24317486","article-title":"Leveraging hierarchical structures for genetic block interaction studies using the hierarchical transformer","volume":"11","author":"Li","year":"2025","journal-title":"medRxiv."},{"key":"B31","doi-asserted-by":"publisher","first-page":"e1010328","DOI":"10.1371\/journal.pcbi.1010328","article-title":"Explainable deep transfer learning model for disease risk prediction using high-dimensional genomic data","volume":"18","author":"Liu","year":"2022","journal-title":"PLOS Comput. Biol."},{"key":"B32","doi-asserted-by":"publisher","first-page":"e0116920","DOI":"10.1371\/journal.pone.0116920","article-title":"Investigating the genetics of hippocampal volume in older adults without dementia","volume":"10","author":"Mather","year":"2015","journal-title":"PLOS ONE"},{"key":"B33","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1186\/1750-1326-6-85","article-title":"Adult hippocampal neurogenesis and its role in Alzheimer\u2019s disease","volume":"6","author":"Mu","year":"2011","journal-title":"Mol. Neurodegener."},{"key":"B34","doi-asserted-by":"publisher","first-page":"1666","DOI":"10.3390\/genes14091666","article-title":"Interaction analysis reveals complex genetic associations with alzheimer\u2019s disease in the CLU and ABCA7 gene regions","volume":"14","author":"Nazarian","year":"2023","journal-title":"Genes"},{"key":"B35","first-page":"464","article-title":"Toward transparent AI: a survey on interpreting the inner structures of deep neural networks","author":"R\u00e4uker","year":"2023"},{"key":"B36","doi-asserted-by":"publisher","first-page":"386","DOI":"10.1037\/h0042519","article-title":"The perceptron: a probabilistic model for information storage and organization in the brain","volume":"65","author":"Rosenblatt","year":"1958","journal-title":"Psychol. Review"},{"key":"B37","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.1093\/brain\/awp007","article-title":"MRI of hippocampal volume loss in early Alzheimer\u2019s disease in relation to ApoE genotype and biomarkers","volume":"132","author":"Schuff","year":"2009","journal-title":"Brain"},{"key":"B38","doi-asserted-by":"publisher","first-page":"1341","DOI":"10.1126\/science.1142382","article-title":"A genome-wide association study of type 2 diabetes in finns detects multiple susceptibility variants","volume":"316","author":"Scott","year":"2007","journal-title":"Science"},{"key":"B39","doi-asserted-by":"publisher","first-page":"1460369","DOI":"10.3389\/fsysb.2024.1460369","article-title":"An exploration of testing genetic associations using goodness-of-fit statistics based on deep ReLU neural networks","volume":"4","author":"Shen","year":"2024","journal-title":"Front. Syst. Biol."},{"key":"B40","doi-asserted-by":"publisher","first-page":"109100","DOI":"10.1016\/j.spl.2021.109100","article-title":"A goodness-of-fit test based on neural network sieve estimators","volume":"174","author":"Shen","year":"2021","journal-title":"Statistics and Probability Letters"},{"key":"B41","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2212.08255","article-title":"A sieve quasi-likelihood ratio test for neural networks with applications to genetic association studies","author":"Shen","year":""},{"key":"B42","doi-asserted-by":"publisher","first-page":"877717","DOI":"10.3389\/fsysb.2022.877717","article-title":"A brief review on deep learning applications in genomic studies","volume":"2","author":"Shen","year":"","journal-title":"Front. Syst. Biol."},{"key":"B43","doi-asserted-by":"publisher","first-page":"517","DOI":"10.1002\/sim.9257","article-title":"A conditional autoregressive model for genetic association analysis accounting for genetic heterogeneity","volume":"41","author":"Shen","year":"","journal-title":"Stat. Med."},{"key":"B44","first-page":"3285","article-title":"The performance of LSTM and BiLSTM in forecasting time series","volume-title":"2019 IEEE international conference on big data (big data). Presented at the 2019 IEEE international conference on big data (big data)","author":"Siami-Namini","year":"2019"},{"key":"B45","doi-asserted-by":"publisher","first-page":"311","DOI":"10.1038\/s41593-020-0599-5","article-title":"The multiplex model of the genetics of Alzheimer\u2019s disease","volume":"23","author":"Sims","year":"2020","journal-title":"Nat. Neurosci."},{"key":"B46","doi-asserted-by":"publisher","first-page":"881","DOI":"10.1038\/nature05616","article-title":"A genome-wide association study identifies novel risk loci for type 2 diabetes","volume":"445","author":"Sladek","year":"2007","journal-title":"Nature"},{"key":"B47","doi-asserted-by":"publisher","first-page":"1550","DOI":"10.1101\/gr.169375.113","article-title":"MultiBLUP: improved SNP-Based prediction for complex traits","volume":"24","author":"Speed","year":"2014","journal-title":"Genome Res."},{"key":"B48","doi-asserted-by":"publisher","first-page":"277","DOI":"10.1038\/s41588-018-0279-5","article-title":"SumHer better estimates the SNP heritability of complex traits from summary statistics","volume":"51","author":"Speed","year":"2019","journal-title":"Nat. Genet."},{"key":"B49","doi-asserted-by":"publisher","first-page":"458","DOI":"10.1038\/s41588-020-0600-y","article-title":"Evaluating and improving heritability models using summary statistics","volume":"52","author":"Speed","year":"2020","journal-title":"Nat. Genet."},{"key":"B50","doi-asserted-by":"publisher","first-page":"754","DOI":"10.1002\/hipo.1091","article-title":"Heritability of hippocampal size in elderly twin men: equivalent influence from genes and environment","volume":"11","author":"Sullivan","year":"2001","journal-title":"Hippocampus"},{"key":"B51","doi-asserted-by":"publisher","first-page":"3701","DOI":"10.1093\/bioinformatics\/btz172","article-title":"Gene-based association tests using GWAS summary statistics","volume":"35","author":"Svishcheva","year":"2019","journal-title":"Bioinformatics"},{"key":"B52","doi-asserted-by":"publisher","first-page":"1061","DOI":"10.1038\/nature09534","article-title":"A map of human genome variation from population scale sequencing","volume":"467","author":"Abecasis","year":"2010","journal-title":"Nature"},{"key":"B53","first-page":"5998","article-title":"Attention is all you need","author":"Vaswani","year":"2017"},{"key":"B54","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1016\/j.ajhg.2011.05.029","article-title":"Rare-variant association testing for sequencing data with the sequence kernel association test","volume":"89","author":"Wu","year":"2011","journal-title":"Am. J. Hum. Genet."},{"key":"B55","doi-asserted-by":"publisher","first-page":"e1009105","DOI":"10.1371\/journal.pgen.1009105","article-title":"Inferring causal direction between two traits in the presence of horizontal pleiotropy with GWAS summary data","volume":"16","author":"Xue","year":"2020","journal-title":"PLOS Genet."},{"key":"B56","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1038\/ng.608","article-title":"Common SNPs explain a large proportion of the heritability for human height","volume":"42","author":"Yang","year":"2010","journal-title":"Nat. Genet."},{"key":"B57","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1016\/j.neunet.2017.07.002","article-title":"Error bounds for approximations with deep ReLU networks","volume":"94","author":"Yarotsky","year":"2017","journal-title":"Neural Netw."},{"key":"B58","first-page":"13005","article-title":"The phase diagram of approximation rates for deep neural networks,\u201d in Proceedings of the 34th International Conference on Neural Information Processing Systems (NIPS '20)","author":"Yarotsky","year":"2020"},{"key":"B59","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s43856-023-00269-x","article-title":"Deep learning-based polygenic risk analysis for Alzheimer\u2019s disease prediction","volume":"3","author":"Zhou","year":"2023","journal-title":"Commun. Med."},{"key":"B60","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1038\/s41467-017-02317-2","article-title":"Causal associations between risk factors and common diseases inferred from GWAS summary data","volume":"9","author":"Zhu","year":"2018","journal-title":"Nat. Commun."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1657021\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T06:37:10Z","timestamp":1767854230000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1657021\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1,8]]},"references-count":60,"alternative-id":["10.3389\/fbinf.2025.1657021"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1657021","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,1,8]]},"article-number":"1657021"}}