{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T14:52:44Z","timestamp":1754146364453,"version":"3.41.2"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T00:00:00Z","timestamp":1752796800000},"content-version":"vor","delay-in-days":17,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Australian Research Council Linkage Program","award":["LP210100414"],"award-info":[{"award-number":["LP210100414"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,7,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Current genome-wide association studies provide valuable insights into the genetic basis of ischaemic stroke (IS) risk. However, polygenic risk scores, the most widely used method for genetic risk prediction, have notable limitations due to their linear nature and inability to capture complex, nonlinear interactions among genetic variants. While deep neural networks offer advantages in modeling these complex relationships, the multifactorial nature of IS and the influence of modifiable risk factors present additional challenges for genetic risk prediction. To address these challenges, we propose a Chromosome-wise Multi-task Genomic (MetaGeno) framework that utilizes genetic data from IS and five related diseases. The framework includes a chromosome-based embedding layer to model local and global interactions among adjacent variants, enabling a biologically informed approach. Incorporating multi-disease learning further enhances predictive accuracy by leveraging shared genetic information. Among various sequential models tested, the Transformer demonstrated superior performance, and outperformed other machine learning models and PRS baselines, achieving an AUROC of 0.809 on the UK Biobank dataset. Risk stratification identified a two-fold increased stroke risk (HR, 2.14; 95% CI: 1.81\u20132.46) in the top 1% risk group, with a nearly five-fold increase in those with modifiable risk factors such as atrial fibrillation and hypertension. Finally, the model was validated on the diverse All of Us dataset (AUROC = 0.764), highlighting ancestry and population differences while demonstrating effective generalization. This study introduces a predictive framework that identifies high-risk individuals and informs targeted prevention strategies, offering potential as a clinical decision-support tool.<\/jats:p>","DOI":"10.1093\/bib\/bbaf348","type":"journal-article","created":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T05:36:52Z","timestamp":1752817012000},"source":"Crossref","is-referenced-by-count":0,"title":["MetaGeno: a chromosome-wise multi-task genomic framework for ischaemic stroke risk prediction"],"prefix":"10.1093","volume":"26","author":[{"given":"Yue","family":"Yang","sequence":"first","affiliation":[{"name":"Australian Artificial Intelligence Institute , Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo 2007, New South Wales,","place":["Australia"]}]},{"given":"Kairui","family":"Guo","sequence":"additional","affiliation":[{"name":"Australian Artificial Intelligence Institute , Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo 2007, New South Wales,","place":["Australia"]}]},{"given":"Yonggang","family":"Zhang","sequence":"additional","affiliation":[{"name":"Australian Artificial Intelligence Institute , Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo 2007, New South Wales,","place":["Australia"]}]},{"given":"Zhen","family":"Fang","sequence":"additional","affiliation":[{"name":"Australian Artificial Intelligence Institute , Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo 2007, New South Wales,","place":["Australia"]}]},{"given":"Hua","family":"Lin","sequence":"additional","affiliation":[{"name":"23Strands , 26-32 Pirrama Road, Pyrmont 2009, New South Wales,","place":["Australia"]}]},{"given":"Mark","family":"Grosser","sequence":"additional","affiliation":[{"name":"23Strands , 26-32 Pirrama Road, Pyrmont 2009, New South Wales,","place":["Australia"]}]},{"given":"Deon","family":"Venter","sequence":"additional","affiliation":[{"name":"23Strands , 26-32 Pirrama Road, Pyrmont 2009, New South Wales,","place":["Australia"]}]},{"given":"Weihai","family":"Lu","sequence":"additional","affiliation":[{"name":"23Strands , 26-32 Pirrama Road, Pyrmont 2009, New South Wales,","place":["Australia"]}]},{"given":"Mengjia","family":"Wu","sequence":"additional","affiliation":[{"name":"Australian Artificial Intelligence Institute , Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo 2007, New South Wales,","place":["Australia"]}]},{"given":"Dennis","family":"Cordato","sequence":"additional","affiliation":[{"name":"Department of Neurology and Neurophysiology , Liverpool Hospital, South Western Sydney Local Health District, Liverpool 2170, New South Wales,","place":["Australia"]}]},{"given":"Guangquan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Australian Artificial Intelligence Institute , Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo 2007, New South Wales,","place":["Australia"]}]},{"given":"Jie","family":"Lu","sequence":"additional","affiliation":[{"name":"Australian Artificial Intelligence Institute , Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo 2007, New South Wales,","place":["Australia"]}]}],"member":"286","published-online":{"date-parts":[[2025,7,18]]},"reference":[{"key":"2025071801364707700_ref1","doi-asserted-by":"publisher","first-page":"3161","DOI":"10.1161\/STROKEAHA.112.665760","article-title":"Genetic heritability of ischemic stroke and the contribution of previously reported candidate gene and genomewide associations","volume":"43","author":"Bevan","year":"2012","journal-title":"Stroke"},{"key":"2025071801364707700_ref2","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1016\/S1474-4422(15)00338-5","article-title":"Loci associated with ischaemic stroke and its subtypes (sign): a genome-wide association study","volume":"15","author":"Pulit","year":"2016","journal-title":"Lancet Neurol"},{"key":"2025071801364707700_ref3","doi-asserted-by":"publisher","first-page":"951","DOI":"10.1016\/S1474-4422(12)70234-X","article-title":"Genetic risk factors for ischaemic stroke and its subtypes (the metastroke collaboration): a meta-analysis of genome-wide association studies","volume":"11","author":"Traylor","year":"2012","journal-title":"Lancet Neurol"},{"key":"2025071801364707700_ref4","doi-asserted-by":"publisher","first-page":"870","DOI":"10.1161\/STR.0b013e318284056a","article-title":"Guidelines for the early management of patients with acute ischemic stroke: a guideline for healthcare professionals from the American Heart Association\/American Stroke Association","volume":"44","author":"Jauch","year":"2013","journal-title":"Stroke"},{"key":"2025071801364707700_ref5","doi-asserted-by":"publisher","first-page":"472","DOI":"10.1161\/CIRCRESAHA.116.308398","article-title":"Stroke risk factors, genetics, and prevention","volume":"120","author":"Boehme","year":"2017","journal-title":"Circ Res"},{"key":"2025071801364707700_ref6","doi-asserted-by":"publisher","first-page":"392","DOI":"10.1038\/nrg.2016.27","article-title":"Developing and evaluating polygenic risk prediction models for stratified disease prevention","volume":"17","author":"Chatterjee","year":"2016","journal-title":"Nat Rev Genet"},{"key":"2025071801364707700_ref7","doi-asserted-by":"publisher","first-page":"2759","DOI":"10.1038\/s41596-020-0353-1","article-title":"Tutorial: a guide to performing polygenic risk score analyses","volume":"15","author":"Choi","year":"2020","journal-title":"Nat Protoc"},{"key":"2025071801364707700_ref8","doi-asserted-by":"publisher","first-page":"524","DOI":"10.1038\/s41588-018-0058-3","article-title":"Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes","volume":"50","author":"Malik","year":"2018","journal-title":"Nat Genet"},{"key":"2025071801364707700_ref9","doi-asserted-by":"publisher","first-page":"e560","DOI":"10.1212\/NXG.0000000000000560","article-title":"Polygenic risk scores augment stroke subtyping","volume":"7","author":"Li","year":"2021","journal-title":"Neurol Genet"},{"key":"2025071801364707700_ref10","doi-asserted-by":"publisher","first-page":"2882","DOI":"10.1161\/STROKEAHA.120.033670","article-title":"Predictive performance of a polygenic risk score for incident ischemic stroke in a healthy older population","volume":"52","author":"Neumann","year":"2021","journal-title":"Stroke"},{"key":"2025071801364707700_ref11","doi-asserted-by":"publisher","first-page":"e003168","DOI":"10.1161\/CIRCGEN.120.003168","article-title":"Combining clinical and polygenic risk improves stroke prediction among individuals with atrial fibrillation","volume":"14","author":"O\u2019Sullivan","year":"2021","journal-title":"Circ: Genomic Precis Med"},{"key":"2025071801364707700_ref12","doi-asserted-by":"publisher","DOI":"10.1038\/s42003-024-05874-7","article-title":"Integration of risk factor polygenic risk score with disease polygenic risk score for disease prediction","volume":"7","author":"Jung","year":"2024","journal-title":"Commun Biol"},{"key":"2025071801364707700_ref13","doi-asserted-by":"publisher","first-page":"2349","DOI":"10.1056\/NEJMoa1605086","article-title":"Genetic risk, adherence to a healthy lifestyle, and coronary disease","volume":"375","author":"Khera","year":"2016","journal-title":"N Engl J Med"},{"key":"2025071801364707700_ref14","doi-asserted-by":"publisher","first-page":"636","DOI":"10.1001\/jama.2019.22241","article-title":"Predictive accuracy of a polygenic risk score\u2013enhanced prediction model vs a clinical risk score for coronary artery disease","volume":"323","author":"Elliott","year":"2020","journal-title":"JAMA"},{"key":"2025071801364707700_ref15","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1038\/nrg.2018.4","article-title":"Integrative omics for health and disease","volume":"19","author":"Karczewski","year":"2018","journal-title":"Nat Rev Genet"},{"key":"2025071801364707700_ref16","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1038\/s41582-020-0350-6","article-title":"Multilevel omics for the discovery of biomarkers and therapeutic targets for stroke","volume":"16","author":"Montaner","year":"2020","journal-title":"Nat Rev Neurol"},{"key":"2025071801364707700_ref17","doi-asserted-by":"publisher","first-page":"547","DOI":"10.1007\/978-3-031-41777-1_19","article-title":"Multi-omics approaches to discovering acute stroke injury and recovery mechanisms","volume-title":"Stroke Genetics","author":"Giles","year":"2024"},{"key":"2025071801364707700_ref18","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","author":"LeCun","year":"2015"},{"key":"2025071801364707700_ref19","doi-asserted-by":"publisher","first-page":"878","DOI":"10.15252\/msb.20156651","article-title":"Deep learning for computational biology","volume":"12","author":"Angermueller","year":"2016","journal-title":"Mol Syst Biol"},{"key":"2025071801364707700_ref20","doi-asserted-by":"publisher","first-page":"110937","DOI":"10.1016\/j.knosys.2023.110937","article-title":"Artificial intelligence-driven biomedical genomics","volume":"279","author":"Guo","year":"2023","journal-title":"Knowledge-Based Syst"},{"key":"2025071801364707700_ref21","doi-asserted-by":"publisher","first-page":"931","DOI":"10.1038\/nmeth.3547","article-title":"Predicting effects of noncoding variants with deep learning\u2013based sequence model","volume":"12","author":"Zhou","year":"2015","journal-title":"Nat Methods"},{"key":"2025071801364707700_ref22","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1007\/978-981-99-7108-4_16","article-title":"BiblioEngine: an AI-empowered platform for disease genetic knowledge mining","volume-title":"International Conference on Health Information Science","author":"Wu","year":"2023"},{"key":"2025071801364707700_ref23","doi-asserted-by":"crossref","first-page":"bbad095","DOI":"10.1093\/bib\/bbad095","article-title":"DeepFormer: a hybrid network based on convolutional neural network and flow-attention mechanism for identifying the function of DNA sequences","volume":"24","author":"Yao","year":"2023","journal-title":"Brief Bioinform"},{"key":"2025071801364707700_ref24","doi-asserted-by":"crossref","first-page":"bbac022","DOI":"10.1093\/bib\/bbac022","article-title":"Deep learning-based identification of genetic variants: application to Alzheimer\u2019s disease classification","volume":"23","author":"Jo","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025071801364707700_ref25","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-019-13848-1","article-title":"Genomic risk score offers predictive performance comparable to clinical risk factors for ischaemic stroke","volume":"10","author":"Abraham","year":"2019","journal-title":"Nat Commun"},{"key":"2025071801364707700_ref26","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1038\/35075590","article-title":"Linkage disequilibrium in the human genome","volume":"411","author":"Reich","year":"2001","journal-title":"Nature"},{"key":"2025071801364707700_ref27","doi-asserted-by":"publisher","first-page":"477","DOI":"10.1038\/nrg2361","article-title":"Linkage disequilibrium-understanding the evolutionary past and mapping the medical future","volume":"9","author":"Slatkin","year":"2008","journal-title":"Nat Rev Genet"},{"key":"2025071801364707700_ref28","doi-asserted-by":"publisher","first-page":"983","DOI":"10.1038\/nbt.4235","article-title":"A universal SNP and small-indel variant caller using deep neural networks","volume":"36","author":"Poplin","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2025071801364707700_ref29","doi-asserted-by":"publisher","first-page":"2032","DOI":"10.1093\/bib\/bbaa022","article-title":"SG-LSTM-FRAME: a computational frame using sequence and geometrical information via LSTM to predict miRNA\u2013gene associations","volume":"22","author":"Xie","year":"2021","journal-title":"Brief Bioinform"},{"key":"2025071801364707700_ref30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-019-2972-5","article-title":"ET-GRU: using multi-layer gated recurrent units to identify electron transport proteins","volume":"20","author":"Le","year":"2019","journal-title":"BMC Bioinformatics"},{"key":"2025071801364707700_ref31","doi-asserted-by":"publisher","first-page":"2112","DOI":"10.1093\/bioinformatics\/btab083","article-title":"DNABERT: pre-trained bidirectional encoder representations from transformers model for DNA-language in genome","volume":"37","author":"Ji","year":"2021","journal-title":"Bioinformatics"},{"key":"2025071801364707700_ref32","doi-asserted-by":"publisher","first-page":"1332","DOI":"10.3389\/fgene.2019.01332","article-title":"Causalcall: nanopore basecalling using a temporal convolutional network","volume":"10","author":"Zeng","year":"2020","journal-title":"Front Genet"},{"key":"2025071801364707700_ref33","doi-asserted-by":"publisher","first-page":"e1001779","DOI":"10.1371\/journal.pmed.1001779","article-title":"UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age","volume":"12","author":"Sudlow","year":"2015","journal-title":"PLoS Med"},{"key":"2025071801364707700_ref34","doi-asserted-by":"publisher","first-page":"668","DOI":"10.1056\/NEJMsr1809937","article-title":"The \u2018All of Us\u2019 research program","volume":"381","author":"All of Us Research Program Investigators","year":"2019","journal-title":"New Eng J Med"},{"key":"2025071801364707700_ref35","volume-title":"Nat Genet"},{"key":"2025071801364707700_ref36","doi-asserted-by":"crossref","first-page":"5424","DOI":"10.1093\/bioinformatics\/btaa1029","article-title":"LDpred2: better, faster, stronger","volume":"36","author":"Priv\u00e9","year":"2020","journal-title":"Bioinformatics"},{"key":"2025071801364707700_ref37","doi-asserted-by":"publisher","first-page":"469","DOI":"10.1002\/gepi.22050","article-title":"Polygenic scores via penalized regression on summary statistics","volume":"41","author":"Mak","year":"2017","journal-title":"Genet Epidemiol"},{"key":"2025071801364707700_ref38","doi-asserted-by":"publisher","first-page":"752","DOI":"10.1016\/j.fmre.2024.02.015","article-title":"DeepRisk: a deep learning approach for genome-wide assessment of common disease risk","volume":"4","author":"Peng","year":"2024","journal-title":"Fundam Res"},{"key":"2025071801364707700_ref39","doi-asserted-by":"crossref","first-page":"1094","DOI":"10.1038\/s42003-021-02622-z","article-title":"GenNet framework: interpretable deep learning for predicting phenotypes from genetic data","volume":"4","author":"van Hilten","year":"2021","journal-title":"Commun Biol"},{"key":"2025071801364707700_ref40","doi-asserted-by":"publisher","first-page":"459","DOI":"10.1038\/nrg2813","article-title":"New approaches to population stratification in genome-wide association studies","volume":"11","author":"Price","year":"2010","journal-title":"Nat Rev Genet"},{"key":"2025071801364707700_ref41","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1016\/j.ajhg.2017.06.005","article-title":"10 years of GWAS discovery: biology, function, and translation","volume":"101","author":"Visscher","year":"2017","journal-title":"Am J Hum Genet"},{"key":"2025071801364707700_ref42","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1471-2164-6-140","article-title":"The distribution of SNPs in human gene regulatory regions","volume":"6","author":"Guo","year":"2005","journal-title":"BMC Genomics"},{"key":"2025071801364707700_ref43","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1016\/j.patcog.2017.11.004","article-title":"Structural property-aware multilayer network embedding for latent factor analysis","volume":"76","author":"Jie","year":"2018","journal-title":"Pattern Recognit"},{"key":"2025071801364707700_ref44","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput"},{"key":"2025071801364707700_ref45","doi-asserted-by":"publisher","volume-title":"Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 1724\u201334","DOI":"10.3115\/v1\/D14-1179"},{"key":"2025071801364707700_ref46","article-title":"Attention is all you need","volume-title":"Advances in neural information processing systems 30, 5998\u20136008","author":"Vaswani"},{"key":"2025071801364707700_ref47","first-page":"156","article-title":"Temporal convolutional networks for action segmentation and detection","volume-title":"proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Lea","year":"2017"},{"key":"2025071801364707700_ref48","doi-asserted-by":"publisher","first-page":"470","DOI":"10.1161\/CIRCULATIONAHA.120.051927","article-title":"Clinical application of a novel genetic risk score for ischemic stroke in patients with cardiometabolic disease","volume":"143","author":"Marston","year":"2021","journal-title":"Circulation"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/4\/bbaf348\/63791593\/bbaf348.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/4\/bbaf348\/63791593\/bbaf348.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T05:36:54Z","timestamp":1752817014000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf348\/8205773"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7]]},"references-count":48,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,7,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf348","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,7]]},"published":{"date-parts":[[2025,7]]},"article-number":"bbaf348"}}