{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:42:53Z","timestamp":1753875773432,"version":"3.41.2"},"reference-count":8,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2025,2,14]],"date-time":"2025-02-14T00:00:00Z","timestamp":1739491200000},"content-version":"vor","delay-in-days":13,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100020595","name":"National Science and Technology Council","doi-asserted-by":"publisher","award":["112-2314-B-001-010","111-2314-B-001-008"],"award-info":[{"award-number":["112-2314-B-001-010","111-2314-B-001-008"]}],"id":[{"id":"10.13039\/100020595","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,2,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Summary<\/jats:title>\n                  <jats:p>Polygenic risk scores (PRSs) hold promise for early disease diagnosis and personalized treatment, but their overall discriminative power remains limited for many diseases in the general population. As a result, numerous novel PRS modeling techniques have been developed to improve predictive performance, but determining the most effective method for a specific application remains uncertain until tested. Hence, we introduce a novel, versatile tool for building an optimized PRS model by integrating candidate models from multiple existing PRS building methods that use target population data and\/or incorporating information from other populations through a trans-ethnic approach. Our tool, PNL is based on PairNet algorithm, a Convolutional Neural Network with low computation complexity through simple paring operation. In the case studies for asthma, type 2 diabetes, and vertigo, the optimal PRS model generated with PNL using only Taiwan biobank (TWB) data achieved Area Under the Curves (AUCs) that matched or improved the best results using other methods individually. Incorporating the UK Biobank data (UKBB) data further improved performance of PNL for asthma and type 2 diabetes. For vertigo, unlike the other diseases, individual method analysis showed that UKBB data alone generally produced lower AUCs compared to TWB data alone. As a result, incorporating UKBB data did not improve AUC with PNL, suggesting that increasing the number of candidate models does not necessarily result in higher AUC values, alleviating concerns about overfitting.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The python code for PairNet algorithm incorporated in PNL is freely available on: https:\/\/github.com\/FannLab\/pairnet. An archived, citable version is stored on: https:\/\/doi.org\/10.5281\/zenodo.14838227.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf071","type":"journal-article","created":{"date-parts":[[2025,2,14]],"date-time":"2025-02-14T16:23:36Z","timestamp":1739550216000},"source":"Crossref","is-referenced-by-count":0,"title":["<i>PNL<\/i>: a software to build polygenic risk scores using a super learner approach based on PairNet, a Convolutional Neural Network"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7731-2374","authenticated-orcid":false,"given":"Ting-Huei","family":"Chen","sequence":"first","affiliation":[{"name":"Department of Mathematics and Statistics, Universit\u00e9 Laval , Quebec, QC, G1V 0A6,","place":["Canada"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8394-7683","authenticated-orcid":false,"given":"Chia-Jung","family":"Lee","sequence":"additional","affiliation":[{"name":"Department of Pathology and Immunology, Washington University School of Medicine , St. Louis, MO, 63110,","place":["United States"]}]},{"given":"Syue-Pu","family":"Chen","sequence":"additional","affiliation":[{"name":"Institute of Biomedical Sciences, Academia Sinica , Taipei, 115,","place":["Taiwan"]}]},{"given":"Shang-Jung","family":"Wu","sequence":"additional","affiliation":[{"name":"Institute of Biomedical Sciences, Academia Sinica , Taipei, 115,","place":["Taiwan"]}]},{"given":"Cathy S J","family":"Fann","sequence":"additional","affiliation":[{"name":"Institute of Biomedical Sciences, Academia Sinica , Taipei, 115,","place":["Taiwan"]}]}],"member":"286","published-online":{"date-parts":[[2025,2,14]]},"reference":[{"key":"2025030423070794200_btaf071-B1","doi-asserted-by":"crossref","first-page":"1776","DOI":"10.1038\/s41467-019-09718-5","article-title":"Polygenic prediction via Bayesian regression and continuous shrinkage priors","volume":"10","author":"Ge","year":"2019","journal-title":"Nat Commun"},{"author":"Jhang","key":"2025030423070794200_btaf071-B2","first-page":"994"},{"key":"2025030423070794200_btaf071-B3","doi-asserted-by":"crossref","first-page":"1175","DOI":"10.1038\/s42003-022-04168-0","article-title":"Phenome-wide analysis of Taiwan biobank reveals novel glycemia-related loci and genetic risks for diabetes","volume":"5","author":"Lee","year":"2022","journal-title":"Commun Biol"},{"key":"2025030423070794200_btaf071-B4","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1016\/j.tig.2021.06.004","article-title":"Genetic prediction of complex traits with polygenic scores: a statistical review","volume":"37","author":"Ma","year":"2021","journal-title":"Trends Genet"},{"key":"2025030423070794200_btaf071-B5","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1086\/519795","article-title":"Plink: a tool set for whole-genome association and population-based linkage analyses","volume":"81","author":"Purcell","year":"2007","journal-title":"Am J Hum Genet"},{"key":"2025030423070794200_btaf071-B6","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1038\/s41588-022-01054-7","article-title":"Improving polygenic prediction in ancestrally diverse populations","volume":"54","author":"Ruan","year":"2022","journal-title":"Nat Genet"},{"key":"2025030423070794200_btaf071-B7","doi-asserted-by":"crossref","first-page":"Article25","DOI":"10.2202\/1544-6115.1309","article-title":"Super learner","volume":"6","author":"Van der Laan","year":"2007","journal-title":"Stat Appl Genet Mol Biol"},{"key":"2025030423070794200_btaf071-B8","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1016\/j.ajhg.2015.09.001","article-title":"Modeling linkage disequilibrium increases accuracy of polygenic risk scores","volume":"97","author":"Vilhj\u00e1lmsson","year":"2015","journal-title":"Am J Hum Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf071\/61899508\/btaf071.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/2\/btaf071\/61899508\/btaf071.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/2\/btaf071\/61899508\/btaf071.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,4]],"date-time":"2025-03-04T23:07:33Z","timestamp":1741129653000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf071\/8015614"}},"subtitle":[],"editor":[{"given":"Russell","family":"Schwartz","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,2]]},"references-count":8,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,2,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf071","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2025,2]]},"published":{"date-parts":[[2025,2]]},"article-number":"btaf071"}}