{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T00:18:15Z","timestamp":1760660295421,"version":"build-2065373602"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T00:00:00Z","timestamp":1755820800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"German Federal Ministry of Education and Research","award":["01ZZ2010","01ZZ2316D"],"award-info":[{"award-number":["01ZZ2010","01ZZ2316D"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,10,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Generalizing machine learning models across small, high-dimensional, and heterogeneous biological datasets remains a critical challenge due to domain shifts caused by variations in data collection, population differences, and privacy constraints that restrict data sharing. Existing federated domain adaptation (FDA) approaches primarily rely on deep learning and focus on classification tasks, making them unsuitable for privacy-sensitive, small-scale regression problems in biomedical research. We introduce a privacy-preserving federated method for unsupervised domain adaptation in regression, enabling robust learning across distributed, high-dimensional datasets while maintaining full data privacy.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Our method is the first to enable distributed training of Gaussian processes for domain adaptation, ensuring complete privacy through randomized encoding and secure aggregation. Unlike deep learning-based FDA approaches, our method is specifically designed for small-scale, high-dimensional biological data, overcoming prior limitations in scalability and generalization. We evaluate our approach on age prediction from DNA methylation data, demonstrating that it achieves performance comparable to non-private state-of-the-art methods while fully preserving data privacy. This work enables secure and effective cross-institutional collaboration in biomedical research without requiring raw data sharing.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code for our method is available at https:\/\/github.com\/mdppml\/FREDA.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf465","type":"journal-article","created":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T18:07:59Z","timestamp":1755886079000},"source":"Crossref","is-referenced-by-count":0,"title":["Privacy-preserving federated unsupervised domain adaptation with application to age prediction from DNA methylation data"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3414-5297","authenticated-orcid":false,"given":"Cem Ata","family":"Baykara","sequence":"first","affiliation":[{"name":"Medical Data Privacy and Privacy-Preserving Machine Learning, University of T\u00fcbingen, 72076 T\u00fcbingen,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7279-620X","authenticated-orcid":false,"given":"Ali Burak","family":"\u00dcnal","sequence":"additional","affiliation":[{"name":"Medical Data Privacy and Privacy-Preserving Machine Learning, University of T\u00fcbingen, 72076 T\u00fcbingen,","place":["Germany"]},{"name":"Institute for Bioinformatics and Medical Informatics, University of T\u00fcbingen, 72076 T\u00fcbingen,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nico","family":"Pfeifer","sequence":"additional","affiliation":[{"name":"Institute for Bioinformatics and Medical Informatics, University of T\u00fcbingen, 72076 T\u00fcbingen,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4088-2784","authenticated-orcid":false,"given":"Mete","family":"Akg\u00fcn","sequence":"additional","affiliation":[{"name":"Medical Data Privacy and Privacy-Preserving Machine Learning, University of T\u00fcbingen, 72076 T\u00fcbingen,","place":["Germany"]},{"name":"Institute for Bioinformatics and Medical Informatics, University of T\u00fcbingen, 72076 T\u00fcbingen,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2025,8,22]]},"reference":[{"key":"2025101607371779800_btaf465-B1","doi-asserted-by":"crossref","first-page":"878","DOI":"10.15252\/msb.20156651","article-title":"Deep learning for computational biology","volume":"12","author":"Angermueller","year":"2016","journal-title":"Mol Syst Biol"},{"year":"2016","author":"Bonawitz","key":"2025101607371779800_btaf465-B2"},{"key":"2025101607371779800_btaf465-B3","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1093\/nar\/30.1.207","article-title":"Gene expression omnibus: NCBI gene expression and hybridization array data repository","volume":"30","author":"Edgar","year":"2002","journal-title":"Nucleic Acids Res"},{"volume-title":"Advances in Data Science and Information Engineering. Transactions on Computational Science and Computational Intelligence","author":"Farahani","key":"2025101607371779800_btaf465-B4","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-71704-9_65"},{"first-page":"3274","year":"2021","author":"Feng","key":"2025101607371779800_btaf465-B5"},{"key":"2025101607371779800_btaf465-B6","doi-asserted-by":"crossref","first-page":"e274","DOI":"10.1371\/journal.pbio.0030274","article-title":"Aging and gene expression in the primate brain","volume":"3","author":"Fraser","year":"2005","journal-title":"PLoS Biol"},{"first-page":"1180","volume-title":"Proceedings of the 32nd International Conference on Machine Learning","author":"Ganin","key":"2025101607371779800_btaf465-B7"},{"key":"2025101607371779800_btaf465-B8","first-page":"1","article-title":"Domain-adversarial training of neural networks","volume":"17","author":"Ganin","year":"2016","journal-title":"J Mach Learn Res"},{"year":"2012","author":"gpy","key":"2025101607371779800_btaf465-B9"},{"key":"2025101607371779800_btaf465-B10","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1038\/s41580-021-00407-0","article-title":"A guide to machine learning for biologists","volume":"23","author":"Greener","year":"2022","journal-title":"Nat Rev Mol Cell Biol"},{"key":"2025101607371779800_btaf465-B11","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1038\/nature24277","article-title":"Genetic effects on gene expression across human tissues","volume":"550","author":"GTEx Consortium","year":"2017","journal-title":"Nature"},{"key":"2025101607371779800_btaf465-B12","doi-asserted-by":"crossref","first-page":"i154","DOI":"10.1093\/bioinformatics\/btz338","article-title":"Weighted elastic net for unsupervised domain adaptation with application to age prediction from DNA methylation data","volume":"35","author":"Handl","year":"2019","journal-title":"Bioinformatics"},{"key":"2025101607371779800_btaf465-B13","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1109\/TPS-ISA58951.2023.00020","volume-title":"2023 5th IEEE International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (TPS-ISA)","author":"Hannemann","year":"2023"},{"key":"2025101607371779800_btaf465-B14","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1007\/978-3-031-89704-7_7","volume-title":"International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics","author":"Hannemann","year":"2025"},{"key":"2025101607371779800_btaf465-B15","doi-asserted-by":"crossref","first-page":"R115","DOI":"10.1186\/gb-2013-14-10-r115","article-title":"DNA methylation age of human tissues and cell types","volume":"14","author":"Horvath","year":"2013","journal-title":"Genome Biol"},{"key":"2025101607371779800_btaf465-B16","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1186\/s12864-016-2647-9","article-title":"Interpretable per case weighted ensemble method for cancer associations","volume":"17","author":"Jalali","year":"2016","journal-title":"BMC Genomics"},{"volume-title":"The Twelfth International Conference on Learning Representations (ICLR 2024)","author":"Jiang","key":"2025101607371779800_btaf465-B17"},{"key":"2025101607371779800_btaf465-B18","doi-asserted-by":"crossref","first-page":"1353","DOI":"10.1038\/s41551-022-00942-x","article-title":"Graph representation learning in biomedicine and healthcare","volume":"6","author":"Li","year":"2022","journal-title":"Nat Biomed Eng"},{"key":"2025101607371779800_btaf465-B19","first-page":"8602","article-title":"Source data-absent unsupervised domain adaptation through hypothesis transfer and labeling transfer","volume":"44","author":"Liang","year":"2022","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2025101607371779800_btaf465-B20","first-page":"14026","volume-title":"Proceedings of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24); 2024 Feb 20\u201327","author":"Liu","year":"2024"},{"first-page":"97","volume-title":"Proceedings of the 32nd International Conference on Machine Learning. Lille","author":"Long","key":"2025101607371779800_btaf465-B21"},{"journal-title":"Proceedings of the 30th International Conference on Neural Information Processing Systems","article-title":"Unsupervised domain adaptation with residual transfer networks","author":"Long","key":"2025101607371779800_btaf465-B22"},{"journal-title":"Proceedings of the 32nd International Conference on Neural Information Processing Systems. Montr\u00e9al","article-title":"Conditional adversarial domain adaptation","author":"Long","key":"2025101607371779800_btaf465-B23"},{"key":"2025101607371779800_btaf465-B24","doi-asserted-by":"crossref","first-page":"749","DOI":"10.1038\/s41551-018-0304-0","article-title":"Explainable machine-learning predictions for the prevention of hypoxaemia during surgery","volume":"2","author":"Lundberg","year":"2018","journal-title":"Nat Biomed Eng"},{"author":"McMahan","key":"2025101607371779800_btaf465-B25","first-page":"1273"},{"key":"2025101607371779800_btaf465-B26","doi-asserted-by":"publisher","first-page":"eadp6040","DOI":"10.1126\/sciadv.adp6040","article-title":"Domain adaptation in small-scale and heterogeneous biological datasets","volume":"10","author":"Orouji","year":"2024","journal-title":"Sci Adv"},{"volume-title":"International Conference on Learning Representations","author":"Peng","key":"2025101607371779800_btaf465-B27"},{"key":"2025101607371779800_btaf465-B28","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1038\/s41551-018-0195-0","article-title":"Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning","volume":"2","author":"Poplin","year":"2018","journal-title":"Nat Biomed Eng"},{"key":"2025101607371779800_btaf465-B29","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1142\/S0129065704001899","article-title":"Gaussian processes for machine learning","volume":"14","author":"Seeger","year":"2004","journal-title":"Int J Neural Syst"},{"key":"2025101607371779800_btaf465-B30","article-title":"Learning transferrable representations for unsupervised domain adaptation","volume":"29","author":"Sener","year":"2016","journal-title":"Adv Neural Inf Process Syst"},{"year":"2016","author":"Sun","key":"2025101607371779800_btaf465-B31"},{"key":"2025101607371779800_btaf465-B32","first-page":"23309","article-title":"PartialFed: cross-domain personalized federated learning via partial initialization","volume":"34","author":"Sun","year":"2021","journal-title":"Adv Neural Inf Process Syst"},{"key":"2025101607371779800_btaf465-B33","doi-asserted-by":"crossref","first-page":"3281","DOI":"10.1016\/j.csbj.2024.08.024","article-title":"Privacy-preserving decentralized learning methods for biomedical applications","volume":"23","author":"Tajabadi","year":"2024","journal-title":"Comput Struct Biotechnol J"},{"key":"2025101607371779800_btaf465-B34","doi-asserted-by":"publisher","first-page":"W228","DOI":"10.1093\/nar\/gkac278","article-title":"DeepLoc 2.0: multi-label subcellular localization prediction using protein language models","volume":"50","author":"Thumuluri","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025101607371779800_btaf465-B35","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1016\/j.artmed.2008.08.014","article-title":"Computational intelligence and machine learning in bioinformatics","volume":"45","author":"Valentini","year":"2009","journal-title":"Artif Intell Med"},{"key":"2025101607371779800_btaf465-B36","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1016\/j.neucom.2018.05.083","article-title":"Deep visual domain adaptation: a survey","volume":"312","author":"Wang","year":"2018","journal-title":"Neurocomputing (Amst)"},{"key":"2025101607371779800_btaf465-B37","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1038\/ng.2764","article-title":"The cancer genome atlas pan-cancer analysis project","volume":"45","author":"Weinstein","year":"2013","journal-title":"Nat Genet"},{"first-page":"36978","year":"2023","author":"Weng","key":"2025101607371779800_btaf465-B38"},{"volume-title":"Gaussian Processes for Machine Learning","year":"2006","author":"Williams","key":"2025101607371779800_btaf465-B39"},{"key":"2025101607371779800_btaf465-B40","first-page":"10040","article-title":"Distribution-informed neural networks for domain adaptation regression","volume":"35","author":"Wu","year":"2022","journal-title":"Adv Neural Inf Process Syst"},{"first-page":"8788","year":"2022","author":"Yan","key":"2025101607371779800_btaf465-B41"},{"key":"2025101607371779800_btaf465-B42","first-page":"26991","article-title":"Make the U in UDA matter: invariant consistency learning for unsupervised domain adaptation","volume":"36","author":"Yue","year":"2023","journal-title":"Adv Neural Inf Process Syst"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf465\/64112007\/btaf465.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/10\/btaf465\/64112007\/btaf465.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/10\/btaf465\/64112007\/btaf465.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T11:37:42Z","timestamp":1760614662000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf465\/8239950"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2025,8,22]]},"references-count":42,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2025,10,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf465","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2025,10]]},"published":{"date-parts":[[2025,8,22]]},"article-number":"btaf465"}}