{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T22:55:02Z","timestamp":1773356102161,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2025,4,4]],"date-time":"2025-04-04T00:00:00Z","timestamp":1743724800000},"content-version":"vor","delay-in-days":6,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"FAIR"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,3,29]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Precision medicine leverages patient-specific multimodal data to improve prevention, diagnosis, prognosis, and treatment of diseases. Advancing precision medicine requires the non-trivial integration of complex, heterogeneous, and potentially high-dimensional data sources, such as multi-omics and clinical data. In the literature, several approaches have been proposed to manage missing data, but are usually limited to the recovery of subsets of features for a subset of patients. A largely overlooked problem is the integration of multiple sources of data when one or more of them are completely missing for a subset of patients, a relatively common condition in clinical practice.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We propose miss-Similarity Network Fusion (miss-SNF), a novel general-purpose data integration approach designed to manage completely missing data in the context of patient similarity networks. miss-SNF integrates incomplete unimodal patient similarity networks by leveraging a non-linear message-passing strategy borrowed from the SNF algorithm. miss-SNF is able to recover missing patient similarities and is \u201ctask agnostic\u201d, in the sense that can integrate partial data for both unsupervised and supervised prediction tasks. Experimental analyses on nine cancer datasets from The Cancer Genome Atlas (TCGA) demonstrate that miss-SNF achieves state-of-the-art results in recovering similarities and in identifying patients subgroups enriched in clinically relevant variables and having differential survival. Moreover, amputation experiments show that miss-SNF supervised prediction of cancer clinical outcomes and Alzheimer\u2019s disease diagnosis with completely missing data achieves results comparable to those obtained when all the data are available.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>miss-SNF code, implemented in R, is available at https:\/\/github.com\/AnacletoLAB\/missSNF.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf150","type":"journal-article","created":{"date-parts":[[2025,4,3]],"date-time":"2025-04-03T07:56:48Z","timestamp":1743667008000},"source":"Crossref","is-referenced-by-count":2,"title":["miss-SNF: a multimodal patient similarity network integration approach to handle completely missing data sources"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7629-8112","authenticated-orcid":false,"given":"Jessica","family":"Gliozzo","sequence":"first","affiliation":[{"name":"AnacletoLab, Dipartimento di Informatica \u201cGiovanni Degli Antoni\u201d, Universit\u00e0 degli Studi di Milano , Via Giovanni Celoria 18 , Milan, 20133,","place":["Italy"]},{"name":"European Commission, Joint Research Centre (JRC) , Ispra, 21027,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5977-9467","authenticated-orcid":false,"given":"Mauricio A","family":"Soto Gomez","sequence":"additional","affiliation":[{"name":"AnacletoLab, Dipartimento di Informatica \u201cGiovanni Degli Antoni\u201d, Universit\u00e0 degli Studi di Milano , Via Giovanni Celoria 18 , Milan, 20133,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3258-0727","authenticated-orcid":false,"given":"Arturo","family":"Bonometti","sequence":"additional","affiliation":[{"name":"Department of Biomedical Sciences, Humanitas University , Via Rita Levi Montalcini 4 , Pieve Emanuele (MI), 20072,","place":["Italy"]},{"name":"Department of Pathology, IRCCS Humanitas Clinical and Research Hospital , Via Alessandro Manzoni 56 , Rozzano (MI), 20089,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9282-188X","authenticated-orcid":false,"given":"Alex","family":"Patak","sequence":"additional","affiliation":[{"name":"European Commission, Joint Research Centre (JRC) , Ispra, 21027,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2024-7572","authenticated-orcid":false,"given":"Elena","family":"Casiraghi","sequence":"additional","affiliation":[{"name":"AnacletoLab, Dipartimento di Informatica \u201cGiovanni Degli Antoni\u201d, Universit\u00e0 degli Studi di Milano , Via Giovanni Celoria 18 , Milan, 20133,","place":["Italy"]},{"name":"Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory , Berkeley, CA, 94720,","place":["United States"]},{"name":"Milan Unit, ELLIS\u2014European Laboratory for Learning and Intelligent Systems ,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5694-3919","authenticated-orcid":false,"given":"Giorgio","family":"Valentini","sequence":"additional","affiliation":[{"name":"AnacletoLab, Dipartimento di Informatica \u201cGiovanni Degli Antoni\u201d, Universit\u00e0 degli Studi di Milano , Via Giovanni Celoria 18 , Milan, 20133,","place":["Italy"]},{"name":"Milan Unit, ELLIS\u2014European Laboratory for Learning and Intelligent Systems ,","place":["Italy"]}]}],"member":"286","published-online":{"date-parts":[[2025,4,4]]},"reference":[{"key":"2025042117315367000_btaf150-B1","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1186\/s13059-020-02015-1","article-title":"Mofa+: a statistical framework for comprehensive integration of multi-modal single-cell data","volume":"21","author":"Argelaguet","year":"2020","journal-title":"Genome Biol"},{"key":"2025042117315367000_btaf150-B2","doi-asserted-by":"publisher","first-page":"1222","DOI":"10.1038\/s41592-023-01909-9","article-title":"Multivi: deep generative model for the integration of multimodal data","volume":"20","author":"Ashuach","year":"2023","journal-title":"Nat Methods"},{"key":"2025042117315367000_btaf150-B3","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1016\/j.csbj.2022.11.050","article-title":"A guide to multi-omics data collection and integration for translational medicine","volume":"21","author":"Athieniti","year":"2023","journal-title":"Comput Struct Biotechnol J"},{"key":"2025042117315367000_btaf150-B4","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach Learn"},{"key":"2025042117315367000_btaf150-B5","doi-asserted-by":"crossref","first-page":"552","DOI":"10.1038\/s43588-023-00465-8","article-title":"GRAPE for fast and scalable graph processing and random walk-based embedding","volume":"3","author":"Cappelletti","year":"2023","journal-title":"Nat Comput Sci"},{"key":"2025042117315367000_btaf150-B6","author":"Dai","year":"2020"},{"key":"2025042117315367000_btaf150-B7","doi-asserted-by":"crossref","first-page":"e1009224","DOI":"10.1371\/journal.pcbi.1009224","article-title":"Evaluation and comparison of multi-omics data integration methods for cancer subtyping","volume":"17","author":"Duan","year":"2021","journal-title":"PLoS Comput Biol"},{"key":"2025042117315367000_btaf150-B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v089.i11","article-title":"Randomized matrix decompositions using r","volume":"89","author":"Erichson","year":"2019","journal-title":"J Stat Soft"},{"key":"2025042117315367000_btaf150-B9","doi-asserted-by":"crossref","first-page":"12140","DOI":"10.1038\/s41598-017-11873-y","article-title":"Estimating the intrinsic dimension of datasets by a minimal neighborhood information","volume":"7","author":"Facco","year":"2017","journal-title":"Sci Rep"},{"key":"2025042117315367000_btaf150-B10","doi-asserted-by":"crossref","first-page":"1098308","DOI":"10.3389\/frai.2023.1098308","article-title":"Missing data in multi-omics integration: recent advances through artificial intelligence","volume":"6","author":"Flores","year":"2023","journal-title":"Front Artif Intell"},{"key":"2025042117315367000_btaf150-B11","first-page":"228","volume-title":"BIOSTEC 2023 \u2013 Bioinformatics","author":"Gliozzo","year":"2023"},{"key":"2025042117315367000_btaf150-B12","doi-asserted-by":"crossref","first-page":"bbac207","DOI":"10.1093\/bib\/bbac207","article-title":"Heterogeneous data integration methods for patient similarity networks","volume":"23","author":"Gliozzo","year":"2022","journal-title":"Briefings Bioinf"},{"key":"2025042117315367000_btaf150-B13","doi-asserted-by":"crossref","first-page":"3612","DOI":"10.1038\/s41598-020-60235-8","article-title":"Network modeling of patients\u2019 biomolecular profiles for clinical phenotype\/outcome prediction","volume":"10","author":"Gliozzo","year":"2020","journal-title":"Sci Rep"},{"key":"2025042117315367000_btaf150-B14","doi-asserted-by":"crossref","first-page":"103049","DOI":"10.1016\/j.artmed.2024.103049","article-title":"Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing","volume":"160","author":"Gliozzo","year":"2025","journal-title":"Artif Intell Med"},{"key":"2025042117315367000_btaf150-B15","doi-asserted-by":"publisher","first-page":"603","DOI":"10.1145\/3511808.3557339","volume-title":"Proceedings of the 31st ACM International Conference on Information & Knowledge Management, CIKM\u201922","author":"Gong","year":"2022"},{"key":"2025042117315367000_btaf150-B16","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The Elements of Statistical Learning: Data Mining, Inference, and Prediction","author":"Hastie","year":"2009","edition":"2nd edn"},{"key":"2025042117315367000_btaf150-B17","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1016\/j.cell.2018.03.042","article-title":"The cancer genome atlas: creating lasting value beyond its data","volume":"173","author":"Hutter","year":"2018","journal-title":"Cell"},{"key":"2025042117315367000_btaf150-B18","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1111\/joim.13640","article-title":"Precision medicine in complex diseases\u2014molecular subgrouping for improved prediction and treatment stratification","volume":"294","author":"Johansson","year":"2023","journal-title":"J Intern Med"},{"key":"2025042117315367000_btaf150-B19","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1038\/nrg.2018.4","article-title":"Integrative omics for health and disease","volume":"19","author":"Karczewski","year":"2018","journal-title":"Nat Rev Genet"},{"key":"2025042117315367000_btaf150-B20","first-page":"68","volume-title":"Partitioning Around Medoids (Program PAM), Chapter 2","author":"Kaufman","year":"1990"},{"key":"2025042117315367000_btaf150-B21","first-page":"1513","author":"Lee","year":"2021"},{"key":"2025042117315367000_btaf150-B22","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1016\/j.cell.2018.02.052","article-title":"An integrated tcga pan-cancer clinical data resource to drive high-quality survival outcome analytics","volume":"173","author":"Liu","year":"2018","journal-title":"Cell"},{"key":"2025042117315367000_btaf150-B23","doi-asserted-by":"crossref","first-page":"102587","DOI":"10.1016\/j.artmed.2023.102587","article-title":"Handling missing values in healthcare data: a systematic review of deep learning-based imputation techniques","volume":"142","author":"Liu","year":"2023","journal-title":"Artif Intell Med"},{"key":"2025042117315367000_btaf150-B24","volume-title":"Introduction to the Practice of Statistics","author":"Moore","year":"2021","edition":"10th edn"},{"key":"2025042117315367000_btaf150-B25","first-page":"849","article-title":"On spectral clustering: analysis and an algorithm","volume":"14","author":"Ng","year":"2001","journal-title":"Adv Neural Inf Process Syst"},{"key":"2025042117315367000_btaf150-B26","doi-asserted-by":"crossref","first-page":"4781","DOI":"10.3390\/ijms20194781","article-title":"The need for multi-omics biomarker signatures in precision medicine","volume":"20","author":"Olivier","year":"2019","journal-title":"Int J Mol Sci"},{"key":"2025042117315367000_btaf150-B27","doi-asserted-by":"crossref","first-page":"2924","DOI":"10.1016\/j.jmb.2018.05.037","article-title":"Patient similarity networks for precision medicine","volume":"430","author":"Pai","year":"2018","journal-title":"J Mol Biol"},{"key":"2025042117315367000_btaf150-B28","doi-asserted-by":"crossref","first-page":"3348","DOI":"10.1093\/bioinformatics\/btz058","article-title":"Nemo: cancer subtyping by integration of partial multi-omic data","volume":"35","author":"Rappoport","year":"2019","journal-title":"Bioinformatics"},{"key":"2025042117315367000_btaf150-B29","doi-asserted-by":"crossref","first-page":"0176","DOI":"10.34133\/hds.0176","article-title":"Moving beyond medical statistics: a systematic review on missing data handling in electronic health records","volume":"4","author":"Ren","year":"2024","journal-title":"Health Data Sci"},{"key":"2025042117315367000_btaf150-B30","doi-asserted-by":"crossref","first-page":"btae523","DOI":"10.1093\/bioinformatics\/btae523","article-title":"Multi-omic graph diagnosis (mogdx): a data integration tool to perform classification tasks for heterogeneous diseases","volume":"40","author":"Ryan","year":"2024","journal-title":"Bioinformatics"},{"key":"2025042117315367000_btaf150-B31","doi-asserted-by":"crossref","first-page":"100152","DOI":"10.1016\/j.crmeth.2021.100152","article-title":"Detecting molecular subtypes from multi-omics datasets using sumo","volume":"2","author":"Sienkiewicz","year":"2022","journal-title":"Cell Rep Methods"},{"key":"2025042117315367000_btaf150-B32","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1016\/j.artmed.2014.03.003","article-title":"An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods","volume":"61","author":"Valentini","year":"2014","journal-title":"Artif Intell Med"},{"key":"2025042117315367000_btaf150-B33","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1093\/jamiaopen\/ooy008","article-title":"Patient similarity by joint matrix trifactorization to identify subgroups in acute myeloid leukemia","volume":"1","author":"Vitali","year":"2018","journal-title":"JAMIA Open"},{"key":"2025042117315367000_btaf150-B34","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nmeth.2810","article-title":"Similarity network fusion for aggregating data types on a genomic scale","volume":"11","author":"Wang","year":"2014","journal-title":"Nat Methods"},{"key":"2025042117315367000_btaf150-B35","doi-asserted-by":"publisher","first-page":"102155","DOI":"10.1016\/j.inffus.2023.102155","article-title":"Joint learning of data recovering and graph contrastive denoising for incomplete multi-view clustering","volume":"104","author":"Wang","year":"2024","journal-title":"Inf Fusion"},{"key":"2025042117315367000_btaf150-B36","doi-asserted-by":"publisher","first-page":"3445","DOI":"10.1038\/s41467-021-23774-w","article-title":"Mogonet integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification","volume":"12","author":"Wang","year":"2021","journal-title":"Nat Commun"},{"key":"2025042117315367000_btaf150-B37","doi-asserted-by":"publisher","first-page":"1055","DOI":"10.1109\/TPAMI.2022.3155499","article-title":"Robust multi-view clustering with incomplete information","volume":"45","author":"Yang","year":"2023","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2025042117315367000_btaf150-B38","doi-asserted-by":"publisher","first-page":"83","DOI":"10.26599\/BDMA.2018.9020003","article-title":"Multi-view clustering: a survey","volume":"1","author":"Yang","year":"2018","journal-title":"Big Data Min Anal"},{"key":"2025042117315367000_btaf150-B39","author":"You","year":"2021"},{"key":"2025042117315367000_btaf150-B40","doi-asserted-by":"publisher","first-page":"2139","DOI":"10.1109\/TPAMI.2023.3332967","article-title":"Semantic invariant multi-view clustering with fully incomplete information","volume":"46","author":"Zeng","year":"2024","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2025042117315367000_btaf150-B41","author":"Zhu","year":"2002"},{"key":"2025042117315367000_btaf150-B42","first-page":"912","author":"Zhu","year":"2003"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf150\/62871738\/btaf150.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/4\/btaf150\/62871738\/btaf150.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/4\/btaf150\/62871738\/btaf150.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,21]],"date-time":"2025-04-21T17:32:01Z","timestamp":1745256721000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf150\/8106484"}},"subtitle":[],"editor":[{"given":"Inanc","family":"Birol","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,3,29]]},"references-count":42,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,3,29]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf150","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2025.02.24.639805","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,4]]},"published":{"date-parts":[[2025,3,29]]},"article-number":"btaf150"}}