{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,15]],"date-time":"2025-11-15T10:27:41Z","timestamp":1763202461635,"version":"3.37.3"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"16","license":[{"start":{"date-parts":[[2021,2,9]],"date-time":"2021-02-09T00:00:00Z","timestamp":1612828800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Research Foundation \u2013 Flanders"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,8,25]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Modern bioinformatics is facing increasingly complex problems to solve, and we are indeed rapidly approaching an era in which the ability to seamlessly integrate heterogeneous sources of information will be crucial for the scientific progress. Here, we present a novel non-linear data fusion framework that generalizes the conventional matrix factorization paradigm allowing inference over arbitrary entity-relation graphs, and we applied it to the prediction of protein\u2013protein interactions (PPIs). Improving our knowledge of PPI networks at the proteome scale is indeed crucial to understand protein function, physiological and disease states and cell life in general.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We devised three data fusion-based models for the proteome-level prediction of PPIs, and we show that our method outperforms state of the art approaches on common benchmarks. Moreover, we investigate its predictions on newly published PPIs, showing that this new data has a clear shift in its underlying distributions and we thus train and test our models on this extended dataset.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab092","type":"journal-article","created":{"date-parts":[[2021,2,5]],"date-time":"2021-02-05T08:08:42Z","timestamp":1612512522000},"page":"2275-2281","source":"Crossref","is-referenced-by-count":16,"title":["A novel method for data fusion over entity-relation graphs and its application to protein\u2013protein interaction prediction"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1157-1899","authenticated-orcid":false,"given":"Daniele","family":"Raimondi","sequence":"first","affiliation":[{"name":"ESAT-STADIUS, KU Leuven , 3001 Leuven, Belgium"}]},{"given":"Jaak","family":"Simm","sequence":"additional","affiliation":[{"name":"ESAT-STADIUS, KU Leuven , 3001 Leuven, Belgium"}]},{"given":"Adam","family":"Arany","sequence":"additional","affiliation":[{"name":"ESAT-STADIUS, KU Leuven , 3001 Leuven, Belgium"}]},{"given":"Yves","family":"Moreau","sequence":"additional","affiliation":[{"name":"ESAT-STADIUS, KU Leuven , 3001 Leuven, Belgium"}]}],"member":"286","published-online":{"date-parts":[[2021,2,9]]},"reference":[{"year":"2015","author":"Arany","key":"2023051609125662700_btab092-B1"},{"first-page":"41","year":"2009","author":"Bengio","key":"2023051609125662700_btab092-B2"},{"key":"2023051609125662700_btab092-B3","doi-asserted-by":"crossref","first-page":"12180","DOI":"10.1073\/pnas.1606762113","article-title":"Inferring interaction partners from protein sequences","volume":"113","author":"Bitbol","year":"2016","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051609125662700_btab092-B4","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1038\/nmeth.1281","article-title":"An experimentally derived confidence score for binary protein-protein interactions","volume":"6","author":"Braun","year":"2009","journal-title":"Nat. Methods"},{"key":"2023051609125662700_btab092-B5","doi-asserted-by":"crossref","first-page":"i305","DOI":"10.1093\/bioinformatics\/btz328","article-title":"Multifaceted protein\u2013protein interaction prediction based on siamese residual rcnn","volume":"35","author":"Chen","year":"2019","journal-title":"Bioinformatics"},{"first-page":"794","year":"2018","author":"Chen","key":"2023051609125662700_btab092-B6"},{"key":"2023051609125662700_btab092-B7","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1126\/science.aaw6718","article-title":"Protein interaction networks revealed by proteome coevolution","volume":"365","author":"Cong","year":"2019","journal-title":"Science"},{"key":"2023051609125662700_btab092-B8","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1016\/j.neunet.2017.10.007","article-title":"Matrix completion by deep matrix factorization","volume":"98","author":"Fan","year":"2018","journal-title":"Neural Netw"},{"key":"2023051609125662700_btab092-B9","doi-asserted-by":"crossref","first-page":"D222","DOI":"10.1093\/nar\/gkt1223","article-title":"Pfam: the protein families database","volume":"42","author":"Finn","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023051609125662700_btab092-B10","doi-asserted-by":"crossref","first-page":"8685","DOI":"10.1073\/pnas.0701361104","article-title":"The human disease network","volume":"104","author":"Goh","year":"2007","journal-title":"Proc. Natl. Acad. Sci. USA"},{"first-page":"270","year":"2018","author":"Guo","key":"2023051609125662700_btab092-B11"},{"key":"2023051609125662700_btab092-B12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12920-016-0189-2","article-title":"Inferring Crohn\u2019s disease association from exome sequences by integrating biological knowledge","volume":"9","author":"Jeong","year":"2016","journal-title":"BMC Med. Genomics"},{"first-page":"7482","year":"2018","author":"Kendall","key":"2023051609125662700_btab092-B13"},{"key":"2023051609125662700_btab092-B14","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1038\/nmeth.3178","article-title":"In silico prediction of physical protein interactions and characterization of interactome orphans","volume":"12","author":"Kotlyar","year":"2015","journal-title":"Nat. Methods"},{"year":"2016","author":"Li","key":"2023051609125662700_btab092-B15"},{"first-page":"2980","year":"2017","author":"Lin","key":"2023051609125662700_btab092-B16"},{"key":"2023051609125662700_btab092-B17","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1002\/humu.22932","article-title":"dbnsfp v3. 0: a one-stop database of functional predictions and annotations for human nonsynonymous and splice-site snvs","volume":"37","author":"Liu","year":"2016","journal-title":"Hum. Mutat"},{"key":"2023051609125662700_btab092-B18","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1038\/s41586-020-2188-x","article-title":"A reference map of the human binary protein interactome","volume":"580","author":"Luck","year":"2020","journal-title":"Nature"},{"key":"2023051609125662700_btab092-B19","doi-asserted-by":"crossref","first-page":"1577","DOI":"10.1093\/bioinformatics\/btt181","article-title":"Instruct: a database of high-quality 3d structurally resolved protein interactome networks","volume":"29","author":"Meyer","year":"2013","journal-title":"Bioinformatics"},{"first-page":"1257","year":"2008","author":"Mnih","key":"2023051609125662700_btab092-B20"},{"key":"2023051609125662700_btab092-B21","doi-asserted-by":"crossref","first-page":"36679","DOI":"10.1038\/srep36679","article-title":"Observation selection bias in contact prediction and its implications for structural bioinformatics","volume":"6","author":"Orlando","year":"2016","journal-title":"Sci. Rep"},{"year":"2019","author":"Paszke","key":"2023051609125662700_btab092-B22"},{"key":"2023051609125662700_btab092-B23","doi-asserted-by":"crossref","first-page":"D833","DOI":"10.1093\/nar\/gkw943","article-title":"Disgenet: a comprehensive platform integrating information on human disease-associated genes and variants","volume":"45","author":"Pi\u00f1ero","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023051609125662700_btab092-B24","doi-asserted-by":"crossref","first-page":"1212","DOI":"10.1016\/j.cell.2014.10.050","article-title":"A proteome-scale map of the human interactome network","volume":"159","author":"Rolland","year":"2014","journal-title":"Cell"},{"key":"2023051609125662700_btab092-B25","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nmeth.1283","article-title":"Cost-effective strategies for completing the interactome","volume":"6","author":"Schwartz","year":"2009","journal-title":"Nat. Methods"},{"key":"2023051609125662700_btab092-B26","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1186\/1471-2105-8-239","article-title":"Probabilistic prediction and ranking of human protein-protein interactions","volume":"8","author":"Scott","year":"2007","journal-title":"BMC Bioinformatics"},{"year":"2015","author":"Simm","key":"2023051609125662700_btab092-B27"},{"key":"2023051609125662700_btab092-B28","doi-asserted-by":"crossref","first-page":"848","DOI":"10.15252\/msb.20156351","article-title":"Fundamentals of protein interaction network mapping","volume":"11","author":"Snider","year":"2015","journal-title":"Mol. Syst. Biol"},{"key":"2023051609125662700_btab092-B29","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1006\/jmbi.2001.4920","article-title":"Correlated sequence-signatures as markers of protein-protein interaction","volume":"311","author":"Sprinzak","year":"2001","journal-title":"J. Mol. Biol"},{"first-page":"720","year":"2003","author":"Srebro","key":"2023051609125662700_btab092-B30"},{"key":"2023051609125662700_btab092-B31","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1109\/TPAMI.2016.2554555","article-title":"A deep matrix factorization method for learning attribute representations","volume":"39","author":"Trigeorgis","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023051609125662700_btab092-B32","doi-asserted-by":"crossref","first-page":"D204","DOI":"10.1093\/nar\/gku989","article-title":"Uniprot: a hub for protein information","volume":"43","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023051609125662700_btab092-B33","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1186\/1471-2105-10-150","article-title":"Protein-protein interaction based on pairwise similarity","volume":"10","author":"Zaki","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023051609125662700_btab092-B34","doi-asserted-by":"crossref","first-page":"W432","DOI":"10.1093\/nar\/gky420","article-title":"Complexcontact: a web server for inter-protein contact prediction using deep learning","volume":"46","author":"Zeng","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023051609125662700_btab092-B35","doi-asserted-by":"crossref","first-page":"556","DOI":"10.1038\/nature11503","article-title":"Structure-based prediction of protein\u2013protein interactions on a genome-wide scale","volume":"490","author":"Zhang","year":"2012","journal-title":"Nature"},{"key":"2023051609125662700_btab092-B36","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1109\/TPAMI.2014.2343973","article-title":"Data fusion by matrix factorization","volume":"37","author":"\u017ditnik","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"year":"2014","author":"\u017ditnik","key":"2023051609125662700_btab092-B37"},{"key":"2023051609125662700_btab092-B38","doi-asserted-by":"crossref","first-page":"e1004552","DOI":"10.1371\/journal.pcbi.1004552","article-title":"Gene prioritization by compressive data fusion and chaining","volume":"11","author":"\u017ditnik","year":"2015","journal-title":"PLoS Comput. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab092\/36337860\/btab092.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/16\/2275\/50339388\/btab092.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/16\/2275\/50339388\/btab092.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T09:16:14Z","timestamp":1684228574000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/16\/2275\/6131674"}},"subtitle":[],"editor":[{"given":"Jinbo","family":"Xu","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,2,9]]},"references-count":38,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2021,8,25]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab092","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2021,8,15]]},"published":{"date-parts":[[2021,2,9]]}}}