{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:15Z","timestamp":1772138055901,"version":"3.50.1"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2022,8,5]],"date-time":"2022-08-05T00:00:00Z","timestamp":1659657600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Singapore Ministry of Education Academic Research Fund","award":["R-253-000-159-114"],"award-info":[{"award-number":["R-253-000-159-114"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>In many biomedical studies, there arises the need to integrate data from multiple directly or indirectly related sources. Collective matrix factorization (CMF) and its variants are models designed to collectively learn from arbitrary collections of matrices. The latent factors learnt are rich integrative representations that can be used in downstream tasks, such as clustering or relation prediction with standard machine-learning models. Previous CMF-based methods have numerous modeling limitations. They do not adequately capture complex non-linear interactions and do not explicitly model varying sparsity and noise levels in the inputs, and some cannot model inputs with multiple datatypes. These inadequacies limit their use on many biomedical datasets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>To address these limitations, we develop Neural Collective Matrix Factorization (NCMF), the first fully neural approach to CMF. We evaluate NCMF on relation prediction tasks of gene\u2013disease association prediction and adverse drug event prediction, using multiple datasets. In each case, data are obtained from heterogeneous publicly available databases and used to learn representations to build predictive models. NCMF is found to outperform previous CMF-based methods and several state-of-the-art graph embedding methods for representation learning in our experiments. Our experiments illustrate the versatility and efficacy of NCMF in representation learning for seamless integration of heterogeneous data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>https:\/\/github.com\/ajayago\/NCMF_bioinformatics.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac543","type":"journal-article","created":{"date-parts":[[2022,8,5]],"date-time":"2022-08-05T09:43:35Z","timestamp":1659692615000},"page":"4554-4561","source":"Crossref","is-referenced-by-count":5,"title":["Neural Collective Matrix Factorization for integrated analysis of heterogeneous biomedical data"],"prefix":"10.1093","volume":"38","author":[{"given":"Ragunathan","family":"Mariappan","sequence":"first","affiliation":[{"name":"Department of Information Systems and Analytics, School of Computing, National University of Singapore , Singapore 117417, Singapore"}]},{"given":"Aishwarya","family":"Jayagopal","sequence":"additional","affiliation":[{"name":"Department of Information Systems and Analytics, School of Computing, National University of Singapore , Singapore 117417, Singapore"}]},{"given":"Ho Zong","family":"Sien","sequence":"additional","affiliation":[{"name":"Department of Information Systems and Analytics, School of Computing, National University of Singapore , Singapore 117417, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6748-6864","authenticated-orcid":false,"given":"Vaibhav","family":"Rajan","sequence":"additional","affiliation":[{"name":"Department of Information Systems and Analytics, School of Computing, National University of Singapore , Singapore 117417, Singapore"}]}],"member":"286","published-online":{"date-parts":[[2022,8,5]]},"reference":[{"key":"2023041408224104400_","author":"Bordes","year":"2013"},{"key":"2023041408224104400_","first-page":"992","author":"Burkhardt","year":"2019"},{"key":"2023041408224104400_","first-page":"1","volume-title":"The NCBI Handbook","author":"Canese","year":"2013"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1007\/s40264-013-0018-x","article-title":"Postmarketing safety surveillance","volume":"36","author":"Coloma","year":"2013","journal-title":"Drug Saf"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"e32730","DOI":"10.2196\/32730","article-title":"Adverse drug event prediction using noisy literature-derived knowledge graphs: algorithm development and validation","volume":"9","author":"Dasgupta","year":"2021","journal-title":"JMIR Med. Inform"},{"key":"2023041408224104400_","first-page":"1","article-title":"Statistical comparisons of classifiers over multiple data sets","volume":"7","author":"Dem\u0161ar","year":"2006","journal-title":"J. Mach. Learn. Res"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"4316","DOI":"10.1093\/bioinformatics\/btaa501","article-title":"A multimodal deep learning framework for predicting drug\u2013drug interaction events","volume":"36","author":"Deng","year":"2020","journal-title":"Bioinformatics"},{"key":"2023041408224104400_","author":"Dettmers","year":"2018"},{"key":"2023041408224104400_","author":"Dong","year":"2017"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-018-07931-2","article-title":"Single-cell RNA-seq denoising using a deep count autoencoder","volume":"10","author":"Eraslan","year":"2019","journal-title":"Nat. Commun"},{"key":"2023041408224104400_","author":"Fu","year":"2017"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1093\/bib\/bbz153","article-title":"Improving drug response prediction by integrating multiple data sources: matrix factorization, kernel and network-based approaches","volume":"22","author":"G\u00fcven\u00e7 Paltun","year":"2021","journal-title":"Brief. Bioinform"},{"key":"2023041408224104400_","first-page":"173","author":"He","year":"2017"},{"key":"2023041408224104400_","first-page":"2704","author":"Hu","year":"2020"},{"key":"2023041408224104400_","author":"Kingma","year":"2014"},{"key":"2023041408224104400_","author":"Klami","year":"2014"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"D1075","DOI":"10.1093\/nar\/gkv1075","article-title":"The SIDER database of drugs and side effects","volume":"44","author":"Kuhn","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"2538","DOI":"10.1093\/bioinformatics\/btz965","article-title":"Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction","volume":"36","author":"Li","year":"2020","journal-title":"Bioinformatics"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"655","DOI":"10.1109\/TCBB.2021.3088614","article-title":"IMCHGAN: inductive matrix completion with heterogeneous graph attention networks for drug-target interactions prediction","volume":"19","author":"Li","year":"2022","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"2209","DOI":"10.1093\/bioinformatics\/btz893","article-title":"Predicting synthetic lethal interactions using heterogeneous data sources","volume":"36","author":"Liany","year":"2020","journal-title":"Bioinformatics"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"1395","DOI":"10.1007\/s10994-019-05801-6","article-title":"Deep collective matrix factorization for augmented multi-view learning","volume":"108","author":"Mariappan","year":"2019","journal-title":"Mach. Learn"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"38022","DOI":"10.18632\/oncotarget.16723","article-title":"Combination therapy in combating cancer","volume":"8","author":"Mokhtari","year":"2017","journal-title":"Oncotarget"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"i60","DOI":"10.1093\/bioinformatics\/btu269","article-title":"Inductive matrix completion for predicting gene\u2013disease associations","volume":"30","author":"Natarajan","year":"2014","journal-title":"Bioinformatics"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"bbab479","DOI":"10.1093\/bib\/bbab479","article-title":"Matrix factorization for biomedical link prediction and scRNA-seq data imputation: an empirical survey","volume":"23","author":"Ou-Yang","year":"2022","journal-title":"Brief. Bioinform"},{"key":"2023041408224104400_","first-page":"701","author":"Perozzi","year":"2014"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"678","DOI":"10.1111\/j.1742-4658.2012.08471.x","article-title":"Computational approaches to disease-gene prediction: rationale, classification and successes","volume":"279","author":"Piro","year":"2012","journal-title":"FEBS J"},{"key":"2023041408224104400_","first-page":"593","author":"Schlichtkrull","year":"2018"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"874","DOI":"10.1016\/j.molmed.2017.08.003","article-title":"Precision oncology: the road ahead","volume":"23","author":"Senft","year":"2017","journal-title":"Trends Mol. Med"},{"key":"2023041408224104400_","first-page":"650","author":"Singh","year":"2008"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"790","DOI":"10.1016\/j.tig.2018.07.003","article-title":"Enter the matrix: factorization uncovers knowledge from omics","volume":"34","author":"Stein-O\u2019Brien","year":"2018","journal-title":"Trends Genet"},{"key":"2023041408224104400_","first-page":"1067","author":"Tang","year":"2015"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"125ra31","DOI":"10.1126\/scitranslmed.3003377","article-title":"Data-driven prediction of drug effects and interactions","volume":"4","author":"Tatonetti","year":"2012","journal-title":"Sci. Transl. Med"},{"key":"2023041408224104400_","first-page":"2071","author":"Trouillon","year":"2016"},{"key":"2023041408224104400_","first-page":"340","article-title":"Big data and pharmacovigilance: data mining for adverse drug events and interactions","volume":"43","author":"Ventola","year":"2018","journal-title":"P T"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1177\/1060028018765159","article-title":"Cost of prescription drug\u2013related morbidity and mortality","volume":"52","author":"Watanabe","year":"2018","journal-title":"Ann. Pharmacother"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1038\/ng.2764","article-title":"The cancer genome atlas pan-cancer analysis project","volume":"45","author":"Weinstein","year":"2013","journal-title":"Nat. Genet"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-020-03799-6","article-title":"MCCMF: collaborative matrix factorization based on matrix completion for predicting miRNA-disease associations","volume":"21","author":"Wu","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1007\/s00439-019-01970-5","article-title":"Translating cancer genomics into precision medicine with artificial intelligence: applications, challenges and future perspectives","volume":"138","author":"Xu","year":"2019","journal-title":"Hum. Genet"},{"key":"2023041408224104400_","article-title":"Heterogeneous network representation learning: a unified framework with survey and benchmark","volume":"1","author":"Yang","year":"2020","journal-title":"IEEE Trans. Knowl. Data Eng"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"3474","DOI":"10.1093\/bioinformatics\/btaa157","article-title":"A graph regularized generalized matrix factorization model for predicting links in biomedical bipartite networks","volume":"36","author":"Zhang","year":"2020","journal-title":"Bioinformatics"},{"key":"2023041408224104400_","author":"Zhao","year":"2021"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1109\/TPAMI.2014.2343973","article-title":"Data fusion by matrix factorization","volume":"37","author":"\u017ditnik","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"i457","DOI":"10.1093\/bioinformatics\/bty294","article-title":"Modeling polypharmacy side effects with graph convolutional networks","volume":"34","author":"\u017ditnik","year":"2018","journal-title":"Bioinformatics"},{"key":"2023041408224104400_","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1016\/j.inffus.2018.09.012","article-title":"Machine learning for integrating data in biology and medicine: principles, practice, and opportunities","volume":"50","author":"\u017ditnik","year":"2019","journal-title":"Inf. Fusion"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac543\/45334107\/btac543.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/19\/4554\/49885387\/btac543.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/19\/4554\/49885387\/btac543.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,25]],"date-time":"2023-11-25T05:10:47Z","timestamp":1700889047000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/19\/4554\/6656347"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,8,5]]},"references-count":44,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2022,9,30]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac543","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2022.01.20.477057","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,10,1]]},"published":{"date-parts":[[2022,8,5]]}}}