{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T15:18:56Z","timestamp":1768922336879,"version":"3.49.0"},"reference-count":55,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2021,3,8]],"date-time":"2021-03-08T00:00:00Z","timestamp":1615161600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,9]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Investigating the relationships between two sets of variables helps to understand their interactions and can be done with canonical correlation analysis (CCA). However, the correlation between the two sets can sometimes depend on a third set of covariates, often subject-related ones such as age, gender or other clinical measures. In this case, applying CCA to the whole population is not optimal and methods to estimate conditional CCA, given the covariates, can be useful.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We propose a new method called Random Forest with Canonical Correlation Analysis (RFCCA) to estimate the conditional canonical correlations between two sets of variables given subject-related covariates. The individual trees in the forest are built with a splitting rule specifically designed to partition the data to maximize the canonical correlation heterogeneity between child nodes. We also propose a significance test to detect the global effect of the covariates on the relationship between two sets of variables. The performance of the proposed method and the global significance test is evaluated through simulation studies that show it provides accurate canonical correlation estimations and well-controlled Type-1 error. We also show an application of the proposed method with EEG data.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>RFCCA is implemented in a freely available R package on CRAN (https:\/\/CRAN.R-project.org\/package=RFCCA).<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab158","type":"journal-article","created":{"date-parts":[[2021,3,3]],"date-time":"2021-03-03T20:31:34Z","timestamp":1614803494000},"page":"2714-2721","source":"Crossref","is-referenced-by-count":11,"title":["Conditional canonical correlation estimation based on covariates with random forests"],"prefix":"10.1093","volume":"37","author":[{"given":"Cansu","family":"Alaku\u015f","sequence":"first","affiliation":[{"name":"Department of Decision Sciences, HEC Montr\u00e9al , Montr\u00e9al, QC H3T 2A7, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7372-7943","authenticated-orcid":false,"given":"Denis","family":"Larocque","sequence":"additional","affiliation":[{"name":"Department of Decision Sciences, HEC Montr\u00e9al , Montr\u00e9al, QC H3T 2A7, Canada"}]},{"given":"S\u00e9bastien","family":"Jacquemont","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Universit\u00e9 de Montr\u00e9al , Montr\u00e9al, QC H3T 1C5, Canada"},{"name":"CHU Sainte-Justine Research Center, Montr\u00e9al, QC, H3T 1C5, Canada"}]},{"given":"Fanny","family":"Barlaam","sequence":"additional","affiliation":[{"name":"CHU Sainte-Justine Research Center, Montr\u00e9al, QC, H3T 1C5, Canada"}]},{"given":"Charles-Olivier","family":"Martin","sequence":"additional","affiliation":[{"name":"CHU Sainte-Justine Research Center, Montr\u00e9al, QC, H3T 1C5, Canada"}]},{"given":"Kristian","family":"Agbogba","sequence":"additional","affiliation":[{"name":"CHU Sainte-Justine Research Center, Montr\u00e9al, QC, H3T 1C5, Canada"}]},{"given":"Sarah","family":"Lipp\u00e9","sequence":"additional","affiliation":[{"name":"Department of Psychology, Universit\u00e9 de Montr\u00e9al , Montr\u00e9al, QC H3T 1J4, Canada"},{"name":"CHU Sainte-Justine Research Center, Montr\u00e9al, QC, H3T 1C5, Canada"}]},{"given":"Aur\u00e9lie","family":"Labbe","sequence":"additional","affiliation":[{"name":"Department of Decision Sciences, HEC Montr\u00e9al , Montr\u00e9al, QC H3T 2A7, Canada"}]}],"member":"286","published-online":{"date-parts":[[2021,3,8]]},"reference":[{"key":"2023051609202664000_btab158-B1","volume-title":"Proceedings of the International Meeting of the Psychometric Society (IMPS2001)","author":"Akaho","year":"2001"},{"key":"2023051609202664000_btab158-B2","doi-asserted-by":"crossref","first-page":"2268","DOI":"10.1016\/S1388-2457(00)00464-8","article-title":"The development of auditory evoked dipole source activity from childhood to adulthood","volume":"111","author":"Albrecht","year":"2000","journal-title":"Clin. Neurophysiol"},{"key":"2023051609202664000_btab158-B3","doi-asserted-by":"crossref","first-page":"1513","DOI":"10.1016\/j.cub.2016.04.035","article-title":"Spatial working memory in humans depends on theta and high gamma synchronization in the prefrontal cortex","volume":"26","author":"Alekseichuk","year":"2016","journal-title":"Curr. Biol"},{"key":"2023051609202664000_btab158-B4","first-page":"1247","volume-title":"Proceedings\u00a0of\u00a0the\u00a030th\u00a0International Conference on Machine Learning - Volume 28, ICML'13","author":"Andrew","year":"2013"},{"key":"2023051609202664000_btab158-B5","doi-asserted-by":"crossref","first-page":"1148","DOI":"10.1214\/18-AOS1709","article-title":"Generalized random forests","volume":"47","author":"Athey","year":"2019","journal-title":"Ann. Stat"},{"key":"2023051609202664000_btab158-B6","first-page":"1","article-title":"Kernel independent component analysis","volume":"3","author":"Bach","year":"2002","journal-title":"J. Mach. Learn. Res"},{"key":"2023051609202664000_btab158-B7","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1007\/BF02789700","article-title":"Robust canonical correlations: a comparative study","volume":"20","author":"Branco","year":"2005","journal-title":"Comput. Stat"},{"key":"2023051609202664000_btab158-B8","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn"},{"key":"2023051609202664000_btab158-B9","volume-title":"Classification and Regression Trees","author":"Breiman","year":"1984"},{"key":"2023051609202664000_btab158-B10","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1038\/nature11412","article-title":"Comprehensive molecular portraits of human breast tumours","volume":"490","year":"2012","journal-title":"Nature"},{"key":"2023051609202664000_btab158-B11","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/nrn1925","article-title":"Gene\u2013environment interactions in psychiatry: joining forces with neuroscience","volume":"7","author":"Caspi","year":"2006","journal-title":"Nat. Rev. Neurosci"},{"key":"2023051609202664000_btab158-B12","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1093\/cercor\/bht341","article-title":"Development of sensory gamma oscillations and cross-frequency coupling from childhood to early adulthood","volume":"25","author":"Cho","year":"2015","journal-title":"Cerebral Cortex"},{"key":"2023051609202664000_btab158-B13","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1007\/s10044-018-00775-y","article-title":"A recursive partitioning approach for subgroup identification in brain\u2013behaviour correlation analysis","volume":"23","author":"Choi","year":"2020","journal-title":"Pattern Analysis Appl"},{"key":"2023051609202664000_btab158-B14","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1016\/j.csda.2013.09.020","article-title":"Fast regularized canonical correlation analysis","volume":"70","author":"Cruz-Cano","year":"2014","journal-title":"Comput. Stat. Data Anal"},{"key":"2023051609202664000_btab158-B15","doi-asserted-by":"crossref","first-page":"1201","DOI":"10.1093\/cercor\/bhm155","article-title":"Que pasa? the posterior\u2013anterior shift in aging","volume":"18","author":"Davis","year":"2008","journal-title":"Cerebral Cortex"},{"key":"2023051609202664000_btab158-B16","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","year":"2012","journal-title":"Nature"},{"key":"2023051609202664000_btab158-B313047812","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1016\/0377-0427(89)90360-9","article-title":"Canonical correlations and generalized SVD: Applications and new algorithms","volume":"27","author":"Ewerbring","year":"1989","journal-title":"Journal of Computational and Applied Mathematics"},{"key":"2023051609202664000_btab158-B18","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1007\/s12021-017-9324-2","article-title":"Multi-view ensemble classification of brain connectivity images for neurodegeneration type discrimination","volume":"15","author":"Fratello","year":"2017","journal-title":"Neuroinformatics"},{"key":"2023051609202664000_btab158-B19","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1007\/s00455-009-9229-9","article-title":"Anthropometric and demographic correlates of dual-axis swallowing accelerometry signal characteristics: a canonical correlation analysis","volume":"25","author":"Hanna","year":"2010","journal-title":"Dysphagia"},{"key":"2023051609202664000_btab158-B20","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1007\/s10994-010-5222-7","article-title":"Sparse canonical correlation analysis","volume":"83","author":"Hardoon","year":"2011","journal-title":"Mach. Learn"},{"key":"2023051609202664000_btab158-B21","doi-asserted-by":"crossref","first-page":"2639","DOI":"10.1162\/0899766042321814","article-title":"Canonical correlation analysis: an overview with application to learning methods","volume":"16","author":"Hardoon","year":"2004","journal-title":"Neural Comput"},{"key":"2023051609202664000_btab158-B22","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1090\/S0025-5718-1957-0085600-6","article-title":"A rotation method for computing canonical correlations","volume":"11","author":"Healy","year":"1957","journal-title":"Math. Comput"},{"key":"2023051609202664000_btab158-B23","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1093\/biomet\/28.3-4.321","article-title":"Relations between two sets of variates","volume":"28","author":"Hotelling","year":"1936","journal-title":"Biometrika"},{"key":"2023051609202664000_btab158-B24","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1002\/sim.1593","article-title":"Bagging survival trees","volume":"23","author":"Hothorn","year":"2004","journal-title":"Stat. Med"},{"key":"2023051609202664000_btab158-B25","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1198\/106186006X133933","article-title":"Unbiased recursive partitioning: a conditional inference framework","volume":"15","author":"Hothorn","year":"2006","journal-title":"J. Comput. Graph. Stat"},{"key":"2023051609202664000_btab158-B26","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1038\/nrg1578","article-title":"Gene\u2013environment interactions in human diseases","volume":"6","author":"Hunter","year":"2005","journal-title":"Nat. Rev. Genet"},{"key":"2023051609202664000_btab158-B27","volume-title":"Fast Unified Random Forests for Survival, Regression, and Classification (RF-SRC)","author":"Ishwaran","year":"2020"},{"key":"2023051609202664000_btab158-B28","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1093\/biomet\/58.3.433","article-title":"Canonical analysis of several sets of variables","volume":"58","author":"Kettenring","year":"1971","journal-title":"Biometrika"},{"key":"2023051609202664000_btab158-B29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v036.i11","article-title":"Feature selection with the Boruta package","volume":"36","author":"Kursa","year":"2010","journal-title":"J. Stat. Softw"},{"key":"2023051609202664000_btab158-B30","doi-asserted-by":"crossref","first-page":"725","DOI":"10.1111\/j.2517-6161.1993.tb01936.x","article-title":"Canonical correlation analysis when the data are curves","volume":"55","author":"Leurgans","year":"1993","journal-title":"J. R. Stat. Soc. Ser. B (Methodological)"},{"key":"2023051609202664000_btab158-B31","doi-asserted-by":"crossref","first-page":"1433","DOI":"10.1111\/biom.12698","article-title":"Incorporating covariates into integrated factor analysis of multi-view data","volume":"73","author":"Li","year":"2017","journal-title":"Biometrics"},{"key":"2023051609202664000_btab158-B32","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1016\/j.bbr.2010.01.033","article-title":"Gender-specific hemodynamics in prefrontal cortex during a verbal working memory task by near-infrared spectroscopy","volume":"209","author":"Li","year":"2010","journal-title":"Behav. Brain Res"},{"key":"2023051609202664000_btab158-B33","first-page":"325","article-title":"A review on machine learning principles for multi-view biological data integration","volume":"19","author":"Li","year":"2018","journal-title":"Brief. Bioinf"},{"key":"2023051609202664000_btab158-B34","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1198\/016214505000001230","article-title":"Random forests and adaptive nearest neighbors","volume":"101","author":"Lin","year":"2006","journal-title":"J. Am. Stat. Assoc"},{"key":"2023051609202664000_btab158-B35","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1016\/j.neuroscience.2009.07.066","article-title":"Electrophysiological study of auditory development","volume":"164","author":"Lipp\u00e9","year":"2009","journal-title":"Neuroscience"},{"key":"2023051609202664000_btab158-B36","first-page":"4768","volume-title":"Advances in Neural Information Processing Systems, NIPS'17","author":"Lundberg","year":"2017"},{"key":"2023051609202664000_btab158-B37","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/s42256-019-0138-9","article-title":"From local explanations to global understanding with explainable AI for trees","volume":"2","author":"Lundberg","year":"2020","journal-title":"Nat. Mach. Intell"},{"key":"2023051609202664000_btab158-B38","doi-asserted-by":"crossref","first-page":"2119","DOI":"10.1093\/bioinformatics\/btr318","article-title":"Varying coefficient model for gene\u2013environment interaction: a non-linear look","volume":"27","author":"Ma","year":"2011","journal-title":"Bioinformatics"},{"key":"2023051609202664000_btab158-B39","first-page":"353","volume-title":"International Conference on Artificial Neural Networks","author":"Melzer","year":"2001"},{"key":"2023051609202664000_btab158-B40","doi-asserted-by":"crossref","first-page":"628","DOI":"10.1093\/bib\/bbv108","article-title":"Dimension reduction techniques for the integrative analysis of multi-omics data","volume":"17","author":"Meng","year":"2016","journal-title":"Brief. Bioinf"},{"key":"2023051609202664000_btab158-B41","first-page":"1967","volume-title":"Proceedings of the","author":"Michaeli","year":"2016"},{"key":"2023051609202664000_btab158-B42","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1016\/j.biopsych.2019.12.001","article-title":"Multiple holdouts with stability: improving the generalizability of machine learning analyses of brain\u2013behavior relationships","volume":"87","author":"Mihalik","year":"2020","journal-title":"Biol. Psychiatry"},{"key":"2023051609202664000_btab158-B43","first-page":"851","article-title":"Deep learning in bioinformatics","volume":"18","author":"Min","year":"2017","journal-title":"Brief. Bioinf"},{"key":"2023051609202664000_btab158-B44","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1007\/s10985-016-9372-1","article-title":"L1 splitting rules in survival forests","volume":"23","author":"Moradian","year":"2017","journal-title":"Lifetime Data Anal"},{"key":"2023051609202664000_btab158-B45","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1177\/0962280217727314","article-title":"Survival forests for data with dependent censoring","volume":"28","author":"Moradian","year":"2019","journal-title":"Stat. Methods Med. Res"},{"key":"2023051609202664000_btab158-B46","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1001\/jamapsychiatry.2017.4741","article-title":"Multivariate associations among behavioral, clinical, and multimodal imaging phenotypes in patients with psychosis","volume":"75","author":"Moser","year":"2018","journal-title":"JAMA Psychiatry"},{"key":"2023051609202664000_btab158-B47","first-page":"994","author":"Pezeshki","year":"2004"},{"key":"2023051609202664000_btab158-B48","doi-asserted-by":"crossref","first-page":"425","DOI":"10.1089\/brain.2018.0653","article-title":"Children with autism spectrum disorder demonstrate regionally specific altered resting-state phase\u2013amplitude coupling","volume":"9","author":"Port","year":"2019","journal-title":"Brain Connectivity"},{"key":"2023051609202664000_btab158-B49","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1177\/0962280219829885","article-title":"Prediction intervals with random forests","volume":"29","author":"Roy","year":"2020","journal-title":"Stat. Methods Med. Res"},{"key":"2023051609202664000_btab158-B50","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1016\/j.neuroimage.2017.07.051","article-title":"Time-resolved phase-amplitude coupling in neural oscillations","volume":"159","author":"Samiee","year":"2017","journal-title":"NeuroImage"},{"key":"2023051609202664000_btab158-B51","doi-asserted-by":"crossref","first-page":"2031","DOI":"10.1007\/s00521-013-1362-6","article-title":"A survey of multi-view machine learning","volume":"23","author":"Sun","year":"2013","journal-title":"Neural Comput. Appl"},{"key":"2023051609202664000_btab158-B52","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1093\/bioinformatics\/btz602","article-title":"Non-parametric individual treatment effect estimation for survival data with random forests","volume":"36","author":"Tabib","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051609202664000_btab158-B53","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1016\/0304-4076(76)90010-5","article-title":"Canonical ridge and econometrics of joint production","volume":"4","author":"Vinod","year":"1976","journal-title":"J. Econometrics"},{"key":"2023051609202664000_btab158-B54","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1002\/bimj.201400226","article-title":"Sparse canonical correlation analysis from a predictive point of view","volume":"57","author":"Wilms","year":"2015","journal-title":"Biometrical J"},{"key":"2023051609202664000_btab158-B55","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1093\/biostatistics\/kxp008","article-title":"A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis","volume":"10","author":"Witten","year":"2009","journal-title":"Biostatistics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab158\/36666582\/btab158.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/17\/2714\/50338953\/btab158.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/17\/2714\/50338953\/btab158.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,25]],"date-time":"2024-08-25T08:14:53Z","timestamp":1724573693000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/17\/2714\/6162156"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,3,8]]},"references-count":55,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2021,9,9]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab158","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9,1]]},"published":{"date-parts":[[2021,3,8]]}}}