{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T11:35:35Z","timestamp":1780486535061,"version":"3.54.1"},"reference-count":39,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2022,5,1]],"date-time":"2022-05-01T00:00:00Z","timestamp":1651363200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,5,6]],"date-time":"2022-05-06T00:00:00Z","timestamp":1651795200000},"content-version":"vor","delay-in-days":5,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[2022,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>A novel multi-task Gaussian process (GP) framework is proposed, by using a common mean process for sharing information across tasks. In particular, we investigate the problem of time series forecasting, with the objective to improve multiple-step-ahead predictions. The common mean process is defined as a GP for which the hyper-posterior distribution is tractable. Therefore an EM algorithm is derived for handling both hyper-parameters optimisation and hyper-posterior computation. Unlike previous approaches in the literature, the model fully accounts for uncertainty and can handle irregular grids of observations while maintaining explicit formulations, by modelling the mean process in a unified GP framework. Predictive analytical equations are provided, integrating information shared across tasks through a relevant prior mean. This approach greatly improves the predictive performances, even far from observations, and may reduce significantly the computational complexity compared to traditional multi-task GP models. Our overall algorithm is called <jats:sc>Magma<\/jats:sc> (standing for Multi tAsk GPs with common MeAn). The quality of the mean process estimation, predictive performances, and comparisons to alternatives are assessed in various simulated scenarios and on real datasets.<\/jats:p>","DOI":"10.1007\/s10994-022-06172-1","type":"journal-article","created":{"date-parts":[[2022,5,6]],"date-time":"2022-05-06T20:05:00Z","timestamp":1651867500000},"page":"1821-1849","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["MAGMA: inference and prediction using multi-task Gaussian processes with common mean"],"prefix":"10.1007","volume":"111","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0806-8934","authenticated-orcid":false,"given":"Arthur","family":"Leroy","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Pierre","family":"Latouche","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1237-7430","authenticated-orcid":false,"given":"Benjamin","family":"Guedj","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Servane","family":"Gey","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2022,5,6]]},"reference":[{"key":"6172_CR1","unstructured":"Alaa, A. M., & van der Schaar, M. (2017). Bayesian inference of individualized treatment effects using multi-task Gaussian processes. In Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., & Garnett, R. (Eds.) Advances in neural information processing systems 30, Curran Associates, Inc., pp. 3424\u20133432."},{"issue":"41","key":"6172_CR2","first-page":"1459","volume":"12","author":"MA \u00c1lvarez","year":"2011","unstructured":"\u00c1lvarez, M. A., & Lawrence, N. D. (2011). Computationally efficient convolved multiple output Gaussian processes. Journal of Machine Learning Research, 12(41), 1459\u20131500.","journal-title":"Journal of Machine Learning Research"},{"key":"6172_CR3","doi-asserted-by":"publisher","unstructured":"\u00c1lvarez, M. A., Rosasco, L., & Lawrence, N.D. (2012). Kernels for vector-valued functions: A review. Foundations and Trends\u00ae in Machine Learning, 4(3), 195\u2013266. https:\/\/doi.org\/10.1561\/2200000036","DOI":"10.1561\/2200000036"},{"issue":"3","key":"6172_CR4","doi-asserted-by":"publisher","first-page":"561","DOI":"10.1016\/S0167-9473(02)00163-9","volume":"41","author":"C Biernacki","year":"2003","unstructured":"Biernacki, C., Celeux, G., & Govaert, G. (2003). Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models. Computational Statistics & Data Analysis, 41(3), 561\u2013575. https:\/\/doi.org\/10.1016\/S0167-9473(02)00163-9","journal-title":"Computational Statistics & Data Analysis"},{"key":"6172_CR5","volume-title":"Pattern Recognition and Machine Learning","author":"CM Bishop","year":"2006","unstructured":"Bishop, C. M. (2006). Pattern recognition and machine learning, information science and statistics. Springer."},{"key":"6172_CR6","unstructured":"Bonilla, E. V., Chai, K. M., & Williams, C. (2008). Multi-task Gaussian process prediction. In Platt, J. C., Koller, D., Singer, Y., Roweis, S. T. (Eds.) Advances in neural information processing systems 20, Curran Associates, Inc., pp. 153\u2013160."},{"issue":"1","key":"6172_CR7","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1023\/A:1007379606734","volume":"28","author":"R Caruana","year":"1997","unstructured":"Caruana, R. (1997). Multitask learning. Machine Learning, 28(1), 41\u201375. https:\/\/doi.org\/10.1023\/A:1007379606734","journal-title":"Machine Learning"},{"issue":"2","key":"6172_CR8","doi-asserted-by":"publisher","first-page":"83","DOI":"10.2307\/2682801","volume":"39","author":"G Casella","year":"1985","unstructured":"Casella, G. (1985). An introduction to empirical Bayes data analysis. The American Statistician, 39(2), 83\u201387. https:\/\/doi.org\/10.2307\/2682801","journal-title":"The American Statistician"},{"key":"6172_CR9","doi-asserted-by":"publisher","unstructured":"Clingerman, C., & Eaton, E. (2017). Lifelong learning with Gaussian processes. In: Ceci, M., Hollm\u00e9n, J., Todorovski, L., Vens, C., D\u017eeroski, S. (Eds) Machine learning and knowledge discovery in databases (Vol. 10535, pp 690\u2013704). Springer. https:\/\/doi.org\/10.1007\/978-3-319-71246-8_42","DOI":"10.1007\/978-3-319-71246-8_42"},{"key":"6172_CR10","doi-asserted-by":"crossref","unstructured":"Crainiceanu, C. M., & Goldsmith, A. J. (2010). Bayesian functional data analysis using WinBUGS. Journal of Statistical Software, 32(11).","DOI":"10.18637\/jss.v032.i11"},{"issue":"1","key":"6172_CR11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","author":"AP Dempster","year":"1977","unstructured":"Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B (Methodological), 39(1), 1\u201338.","journal-title":"Journal of the Royal Statistical Society Series B (Methodological)"},{"key":"6172_CR12","doi-asserted-by":"publisher","unstructured":"Duvenaud, D. (2014). Automatic model construction with Gaussian processes. Thesis, University of Cambridge, https:\/\/doi.org\/10.17863\/CAM.14087","DOI":"10.17863\/CAM.14087"},{"key":"6172_CR13","unstructured":"Ferraty, F., & Vieu, P. (2006). Nonparametric functional data analysis: Theory and practice. Springer."},{"issue":"3","key":"6172_CR14","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1527\/tjsai.27.103","volume":"27","author":"K Hayashi","year":"2012","unstructured":"Hayashi, K., Takenouchi, T., Tomioka, R., & Kashima, H. (2012). Self-measuring similarity for multi-task Gaussian process. Transactions of the Japanese Society for Artificial Intelligence, 27(3), 103\u2013110. https:\/\/doi.org\/10.1527\/tjsai.27.103","journal-title":"Transactions of the Japanese Society for Artificial Intelligence"},{"key":"6172_CR15","volume-title":"The EM Algorithm and Extensions","author":"GJ McLachlan","year":"2007","unstructured":"McLachlan, G. J., & Krishnan, T. (2007). The EM algorithm and extensions. Wiley."},{"key":"6172_CR16","doi-asserted-by":"publisher","unstructured":"Morales, J. L., & Nocedal, J. (2011). Remark on algorithm L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization. ACM Transactions on Mathematical Software, 38(1), 7:1\u20137:4. https:\/\/doi.org\/10.1145\/2049662.2049669","DOI":"10.1145\/2049662.2049669"},{"key":"6172_CR17","unstructured":"Moreno-Mu\u00f1oz, P., Art\u00e9s-Rodr\u00edguez, A., & \u00c1lvarez, M. A. (2019). Continual multi-task Gaussian processes. arXiv:1911.00002 [cs, stat] arXiv:1911.00002"},{"key":"6172_CR18","unstructured":"Nguyen, T. V., & Bonilla, E. V. (2014). Collaborative multi-output Gaussian processes. In Proceedings of the thirtieth conference on uncertainty in artificial intelligence, AUAI Press, UAI\u201914, pp. 643\u2013652"},{"issue":"151","key":"6172_CR19","doi-asserted-by":"publisher","first-page":"773","DOI":"10.1090\/S0025-5718-1980-0572855-7","volume":"35","author":"J Nocedal","year":"1980","unstructured":"Nocedal, J. (1980). Updating quasi-Newton matrices with limited storage. Mathematics of Computation, 35(151), 773\u2013782. https:\/\/doi.org\/10.1090\/S0025-5718-1980-0572855-7","journal-title":"Mathematics of Computation"},{"key":"6172_CR20","volume-title":"Approximation Methods for Gaussian Process Regression","author":"J Qui\u00f1onero-Candela","year":"2007","unstructured":"Qui\u00f1onero-Candela, J., Rasmussen, C. E., & Williams, C. K. I. (2007). Approximation methods for Gaussian process regression. MIT Press."},{"key":"6172_CR21","unstructured":"Rakitsch, B., Lippert, C., Borgwardt, K., & Stegle, O. (2013). It is all in the noise: Efficient multi-task Gaussian process inference with structured residuals. In Advances in neural information processing systems 26, Curran Associates, Inc., pp. 1466\u20131474"},{"key":"6172_CR22","doi-asserted-by":"publisher","DOI":"10.1007\/b98888","volume-title":"Functional Data Analysis","author":"JO Ramsay","year":"2005","unstructured":"Ramsay, J. O., & Silverman, B. W. (2005). Functional data analysis. Springer."},{"key":"6172_CR23","volume-title":"Gaussian Processes for Machine Learning","author":"CE Rasmussen","year":"2006","unstructured":"Rasmussen, C. E., & Williams, C. K. I. (2006). Gaussian processes for machine learning, adaptive computation and machine learning. MIT Press."},{"issue":"1","key":"6172_CR24","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1111\/j.2517-6161.1991.tb01821.x","volume":"53","author":"JA Rice","year":"1991","unstructured":"Rice, J. A., & Silverman, B. W. (1991). Estimating the mean and covariance structure nonparametrically when the data are curves. Journal of the Royal Statistical Society Series B (Methodological), 53(1), 233\u2013243.","journal-title":"Journal of the Royal Statistical Society Series B (Methodological)"},{"key":"6172_CR25","first-page":"8","volume":"17","author":"A Schwaighofer","year":"2004","unstructured":"Schwaighofer, A., Tresp, V., & Yu, K. (2004). Learning Gaussian process kernels via hierarchical bayes. Advances in Neural Information Processing Systems, 17, 8.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"6172_CR27","unstructured":"Shi, J. Q., & Cheng, Y. (2014). Gaussian process function data analysis R package \u2018GPFDA\u2019. https:\/\/cran.r-project.org\/web\/packages\/GPFDA\/GPFDA.pdf"},{"key":"6172_CR28","doi-asserted-by":"publisher","DOI":"10.1201\/b11038","volume-title":"Gaussian Process Regression Analysis for Functional Data","author":"JQ Shi","year":"2011","unstructured":"Shi, J. Q., & Choi, T. (2011). Gaussian process regression analysis for functional data. CRC Press."},{"issue":"1","key":"6172_CR26","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1007\/s11222-005-4787-7","volume":"15","author":"J Shi","year":"2005","unstructured":"Shi, J., Murray-Smith, R., & Titterington, D. (2005). Hierarchical Gaussian process mixtures for regression. Statistics and Computing, 15(1), 31\u201341. https:\/\/doi.org\/10.1007\/s11222-005-4787-7","journal-title":"Statistics and Computing"},{"issue":"3","key":"6172_CR29","doi-asserted-by":"publisher","first-page":"714","DOI":"10.1111\/j.1541-0420.2007.00758.x","volume":"63","author":"JQ Shi","year":"2007","unstructured":"Shi, J. Q., Wang, B., Murray-Smith, R., & Titterington, D. M. (2007). Gaussian process functional regression modeling for batch data. Biometrics, 63(3), 714\u2013723. https:\/\/doi.org\/10.1111\/j.1541-0420.2007.00758.x","journal-title":"Biometrics"},{"key":"6172_CR30","unstructured":"Snelson, E., & Ghahramani, Z. (2006). Sparse Gaussian processes using pseudo-inputs. In Advances in neural information processing systems (Vol\u00a018), MIT Press"},{"key":"6172_CR31","first-page":"2004","volume":"26","author":"K Swersky","year":"2013","unstructured":"Swersky, K., Snoek, J., & Adams, R. P. (2013). Multi-task Bayesian optimization. Advances in Neural Information Processing Systems, 26, 2004\u20132012.","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"1","key":"6172_CR32","doi-asserted-by":"publisher","first-page":"54","DOI":"10.1111\/j.1541-0420.2007.00829.x","volume":"64","author":"WK Thompson","year":"2008","unstructured":"Thompson, W. K., & Rosen, O. (2008). A Bayesian model for sparse functional data. Biometrics, 64(1), 54\u201363. https:\/\/doi.org\/10.1111\/j.1541-0420.2007.00829.x","journal-title":"Biometrics"},{"key":"6172_CR33","unstructured":"Titsias, M. (2009). Variational learning of inducing variables in sparse Gaussian processes. In Proceedings of the twelth international conference on artificial intelligence and statistics, PMLR, pp. 567\u2013574."},{"issue":"2","key":"6172_CR34","doi-asserted-by":"publisher","first-page":"271","DOI":"10.1016\/S0893-6080(97)00133-0","volume":"11","author":"N Ueda","year":"1998","unstructured":"Ueda, N., & Nakano, R. (1998). Deterministic annealing EM algorithm. Neural Networks, 11(2), 271\u2013282. https:\/\/doi.org\/10.1016\/S0893-6080(97)00133-0","journal-title":"Neural Networks"},{"key":"6172_CR35","first-page":"265","volume":"21","author":"C Williams","year":"2009","unstructured":"Williams, C., Klanke, S., Vijayakumar, S., & Chai, K. M. (2009). Multi-task Gaussian process learning of robot inverse dynamics. Advances in Neural Information Processing Systems, 21, 265\u2013272.","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"3","key":"6172_CR36","doi-asserted-by":"publisher","first-page":"649","DOI":"10.1214\/15-BA967","volume":"11","author":"J Yang","year":"2016","unstructured":"Yang, J., Zhu, H., Choi, T., & Cox, D. D. (2016). Smoothing and mean-covariance estimation of functional data with a Bayesian hierarchical model. Bayesian Analysis, 11(3), 649\u2013670. https:\/\/doi.org\/10.1214\/15-BA967","journal-title":"Bayesian Analysis"},{"issue":"4","key":"6172_CR37","doi-asserted-by":"publisher","first-page":"1082","DOI":"10.1111\/biom.12705","volume":"73","author":"J Yang","year":"2017","unstructured":"Yang, J., Cox, D. D., Lee, J. S., Ren, P., & Choi, T. (2017). Efficient Bayesian hierarchical functional data analysis with basis function approximations using Gaussian-Wishart processes. Biometrics, 73(4), 1082\u20131091. https:\/\/doi.org\/10.1111\/biom.12705","journal-title":"Biometrics"},{"key":"6172_CR38","doi-asserted-by":"publisher","unstructured":"Yu, K., Tresp, V., & Schwaighofer, A. (2005). Learning Gaussian processes from multiple tasks. In Proceedings of the 22nd international conference on machine learning, ACM, ICML \u201905, pp. 1012\u20131019. https:\/\/doi.org\/10.1145\/1102351.1102479","DOI":"10.1145\/1102351.1102479"},{"key":"6172_CR39","doi-asserted-by":"publisher","unstructured":"Zhu, J., & Sun, S. (2014). Multi-task sparse Gaussian processes with improved multi-task sparsity regularization. In Pattern recognition, Springer, pp. 54\u201362, https:\/\/doi.org\/10.1007\/978-3-662-45646-0_6","DOI":"10.1007\/978-3-662-45646-0_6"}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-022-06172-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10994-022-06172-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-022-06172-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,20]],"date-time":"2022-05-20T20:18:45Z","timestamp":1653077925000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10994-022-06172-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5]]},"references-count":39,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,5]]}},"alternative-id":["6172"],"URL":"https:\/\/doi.org\/10.1007\/s10994-022-06172-1","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"value":"0885-6125","type":"print"},{"value":"1573-0565","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5]]},"assertion":[{"value":"12 April 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 January 2022","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 March 2022","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 May 2022","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}