{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T02:53:00Z","timestamp":1770519180907,"version":"3.49.0"},"reference-count":50,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T00:00:00Z","timestamp":1763424000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T00:00:00Z","timestamp":1763424000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"TU Wien"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Stat Comput"],"published-print":{"date-parts":[[2026,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Sparse and outlier-robust principal component analysis (PCA) has been a very active field of research recently. Yet, most existing methods apply PCA to a single data set whereas multi-source data\u2014i.e. multiple related data sets requiring joint analysis\u2014arise across many scientific areas. We introduce a novel PCA methodology that simultaneously (i) selects important features, (ii) allows for the detection of global sparse patterns across multiple data sources as well as local source-specific patterns, and (iii) is resistant to outliers. To this end, we develop a regularization problem with a penalty that accommodates global-local structured sparsity patterns, and where an outlier-robust covariance estimator, namely the ssMRCD, is used as plug-in to permit joint, robust analysis across multiple data sources. We provide an efficient implementation of our proposal via the alternating direction method of multipliers and illustrate its practical advantages in simulations and in applications.<\/jats:p>","DOI":"10.1007\/s11222-025-10771-y","type":"journal-article","created":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T19:37:52Z","timestamp":1763494672000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Sparse outlier-robust PCA for multi-source data"],"prefix":"10.1007","volume":"36","author":[{"given":"Patricia","family":"Puchhammer","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ines","family":"Wilms","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter","family":"Filzmoser","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,11,18]]},"reference":[{"issue":"13","key":"10771_CR1","first-page":"1","volume":"23","author":"D Bertsimas","year":"2022","unstructured":"Bertsimas, D., Cory-Wright, R., Pauphilet, J.: Solving large-scale sparse PCA to certifiable (near) optimality. J. Mach. Learn. Res. 23(13), 1\u201335 (2022)","journal-title":"J. Mach. Learn. Res."},{"issue":"32","key":"10771_CR2","first-page":"1","volume":"24","author":"D Bertsimas","year":"2023","unstructured":"Bertsimas, D., Kitane, D.L.: Sparse PCA: A geometric approach. J. Mach. Learn. Res. 24(32), 1\u201333 (2023)","journal-title":"J. Mach. Learn. Res."},{"issue":"1","key":"10771_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1561\/2200000016","volume":"3","author":"S Boyd","year":"2011","unstructured":"Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends\u00ae in Machine Learning 3(1), 1\u2013122 (2011)","journal-title":"Foundations and Trends\u00ae in Machine Learning"},{"issue":"1","key":"10771_CR4","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1007\/s11222-019-09869-x","volume":"30","author":"K Boudt","year":"2020","unstructured":"Boudt, K., Rousseeuw, P.J., Vanduffel, S., Verdonck, T.: The minimum regularized covariance determinant estimator. Stat. Comput. 30(1), 113\u2013128 (2020)","journal-title":"Stat. Comput."},{"key":"10771_CR5","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1016\/j.eneco.2016.09.013","volume":"60","author":"L Barbaglia","year":"2016","unstructured":"Barbaglia, L., Wilms, I., Croux, C.: Commodity dynamics: A sparse multi-class approach. Energy Economics 60, 62\u201372 (2016)","journal-title":"Energy Economics"},{"issue":"2","key":"10771_CR6","doi-asserted-by":"publisher","first-page":"202","DOI":"10.1080\/00401706.2012.727746","volume":"55","author":"C Croux","year":"2013","unstructured":"Croux, C., Filzmoser, P., Fritz, H.: Robust sparse principal component analysis. Technometrics 55(2), 202\u2013214 (2013)","journal-title":"Technometrics"},{"issue":"2","key":"10771_CR7","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1006\/jmva.1999.1839","volume":"71","author":"C Croux","year":"1999","unstructured":"Croux, C., Haesbroeck, G.: Influence function and efficiency of the minimum covariance determinant scatter matrix estimator. J. Multivar. Anal. 71(2), 161\u2013190 (1999)","journal-title":"J. Multivar. Anal."},{"issue":"3","key":"10771_CR8","doi-asserted-by":"publisher","first-page":"603","DOI":"10.1093\/biomet\/87.3.603","volume":"87","author":"C Croux","year":"2000","unstructured":"Croux, C., Haesbroeck, G.: Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies. Biometrika 87(3), 603\u2013618 (2000)","journal-title":"Biometrika"},{"issue":"2","key":"10771_CR9","doi-asserted-by":"publisher","first-page":"203","DOI":"10.1080\/757584614","volume":"22","author":"J Cadima","year":"1995","unstructured":"Cadima, J., Jolliffe, I.T.: Loading and correlations in the interpretation of principle compenents. J. Appl. Stat. 22(2), 203\u2013214 (1995)","journal-title":"J. Appl. Stat."},{"issue":"3","key":"10771_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/1970392.1970395","volume":"58","author":"EJ Cand\u00e8s","year":"2011","unstructured":"Cand\u00e8s, E.J., Li, X., Ma, Y., Wright, J.: Robust principal component analysis? Journal of the ACM (JACM) 58(3), 1\u201337 (2011)","journal-title":"Journal of the ACM (JACM)"},{"issue":"1","key":"10771_CR11","doi-asserted-by":"publisher","first-page":"206","DOI":"10.1016\/j.jmva.2004.08.002","volume":"95","author":"C Croux","year":"2005","unstructured":"Croux, C., Ruiz-Gazen, A.: High breakdown estimators for principal components: The projection-pursuit approach revisited. J. Multivar. Anal. 95(1), 206\u2013226 (2005)","journal-title":"J. Multivar. Anal."},{"key":"10771_CR12","unstructured":"d\u2019Aspremont, A., Bach, F., El\u00a0Ghaoui, L.: Optimal solutions for sparse principal component analysis. Journal of Machine Learning Research 9(7) (2008)"},{"issue":"2","key":"10771_CR13","doi-asserted-by":"publisher","first-page":"373","DOI":"10.1111\/rssb.12033","volume":"76","author":"P Danaher","year":"2014","unstructured":"Danaher, P., Wang, P., Witten, D.M.: The joint graphical lasso for inverse covariance estimation across multiple classes. J. R. Stat. Soc. Ser. B Stat Methodol. 76(2), 373\u2013397 (2014)","journal-title":"J. R. Stat. Soc. Ser. B Stat Methodol."},{"key":"10771_CR14","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-96422-5","volume-title":"Applied compositional data analysis","author":"P Filzmoser","year":"2018","unstructured":"Filzmoser, P., Hron, K., Templ, M.: Applied compositional data analysis. Springer, Cham (2018)"},{"issue":"1","key":"10771_CR15","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1007\/s11222-023-10328-x","volume":"34","author":"A Fayomi","year":"2024","unstructured":"Fayomi, A., Pantazis, Y., Tsagris, M., Wood, A.T.: Cauchy robust principal component analysis with applications to high-dimensional data sets. Stat. Comput. 34(1), 26 (2024)","journal-title":"Stat. Comput."},{"key":"10771_CR16","unstructured":"GeoSphere Austria https:\/\/data.hub.geosphere.at (2024)"},{"issue":"1","key":"10771_CR17","doi-asserted-by":"publisher","first-page":"18608","DOI":"10.1038\/s41598-019-54673-2","volume":"9","author":"Z Gu","year":"2019","unstructured":"Gu, Z., Schipper, N.C.D., Van Deun, K.: Variable selection in the regularized simultaneous component analysis method for multi-source data integration. Scientific reports 9(1), 18608 (2019)","journal-title":"Scientific reports"},{"issue":"3","key":"10771_CR18","doi-asserted-by":"publisher","first-page":"644","DOI":"10.1109\/TAC.2014.2354892","volume":"60","author":"E Ghadimi","year":"2014","unstructured":"Ghadimi, E., Teixeira, A., Shames, I., Johansson, M.: Optimal parameter selection for the alternating direction method of multipliers (ADMM): Quadratic problems. IEEE Trans. Autom. Control 60(3), 644\u2013658 (2014)","journal-title":"IEEE Trans. Autom. Control"},{"issue":"5","key":"10771_CR19","doi-asserted-by":"publisher","first-page":"2268","DOI":"10.3758\/s13428-018-1163-z","volume":"51","author":"Z Gu","year":"2019","unstructured":"Gu, Z., Van Deun, K.: Regularizedsca: Regularized simultaneous component analysis of multiblock data in R. Behav. Res. Methods 51(5), 2268\u20132289 (2019)","journal-title":"Behav. Res. Methods"},{"issue":"4","key":"10771_CR20","doi-asserted-by":"publisher","first-page":"424","DOI":"10.1080\/00401706.2015.1093962","volume":"58","author":"M Hubert","year":"2016","unstructured":"Hubert, M., Reynkens, T., Schmitt, E., Verdonck, T.: Sparse PCA for high-dimensional data with outliers. Technometrics 58(4), 424\u2013434 (2016)","journal-title":"Technometrics"},{"issue":"1\u20132","key":"10771_CR21","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1016\/S0169-7439(01)00188-5","volume":"60","author":"M Hubert","year":"2002","unstructured":"Hubert, M., Rousseeuw, P.J., Verboven, S.: A fast method for robust principal components with applications to chemometrics. Chemom. Intell. Lab. Syst. 60(1\u20132), 101\u2013111 (2002)","journal-title":"Chemom. Intell. Lab. Syst."},{"issue":"1","key":"10771_CR22","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1198\/004017004000000563","volume":"47","author":"M Hubert","year":"2005","unstructured":"Hubert, M., Rousseeuw, P.J., Vanden Branden, K.: ROBPCA: A new approach to robust principal component analysis. Technometrics 47(1), 64\u201379 (2005)","journal-title":"Technometrics"},{"key":"10771_CR23","unstructured":"Journ\u00e9e, M., Nesterov, Y., Richt\u00e1rik, P., Sepulchre, R.: Generalized power method for sparse principal component analysis. Journal of Machine Learning Research 11(2) (2010)"},{"key":"10771_CR24","unstructured":"Jenatton, R., Obozinski, G., Bach, F.: Structured sparse principal component analysis. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 366\u2013373 (2010). JMLR Workshop and Conference Proceedings"},{"issue":"3","key":"10771_CR25","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1198\/1061860032148","volume":"12","author":"IT Jolliffe","year":"2003","unstructured":"Jolliffe, I.T., Trendafilov, N.T., Uddin, M.: A modified principal component technique based on the lasso. J. Comput. Graph. Stat. 12(3), 531\u2013547 (2003)","journal-title":"J. Comput. Graph. Stat."},{"issue":"391","key":"10771_CR26","doi-asserted-by":"publisher","first-page":"759","DOI":"10.1080\/01621459.1985.10478181","volume":"80","author":"G Li","year":"1985","unstructured":"Li, G., Chen, Z.: Projection-pursuit approach to robust dispersion matrices and principal components: Primary theory and monte carlo. J. Am. Stat. Assoc. 80(391), 759\u2013766 (1985)","journal-title":"J. Am. Stat. Assoc."},{"issue":"1","key":"10771_CR27","doi-asserted-by":"publisher","first-page":"523","DOI":"10.1214\/12-AOAS597","volume":"7","author":"EF Lock","year":"2013","unstructured":"Lock, E.F., Hoadley, K.A., Marron, J.S., Nobel, A.B.: Joint and individual variation explained (JIVE) for integrated analysis of multiple data types. Ann. Appl. Stat. 7(1), 523 (2013)","journal-title":"Ann. Appl. Stat."},{"issue":"3","key":"10771_CR28","doi-asserted-by":"publisher","first-page":"104","DOI":"10.1007\/s11222-024-10413-9","volume":"34","author":"S Leyder","year":"2024","unstructured":"Leyder, S., Raymaekers, J., Verdonck, T.: Generalized spherical principal component analysis. Stat. Comput. 34(3), 104 (2024)","journal-title":"Stat. Comput."},{"issue":"2","key":"10771_CR29","doi-asserted-by":"publisher","first-page":"772","DOI":"10.1214\/13-AOS1097","volume":"41","author":"Z Ma","year":"2013","unstructured":"Ma, Z.: Sparse principal component analysis and iterative thresholding. Ann. Stat. 41(2), 772\u2013801 (2013)","journal-title":"Ann. Stat."},{"key":"10771_CR30","doi-asserted-by":"crossref","unstructured":"NEXT Grant agreement Nr: 776804. https:\/\/doi.org\/10.3030\/776804 (2021)","DOI":"10.3030\/776804"},{"key":"10771_CR31","doi-asserted-by":"crossref","unstructured":"Puchhammer, P., Filzmoser, P.: Spatially smoothed robust covariance estimation for local outlier detection. Journal of Computational and Graphical Statistics, 1\u201313 (2023)","DOI":"10.1080\/10618600.2023.2277875"},{"issue":"4","key":"10771_CR32","doi-asserted-by":"publisher","first-page":"823","DOI":"10.1080\/10618600.2021.1874963","volume":"30","author":"BS Price","year":"2021","unstructured":"Price, B.S., Molstad, A.J., Sherwood, B.: Estimating multiple precision matrices with cluster fusion regularization. J. Comput. Graph. Stat. 30(4), 823\u2013834 (2021)","journal-title":"J. Comput. Graph. Stat."},{"issue":"232","key":"10771_CR33","first-page":"1","volume":"18","author":"BS Price","year":"2018","unstructured":"Price, B.S., Sherwood, B.: A cluster elastic net for multivariate regression. J. Mach. Learn. Res. 18(232), 1\u201339 (2018)","journal-title":"J. Mach. Learn. Res."},{"key":"10771_CR34","unstructured":"Puchhammer, P.: ssMRCD: Spatially Smoothed MRCD Estimator. (2023). R package version 1.1.0. https:\/\/CRAN.R-project.org\/package=ssMRCD"},{"issue":"3","key":"10771_CR35","doi-asserted-by":"publisher","first-page":"212","DOI":"10.1080\/00401706.1999.10485670","volume":"41","author":"PJ Rousseeuw","year":"1999","unstructured":"Rousseeuw, P.J., Driessen, K.V.: A fast algorithm for the minimum covariance determinant estimator. Technometrics 41(3), 212\u2013223 (1999)","journal-title":"Technometrics"},{"key":"10771_CR36","doi-asserted-by":"crossref","unstructured":"Reynkens, T.: rospca: Robust Sparse PCA Using the ROSPCA Algorithm. (2018). R package version 1.0.4. https:\/\/CRAN.R-project.org\/package=rospca","DOI":"10.32614\/CRAN.package.rospca"},{"issue":"283\u2013297","key":"10771_CR37","first-page":"37","volume":"8","author":"PJ Rousseeuw","year":"1985","unstructured":"Rousseeuw, P.J.: Multivariate estimation with high breakdown point. Math. Stat. Appl. 8(283\u2013297), 37 (1985)","journal-title":"Math. Stat. Appl."},{"issue":"2","key":"10771_CR38","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1080\/10618600.2012.681250","volume":"22","author":"N Simon","year":"2013","unstructured":"Simon, N., Friedman, J., Hastie, T., Tibshirani, R.: A sparse-group lasso. J. Comput. Graph. Stat. 22(2), 231\u2013245 (2013)","journal-title":"J. Comput. Graph. Stat."},{"issue":"6","key":"10771_CR39","doi-asserted-by":"publisher","first-page":"1015","DOI":"10.1016\/j.jmva.2007.06.007","volume":"99","author":"H Shen","year":"2008","unstructured":"Shen, H., Huang, J.Z.: Sparse principal component analysis via regularized low rank matrix approximation. J. Multivar. Anal. 99(6), 1015\u20131034 (2008)","journal-title":"J. Multivar. Anal."},{"key":"10771_CR40","first-page":"1","volume":"25","author":"N Shi","year":"2024","unstructured":"Shi, N., Kontar, R.A.: Personalized PCA: Decoupling shared and unique features. J. Mach. Learn. Res. 25, 1\u201382 (2024)","journal-title":"J. Mach. Learn. Res."},{"key":"10771_CR41","doi-asserted-by":"crossref","unstructured":"Soetaert, K.: rootSolve: Nonlinear Root Finding, Equilibrium and Steady-state Analysis of Ordinary Differential Equations. (2009). R package version 1.6","DOI":"10.32614\/CRAN.package.rootSolve"},{"issue":"6","key":"10771_CR42","doi-asserted-by":"publisher","first-page":"1229","DOI":"10.1007\/s11222-015-9602-5","volume":"26","author":"E Schmitt","year":"2016","unstructured":"Schmitt, E., Vakili, K.: The FastHCS algorithm for robust PCA. Stat. Comput. 26(6), 1229\u20131242 (2016)","journal-title":"Stat. Comput."},{"issue":"198","key":"10771_CR43","first-page":"1","volume":"22","author":"TM Tang","year":"2021","unstructured":"Tang, T.M., Allen, G.I.: Integrated principal components analysis. J. Mach. Learn. Res. 22(198), 1\u201371 (2021)","journal-title":"J. Mach. Learn. Res."},{"issue":"2","key":"10771_CR44","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1111\/rssc.12231","volume":"67","author":"I Wilms","year":"2018","unstructured":"Wilms, I., Barbaglia, L., Croux, C.: Multiclass vector auto-regressive models for multistore sales data. J. R. Stat. Soc.: Ser. C: Appl. Stat. 67(2), 435\u2013452 (2018)","journal-title":"J. R. Stat. Soc.: Ser. C: Appl. Stat."},{"key":"10771_CR45","unstructured":"Wang, W., Liang, Y., Xing, E.: Block regularized lasso for multivariate multi-response linear regression. In: Carvalho, C.M., Ravikumar, P. (eds.) Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. Proceedings of Machine Learning Research, vol. 31, pp. 608\u2013617. PMLR, Scottsdale, Arizona, USA (2013). https:\/\/proceedings.mlr.press\/v31\/wang13c.html"},{"issue":"4","key":"10771_CR46","doi-asserted-by":"publisher","first-page":"473","DOI":"10.1080\/00401706.2019.1671234","volume":"62","author":"Y Wang","year":"2020","unstructured":"Wang, Y., Van Aelst, S.: Sparse principal component analysis based on least trimmed squares. Technometrics 62(4), 473\u2013485 (2020)","journal-title":"Technometrics"},{"key":"10771_CR47","doi-asserted-by":"publisher","first-page":"524","DOI":"10.1016\/j.patcog.2016.08.025","volume":"61","author":"S Yi","year":"2017","unstructured":"Yi, S., Lai, Z., He, Z., Cheung, Y.-M., Liu, Y.: Joint sparse principal component analysis. Pattern Recogn. 61, 524\u2013536 (2017)","journal-title":"Pattern Recogn."},{"issue":"77","key":"10771_CR48","first-page":"1","volume":"25","author":"Y Yao","year":"2024","unstructured":"Yao, Y., Peng, L., Tsakiris, M.C.: Unlabeled principal component analysis and matrix completion. J. Mach. Learn. Res. 25(77), 1\u201338 (2024)","journal-title":"J. Mach. Learn. Res."},{"key":"10771_CR49","doi-asserted-by":"crossref","unstructured":"Zhou, G., Cichocki, A., Zhang, Y., Mandic, D.P.: Group component analysis for multiblock data: Common and individual feature extraction. EEE Trans. Neural Netw. Learn. Syst. 27(11), 2426\u20132439 (2016)","DOI":"10.1109\/TNNLS.2015.2487364"},{"issue":"2","key":"10771_CR50","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1198\/106186006X113430","volume":"15","author":"H Zou","year":"2006","unstructured":"Zou, H., Hastie, T., Tibshirani, R.: Sparse principal component analysis. J. Comput. Graph. Stat. 15(2), 265\u2013286 (2006)","journal-title":"J. Comput. Graph. Stat."}],"container-title":["Statistics and Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11222-025-10771-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11222-025-10771-y","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11222-025-10771-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,7]],"date-time":"2026-02-07T03:53:25Z","timestamp":1770436405000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11222-025-10771-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,18]]},"references-count":50,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,2]]}},"alternative-id":["10771"],"URL":"https:\/\/doi.org\/10.1007\/s11222-025-10771-y","relation":{},"ISSN":["0960-3174","1573-1375"],"issn-type":[{"value":"0960-3174","type":"print"},{"value":"1573-1375","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,18]]},"assertion":[{"value":"6 February 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 November 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 November 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}},{"value":"All code for simulations and the weather data example is part of the supplementary material.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Code availability"}}],"article-number":"22"}}