{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T02:08:33Z","timestamp":1760148513050,"version":"build-2065373602"},"reference-count":40,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2023,5,12]],"date-time":"2023-05-12T00:00:00Z","timestamp":1683849600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Foundation for Science and Technology (FCT, Portugal)","award":["UIDB\/05757\/2020","UIDP\/05757\/2020","LA\/P\/0007\/2021"],"award-info":[{"award-number":["UIDB\/05757\/2020","UIDP\/05757\/2020","LA\/P\/0007\/2021"]}]},{"name":"FCT\/MCTES (PIDDAC)","award":["UIDB\/05757\/2020","UIDP\/05757\/2020","LA\/P\/0007\/2021"],"award-info":[{"award-number":["UIDB\/05757\/2020","UIDP\/05757\/2020","LA\/P\/0007\/2021"]}]},{"name":"SusTEC","award":["UIDB\/05757\/2020","UIDP\/05757\/2020","LA\/P\/0007\/2021"],"award-info":[{"award-number":["UIDB\/05757\/2020","UIDP\/05757\/2020","LA\/P\/0007\/2021"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computation"],"abstract":"<jats:p>The reconstruction or prediction of meteorological records through the Analog Ensemble (AnEn) method is very efficient when the number of predictor time series is small. Thus, in order to take advantage of the richness and diversity of information contained in a large number of predictors, it is necessary to reduce their dimensions. This study presents methods to accomplish such reduction, allowing the use of a high number of predictor variables. In particular, the techniques of Principal Component Analysis (PCA) and Partial Least Squares (PLS) are used to reduce the dimension of the predictor dataset without loss of essential information. The combination of the AnEn and PLS techniques results in a very efficient hybrid method (PLSAnEn) for reconstructing or forecasting unstable meteorological variables, such as wind speed. This hybrid method is computationally demanding but its performance can be improved via parallelization or the introduction of variants in which all possible analogs are previously clustered. The multivariate linear regression methods used on the new variables resulting from the PCA or PLS techniques also proved to be efficient, especially for the prediction of meteorological variables without local oscillations, such as the pressure.<\/jats:p>","DOI":"10.3390\/computation11050098","type":"journal-article","created":{"date-parts":[[2023,5,15]],"date-time":"2023-05-15T02:56:57Z","timestamp":1684119417000},"page":"98","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Reconstruction of Meteorological Records by Methods Based on Dimension Reduction of the Predictor Dataset"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2431-8665","authenticated-orcid":false,"given":"Carlos","family":"Balsa","sequence":"first","affiliation":[{"name":"Research Centre in Digitalization and Intelligent Robotics (CeDRI), Laborat\u00f3rio para a Sustentabilidade e Tecnologia em Regi\u00f5es de Montanha (SusTEC), Instituto Polit\u00e9cnico de Bragan\u00e7a, Campus de Santa Apol\u00f3nia, 5300-253 Bragan\u00e7a, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5781-0480","authenticated-orcid":false,"given":"Murilo M.","family":"Breve","sequence":"additional","affiliation":[{"name":"Research Centre in Digitalization and Intelligent Robotics (CeDRI), Laborat\u00f3rio para a Sustentabilidade e Tecnologia em Regi\u00f5es de Montanha (SusTEC), Instituto Polit\u00e9cnico de Bragan\u00e7a, Campus de Santa Apol\u00f3nia, 5300-253 Bragan\u00e7a, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9030-7158","authenticated-orcid":false,"given":"Carlos V.","family":"Rodrigues","sequence":"additional","affiliation":[{"name":"Vestas Wind Systems A\/S, Design Centre Porto, 4465-671 Le\u00e7a do Balio, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1344-8264","authenticated-orcid":false,"given":"Jos\u00e9","family":"Rufino","sequence":"additional","affiliation":[{"name":"Research Centre in Digitalization and Intelligent Robotics (CeDRI), Laborat\u00f3rio para a Sustentabilidade e Tecnologia em Regi\u00f5es de Montanha (SusTEC), Instituto Polit\u00e9cnico de Bragan\u00e7a, Campus de Santa Apol\u00f3nia, 5300-253 Bragan\u00e7a, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2023,5,12]]},"reference":[{"key":"ref_1","first-page":"50","article-title":"Big data: A revolution that will transform how we live, work, and think","volume":"50","author":"Cukier","year":"2013","journal-title":"Choice Rev. Online"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Jain, H., and Jain, R. (2017, January 23\u201325). Big data in weather forecasting: Applications and challenges. Proceedings of the 2017 International Conference on Big Data Analytics and Computational Intelligence (ICBDAC), Chirala, Andhra Pradesh, India.","DOI":"10.1109\/ICBDACI.2017.8070824"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/0038-092X(91)90023-P","article-title":"A methodology for the synthesis of hourly weather data","volume":"46","author":"Knight","year":"1991","journal-title":"Sol. Energy"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"3554","DOI":"10.1175\/2011MWR3653.1","article-title":"Kalman Filter and Analog Schemes to Postprocess Numerical Weather Predictions","volume":"139","author":"Monache","year":"2011","journal-title":"Mon. Weather Rev."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"3498","DOI":"10.1175\/MWR-D-12-00281.1","article-title":"Probabilistic Weather Prediction with an Analog Ensemble","volume":"141","author":"Monache","year":"2013","journal-title":"Mon. Weather Rev."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"768","DOI":"10.1016\/j.renene.2014.11.061","article-title":"A novel application of an analog ensemble for short-term wind power forecasting","volume":"76","author":"Alessandrini","year":"2015","journal-title":"Renew. Energy"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"274","DOI":"10.1016\/j.renene.2017.02.052","article-title":"Short-term photovoltaic power forecasting using Artificial Neural Networks and an Analog Ensemble","volume":"108","author":"Cervone","year":"2017","journal-title":"Renew. Energy"},{"key":"ref_8","unstructured":"Mercer, D. (2022, July 28). Clustering Large Datasets. Technical Report. Available online: http:\/\/pzs.dstu.dp.ua\/DataMining\/cluster\/bibl\/LargeClaster.pdf."},{"key":"ref_9","unstructured":"Balsa, C., Rodrigues, C.V., Ara\u00fajo, L., and Rufino, J. (2021). Communications in Computer and Information Science, Springer International Publishing."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"053703","DOI":"10.1063\/1.5124711","article-title":"Ultra-fast analog ensemble using kd-tree","volume":"11","author":"Yang","year":"2019","journal-title":"J. Renew. Sustain. Energy"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.55969\/paradigmplus.v1n2a1","article-title":"Using Analog Ensembles with Alternative Metrics for Hindcasting with Multistations","volume":"1","author":"Balsa","year":"2020","journal-title":"ParadigmPlus"},{"key":"ref_12","unstructured":"Ara\u00fajo, L., Balsa, C., Rodrigues, C.V., and Rufino, J. (2021). Advances in Intelligent Systems and Computing, Springer International Publishing."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Balsa, C., Rodrigues, C.V., Ara\u00fajo, L., and Rufino, J. (2022). Cluster-Based Analogue Ensembles for Hindcasting with Multistations. Computation, 10.","DOI":"10.3390\/computation10060091"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1016\/j.apenergy.2015.08.011","article-title":"An analog ensemble for short-term probabilistic solar power forecast","volume":"157","author":"Alessandrini","year":"2015","journal-title":"Appl. Energy"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1007\/s42865-020-00028-z","article-title":"Post-processing rainfall in a high-resolution simulation of the 1994 Piedmont flood","volume":"1","author":"Meech","year":"2020","journal-title":"Bull. Atmos. Sci. Technol."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Rozoff, C.M., and Alessandrini, S. (2022). A Comparison between Analog Ensemble and Convolutional Neural Network Empirical-Statistical Downscaling Techniques for Reconstructing High-Resolution Near-Surface Wind. Energies, 15.","DOI":"10.3390\/en15051718"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1016\/j.solener.2021.11.033","article-title":"Predicting rare events of solar power production with the analog ensemble","volume":"231","author":"Alessandrini","year":"2022","journal-title":"Sol. Energy"},{"key":"ref_18","unstructured":"Jolliffe, I.T. (2002). Principal Component Analysis, Springer. [2nd ed.]."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Zhang, X.D. (2020). A Matrix Algebra Approach to Artificial Intelligence, Springer.","DOI":"10.1007\/978-981-15-2770-8"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Eld\u00e9n, L. (2007). Matrix Methods in Data Mining and Pattern Recognition, SIAM.","DOI":"10.1137\/1.9780898718867"},{"key":"ref_21","unstructured":"Spence, L., Insel, A., and Friedberg, S. (2013). Elementary Linear Algebra: A matrix Approach, Pearson Education Limited."},{"key":"ref_22","unstructured":"Balsa, C., Breve, M.M., Rodrigues, C.V., Costa, L.S., and Rufino, J. (2022). Communications in Computer and Information Science, Springer Nature."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1080\/01621459.1965.10480787","article-title":"Principal Components Regression in Exploratory Statistical Research","volume":"60","author":"Massy","year":"1965","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1002\/wics.51","article-title":"Partial least squares regression and projection on latent structure regression (PLS Regression)","volume":"2","author":"Abdi","year":"2010","journal-title":"WIREs Comput. Stat."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v018.i02","article-title":"The pls Package: Principal Component and PartialLeast Squares Regression in R","volume":"18","author":"Mevik","year":"2007","journal-title":"J. Stat. Softw."},{"key":"ref_26","unstructured":"Wold, H. (1973). Multivariate Analysis\u2013III, Elsevier."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1016\/0169-7439(93)85002-X","article-title":"SIMPLS: An alternative approach to partial least squares regression","volume":"18","author":"Jong","year":"1993","journal-title":"Chemom. Intell. Lab. Syst."},{"key":"ref_28","unstructured":"Rosipal, R., and Kr\u00e4mer, N. (2006). Subspace, Latent Structure and Feature Selection, Springer."},{"key":"ref_29","unstructured":"National Oceanic and Atmospheric Administration, National Weather Service (2022, July 28). National Data Buoy Center, Available online: https:\/\/www.ndbc.noaa.gov."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"743","DOI":"10.1016\/j.solener.2022.03.004","article-title":"Verifying operational intra-day solar forecasts from ECMWF and NOAA","volume":"236","author":"Yang","year":"2022","journal-title":"Sol. Energy"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1016\/j.solener.2016.04.049","article-title":"Post-processing techniques and principal component analysis for regional wind power and solar irradiance forecasting","volume":"134","author":"Alessandrini","year":"2016","journal-title":"Sol. Energy"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1247","DOI":"10.5194\/gmd-7-1247-2014","article-title":"Root mean square error (RMSE) or mean absolute error (MAE)?\u2014Arguments against avoiding RMSE in the literature","volume":"7","author":"Chai","year":"2014","journal-title":"Geosci. Model Dev."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1111\/j.2517-6161.1974.tb00994.x","article-title":"Cross-Validatory Choice and Assessment of Statistical Predictions","volume":"36","author":"Stone","year":"1974","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/00401706.1968.10490530","article-title":"Estimation of Error Rates in Discriminant Analysis","volume":"10","author":"Lachenbruch","year":"1968","journal-title":"Technometrics"},{"key":"ref_35","first-page":"171","article-title":"A survey of forecast error measures","volume":"24","author":"Shcherbakov","year":"2013","journal-title":"World Appl. Sci. J."},{"key":"ref_36","unstructured":"Tenenhaus, M. (1998). La Regression PLS\u2014Th\u00e9orie et Pratique, Editions Technip."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"668","DOI":"10.1175\/1520-0493(1996)124<0668:TSLLJA>2.0.CO;2","article-title":"The Summertime Low-Level Jet and Marine Boundary Layer Structure along the California Coast","volume":"124","author":"Burk","year":"1996","journal-title":"Mon. Weather Rev."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1175\/1520-0450(1966)005<0573:MAPICC>2.0.CO;2","article-title":"Marine Air Penetration in Central California","volume":"5","author":"Fosberg","year":"1966","journal-title":"J. Appl. Meteorol. Climatol."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.cageo.2013.09.015","article-title":"psd: Adaptive, sine multitaper power spectral density estimation for R","volume":"63","author":"Barbour","year":"2014","journal-title":"Comput. Geosci."},{"key":"ref_40","unstructured":"R Core Team (2022). R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing."}],"container-title":["Computation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-3197\/11\/5\/98\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:34:09Z","timestamp":1760124849000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-3197\/11\/5\/98"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,12]]},"references-count":40,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2023,5]]}},"alternative-id":["computation11050098"],"URL":"https:\/\/doi.org\/10.3390\/computation11050098","relation":{},"ISSN":["2079-3197"],"issn-type":[{"type":"electronic","value":"2079-3197"}],"subject":[],"published":{"date-parts":[[2023,5,12]]}}}