{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,18]],"date-time":"2026-06-18T15:44:12Z","timestamp":1781797452476,"version":"3.54.5"},"reference-count":312,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2025,4,23]],"date-time":"2025-04-23T00:00:00Z","timestamp":1745366400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,4,23]],"date-time":"2025-04-23T00:00:00Z","timestamp":1745366400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Artif Intell Rev"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Time series forecasting is a critical task that provides key information for decision-making across various fields, such as economic planning, supply chain management, and medical diagnosis. After the use of traditional statistical methodologies and machine learning in the past, various fundamental deep learning architectures such as MLPs, CNNs, RNNs, and GNNs have been developed and applied to solve time series forecasting problems. However, the structural limitations caused by the inductive biases of each deep learning architecture constrained their performance. Transformer models, which excel at handling long-term dependencies, have become significant architectural components for time series forecasting. However, recent research has shown that alternatives such as simple linear layers can outperform Transformers. These findings have opened up new possibilities for using diverse architectures, ranging from fundamental deep learning models to emerging architectures and hybrid approaches. In this context of exploration into various models, the architectural modeling of time series forecasting has now entered a renaissance. This survey not only provides a historical context for time series forecasting but also offers comprehensive and timely analysis of the movement toward architectural diversification. By comparing and re-examining various deep learning models, we uncover new perspectives and present the latest trends in time series forecasting, including the emergence of hybrid models, diffusion models, Mamba models, and foundation models. By focusing on the inherent characteristics of time series data, we also address open challenges that have gained attention in time series forecasting, such as channel dependency, distribution shift, causality, and feature extraction. This survey explores vital elements that can enhance forecasting performance through diverse approaches. These contributions help lower entry barriers for newcomers by providing a systematic understanding of the diverse research areas in time series forecasting (TSF), while offering seasoned researchers broader perspectives and new opportunities through in-depth exploration of TSF challenges.<\/jats:p>","DOI":"10.1007\/s10462-025-11223-9","type":"journal-article","created":{"date-parts":[[2025,4,23]],"date-time":"2025-04-23T08:45:40Z","timestamp":1745397940000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":182,"title":["A comprehensive survey of deep learning for time series forecasting: architectural diversity and open challenges"],"prefix":"10.1007","volume":"58","author":[{"given":"Jongseon","family":"Kim","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hyungjoon","family":"Kim","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"HyunGi","family":"Kim","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dongjun","family":"Lee","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sungroh","family":"Yoon","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,4,23]]},"reference":[{"key":"11223_CR1","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1007\/BF00126626","volume":"6","author":"YS Abu-Mostafa","year":"1996","unstructured":"Abu-Mostafa YS, Atiya AF (1996) Introduction to financial forecasting. Appl Intell 6:205\u2013213","journal-title":"Appl Intell"},{"key":"11223_CR2","unstructured":"Achiam J, Adler S, Agarwal S, et\u00a0al (2023) Gpt-4 technical report. Preprint at arXiv:2303.08774"},{"key":"11223_CR3","doi-asserted-by":"publisher","DOI":"10.3233\/faia240677","author":"MA Ahamed","year":"2024","unstructured":"Ahamed MA, Cheng Q (2024) TimeMachine: a time series is worth 4 Mambas for long-term forecasting. IOS Press. https:\/\/doi.org\/10.3233\/faia240677 (http:\/\/dx.doi.org\/10.3233\/faia240677)","journal-title":"IOS Press"},{"key":"11223_CR4","first-page":"6596397","volume":"1","author":"DM Ahmed","year":"2022","unstructured":"Ahmed DM, Hassan MM, Mstafa RJ (2022) A review on deep sequential models for forecasting time series data. Appl Comput Intell Soft Comput 1:6596397","journal-title":"Appl Comput Intell Soft Comput"},{"key":"11223_CR5","unstructured":"Alcaraz JL, Strodthoff N (2023) Diffusion-based time series imputation and forecasting with structured state space models. Transactions on Machine Learning Research https:\/\/openreview.net\/forum?id=hHiIbk7ApW"},{"key":"11223_CR6","doi-asserted-by":"crossref","unstructured":"Alghamdi T, Elgazzar K, Bayoumi M, et\u00a0al (2019) Forecasting traffic congestion using arima modeling. In: 2019 15th international wireless communications & mobile computing conference (IWCMC), IEEE, pp 1227\u20131232","DOI":"10.1109\/IWCMC.2019.8766698"},{"key":"11223_CR7","unstructured":"Ansari AF, Stella L, Turkmen AC, et\u00a0al (2024) Chronos: Learning the language of time series. Transactions on Machine Learning Research https:\/\/openreview.net\/forum?id=gerNCVqqtR, Expert Certification"},{"issue":"3","key":"11223_CR8","doi-asserted-by":"crossref","first-page":"822","DOI":"10.1016\/j.ijforecast.2010.04.009","volume":"27","author":"G Athanasopoulos","year":"2011","unstructured":"Athanasopoulos G, Hyndman RJ, Song H et al (2011) The tourism forecasting competition. Int J Forecast 27(3):822\u2013844","journal-title":"Int J Forecast"},{"key":"11223_CR9","unstructured":"Bai S, Kolter JZ, Koltun V (2018) An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. Preprint at arXiv:1803.01271"},{"issue":"3","key":"11223_CR10","doi-asserted-by":"crossref","first-page":"452","DOI":"10.1016\/j.ygyno.2013.06.009","volume":"130","author":"JN Barlin","year":"2013","unstructured":"Barlin JN, Zhou Q, Clair CMS et al (2013) Classification and regression tree (cart) analysis of endometrial carcinoma: seeing the forest for the trees. Gynecol Oncol 130(3):452\u2013456","journal-title":"Gynecol Oncol"},{"key":"11223_CR11","unstructured":"Behrouz A, Santacatterina M, Zabih R (2024) Chimera: Effectively modeling multivariate time series with 2-dimensional state space models. In: The Thirty-eighth Annual Conference on Neural Information Processing Systems, https:\/\/openreview.net\/forum?id=ncYGjx2vnE"},{"key":"11223_CR12","unstructured":"Beltagy I, Peters ME, Cohan A (2020) Longformer: The long-document transformer. Preprint at arXiv:2004.05150"},{"issue":"6","key":"11223_CR13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3533382","volume":"55","author":"K Benidis","year":"2022","unstructured":"Benidis K, Rangapuram SS, Flunkert V et al (2022) Deep learning for time series forecasting: Tutorial and literature survey. ACM Comput Surv 55(6):1\u201336","journal-title":"ACM Comput Surv"},{"key":"11223_CR14","first-page":"30518","volume":"36","author":"S Bergsma","year":"2023","unstructured":"Bergsma S, Zeyl T, Guo L (2023) Sutranets: sub-series autoregressive networks for long-sequence, probabilistic forecasting. Adv Neural Inf Process Syst 36:30518\u201330533","journal-title":"Adv Neural Inf Process Syst"},{"issue":"75","key":"11223_CR15","first-page":"1970","volume":"3","author":"GE Box","year":"1970","unstructured":"Box GE, Jenkins GM, Reinsel G et al (1970) Forecasting and control. Time Series. Analysis 3(75):1970","journal-title":"Analysis"},{"key":"11223_CR16","first-page":"123","volume":"24","author":"L Breiman","year":"1996","unstructured":"Breiman L (1996) Bagging predictors. Mach Learning 24:123\u2013140","journal-title":"Mach Learning"},{"key":"11223_CR17","unstructured":"Brown RG (1959) Statistical forecasting for inventory control. (No Title)"},{"key":"11223_CR18","first-page":"1877","volume":"33","author":"T Brown","year":"2020","unstructured":"Brown T, Mann B, Ryder N et al (2020) Language models are few-shot learners. Adv Neural Inf Process Syst 33:1877\u20131901","journal-title":"Adv Neural Inf Process Syst"},{"key":"11223_CR19","doi-asserted-by":"crossref","unstructured":"Cai W, Liang Y, Liu X, et\u00a0al (2024a) Msgnet: Learning multi-scale inter-series correlations for multivariate time series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 11141\u201311149","DOI":"10.1609\/aaai.v38i10.28991"},{"key":"11223_CR20","unstructured":"Cai W, Wang K, Wu H, et\u00a0al (2024b) Forecastgrapher: Redefining multivariate time series forecasting with graph neural networks. Preprint at arXiv:2405.18036"},{"key":"11223_CR21","unstructured":"Cai X, Zhu Y, Wang X, et\u00a0al (2024c) Mambats: Improved selective state space models for long-term time series forecasting. Preprint at arXiv:2405.16440"},{"key":"11223_CR22","doi-asserted-by":"crossref","unstructured":"Cao H, Tan C, Gao Z, et\u00a0al (2024) A survey on generative diffusion models. IEEE Transactions on Knowledge and Data Engineering","DOI":"10.1109\/TKDE.2024.3361474"},{"key":"11223_CR23","unstructured":"Center JMK (2020) Dominick\u2019s dataset"},{"key":"11223_CR24","doi-asserted-by":"crossref","unstructured":"Challu C, Olivares KG, Oreshkin BN, et\u00a0al (2023) Nhits: Neural hierarchical interpolation for time series forecasting. In: Proceedings of the AAAI conference on artificial intelligence, pp 6989\u20136997","DOI":"10.1609\/aaai.v37i6.25854"},{"key":"11223_CR25","unstructured":"Chang S, Zhang Y, Han W, et\u00a0al (2017) Dilated recurrent neural networks. Adv Neural Inform Process Syst 30"},{"issue":"7","key":"11223_CR26","doi-asserted-by":"crossref","first-page":"6950","DOI":"10.1109\/TITS.2021.3065404","volume":"23","author":"P Chen","year":"2021","unstructured":"Chen P, Fu X, Wang X (2021) A graph convolutional stacked bidirectional unidirectional-LSTM neural network for metro ridership prediction. IEEE Trans Intell Transp Syst 23(7):6950\u20136962","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"11223_CR27","doi-asserted-by":"crossref","first-page":"476","DOI":"10.1016\/j.ins.2020.06.026","volume":"542","author":"Y Chen","year":"2021","unstructured":"Chen Y, Zou X, Li K et al (2021) Multiple local 3D CNNs for region-based prediction in smart cities. Inf Sci 542:476\u2013491","journal-title":"Inf Sci"},{"issue":"1","key":"11223_CR28","doi-asserted-by":"crossref","first-page":"4512","DOI":"10.1038\/s41467-022-31384-3","volume":"13","author":"H Chen","year":"2022","unstructured":"Chen H, Lundberg SM, Lee SI (2022) Explaining a series of models by propagating Shapley values. Nat Commun 13(1):4512","journal-title":"Nat Commun"},{"key":"11223_CR29","volume":"97","author":"Z Chen","year":"2023","unstructured":"Chen Z, Ma M, Li T et al (2023) Long sequence time-series forecasting with deep learning: a survey. Inform Fusion 97:101819","journal-title":"Inform Fusion"},{"key":"11223_CR30","unstructured":"Chen K, Chen G, Xu D, et\u00a0al (2021a) Nast: Non-autoregressive spatial-temporal transformer for time series forecasting. Preprint at arXiv:2102.05624"},{"key":"11223_CR31","unstructured":"Cheng X, Chen X, Li S, et\u00a0al (2024c) Leveraging 2D information for long-term time series forecasting with vanilla transformers. Preprint at arXiv:2405.13810"},{"key":"11223_CR32","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 785\u2013794","DOI":"10.1145\/2939672.2939785"},{"key":"11223_CR33","unstructured":"Cheng H, Wen Q, Liu Y, et\u00a0al (2024a) Robusttsf: Towards theory and design of robust time series forecasting with anomalies. ICRL"},{"key":"11223_CR34","unstructured":"Cheng M, Yang J, Pan T, et\u00a0al (2024b) Convtimenet: A deep hierarchical fully convolutional model for multivariate time series analysis. Preprint at arXiv:2403.01493"},{"key":"11223_CR35","unstructured":"Chen J, Lenssen JE, Feng A, et\u00a0al (2024a) From similarity to superiority: Channel clustering for time series forecasting. In: The Thirty-eighth Annual Conference on Neural Information Processing Systems, https:\/\/openreview.net\/forum?id=MDgn9aazo0"},{"key":"11223_CR36","unstructured":"Chen X, Li X, Liu B, et\u00a0al (2024c) Biased temporal convolution graph network for time series forecasting with missing values. In: The Twelfth International Conference on Learning Representations"},{"key":"11223_CR37","unstructured":"Chen Y, Ren K, Wang Y, et\u00a0al (2024d) Contiformer: Continuous-time transformer for irregular time series modeling. Adv Neural Inform Process Syst 36"},{"key":"11223_CR38","unstructured":"Chen P, ZHANG Y, Cheng Y, et\u00a0al (2024b) Pathformer: Multi-scale transformers with adaptive pathways for time series forecasting. In: The Twelfth International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=lJkOCMP2aW"},{"key":"11223_CR39","volume":"7","author":"D Chicco","year":"2021","unstructured":"Chicco D, Warrens MJ, Jurman G (2021) The coefficient of determination r-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. Peer J Comput Sci 7:e623","journal-title":"Peer J Comput Sci"},{"key":"11223_CR40","doi-asserted-by":"publisher","unstructured":"Cho K, van Merri\u00ebnboer B, Gulcehre C, et\u00a0al (2014) Learning phrase representations using RNN encoder\u2013decoder for statistical machine translation. In: Moschitti A, Pang B, Daelemans W (eds) Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, pp 1724\u20131734, https:\/\/doi.org\/10.3115\/v1\/D14-1179, https:\/\/aclanthology.org\/D14-1179\/","DOI":"10.3115\/v1\/D14-1179"},{"key":"11223_CR41","unstructured":"Clark K, Luong MT, Le QV, et\u00a0al (2020) Electra: Pre-training text encoders as discriminators rather than generators. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=r1xMH1BtvB"},{"key":"11223_CR42","doi-asserted-by":"crossref","unstructured":"Cortes C (1995) Support-vector networks. Machine Learning","DOI":"10.1007\/BF00994018"},{"key":"11223_CR43","doi-asserted-by":"crossref","unstructured":"Coskunuzer B, Segovia-Dominguez I, Chen Y, et\u00a0al (2024) Time-aware knowledge representations of dynamic objects with multidimensional persistence. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 11678\u201311686","DOI":"10.1609\/aaai.v38i10.29051"},{"key":"11223_CR44","unstructured":"Crabb\u00e9 J, Huynh N, Stanczuk JP, et\u00a0al (2024) Time series diffusion in the frequency domain. In: Forty-first International Conference on Machine Learning, https:\/\/openreview.net\/forum?id=W9GaJUVLCT"},{"key":"11223_CR45","unstructured":"Cryer JD (1986) Time series analysis, vol 286. Duxbury Press Boston"},{"key":"11223_CR46","doi-asserted-by":"publisher","unstructured":"Dai Z, Yang Z, Yang Y, et\u00a0al (2019) Transformer-XL: Attentive language models beyond a fixed-length context. In: Korhonen A, Traum D, M\u00e0rquez L (eds) Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, pp 2978\u20132988, https:\/\/doi.org\/10.18653\/v1\/P19-1285, https:\/\/aclanthology.org\/P19-1285","DOI":"10.18653\/v1\/P19-1285"},{"issue":"1","key":"11223_CR47","doi-asserted-by":"publisher","first-page":"204","DOI":"10.1016\/j.ijpe.2010.09.006","volume":"131","author":"P Danese","year":"2011","unstructured":"Danese P, Kalchschmidt M (2011) The role of the forecasting process in improving forecast accuracy and operational performance. Int J Prod Econ 131(1):204\u2013214. https:\/\/doi.org\/10.1016\/j.ijpe.2010.09.006 (https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0925527310003282, innsbruck 2008)","journal-title":"Int J Prod Econ"},{"key":"11223_CR48","unstructured":"Das A, Kong W, Sen R, et\u00a0al (2024) A decoder-only foundation model for time-series forecasting. In: Forty-first International Conference on Machine Learning, https:\/\/openreview.net\/forum?id=jn2iTJas6h"},{"key":"11223_CR49","doi-asserted-by":"publisher","unstructured":"Devlin J, Chang MW, Lee K, et\u00a0al (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. In: Burstein J, Doran C, Solorio T (eds) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, pp 4171\u20134186, https:\/\/doi.org\/10.18653\/v1\/N19-1423, https:\/\/aclanthology.org\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"11223_CR50","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s12040-020-01408-x","volume":"129","author":"T Dimri","year":"2020","unstructured":"Dimri T, Ahmad S, Sharif M (2020) Time series analysis of climate variables using seasonal ARIMA approach. J Earth Syst Sci 129:1\u201316","journal-title":"J Earth Syst Sci"},{"key":"11223_CR51","unstructured":"Dosovitskiy A, Beyer L, Kolesnikov A, et\u00a0al (2021) An image is worth 16x16 words: Transformers for image recognition at scale. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=YicbFdNTTy"},{"issue":"3","key":"11223_CR52","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1109\/TSP.2013.2288675","volume":"62","author":"K Dragomiretskiy","year":"2013","unstructured":"Dragomiretskiy K, Zosso D (2013) Variational mode decomposition. IEEE Trans Signal Process 62(3):531\u2013544","journal-title":"IEEE Trans Signal Process"},{"key":"11223_CR53","unstructured":"Dubey A, Jauhri A, Pandey A, et\u00a0al (2024) The llama 3 herd of models. Preprint at arXiv:2407.21783"},{"key":"11223_CR54","doi-asserted-by":"crossref","unstructured":"Dudek G (2024) Stacking for probabilistic short-term load forecasting. In: International Conference on Computational Science, Springer, pp 3\u201318","DOI":"10.1007\/978-3-031-63751-3_1"},{"key":"11223_CR55","unstructured":"Ekambaram V, Jati A, Dayama P, et\u00a0al (2024) Tiny time mixers (ttms): Fast pre-trained models for enhanced zero\/few-shot forecasting of multivariate time series. CoRR"},{"key":"11223_CR56","doi-asserted-by":"crossref","unstructured":"Ekambaram V, Jati A, Nguyen N, et\u00a0al (2023) TSMIXER: Lightweight MLP-mixer model for multivariate time series forecasting. In: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp 459\u2013469","DOI":"10.1145\/3580305.3599533"},{"key":"11223_CR57","doi-asserted-by":"crossref","unstructured":"Eldele E, Ragab M, Chen Z, et\u00a0al (2021) Time-series representation learning via temporal and contextual contrasting. Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21)","DOI":"10.24963\/ijcai.2021\/324"},{"key":"11223_CR58","unstructured":"Eldele E, Ragab M, Chen Z, et\u00a0al (2024) Tslanet: Rethinking transformers for time series representation learning. International Conference on Machine Learning"},{"key":"11223_CR59","doi-asserted-by":"crossref","unstructured":"Fan W, Wang P, Wang D, et\u00a0al (2023) DISH-TS: a general paradigm for alleviating distribution shift in time series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 7522\u20137529","DOI":"10.1609\/aaai.v37i6.25914"},{"key":"11223_CR60","unstructured":"Fan X, Wu Y, Xu C, et\u00a0al (2024b) MG-TSD: Multi-granularity time series diffusion models with guided learning process. In: The Twelfth International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=CZiY6OLktd"},{"key":"11223_CR61","doi-asserted-by":"crossref","unstructured":"Fan H, Xiong B, Mangalam K, et\u00a0al (2021) Multiscale vision transformers. In: Proceedings of the IEEE\/CVF international conference on computer vision, pp 6824\u20136835","DOI":"10.1109\/ICCV48922.2021.00675"},{"key":"11223_CR62","doi-asserted-by":"crossref","unstructured":"Fan W, Yi K, Ye H, et\u00a0al (2024a) Deep frequency derivative learning for non-stationary time series forecasting. IJCAI","DOI":"10.24963\/ijcai.2024\/436"},{"key":"11223_CR63","volume":"240","author":"R Feng","year":"2024","unstructured":"Feng R, Chen M, Song Y (2024) Learning traffic as videos: short-term traffic flow prediction using mixed-pointwise convolution and channel attention mechanism. Expert Syst Appl 240:122468","journal-title":"Expert Syst Appl"},{"key":"11223_CR64","doi-asserted-by":"crossref","unstructured":"Feng S, Miao C, Zhang Z, et\u00a0al (2024b) Latent diffusion transformer for probabilistic time series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 11979\u201311987","DOI":"10.1609\/aaai.v38i11.29085"},{"issue":"1","key":"11223_CR65","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1006\/jcss.1997.1504","volume":"55","author":"Y Freund","year":"1997","unstructured":"Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119\u2013139","journal-title":"J Comput Syst Sci"},{"key":"11223_CR66","doi-asserted-by":"crossref","unstructured":"Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Annals of statistics pp 1189\u20131232","DOI":"10.1214\/aos\/1013203451"},{"key":"11223_CR67","unstructured":"Fu DY, Dao T, Saab KK, et\u00a0al (2023) Hungry hungry hippos: Towards language modeling with state space models. In The International Conference on Learning Representations (ICLR)"},{"issue":"4","key":"11223_CR68","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1007\/BF00344251","volume":"36","author":"K Fukushima","year":"1980","unstructured":"Fukushima K (1980) Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36(4):193\u2013202","journal-title":"Biol Cybern"},{"issue":"1","key":"11223_CR69","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pdig.0000417","volume":"3","author":"J Gallifant","year":"2024","unstructured":"Gallifant J, Fiske A, Levites Strekalova YA et al (2024) Peer review of GPT-4 technical report and systems card. PLOS Digital Health 3(1):e0000417","journal-title":"PLOS Digital Health"},{"key":"11223_CR70","doi-asserted-by":"crossref","DOI":"10.1016\/j.engappai.2022.105151","volume":"115","author":"MA Ganaie","year":"2022","unstructured":"Ganaie MA, Hu M, Malik AK et al (2022) Ensemble deep learning: a review. Eng Appl Artif Intell 115:105151","journal-title":"Eng Appl Artif Intell"},{"key":"11223_CR71","volume":"56","author":"MV Garc\u00eda","year":"2020","unstructured":"Garc\u00eda MV, Aznarte JL (2020) Shapley additive explanations for no2 forecasting. Eco Inform 56:101039","journal-title":"Eco Inform"},{"key":"11223_CR82","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G, et\u00a0al (2021a) Solar power dataset (4 seconds observations). https:\/\/doi.org\/10.5281\/zenodo.4656027","DOI":"10.5281\/zenodo.4656027"},{"key":"11223_CR73","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021b) Wind farms dataset (with missing values). https:\/\/doi.org\/10.5281\/zenodo.4654909","DOI":"10.5281\/zenodo.4654909"},{"key":"11223_CR74","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021c) Wind farms dataset (without missing values). https:\/\/doi.org\/10.5281\/zenodo.4654858","DOI":"10.5281\/zenodo.4654858"},{"key":"11223_CR75","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021e) Australian electricity demand dataset. https:\/\/doi.org\/10.5281\/zenodo.4659727","DOI":"10.5281\/zenodo.4659727"},{"key":"11223_CR76","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021f) Bitcoin dataset with missing values. https:\/\/doi.org\/10.5281\/zenodo.5121965","DOI":"10.5281\/zenodo.5121965"},{"key":"11223_CR77","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021g) Bitcoin dataset without missing values. https:\/\/doi.org\/10.5281\/zenodo.5122101","DOI":"10.5281\/zenodo.5122101"},{"key":"11223_CR78","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021h) Rideshare dataset with missing values. https:\/\/doi.org\/10.5281\/zenodo.5122114","DOI":"10.5281\/zenodo.5122114"},{"key":"11223_CR79","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021i) Rideshare dataset without missing values. https:\/\/doi.org\/10.5281\/zenodo.5122232","DOI":"10.5281\/zenodo.5122232"},{"key":"11223_CR80","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021j) Temperature rain dataset with missing values. https:\/\/doi.org\/10.5281\/zenodo.5129073","DOI":"10.5281\/zenodo.5129073"},{"key":"11223_CR81","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G et al (2021k) Temperature rain dataset without missing values. https:\/\/doi.org\/10.5281\/zenodo.5129091","DOI":"10.5281\/zenodo.5129091"},{"key":"11223_CR72","unstructured":"Godahewa RW, Bergmeir C, Webb GI, et\u00a0al (2021l) Monash time series forecasting archive. In: Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2), https:\/\/openreview.net\/forum?id=wEc1mgAjU-"},{"key":"11223_CR83","doi-asserted-by":"publisher","unstructured":"Godahewa R, Bergmeir C, Webb G, et\u00a0al (2021d) Wind power dataset (4 seconds observations). https:\/\/doi.org\/10.5281\/zenodo.4656032","DOI":"10.5281\/zenodo.4656032"},{"key":"11223_CR84","unstructured":"Gong Z, Tang Y, Liang J (2023) Patchmixer: A patch-mixing architecture for long-term time series forecasting. Preprint at arXiv:2310.00655"},{"key":"11223_CR85","doi-asserted-by":"crossref","unstructured":"Granger CW (1969) Investigating causal relations by econometric models and cross-spectral methods. Econometrica: J Econom Soc 424\u2013438","DOI":"10.2307\/1912791"},{"issue":"4","key":"11223_CR86","doi-asserted-by":"crossref","first-page":"3473","DOI":"10.1007\/s10462-022-10256-8","volume":"56","author":"M Graziani","year":"2023","unstructured":"Graziani M, Dutkiewicz L, Calvaresi D et al (2023) A global taxonomy of interpretable AI: unifying the terminology for the technical and social sciences. Artif Intell Rev 56(4):3473\u20133504","journal-title":"Artif Intell Rev"},{"key":"11223_CR87","unstructured":"Gruver N, Finzi MA, Qiu S, et\u00a0al (2023) Large language models are zero-shot time series forecasters. In: Thirty-seventh Conference on Neural Information Processing Systems, https:\/\/openreview.net\/forum?id=md68e8iZK1"},{"key":"11223_CR88","first-page":"572","volume":"34","author":"A Gu","year":"2021","unstructured":"Gu A, Johnson I, Goel K et al (2021) Combining recurrent, convolutional, and continuous-time models with linear state space layers. Adv Neural Inf Process Syst 34:572\u2013585","journal-title":"Adv Neural Inf Process Syst"},{"key":"11223_CR89","unstructured":"Gu A, Dao T (2024) Mamba: Linear-time sequence modeling with selective state spaces. In: First Conference on Language Modeling, https:\/\/openreview.net\/forum?id=tEYskw1VY2"},{"key":"11223_CR90","unstructured":"Gu A, Goel K, Re C (2022) Efficiently modeling long sequences with structured state spaces. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=uYLFoz1vlAC"},{"issue":"1","key":"11223_CR91","doi-asserted-by":"crossref","first-page":"315","DOI":"10.3390\/forecast5010017","volume":"5","author":"Y Hahn","year":"2023","unstructured":"Hahn Y, Langer T, Meyes R et al (2023) Time series dataset survey for forecasting with deep learning. Forecasting 5(1):315\u2013335","journal-title":"Forecasting"},{"issue":"1","key":"11223_CR92","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1109\/MCI.2021.3129960","volume":"17","author":"R Hamon","year":"2022","unstructured":"Hamon R, Junklewitz H, Sanchez I et al (2022) Bridging the gap between AI and explainability in the GDPR: towards trustworthiness-by-design in automated decision-making. IEEE Comput Intell Mag 17(1):72\u201385","journal-title":"IEEE Comput Intell Mag"},{"issue":"5","key":"11223_CR93","volume":"44","author":"H Han","year":"2023","unstructured":"Han H, Park S, Min S et al (2023) Improving generalization performance of electrocardiogram classification models. Physiol Meas 44(5):054003","journal-title":"Physiol Meas"},{"key":"11223_CR94","unstructured":"Han L, Chen XY, Ye HJ, et\u00a0al (2024) SOFTS: Efficient multivariate time series forecasting with series-core fusion. In: The Thirty-eighth Annual Conference on Neural Information Processing Systems, https:\/\/openreview.net\/forum?id=89AUi5L1uA"},{"key":"11223_CR95","unstructured":"Han H, Kim S, Choi HS, et\u00a0al (2023a) On the impact of knowledge distillation for model interpretability. In: Proceedings of the 40th International Conference on Machine Learning. JMLR.org, ICML\u201923"},{"key":"11223_CR96","unstructured":"Hasson H, Maddix DC, Wang B, et\u00a0al (2023) Theoretical guarantees of learning ensembling strategies with applications to time series forecasting. In: International Conference on Machine Learning, PMLR, pp 12616\u201312632"},{"key":"11223_CR97","first-page":"6840","volume":"33","author":"J Ho","year":"2020","unstructured":"Ho J, Jain A, Abbeel P (2020) Denoising diffusion probabilistic models. Adv Neural Inf Process Syst 33:6840\u20136851","journal-title":"Adv Neural Inf Process Syst"},{"issue":"8","key":"11223_CR98","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735\u20131780","journal-title":"Neural Comput"},{"key":"11223_CR99","unstructured":"Holt CC (1957) Forecasting trends and seasonals by exponentially weighted averages. Carnegie institute of technology. Pittsburgh ONR memorandum"},{"issue":"8","key":"11223_CR100","doi-asserted-by":"crossref","first-page":"2554","DOI":"10.1073\/pnas.79.8.2554","volume":"79","author":"JJ Hopfield","year":"1982","unstructured":"Hopfield JJ (1982) Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci 79(8):2554\u20132558","journal-title":"Proc Natl Acad Sci"},{"key":"11223_CR101","unstructured":"Hounie I, Porras-Valenzuela J, Ribeiro A (2024) Transformers with loss shaping constraints for long-term time series forecasting. In: Forty-first International Conference on Machine Learning"},{"key":"11223_CR102","unstructured":"Hou H, Yu FR (2024) Rwkv-ts: Beyond traditional recurrent neural network for time series tasks. Preprint at arXiv:2401.09093"},{"key":"11223_CR103","doi-asserted-by":"crossref","unstructured":"Huang NE, Shen Z, Long SR et al (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc Lond Ser A 454(1971):903\u2013995","DOI":"10.1098\/rspa.1998.0193"},{"key":"11223_CR104","volume":"238","author":"S Huang","year":"2024","unstructured":"Huang S, Liu Y, Zhang F et al (2024) Crosswavenet: a dual-channel network with deep cross-decomposition for long-term time series forecasting. Expert Syst Appl 238:121642","journal-title":"Expert Syst Appl"},{"key":"11223_CR105","doi-asserted-by":"crossref","unstructured":"Huang Q, Shen L, Zhang R, et\u00a0al (2024a) HDMIXER: Hierarchical dependency with extendable patch for multivariate time series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 12608\u201312616","DOI":"10.1609\/aaai.v38i11.29155"},{"key":"11223_CR106","doi-asserted-by":"crossref","unstructured":"Huber PJ (1992) Robust estimation of a location parameter. In: Breakthroughs in statistics: Methodology and distribution. Springer, pp 492\u2013518","DOI":"10.1007\/978-1-4612-4380-9_35"},{"key":"11223_CR107","unstructured":"Hu J, Lan D, Zhou Z, et\u00a0al (2024a) Time-ssm: Simplifying and unifying state space models for time series forecasting. CoRR abs\/2405.16312. https:\/\/doi.org\/10.48550\/arXiv.2405.16312"},{"key":"11223_CR108","unstructured":"Hu Y, Liu P, Zhu P, et\u00a0al (2024b) Adaptive multi-scale decomposition framework for time series forecasting. CoRR abs\/2406.03751. https:\/\/doi.org\/10.48550\/arXiv.2406.03751"},{"key":"11223_CR109","unstructured":"Ilbert R, Odonnat A, Feofanov V, et\u00a0al (2024) SAMformer: Unlocking the potential of transformers in time series forecasting with sharpness-aware minimization and channel-wise attention. In: Salakhutdinov R, Kolter Z, Heller K, et\u00a0al (eds) Proceedings of the 41st International Conference on Machine Learning, Proceedings of Machine Learning Research, vol 235. PMLR, pp 20924\u201320954, https:\/\/proceedings.mlr.press\/v235\/ilbert24a.html"},{"key":"11223_CR110","unstructured":"Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning, pmlr, pp 448\u2013456"},{"key":"11223_CR111","doi-asserted-by":"crossref","unstructured":"Jhin SY, Kim S, Park N (2024) Addressing prediction delays in time series forecasting: A continuous gru approach with derivative regularization. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp 1234\u20131245","DOI":"10.1145\/3637528.3671969"},{"key":"11223_CR112","unstructured":"Jia Y, Lin Y, Hao X, et\u00a0al (2024) Witran: Water-wave information transmission and recurrent acceleration network for long-range time series forecasting. Adv Neural Inform Process Syst 36"},{"key":"11223_CR113","unstructured":"Jin M, Wang S, Ma L, et\u00a0al (2024) Time-LLM: Time series forecasting by reprogramming large language models. In: International Conference on Learning Representations (ICLR)"},{"key":"11223_CR114","unstructured":"Jin M, Wen Q, Liang Y, et\u00a0al (2023) Large models for time series and Spatio-temporal data: a survey and outlook. Preprint at arXiv:2310.10196"},{"key":"11223_CR115","doi-asserted-by":"crossref","unstructured":"Jung D, Lee J, Yi J, et\u00a0al (2020) ICAPS: an interpretable classifier via disentangled capsule networks. In: European Conference on Computer Vision, Springer, pp 314\u2013330","DOI":"10.1007\/978-3-030-58529-7_19"},{"issue":"1","key":"11223_CR116","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1115\/1.3662552","volume":"82","author":"RE Kalman","year":"1960","unstructured":"Kalman RE (1960) A new approach to linear filtering and prediction problems. Transact ASME-J Basic Eng 82(1):35\u201345","journal-title":"Transact ASME-J Basic Eng"},{"key":"11223_CR117","unstructured":"Kang BG, Lee D, Kim H, et\u00a0al (2024) Introducing spectral attention for long-range dependency in time series forecasting. Adv Neural Inform Process Syst"},{"key":"11223_CR118","unstructured":"Ke G, Meng Q, Finley T, et\u00a0al (2017) LIGHTGBM: A highly efficient gradient boosting decision tree. Adv Neural Inform Process Syst 30"},{"issue":"2","key":"11223_CR119","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1109\/TITS.2011.2106209","volume":"12","author":"A Khosravi","year":"2011","unstructured":"Khosravi A, Mazloumi E, Nahavandi S et al (2011) Prediction intervals to account for uncertainties in travel time prediction. IEEE Trans Intell Transp Syst 12(2):537\u2013547","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"11223_CR120","volume":"203","author":"D Kim","year":"2022","unstructured":"Kim D, Baek JG (2022) Bagging ensemble-based novel data generation method for univariate time series forecasting. Expert Syst Appl 203:117366","journal-title":"Expert Syst Appl"},{"key":"11223_CR121","unstructured":"Kim T, Kim J, Tae Y, et\u00a0al (2021) Reversible instance normalization for accurate time-series forecasting against distribution shift. In: International Conference on Learning Representations"},{"key":"11223_CR122","unstructured":"Kim D, Park J, Lee J, et\u00a0al (2024) Are self-attentions effective for time series forecasting? In: The Thirty-eighth Annual Conference on Neural Information Processing Systems, https:\/\/openreview.net\/forum?id=iN43sJoib7"},{"key":"11223_CR123","doi-asserted-by":"publisher","unstructured":"Kim S, Yi J, Kim E, et\u00a0al (2020) Interpretation of NLP models through input marginalization. In: Webber B, Cohn T, He Y, et\u00a0al (eds) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, pp 3154\u20133167, https:\/\/doi.org\/10.18653\/v1\/2020.emnlp-main.255, https:\/\/aclanthology.org\/2020.emnlp-main.255\/","DOI":"10.18653\/v1\/2020.emnlp-main.255"},{"key":"11223_CR124","unstructured":"Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=SJU4ayYgl"},{"key":"11223_CR125","unstructured":"Kitaev N, Kaiser L, Levskaya A (2020) Reformer: The efficient transformer. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=rkgNKkHtvB"},{"key":"11223_CR126","doi-asserted-by":"crossref","unstructured":"Koenker R, Bassett\u00a0Jr G (1978) Regression quantiles. Econometrica: J Econom Soc 33\u201350","DOI":"10.2307\/1913643"},{"key":"11223_CR127","unstructured":"Kollovieh M, Ansari AF, Bohlke-Schneider M, et\u00a0al (2024) Predict, refine, synthesize: Self-guiding diffusion models for probabilistic time series forecasting. Adv Neural Inform Process Syst 36"},{"key":"11223_CR128","unstructured":"Kong Z, Ping W, Huang J, et\u00a0al (2021) Diffwave: A versatile diffusion model for audio synthesis. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=a-xFK8Ymz5J"},{"issue":"8","key":"11223_CR129","doi-asserted-by":"crossref","first-page":"255","DOI":"10.3390\/fi15080255","volume":"15","author":"VI Kontopoulou","year":"2023","unstructured":"Kontopoulou VI, Panagopoulos AD, Kakkos I et al (2023) A review of Arima vs. machine learning approaches for time series forecasting in data driven networks. Future Internet 15(8):255","journal-title":"Future Internet"},{"issue":"11","key":"11223_CR130","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","volume":"86","author":"Y LeCun","year":"1998","unstructured":"LeCun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278\u20132324","journal-title":"Proc IEEE"},{"key":"11223_CR131","unstructured":"Liang A, Jiang X, Sun Y, et\u00a0al (2024a) Bi-mamba4ts: Bidirectional mamba for time series forecasting. Preprint at arXiv:2404.15772"},{"key":"11223_CR132","doi-asserted-by":"crossref","unstructured":"Liang Y, Wen H, Nie Y, et\u00a0al (2024c) Foundation models for time series analysis: A tutorial and survey. Preprint at arXiv:2403.14735","DOI":"10.1145\/3637528.3671451"},{"key":"11223_CR133","unstructured":"Liang D, Zhang H, Yuan D, et\u00a0al (2024b) Minusformer: Improving time series forecasting by progressively learning residuals. Preprint at arXiv:2402.02332"},{"key":"11223_CR134","unstructured":"Li Y, Chen W, Hu X, et\u00a0al (2024b) Transformer-modulated diffusion models for probabilistic multivariate time series forecasting. In: The Twelfth International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=qae04YACHs"},{"key":"11223_CR135","unstructured":"Li S, Jin X, Xuan Y, et\u00a0al (2019) Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting. Adv Neural Inform Process Syst 32"},{"key":"11223_CR136","unstructured":"Li J, Li D, Savarese S, et\u00a0al (2023) Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. In: International conference on machine learning, PMLR, pp 19730\u201319742"},{"issue":"2194","key":"11223_CR137","doi-asserted-by":"crossref","first-page":"20200209","DOI":"10.1098\/rsta.2020.0209","volume":"379","author":"B Lim","year":"2021","unstructured":"Lim B, Zohren S (2021) Time-series forecasting with deep learning: a survey. Phil Trans R Soc A 379(2194):20200209","journal-title":"Phil Trans R Soc A"},{"issue":"4","key":"11223_CR138","doi-asserted-by":"crossref","first-page":"1748","DOI":"10.1016\/j.ijforecast.2021.03.012","volume":"37","author":"B Lim","year":"2021","unstructured":"Lim B, Ar\u0131k S\u00d6, Loeff N et al (2021) Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int J Forecast 37(4):1748\u20131764","journal-title":"Int J Forecast"},{"issue":"1","key":"11223_CR139","first-page":"44","volume":"1","author":"K Lin","year":"2022","unstructured":"Lin K, Huang C (2022) Ensemble learning applications in multiple industries: a review. Inf Dyn Appl 1(1):44\u201358","journal-title":"Inf Dyn Appl"},{"issue":"1","key":"11223_CR140","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1631\/FITEE.2300310","volume":"25","author":"L Lin","year":"2024","unstructured":"Lin L, Li Z, Li R et al (2024) Diffusion models for time-series applications: a survey. Front Inform Technol Electron Eng 25(1):19\u201341","journal-title":"Front Inform Technol Electron Eng"},{"key":"11223_CR141","unstructured":"Lin S, Lin W, Wu W et al (2024) SPARSETSF: Modeling long-term time series forecasting with 1k parameters"},{"key":"11223_CR142","unstructured":"Lin S, Lin W, Wu W, et\u00a0al (2024c) PETformer: Long-term time series forecasting via placeholder-enhanced transformer. https:\/\/openreview.net\/forum?id=u3RJbzzBZj"},{"issue":"3","key":"11223_CR143","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1145\/3236386.3241340","volume":"16","author":"ZC Lipton","year":"2018","unstructured":"Lipton ZC (2018) The mythos of model interpretability: in machine learning, the concept of interpretability is both important and slippery. Queue 16(3):31\u201357","journal-title":"Queue"},{"key":"11223_CR144","unstructured":"Li Z, Qin Y, Cheng X, et\u00a0al (2024d) Ftmixer: Frequency and time domain representations fusion for time series modeling. Preprint at arXiv:2405.15256"},{"key":"11223_CR145","doi-asserted-by":"crossref","first-page":"91896","DOI":"10.1109\/ACCESS.2021.3091162","volume":"9","author":"Z Liu","year":"2021","unstructured":"Liu Z, Zhu Z, Gao J et al (2021) Forecast methods for time series data: a survey. IEEE Access 9:91896\u201391912","journal-title":"IEEE Access"},{"key":"11223_CR146","first-page":"9881","volume":"35","author":"Y Liu","year":"2022","unstructured":"Liu Y, Wu H, Wang J et al (2022) Non-stationary transformers: exploring the stationarity in time series forecasting. Adv Neural Inf Process Syst 35:9881\u20139893","journal-title":"Adv Neural Inf Process Syst"},{"key":"11223_CR147","unstructured":"Liu Z, Cheng M, Li Z, et\u00a0al (2024e) Adaptive normalization for non-stationary time series forecasting: a temporal slice perspective. Adv Neural Inform Process Syst 36"},{"key":"11223_CR148","unstructured":"Liu Y, Hu T, Zhang H, et\u00a0al (2024c) itransformer: Inverted transformers are effective for time series forecasting. In: The Twelfth International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=JePfAI8fah"},{"key":"11223_CR149","unstructured":"Liu J, Liu C, Woo G, et\u00a0al (2024b) Unitst: Effectively modeling inter-series and intra-series dependencies for multivariate time series forecasting. Preprint at arXiv:2406.04975"},{"key":"11223_CR150","unstructured":"Liu Y, Li C, Wang J, et\u00a0al (2024d) Koopa: Learning non-stationary time series dynamics with Koopman predictors. Adv Neural Inform Process Syst 36"},{"key":"11223_CR151","unstructured":"Liu H, Li C, Wu Q, et\u00a0al (2024a) Visual instruction tuning. Adv Neural Inform Process Syst 36"},{"key":"11223_CR152","unstructured":"Liu S, Yu H, Liao C, et\u00a0al (2021a) Pyraformer: Low-complexity pyramidal attention for long-range time series modeling and forecasting. In: International conference on learning representations"},{"key":"11223_CR153","doi-asserted-by":"crossref","unstructured":"Li Y, Xu J, Anastasiu D (2024c) Learning from polar representation: An extreme-adaptive model for long-term time series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 171\u2013179","DOI":"10.1609\/aaai.v38i1.27768"},{"key":"11223_CR154","doi-asserted-by":"crossref","unstructured":"Li J, Xu X, Ding X, et\u00a0al (2024a) Bayesian spatio-temporal graph convolutional network for railway train delay prediction. IEEE Transactions on Intelligent Transportation Systems","DOI":"10.1109\/TITS.2024.3409754"},{"key":"11223_CR155","unstructured":"Li Y, Yu R, Shahabi C, et\u00a0al (2018) Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=SJiHXGWAZ"},{"key":"11223_CR156","doi-asserted-by":"crossref","unstructured":"Lu\u00a0Han DCZHan-Jia\u00a0Ye (2024) The capacity and robustness trade-off: Revisiting the channel independent strategy for multivariate time series forecasting. IEEE Transactions on Knowledge and Data Engineering pp 1\u201314","DOI":"10.1109\/TKDE.2024.3400008"},{"key":"11223_CR157","unstructured":"Lu J, Han X, Sun Y, et\u00a0al (2024) Cats: Enhancing multivariate time series forecasting by constructing auxiliary time series as exogenous variables. Int Conf Mach Learn"},{"key":"11223_CR158","unstructured":"Lundberg SM, Lee SI (2017) A unified approach to interpreting model predictions. In: Guyon I, Luxburg UV, Bengio S, et\u00a0al (eds) Advances in Neural Information Processing Systems, vol\u00a030. Curran Associates, Inc., https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2017\/file\/8a20a8621978632d76c43dfd28b67767-Paper.pdf"},{"key":"11223_CR159","unstructured":"Luo D, Wang X (2024) Moderntcn: A modern pure convolution structure for general time series analysis. In: The Twelfth International Conference on Learning Representations"},{"key":"11223_CR160","doi-asserted-by":"crossref","unstructured":"Ma Y, Guo Z, Ren Z, et\u00a0al (2020) Streaming graph neural networks. In: Proceedings of the 43rd international ACM SIGIR conference on research and development in information retrieval, pp 719\u2013728","DOI":"10.1145\/3397271.3401092"},{"issue":"4","key":"11223_CR161","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1016\/S0169-2070(00)00057-1","volume":"16","author":"S Makridakis","year":"2000","unstructured":"Makridakis S, Hibon M (2000) The M3-competition: results, conclusions and implications. Int J Forecast 16(4):451\u2013476","journal-title":"Int J Forecast"},{"issue":"2","key":"11223_CR162","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1002\/for.3980010202","volume":"1","author":"S Makridakis","year":"1982","unstructured":"Makridakis S, Andersen A, Carbone R et al (1982) The accuracy of extrapolation (time series) methods: Results of a forecasting competition. J Forecast 1(2):111\u2013153","journal-title":"J Forecast"},{"issue":"1","key":"11223_CR163","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1016\/j.ijforecast.2019.04.014","volume":"36","author":"S Makridakis","year":"2020","unstructured":"Makridakis S, Spiliotis E, Assimakopoulos V (2020) The M4 competition: 100,000 time series and 61 forecasting methods. Int J Forecast 36(1):54\u201374","journal-title":"Int J Forecast"},{"key":"11223_CR164","doi-asserted-by":"crossref","unstructured":"Ma X, Li X, Fang L, et\u00a0al (2024) U-mixer: An unet-mixer architecture with stationarity correction for time series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 14255\u201314262","DOI":"10.1609\/aaai.v38i13.29337"},{"key":"11223_CR165","unstructured":"Marisca I, Alippi C, Bianchi FM (2024) Graph-based forecasting with missing data through spatiotemporal downsampling"},{"issue":"1","key":"11223_CR166","doi-asserted-by":"crossref","first-page":"76","DOI":"10.1111\/joes.12429","volume":"37","author":"RP Masini","year":"2023","unstructured":"Masini RP, Medeiros MC, Mendes EF (2023) Machine learning advances for time series forecasting. J Econom Surv 37(1):76\u2013111","journal-title":"J Econom Surv"},{"key":"11223_CR167","doi-asserted-by":"crossref","DOI":"10.1016\/j.energy.2020.118874","volume":"214","author":"M Massaoudi","year":"2021","unstructured":"Massaoudi M, Refaat SS, Chihi I et al (2021) A novel stacked generalization ensemble-based hybrid LGBM-XGB-MLP model for short-term load forecasting. Energy 214:118874","journal-title":"Energy"},{"issue":"10","key":"11223_CR168","doi-asserted-by":"crossref","first-page":"1087","DOI":"10.1287\/mnsc.22.10.1087","volume":"22","author":"JE Matheson","year":"1976","unstructured":"Matheson JE, Winkler RL (1976) Scoring rules for continuous probability distributions. Manage Sci 22(10):1087\u20131096","journal-title":"Manage Sci"},{"issue":"4","key":"11223_CR169","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1080\/07350015.2015.1086655","volume":"34","author":"MW McCracken","year":"2016","unstructured":"McCracken MW, Ng S (2016) FRED-MD: a monthly database for macroeconomic research. J Business Econom Stat 34(4):574\u2013589","journal-title":"J Business Econom Stat"},{"issue":"11","key":"11223_CR170","first-page":"1","volume":"4","author":"A McLeod","year":"2013","unstructured":"McLeod A, Gweon H (2013) Optimal deseasonalization for monthly and daily geophysical time series. J Environ Stat 4(11):1\u201311","journal-title":"J Environ Stat"},{"key":"11223_CR171","unstructured":"Meijer C, Chen LY (2024) The rise of diffusion models in time-series forecasting. Preprint at arXiv:2401.03006"},{"key":"11223_CR172","doi-asserted-by":"crossref","DOI":"10.1016\/j.enbuild.2023.113022","volume":"288","author":"N Mounir","year":"2023","unstructured":"Mounir N, Ouadi H, Jrhilifa I (2023) Short-term electric load forecasting using an EMD-BI-LSTM approach for smart grid energy management system. Energy Build 288:113022","journal-title":"Energy Build"},{"issue":"5","key":"11223_CR173","doi-asserted-by":"crossref","first-page":"792","DOI":"10.3390\/atmos14050792","volume":"14","author":"B Mu","year":"2023","unstructured":"Mu B, Jiang X, Yuan S et al (2023) Nao seasonal forecast using a multivariate air-sea coupled deep learning model combined with causal discovery. Atmosphere 14(5):792","journal-title":"Atmosphere"},{"issue":"2","key":"11223_CR174","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1111\/1467-9868.00231","volume":"62","author":"GP Nason","year":"2000","unstructured":"Nason GP, Von Sachs R, Kroisandt G (2000) Wavelet processes and adaptive estimation of the evolutionary wavelet spectrum. J R Stat Soc 62(2):271\u2013292","journal-title":"J R Stat Soc"},{"key":"11223_CR175","doi-asserted-by":"crossref","unstructured":"Nawrot P, Tworkowski S, Tyrolski M, et\u00a0al (2021) Hierarchical transformers are more efficient language models. Preprint at arXiv:2110.13711","DOI":"10.18653\/v1\/2022.findings-naacl.117"},{"key":"11223_CR176","doi-asserted-by":"crossref","unstructured":"Nie T, Mei Y, Qin G, et\u00a0al (2024) Channel-aware low-rank adaptation in time series forecasting. Conference on Information and Knowledge Management","DOI":"10.1145\/3627673.3679884"},{"key":"11223_CR177","unstructured":"Nie Y, Nguyen NH, Sinthong P, et\u00a0al (2023) A time series is worth 64 words: Long-term forecasting with transformers. In: The Eleventh International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=Jbdc0vTOcol"},{"key":"11223_CR178","unstructured":"Ni Z, Yu H, Liu S, et\u00a0al (2024) Basisformer: Attention-based time series forecasting with learnable and interpretable basis. Adv Neural Inform Process Syst 36"},{"key":"11223_CR179","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s43067-019-0008-x","volume":"7","author":"IK Nti","year":"2020","unstructured":"Nti IK, Teimeh M, Nyarko-Boateng O et al (2020) Electricity load forecasting: a systematic review. J Electric Syst Inform Technol 7:1\u201319","journal-title":"J Electric Syst Inform Technol"},{"key":"11223_CR180","unstructured":"Oord Avd, Li Y, Vinyals O (2018) Representation learning with contrastive predictive coding. Preprint at arXiv:1807.03748"},{"key":"11223_CR181","unstructured":"Oreshkin BN, Carpov D, Chapados N, et\u00a0al (2020) N-beats: Neural basis expansion analysis for interpretable time series forecasting. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=r1ecqn4YwB"},{"issue":"10","key":"11223_CR182","doi-asserted-by":"crossref","first-page":"1345","DOI":"10.1109\/TKDE.2009.191","volume":"22","author":"SJ Pan","year":"2009","unstructured":"Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345\u20131359","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"11223_CR183","doi-asserted-by":"crossref","unstructured":"Papapetrou P, Lee Z (2024) Interpretable and explainable time series mining. In: 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA), IEEE, pp 1\u20133","DOI":"10.1109\/DSAA61799.2024.10722788"},{"key":"11223_CR184","doi-asserted-by":"crossref","unstructured":"Park C, Lee C, Bahng H, et\u00a0al (2020) St-grat: A novel spatio-temporal graph attention networks for accurately forecasting dynamically changing road speed. In: Proceedings of the 29th ACM international conference on information & knowledge management, pp 1215\u20131224","DOI":"10.1145\/3340531.3411940"},{"issue":"9","key":"11223_CR185","doi-asserted-by":"crossref","first-page":"3760","DOI":"10.1109\/TNNLS.2019.2944933","volume":"31","author":"N Passalis","year":"2019","unstructured":"Passalis N, Tefas A, Kanniainen J et al (2019) Deep adaptive input normalization for time series forecasting. IEEE Transact Neural Netw Learn Syst 31(9):3760\u20133765","journal-title":"IEEE Transact Neural Netw Learn Syst"},{"key":"11223_CR186","doi-asserted-by":"crossref","unstructured":"Patro BN, Agneeswaran VS (2024a) Mamba-360: Survey of state space models as transformer alternative for long sequence modelling: Methods, applications, and challenges. Preprint at arXiv:2404.16112","DOI":"10.2139\/ssrn.4930035"},{"key":"11223_CR187","unstructured":"Patro BN, Agneeswaran VS (2024b) Simba: Simplified mamba-based architecture for vision and multivariate time series. Preprint at arXiv:2403.15360"},{"issue":"4","key":"11223_CR188","doi-asserted-by":"crossref","first-page":"242","DOI":"10.3390\/info14040242","volume":"14","author":"N Patwardhan","year":"2023","unstructured":"Patwardhan N, Marrone S, Sansone C (2023) Transformers in the real world: a survey on NLP applications. Information 14(4):242","journal-title":"Information"},{"key":"11223_CR189","doi-asserted-by":"publisher","unstructured":"Pavlyshenko BM (2020) Using bayesian regression for stacking time series predictive models. In: 2020 IEEE Third International Conference on Data Stream Mining & Processing (DSMP), pp 305\u2013309, https:\/\/doi.org\/10.1109\/DSMP47368.2020.9204312","DOI":"10.1109\/DSMP47368.2020.9204312"},{"issue":"4","key":"11223_CR190","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1093\/biomet\/82.4.669","volume":"82","author":"J Pearl","year":"1995","unstructured":"Pearl J (1995) Causal diagrams for empirical research. Biometrika 82(4):669\u2013688","journal-title":"Biometrika"},{"issue":"2","key":"11223_CR191","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1177\/0049124198027002004","volume":"27","author":"J Pearl","year":"1998","unstructured":"Pearl J (1998) Graphs, causality, and structural equation models. Sociol Methods Res 27(2):226\u2013284","journal-title":"Sociol Methods Res"},{"key":"11223_CR192","first-page":"3","volume-title":"Models, reasoning and inference","author":"J Pearl","year":"2000","unstructured":"Pearl J et al (2000) Models, reasoning and inference, vol 19. Cambridge University Press, Cambridge, p 3"},{"key":"11223_CR193","doi-asserted-by":"crossref","unstructured":"Peng Zj, Zhang C, Tian Yx (2023) Crude oil price time series forecasting: a novel approach based on variational mode decomposition, time-series imaging, and deep learning. IEEE Access","DOI":"10.1109\/ACCESS.2023.3301576"},{"issue":"2","key":"11223_CR194","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1016\/j.ejor.2018.01.045","volume":"268","author":"F Petropoulos","year":"2018","unstructured":"Petropoulos F, Hyndman RJ, Bergmeir C (2018) Exploring the sources of uncertainty: why does bagging for time series forecasting work? Eur J Oper Res 268(2):545\u2013554","journal-title":"Eur J Oper Res"},{"key":"11223_CR195","doi-asserted-by":"crossref","unstructured":"Piao X, Chen Z, Murayama T, et\u00a0al (2024) Fredformer: Frequency debiased transformer for time series forecasting. In: KDD, pp 2400\u20132410, https:\/\/doi.org\/10.1145\/3637528.3671928","DOI":"10.1145\/3637528.3671928"},{"key":"11223_CR196","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1016\/j.idm.2019.12.010","volume":"5","author":"S Portet","year":"2020","unstructured":"Portet S (2020) A primer on model selection using the Akaike information criterion. Infectious Disease Modelling 5:111\u2013128","journal-title":"Infectious Disease Modelling"},{"key":"11223_CR197","doi-asserted-by":"crossref","unstructured":"Qian J, Wang Q, Wu Y, et\u00a0al (2023) Causality-based deep learning forecast of the Kuroshio volume transport in the east china sea. Earth Space Sci 10(2):e2022EA002722","DOI":"10.1029\/2022EA002722"},{"key":"11223_CR198","doi-asserted-by":"publisher","unstructured":"Qin Y, Song D, Chen H, et\u00a0al (2017) A dual-stage attention-based recurrent neural network for time series prediction. In: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17, pp 2627\u20132633, https:\/\/doi.org\/10.24963\/ijcai.2017\/366","DOI":"10.24963\/ijcai.2017\/366"},{"key":"11223_CR199","unstructured":"Qi S, Wen L, Li Y, et\u00a0al (2024) Enhancing multivariate time series forecasting with mutual information-driven cross-variable and temporal modeling. Preprint at arXiv:2403.00869"},{"key":"11223_CR200","first-page":"81","volume":"1","author":"JR Quinlan","year":"1986","unstructured":"Quinlan JR (1986) Induction of decision trees. Mach learning 1:81\u2013106","journal-title":"Mach learning"},{"key":"11223_CR201","unstructured":"Radford A, Kim JW, Hallacy C, et\u00a0al (2021) Learning transferable visual models from natural language supervision. In: International conference on machine learning, PMLR, pp 8748\u20138763"},{"key":"11223_CR202","unstructured":"Ramesh A, Dhariwal P, Nichol A, et\u00a0al (2022) Hierarchical text-conditional image generation with clip latent. Preprint at arXiv:2204.06125 1(2):3"},{"key":"11223_CR203","unstructured":"Rasul K, Ashok A, Williams AR, et\u00a0al (2024) Lag-llama: Towards foundation models for probabilistic time series forecasting. Preprint at arXiv:2310.08278"},{"key":"11223_CR204","unstructured":"Rasul K, Seward C, Schuster I, et\u00a0al (2021) Autoregressive denoising diffusion models for multivariate probabilistic time series forecasting. In: International Conference on Machine Learning, PMLR, pp 8857\u20138868"},{"key":"11223_CR205","doi-asserted-by":"crossref","unstructured":"Ribeiro MT, Singh S, Guestrin C (2016) \" why should i trust you?\" explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135\u20131144","DOI":"10.1145\/2939672.2939778"},{"key":"11223_CR206","doi-asserted-by":"crossref","unstructured":"Rombach R, Blattmann A, Lorenz D, et\u00a0al (2022) High-resolution image synthesis with latent diffusion models. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 10684\u201310695","DOI":"10.1109\/CVPR52688.2022.01042"},{"issue":"1","key":"11223_CR207","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1093\/biomet\/70.1.41","volume":"70","author":"PR Rosenbaum","year":"1983","unstructured":"Rosenbaum PR, Rubin DB (1983) The central role of the propensity score in observational studies for causal effects. Biometrika 70(1):41\u201355","journal-title":"Biometrika"},{"key":"11223_CR208","unstructured":"Rossi E, Chamberlain B, Frasca F, et\u00a0al (2020) Temporal graph networks for deep learning on dynamic graphs. In: ICML 2020 Workshop on Graph Representation Learning"},{"issue":"6088","key":"11223_CR209","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/323533a0","volume":"323","author":"DE Rumelhart","year":"1986","unstructured":"Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533\u2013536","journal-title":"Nature"},{"key":"11223_CR210","doi-asserted-by":"crossref","unstructured":"Runge J, Nowack P, Kretschmer M, et\u00a0al (2019) Detecting and quantifying causal associations in large nonlinear time series datasets. Sci Adv 5(11):eaau4996","DOI":"10.1126\/sciadv.aau4996"},{"key":"11223_CR211","first-page":"36479","volume":"35","author":"C Saharia","year":"2022","unstructured":"Saharia C, Chan W, Saxena S et al (2022) Photorealistic text-to-image diffusion models with deep language understanding. Adv Neural Inf Process Syst 35:36479\u201336494","journal-title":"Adv Neural Inf Process Syst"},{"key":"11223_CR212","doi-asserted-by":"crossref","unstructured":"\u015eAHiN E, Arslan NN, \u00d6zdemir D (2024) Unlocking the black box: an in-depth review on interpretability, explainability, and reliability in deep learning. Neural Computing and Applications pp 1\u2013107","DOI":"10.1007\/s00521-024-10437-2"},{"key":"11223_CR213","doi-asserted-by":"crossref","unstructured":"Sankaranarayanan S, Balaji Y, Castillo CD, et\u00a0al (2018) Generate to adapt: Aligning domains using generative adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8503\u20138512","DOI":"10.1109\/CVPR.2018.00887"},{"issue":"1","key":"11223_CR214","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1109\/TNN.2008.2005605","volume":"20","author":"F Scarselli","year":"2008","unstructured":"Scarselli F, Gori M, Tsoi AC et al (2008) The graph neural network model. IEEE Trans Neural Networks 20(1):61\u201380","journal-title":"IEEE Trans Neural Networks"},{"issue":"5","key":"11223_CR215","doi-asserted-by":"crossref","first-page":"612","DOI":"10.1109\/JPROC.2021.3058954","volume":"109","author":"B Sch\u00f6lkopf","year":"2021","unstructured":"Sch\u00f6lkopf B, Locatello F, Bauer S et al (2021) Toward causal representation learning. Proc IEEE 109(5):612\u2013634","journal-title":"Proc IEEE"},{"key":"11223_CR216","unstructured":"Shabani MA, Abdi AH, Meng L, et\u00a0al (2023) Scaleformer: Iterative multi-scale refining transformers for time series forecasting. In: The Eleventh International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=sCrnllCtjoE"},{"key":"11223_CR217","doi-asserted-by":"crossref","DOI":"10.1515\/9780691214696","volume-title":"A Mathematical Theory of Evidence","author":"G Shafer","year":"1976","unstructured":"Shafer G (1976) A Mathematical Theory of Evidence. Princeton University Press, Princeton"},{"key":"11223_CR218","first-page":"2075","volume":"13","author":"N Sharma","year":"2021","unstructured":"Sharma N, Mangla M, Mohanty SN et al (2021) Employing stacked ensemble approach for time series forecasting. Int J Inf Technol 13:2075\u20132080","journal-title":"Int J Inf Technol"},{"key":"11223_CR219","doi-asserted-by":"crossref","unstructured":"Sharma K, Dwivedi YK, Metri B (2022) Incorporating causality in energy consumption forecasting using deep neural networks. Annals of Operations Research pp 1\u201336","DOI":"10.1007\/s10479-022-04857-3"},{"key":"11223_CR220","unstructured":"Shen L, Chen W, Kwok J (2024) Multi-resolution diffusion models for time series forecasting. In: The Twelfth International Conference on Learning Representations"},{"key":"11223_CR221","unstructured":"Shen L, Kwok J (2023) Non-autoregressive conditional diffusion models for time series prediction. In: International Conference on Machine Learning, PMLR, pp 31016\u201331029"},{"key":"11223_CR222","unstructured":"Shi X, Chen Z, Wang H, et\u00a0al (2015) Convolutional LSTM network: a machine learning approach for precipitation nowcasting. Adv Neural Inform Process Syst 28"},{"key":"11223_CR223","doi-asserted-by":"crossref","first-page":"1421","DOI":"10.1007\/s10994-019-05815-0","volume":"108","author":"SY Shih","year":"2019","unstructured":"Shih SY, Sun FK, Hy Lee (2019) Temporal pattern attention for multivariate time series forecasting. Mach Learn 108:1421\u20131441","journal-title":"Mach Learn"},{"key":"11223_CR224","unstructured":"Shu Y, Lampos V (2024) Deformtime: Capturing variable dependencies with deformable attention for time series forecasting. Preprint at arXiv:2406.07438"},{"issue":"1","key":"11223_CR225","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.ijforecast.2019.03.017","volume":"36","author":"S Smyl","year":"2020","unstructured":"Smyl S (2020) A hybrid method of exponential smoothing and recurrent neural networks for time series forecasting. Int J Forecast 36(1):75\u201385","journal-title":"Int J Forecast"},{"key":"11223_CR226","doi-asserted-by":"crossref","unstructured":"Smyl S, Oreshkin BN, Pe\u0142ka P, et\u00a0al (2024) Any-quantile probabilistic forecasting of short-term electricity demand. Preprint at arXiv:2404.17451","DOI":"10.2139\/ssrn.4995513"},{"key":"11223_CR227","unstructured":"Sohl-Dickstein J, Weiss E, Maheswaranathan N, et\u00a0al (2015) Deep unsupervised learning using nonequilibrium thermodynamics. In: International conference on machine learning, PMLR, pp 2256\u20132265"},{"key":"11223_CR228","unstructured":"Song Y, Ermon S (2019) Generative modeling by estimating gradients of the data distribution. Adv Neural Inform Process Syst 32"},{"key":"11223_CR229","doi-asserted-by":"crossref","unstructured":"Song C, Lin Y, Guo S, et\u00a0al (2020) Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In: Proceedings of the AAAI conference on artificial intelligence, pp 914\u2013921","DOI":"10.1609\/aaai.v34i01.5438"},{"key":"11223_CR230","unstructured":"Song Y, Sohl-Dickstein J, Kingma DP, et\u00a0al (2021) Score-based generative modeling through stochastic differential equations. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=PxTIG12RRHS"},{"key":"11223_CR231","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s12199-012-0294-6","volume":"18","author":"IN Soyiri","year":"2013","unstructured":"Soyiri IN, Reidpath DD (2013) An overview of health forecasting. Environ Health Prev Med 18:1\u20139","journal-title":"Environ Health Prev Med"},{"key":"11223_CR232","unstructured":"Sparks AH, Carroll J, Goldie J, et\u00a0al (2020) Bomrang: Australian government bureau of meteorology (BOM) data client. R package version 07.0 https:\/\/CRAN.R-project.org\/package=bomrang"},{"issue":"1","key":"11223_CR233","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava N, Hinton G, Krizhevsky A et al (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929\u20131958","journal-title":"J Mach Learn Res"},{"key":"11223_CR234","doi-asserted-by":"crossref","unstructured":"\u0160t\u011bpni\u010dka M, Burda M (2017) On the results and observations of the time series forecasting competition cif 2016. In: 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), IEEE, pp 1\u20136","DOI":"10.1109\/FUZZ-IEEE.2017.8015455"},{"key":"11223_CR235","first-page":"793","volume-title":"2024 IEEE 6th International Conference on Power","author":"M Sui","year":"2024","unstructured":"Sui M, Zhang C, Zhou L et al (2024) An ensemble approach to stock price prediction using deep learning and time series models. 2024 IEEE 6th International Conference on Power. IEEE, Intelligent Computing and Systems (ICPICS), pp 793\u2013797"},{"key":"11223_CR236","doi-asserted-by":"crossref","unstructured":"Sun F, Hao W, Zou A, et\u00a0al (2024a) A survey on spatio-temporal series prediction with deep learning: taxonomy, applications, and future directions. Neural Computing and Applications pp 1\u201325","DOI":"10.1007\/s00521-024-09659-1"},{"key":"11223_CR237","unstructured":"Sun Y, Xie Z, Chen D, et\u00a0al (2024b) Hierarchical classification auxiliary network for time series forecasting. Preprint at arXiv:2405.18975"},{"issue":"8","key":"11223_CR238","doi-asserted-by":"crossref","first-page":"7067","DOI":"10.1016\/j.eswa.2012.01.039","volume":"39","author":"SB Taieb","year":"2012","unstructured":"Taieb SB, Bontempi G, Atiya AF et al (2012) A review and comparison of strategies for multi-step ahead time series forecasting based on the NN5 forecasting competition. Expert Syst Appl 39(8):7067\u20137083","journal-title":"Expert Syst Appl"},{"key":"11223_CR239","unstructured":"Tang P, Zhang W (2024) PDMLP: Patch-based decomposed MLP for long-term time series forecastin. Preprint at arXiv:2405.13575"},{"key":"11223_CR240","first-page":"24804","volume":"34","author":"Y Tashiro","year":"2021","unstructured":"Tashiro Y, Song J, Song Y et al (2021) CSDI: Conditional score-based diffusion models for probabilistic time series imputation. Adv Neural Inf Process Syst 34:24804\u201324816","journal-title":"Adv Neural Inf Process Syst"},{"key":"11223_CR241","doi-asserted-by":"crossref","unstructured":"Tay Y, Dehghani M, Bahri D, et\u00a0al (2020) Efficient transformers: a survey. ACM Comput Surv 55:1\u201328. https:\/\/api.semanticscholar.org\/CorpusID:221702858","DOI":"10.1145\/3530811"},{"issue":"2","key":"11223_CR242","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1016\/0169-2070(90)90007-X","volume":"6","author":"PA Thompson","year":"1990","unstructured":"Thompson PA (1990) An MSE statistic for comparing forecast accuracy across series. Int J Forecast 6(2):219\u2013227","journal-title":"Int J Forecast"},{"key":"11223_CR243","unstructured":"Touvron H, Martin L, Stone K, et\u00a0al (2023) LLAMA 2: Open foundation and fine-tuned chat models. Preprint at arXiv:2307.09288"},{"key":"11223_CR244","doi-asserted-by":"crossref","unstructured":"Truchan H, Kalfar C, Ahmadi Z (2024) LTBOOST: Boosted hybrids of ensemble linear and gradient algorithms for the long-term time series forecasting. In: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, pp 2271\u20132281","DOI":"10.1145\/3627673.3679527"},{"key":"11223_CR245","unstructured":"Ulyanov D, Vedaldi A, Lempitsky V (2016) Instance normalization: The missing ingredient for fast stylization. Preprint at arXiv:1607.08022"},{"key":"11223_CR246","unstructured":"Van Den\u00a0Oord A, Dieleman S, Zen H, et\u00a0al (2016) Wavenet: a generative model for raw audio. Preprint at arXiv:1609.03499 12"},{"key":"11223_CR247","unstructured":"Vapnik V, Golowich S, Smola A (1996) Support vector method for function approximation, regression estimation and signal processing. Adv Neural Inform Process Syst 9"},{"key":"11223_CR248","unstructured":"Vaswani A, Shazeer N, Parmar N, et\u00a0al (2017) Attention is all you need. Adv Neural Inform Process Syst 30"},{"key":"11223_CR249","unstructured":"Veli\u010dkovi\u0107 P, Cucurull G, Casanova A, et\u00a0al (2018) Graph attention networks. In: International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=rJXMpikCZ"},{"key":"11223_CR250","first-page":"841","volume":"31","author":"S Wachter","year":"2017","unstructured":"Wachter S, Mittelstadt B, Russell C (2017) Counterfactual explanations without opening the black box: Automated decisions and the GDPR. Harv JL Tech 31:841","journal-title":"Harv JL Tech"},{"issue":"1","key":"11223_CR251","doi-asserted-by":"publisher","first-page":"264","DOI":"10.1109\/TNNLS.2021.3093468","volume":"34","author":"W Wang","year":"2023","unstructured":"Wang W, Li H, Ding Z et al (2023) Rethinking maximum mean discrepancy for visual domain adaptation. IEEE Transact Neural Netw Learn Syst 34(1):264\u2013277. https:\/\/doi.org\/10.1109\/TNNLS.2021.3093468","journal-title":"IEEE Transact Neural Netw Learn Syst"},{"key":"11223_CR252","volume":"288","author":"Z Wang","year":"2024","unstructured":"Wang Z, Ruan S, Huang T et al (2024) A lightweight multi-layer perceptron for efficient multivariate time series forecasting. Knowl-Based Syst 288:111463","journal-title":"Knowl-Based Syst"},{"key":"11223_CR253","doi-asserted-by":"crossref","unstructured":"Wang L, Adiga A, Chen J, et\u00a0al (2022b) Causalgnn: Causal-based graph neural networks for spatio-temporal epidemic forecasting. In: Proceedings of the AAAI conference on artificial intelligence, pp 12191\u201312199","DOI":"10.1609\/aaai.v36i11.21479"},{"key":"11223_CR254","doi-asserted-by":"crossref","unstructured":"Wang J, Jatowt A, Yoshikawa M (2022a) Timebert: Extending pre-trained language representations with temporal information. Preprint at arXiv:2204.13032","DOI":"10.1145\/3539618.3591686"},{"key":"11223_CR255","doi-asserted-by":"crossref","unstructured":"Wang Z, Kong F, Feng S, et\u00a0al (2024d) Is mamba effective for time series forecasting? Preprint at arXiv:2403.11144","DOI":"10.2139\/ssrn.4832898"},{"key":"11223_CR256","doi-asserted-by":"crossref","unstructured":"Wang Z, Miliou I, Samsten I, et\u00a0al (2023b) Counterfactual explanations for time series forecasting. In: 2023 IEEE International Conference on Data Mining (ICDM), IEEE, pp 1391\u20131396","DOI":"10.1109\/ICDM58522.2023.00180"},{"key":"11223_CR257","unstructured":"Wang Y, Wu H, Dong J, et\u00a0al (2024c) Timexer: Empowering transformers for time series forecasting with exogenous variables. In: The Thirty-eighth Annual Conference on Neural Information Processing Systems, https:\/\/openreview.net\/forum?id=INAeUQ04lT"},{"key":"11223_CR258","unstructured":"Wang S, Wu H, Shi X, et\u00a0al (2024a) Timemixer: Decomposable multiscale mixing for time series forecasting. International Conference on Learning Representations"},{"key":"11223_CR259","unstructured":"Wang X, Zhou T, Wen Q, et\u00a0al (2024b) Card: Channel aligned robust blend transformer for time series forecasting. In: The Twelfth International Conference on Learning Representations"},{"issue":"11","key":"11223_CR260","doi-asserted-by":"crossref","first-page":"8441","DOI":"10.1109\/TNNLS.2022.3151160","volume":"34","author":"C Wei","year":"2022","unstructured":"Wei C, Niu C, Tang Y et al (2022) NPENAS: Neural predictor guided evolution for neural architecture search. IEEE Transact Neural Netw Learn Syst 34(11):8441\u20138455","journal-title":"IEEE Transact Neural Netw Learn Syst"},{"key":"11223_CR261","unstructured":"Wen R, Torkkola K, Narayanaswamy B, et\u00a0al (2017) A multi-horizon quantile recurrent forecaster. Preprint at arXiv:1711.11053"},{"key":"11223_CR262","doi-asserted-by":"publisher","unstructured":"Wen Q, Zhou T, Zhang C, et\u00a0al (2023) Transformers in time series: A survey. In: Elkind E (ed) Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI-23. International Joint Conferences on Artificial Intelligence Organization, pp 6778\u20136786, https:\/\/doi.org\/10.24963\/ijcai.2023\/759, (survey Track)","DOI":"10.24963\/ijcai.2023\/759"},{"issue":"3","key":"11223_CR263","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1287\/mnsc.6.3.324","volume":"6","author":"PR Winters","year":"1960","unstructured":"Winters PR (1960) Forecasting sales by exponentially weighted moving averages. Manage Sci 6(3):324\u2013342","journal-title":"Manage Sci"},{"issue":"2","key":"11223_CR264","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/S0893-6080(05)80023-1","volume":"5","author":"DH Wolpert","year":"1992","unstructured":"Wolpert DH (1992) Stacked generalization. Neural Netw 5(2):241\u2013259","journal-title":"Neural Netw"},{"key":"11223_CR265","doi-asserted-by":"crossref","unstructured":"Wong SC, Gatt A, Stamatescu V, et\u00a0al (2016) Understanding data augmentation for classification: when to warp? In: 2016 international conference on digital image computing: techniques and applications (DICTA), IEEE, pp 1\u20136","DOI":"10.1109\/DICTA.2016.7797091"},{"key":"11223_CR266","unstructured":"Woo G, Liu C, Kumar A, et\u00a0al (2024) Unified training of universal time series forecasting transformers. In: Forty-first International Conference on Machine Learning"},{"issue":"01","key":"11223_CR267","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1142\/S1793536909000047","volume":"1","author":"Z Wu","year":"2009","unstructured":"Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal 1(01):1\u201341","journal-title":"Adv Adapt Data Anal"},{"key":"11223_CR268","volume":"288","author":"B Wu","year":"2024","unstructured":"Wu B, Wang L (2024) Two-stage decomposition and temporal fusion transformers for interpretable wind speed forecasting. Energy 288:129728","journal-title":"Energy"},{"key":"11223_CR269","first-page":"22419","volume":"34","author":"H Wu","year":"2021","unstructured":"Wu H, Xu J, Wang J et al (2021) Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Adv Neural Inf Process Syst 34:22419\u201322430","journal-title":"Adv Neural Inf Process Syst"},{"key":"11223_CR270","volume":"294","author":"B Wu","year":"2024","unstructured":"Wu B, Yu S, Peng L et al (2024) Interpretable wind speed forecasting with meteorological feature exploring and two-stage decomposition. Energy 294:130782","journal-title":"Energy"},{"key":"11223_CR271","unstructured":"Wu Z, Gong Y, Zhang A (2024b) Dtmamba: Dual twin mamba for time series forecasting. Preprint at arXiv:2405.07022"},{"key":"11223_CR272","unstructured":"Wu H, Hu T, Liu Y, et\u00a0al (2023) Timesnet: Temporal 2d-variation modeling for general time series analysis. International Conference on Learning Representations"},{"key":"11223_CR273","unstructured":"Xiong Q, Tang K, Ma M, et\u00a0al (2024) Tdt loss takes it all: Integrating temporal dependencies among targets into non-autoregressive time series forecasting. Preprint at arXiv:2406.04777"},{"key":"11223_CR274","doi-asserted-by":"crossref","first-page":"158","DOI":"10.21595\/vp.2018.20250","volume":"22","author":"B Xu","year":"2019","unstructured":"Xu B, Sheng Y, Li P et al (2019) Causes and classification of EMD mode mixing. Vibroengineering Procedia 22:158\u2013164","journal-title":"Vibroengineering Procedia"},{"key":"11223_CR275","doi-asserted-by":"crossref","unstructured":"Xua B, Yang G (2024) Interpretability research of deep learning: a literature survey. Information Fusion p 102721","DOI":"10.1016\/j.inffus.2024.102721"},{"key":"11223_CR276","unstructured":"Xu Z, Bian Y, Zhong J, et\u00a0al (2024b) Beyond trend and periodicity: Guiding time series forecasting with textual cues. Preprint at arXiv:2405.13522"},{"key":"11223_CR277","unstructured":"Xu X, Chen Z, Xu D, et\u00a0al (2022) Mixing signals: Data augmentation approach for deep learning based modulation recognition. Preprint at arXiv:2204.03737"},{"key":"11223_CR278","doi-asserted-by":"crossref","unstructured":"Xue H, Salim FD (2023) Promptcast: a new prompt-based learning paradigm for time series forecasting. IEEE Transactions on Knowledge and Data Engineering","DOI":"10.1109\/TKDE.2023.3342137"},{"key":"11223_CR279","unstructured":"Xu X, Liang Y, Huang B, et\u00a0al (2024a) Integrating mamba and transformer for long-short range time series forecasting. Preprint at arXiv:2404.14757"},{"key":"11223_CR280","unstructured":"Xu Z, Zeng A, Xu Q (2024) Fits: Modeling time series with [CDATA[10 k ]]$$10 k $$ parameters"},{"key":"11223_CR281","doi-asserted-by":"crossref","first-page":"61915","DOI":"10.1109\/ACCESS.2020.2983588","volume":"8","author":"Y Yang","year":"2020","unstructured":"Yang Y, Yang Y (2020) Hybrid method for short-term time series forecasting based on EEMD. IEEE Access 8:61915\u201361928","journal-title":"IEEE Access"},{"key":"11223_CR282","unstructured":"Yang Y, Jin M, Wen H, et\u00a0al (2024a) A survey on diffusion models for time series and Spatio-temporal data. Preprint at arXiv:2404.18886"},{"key":"11223_CR283","doi-asserted-by":"crossref","unstructured":"Yang F, Li X, Wang M, et\u00a0al (2023a) Waveform: Graph enhanced wavelet learning for long sequence forecasting of multivariate time series. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 10754\u201310761","DOI":"10.1609\/aaai.v37i9.26276"},{"key":"11223_CR284","unstructured":"Yan T, Gong H, Yongping H, et\u00a0al (2024) Probabilistic time series modeling with decomposable denoising diffusion model. In: Salakhutdinov R, Kolter Z, Heller K, et\u00a0al (eds) Proceedings of the 41st International Conference on Machine Learning, Proceedings of Machine Learning Research, vol 235. PMLR, pp 55759\u201355777, https:\/\/proceedings.mlr.press\/v235\/yan24b.html"},{"key":"11223_CR285","unstructured":"Yang M, Xu C, Bai Y, et\u00a0al (2023b) Investigating black-box model for wind power forecasting using local interpretable model-agnostic explanations algorithm: why should a model be trusted? CSEE J Power Energy Syst"},{"key":"11223_CR286","doi-asserted-by":"publisher","unstructured":"Yang Y, Zhu Q, Chen J (2024b) Vcformer: Variable correlation transformer with inherent lagged correlation for multivariate time series forecasting. In: Larson K (ed) Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24. International Joint Conferences on Artificial Intelligence Organization, pp 5335\u20135343, https:\/\/doi.org\/10.24963\/ijcai.2024\/590, (Main Track)","DOI":"10.24963\/ijcai.2024\/590"},{"key":"11223_CR287","doi-asserted-by":"crossref","unstructured":"Yan S, Xiong Y, Lin D (2018) Spatial temporal graph convolutional networks for skeleton-based action recognition. In: Proceedings of the AAAI conference on artificial intelligence","DOI":"10.1609\/aaai.v32i1.12328"},{"key":"11223_CR288","unstructured":"Yan T, Zhang H, Zhou T, et\u00a0al (2021) Scoregrad: Multivariate probabilistic time series forecasting with continuous energy-based generative models. Preprint at arXiv:2106.10121"},{"key":"11223_CR289","unstructured":"Ye J, Zhang W, Yi K, et\u00a0al (2024) A survey of time series foundation models: Generalizing time series representation with large language mode. Preprint at arXiv:2405.02358"},{"key":"11223_CR290","unstructured":"Yi K, Zhang Q, Fan W, et\u00a0al (2024) Frequency-domain mlps are more effective learners in time series forecasting. Adv Neural Inform Process Syst 36"},{"key":"11223_CR291","unstructured":"Yuan X, Qiao Y (2024) Diffusion-TS: Interpretable diffusion for general time series generation. In: The Twelfth International Conference on Learning Representations, https:\/\/openreview.net\/forum?id=4h1apFjO99"},{"key":"11223_CR292","doi-asserted-by":"crossref","unstructured":"Yu C, Wang F, Shao Z, et\u00a0al (2023) Dsformer: A double sampling transformer for multivariate time series long-term prediction. In: Proceedings of the 32nd ACM international conference on information and knowledge management, pp 3062\u20133072","DOI":"10.1145\/3583780.3614851"},{"key":"11223_CR293","unstructured":"Yu G, Zou J, Hu X, et\u00a0al (2024) Revitalizing multivariate time series forecasting: Learnable decomposition with inter-series dependencies and intra-series variations modeling. Int Conf Mach Learn"},{"key":"11223_CR294","doi-asserted-by":"crossref","unstructured":"Zeng A, Chen M, Zhang L, et\u00a0al (2023) Are transformers effective for time series forecasting? In: Proceedings of the AAAI conference on artificial intelligence, pp 11121\u201311128","DOI":"10.1609\/aaai.v37i9.26317"},{"key":"11223_CR295","unstructured":"Zeng C, Liu Z, Zheng G, et\u00a0al (2024) C-mamba: Channel correlation enhanced state space models for multivariate time series forecasting. Preprint at arXiv:2406.05316"},{"key":"11223_CR296","doi-asserted-by":"crossref","first-page":"688","DOI":"10.1016\/j.renene.2021.04.091","volume":"174","author":"S Zhang","year":"2021","unstructured":"Zhang S, Chen Y, Xiao J et al (2021) Hybrid wind speed forecasting model based on multivariate data secondary decomposition approach and deep learning algorithm with attention mechanism. Renew Energy 174:688\u2013704","journal-title":"Renew Energy"},{"key":"11223_CR297","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1016\/j.ins.2020.08.053","volume":"544","author":"S Zhang","year":"2021","unstructured":"Zhang S, Chen Y, Zhang W et al (2021) A novel ensemble deep learning model with dynamic error correction and multi-objective ensemble pruning for time series forecasting. Inf Sci 544:427\u2013445","journal-title":"Inf Sci"},{"key":"11223_CR298","doi-asserted-by":"crossref","unstructured":"Zhang Q, Chang J, Meng G, et\u00a0al (2020) Spatio-temporal graph structure learning for traffic forecasting. In: Proceedings of the AAAI conference on artificial intelligence, pp 1177\u20131185","DOI":"10.1609\/aaai.v34i01.5470"},{"key":"11223_CR299","doi-asserted-by":"publisher","unstructured":"Zhang X, Chowdhury RR, Gupta RK, et\u00a0al (2024d) Large language models for time series: A survey. In: Larson K (ed) Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24. International Joint Conferences on Artificial Intelligence Organization, pp 8335\u20138343, https:\/\/doi.org\/10.24963\/ijcai.2024\/921, (Survey Track)","DOI":"10.24963\/ijcai.2024\/921"},{"key":"11223_CR300","unstructured":"Zhang Y, Ma L, Pal S, et\u00a0al (2024e) Multi-resolution time-series transformer for long-term forecasting. In: International Conference on Artificial Intelligence and Statistics, PMLR, pp 4222\u20134230"},{"key":"11223_CR301","unstructured":"Zhang M, Sun Y, Liang F (2024c) Sparse deep learning for time series data: theory and applications. Adv Neural Inform Process Syst 36"},{"key":"11223_CR302","doi-asserted-by":"crossref","unstructured":"Zhang D, Wang Y (2024) Adaptive extraction network for multivariate long sequence time-series forecasting. Preprint at arXiv:2405.12038","DOI":"10.1016\/j.knosys.2023.111321"},{"key":"11223_CR303","doi-asserted-by":"crossref","unstructured":"Zhang K, Wen Q, Zhang C, et\u00a0al (2024a) Self-supervised learning for time series analysis: Taxonomy, progress, and prospects. IEEE Transactions on Pattern Analysis and Machine Intelligence","DOI":"10.1109\/TPAMI.2024.3387317"},{"key":"11223_CR304","unstructured":"Zhang Y, Yan J (2023) Crossformer: Transformer utilizing cross-dimension dependency for multivariate time series forecasting. In: The eleventh international conference on learning representations"},{"key":"11223_CR305","doi-asserted-by":"crossref","unstructured":"Zhang J, Zheng Y, Qi D (2017) Deep spatio-temporal residual networks for citywide crowd flows prediction. In: Proceedings of the AAAI conference on artificial intelligence","DOI":"10.1609\/aaai.v31i1.10735"},{"key":"11223_CR306","unstructured":"Zhang K, Zou X, Tang Y (2024b) Caformer: Rethinking time series analysis from causal perspective. Preprint at arXiv:2403.08572"},{"key":"11223_CR307","unstructured":"Zhan T, He Y, Li Z, et\u00a0al (2024) Time evidence fusion network: Multi-source view in long-term time series forecasting. Preprint at arXiv:2405.06419"},{"key":"11223_CR308","unstructured":"Zhao S, Ma T, Ermon S (2020) Individual calibration with randomized forecasting. In: International Conference on Machine Learning, PMLR, pp 11387\u201311397"},{"key":"11223_CR309","doi-asserted-by":"crossref","unstructured":"Zheng C, Fan X, Wang C, et\u00a0al (2020) Gman: A graph multi-attention network for traffic prediction. In: Proceedings of the AAAI conference on artificial intelligence, pp 1234\u20131241","DOI":"10.1609\/aaai.v34i01.5477"},{"key":"11223_CR310","unstructured":"Zhou T, Ma Z, Wen Q, et\u00a0al (2022) Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. In: International conference on machine learning, PMLR, pp 27268\u201327286"},{"key":"11223_CR311","unstructured":"Zhou T, Niu P, Wang X, et\u00a0al (2023) One fits all: Power general time series analysis by pretrained LM. In: Thirty-seventh Conference on Neural Information Processing Systems, https:\/\/openreview.net\/forum?id=gMS6FVZvmF"},{"key":"11223_CR312","doi-asserted-by":"crossref","unstructured":"Zhou H, Zhang S, Peng J, et\u00a0al (2021) Informer: Beyond efficient transformer for long sequence time-series forecasting. In: Proceedings of the AAAI conference on artificial intelligence, pp 11106\u201311115","DOI":"10.1609\/aaai.v35i12.17325"}],"container-title":["Artificial Intelligence Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10462-025-11223-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10462-025-11223-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10462-025-11223-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,13]],"date-time":"2025-05-13T17:44:36Z","timestamp":1747158276000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10462-025-11223-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4,23]]},"references-count":312,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2025,7]]}},"alternative-id":["11223"],"URL":"https:\/\/doi.org\/10.1007\/s10462-025-11223-9","relation":{},"ISSN":["1573-7462"],"issn-type":[{"value":"1573-7462","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,4,23]]},"assertion":[{"value":"4 April 2025","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 April 2025","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"216"}}