{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,15]],"date-time":"2025-10-15T18:02:53Z","timestamp":1760551373456},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2021,2,5]],"date-time":"2021-02-05T00:00:00Z","timestamp":1612483200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,2,5]],"date-time":"2021-02-05T00:00:00Z","timestamp":1612483200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Knowl Inf Syst"],"published-print":{"date-parts":[[2021,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Knowledge discovery systems are nowadays supposed to store and process very large data. When working with big time series, multivariate prediction becomes more and more complicated because the use of all the variables does not allow to have the most accurate predictions and poses certain problems for classical prediction models. In this article, we present a scalable prediction process for large time series prediction, including a new algorithm for identifying time series predictors, which analyses the dependencies between time series using the mutual reinforcement principle between Hubs and Authorities of the Hits (Hyperlink-Induced Topic Search) algorithm. The proposed framework is evaluated on 3 real datasets. The results show that the best predictions are obtained using a very small number of predictors compared to the initial number of variables. The proposed feature selection algorithm shows promising results compared to widely known algorithms, such as the classic and the kernel principle component analysis, factor analysis, and the fast correlation-based filter method, and improves the prediction accuracy of many time series of the used datasets.<\/jats:p>","DOI":"10.1007\/s10115-021-01544-w","type":"journal-article","created":{"date-parts":[[2021,2,5]],"date-time":"2021-02-05T08:26:30Z","timestamp":1612513590000},"page":"1093-1116","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["A scalable framework for large time series prediction"],"prefix":"10.1007","volume":"63","author":[{"given":"Youssef","family":"Hmamouche","sequence":"first","affiliation":[]},{"given":"Lotfi","family":"Lakhal","sequence":"additional","affiliation":[]},{"given":"Alain","family":"Casali","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,2,5]]},"reference":[{"issue":"1","key":"1544_CR1","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1109\/TPWRS.2016.2556620","volume":"32","author":"O Abedinia","year":"2017","unstructured":"Abedinia O, Amjady N, Zareipour H (2017) A new feature selection technique for load and price forecast of electrical power systems. IEEE Trans Power Syst 32(1):62\u201374. https:\/\/doi.org\/10.1109\/TPWRS.2016.2556620","journal-title":"IEEE Trans Power Syst"},{"issue":"6","key":"1544_CR2","doi-asserted-by":"publisher","first-page":"716","DOI":"10.1109\/TAC.1974.1100705","volume":"19","author":"H Akaike","year":"1974","unstructured":"Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19(6):716\u2013723. https:\/\/doi.org\/10.1109\/TAC.1974.1100705","journal-title":"IEEE Trans Autom Control"},{"issue":"5","key":"1544_CR3","doi-asserted-by":"publisher","first-page":"2447","DOI":"10.1016\/j.laa.2012.10.022","volume":"438","author":"M Benzi","year":"2013","unstructured":"Benzi M, Estrada E, Klymko C (2013) Ranking hubs and authorities using matrix functions. Linear Algebra Appl 438(5):2447\u20132474. https:\/\/doi.org\/10.1016\/j.laa.2012.10.022","journal-title":"Linear Algebra Appl"},{"issue":"10","key":"1544_CR4","doi-asserted-by":"publisher","first-page":"1923","DOI":"10.1016\/j.neucom.2010.01.017","volume":"73","author":"SF Crone","year":"2010","unstructured":"Crone SF, Kourentzes N (2010) Feature selection for time series prediction\u2014A combined filter and wrapper approach for neural networks. Neurocomputing 73(10):1923\u20131936. https:\/\/doi.org\/10.1016\/j.neucom.2010.01.017","journal-title":"Neurocomputing"},{"key":"1544_CR5","unstructured":"Dua D, Graff C (2017) UCI machine learning repository. http:\/\/archive.ics.uci.edu\/ml"},{"issue":"4","key":"1544_CR6","doi-asserted-by":"publisher","first-page":"540","DOI":"10.1162\/003465300559037","volume":"82","author":"M Forni","year":"2000","unstructured":"Forni M, Hallin M, Lippi M, Reichlin L (2000) The generalized dynamic-factor model: identification and estimation. Rev Econ Stat 82(4):540\u2013554. https:\/\/doi.org\/10.1162\/003465300559037","journal-title":"Rev Econ Stat"},{"issue":"2","key":"1544_CR7","doi-asserted-by":"publisher","first-page":"262","DOI":"10.1088\/1742-2132\/12\/2\/262","volume":"12","author":"S Gan","year":"2015","unstructured":"Gan S, Chen Y, Zu S, Qu S, Zhong W (2015) Structure-oriented singular value decomposition for random noise attenuation of seismic data. J Geophys Eng 12(2):262. https:\/\/doi.org\/10.1088\/1742-2132\/12\/2\/262","journal-title":"J Geophys Eng"},{"key":"1544_CR8","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1016\/0165-1889(80)90069-X","volume":"2","author":"CWJ Granger","year":"1980","unstructured":"Granger CWJ (1980) Testing for causality. J Econ Dyn Control 2:329\u2013352. https:\/\/doi.org\/10.1016\/0165-1889(80)90069-X","journal-title":"J Econ Dyn Control"},{"key":"1544_CR9","doi-asserted-by":"publisher","unstructured":"Hmamouche Y, Lakhal L, Casali A (2018) Predictors extraction in time series using authorities-hubs ranking. In: 2018 IEEE international conference on data mining workshops (ICDMW), pp 1070\u20131079. https:\/\/doi.org\/10.1109\/ICDMW.2018.00155","DOI":"10.1109\/ICDMW.2018.00155"},{"issue":"8","key":"1544_CR10","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735\u20131780. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735","journal-title":"Neural Comput"},{"issue":"1","key":"1544_CR11","doi-asserted-by":"publisher","first-page":"55","DOI":"10.2307\/1267351","volume":"12","author":"AE Hoerl","year":"1970","unstructured":"Hoerl AE, Kennard RW (1970) Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12(1):55\u201367. https:\/\/doi.org\/10.2307\/1267351","journal-title":"Technometrics"},{"key":"1544_CR12","doi-asserted-by":"publisher","unstructured":"Hyndman RJ (2017) Australia datasets website. https:\/\/doi.org\/10.1109\/TPWRS.2016.25566201, (accessed 2020-04-12)","DOI":"10.1109\/TPWRS.2016.2556620"},{"issue":"4","key":"1544_CR13","first-page":"43","volume":"4","author":"RJ Hyndman","year":"2006","unstructured":"Hyndman RJ et al (2006) Another look at forecast-accuracy metrics for intermittent demand. Foresight: Int J Appl Forecast 4(4):43\u201346","journal-title":"Foresight: Int J Appl Forecast"},{"issue":"2","key":"1544_CR14","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1016\/0165-1889(88)90041-3","volume":"12","author":"S Johansen","year":"1988","unstructured":"Johansen S (1988) Statistical analysis of cointegration vectors. J Econ Dyn Control 12(2):231\u2013254. https:\/\/doi.org\/10.1016\/0165-1889(88)90041-3","journal-title":"J Econ Dyn Control"},{"key":"1544_CR15","unstructured":"John GH, Kohavi R, Pfleger K (1994) Irrelevant features and the subset selection problem. In: Machine learning: proceedings of the eleventh international, Morgan Kaufmann, pp 121\u2013129"},{"key":"1544_CR16","series-title":"Springer series in statistics","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1007\/978-1-4757-1904-8_7","volume-title":"Principal component analysis and factor analysis","author":"IT Jolliffe","year":"1986","unstructured":"Jolliffe IT (1986) Principal component analysis and factor analysis. Springer series in statistics. Springer, New York, pp 115\u2013128"},{"issue":"1","key":"1544_CR17","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1016\/S0024-3795(98)10024-1","volume":"284","author":"J Kamm","year":"1998","unstructured":"Kamm J, Nagy JG (1998) Kronecker product and SVD approximations in image restoration. Linear Algebra Appl 284(1):177\u2013192. https:\/\/doi.org\/10.1109\/TPWRS.2016.25566203","journal-title":"Linear Algebra Appl"},{"issue":"4","key":"1544_CR18","doi-asserted-by":"publisher","first-page":"264","DOI":"10.3390\/e20040264","volume":"20","author":"Z Karevan","year":"2018","unstructured":"Karevan Z, Suykens JA (2018) Transductive feature selection using clustering-based sample entropy for temperature prediction in weather forecasting. Entropy 20(4):264","journal-title":"Entropy"},{"key":"1544_CR19","unstructured":"Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. CoRR abs\/1412.6980"},{"issue":"5","key":"1544_CR20","doi-asserted-by":"publisher","first-page":"604","DOI":"10.1145\/324133.324140","volume":"46","author":"JM Kleinberg","year":"1999","unstructured":"Kleinberg JM (1999) Authoritative sources in a hyperlinked environment. J ACM 46(5):604\u2013632. https:\/\/doi.org\/10.1145\/324133.324140","journal-title":"J ACM"},{"issue":"1","key":"1544_CR21","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1016\/S0004-3702(97)00043-X","volume":"97","author":"R Kohavi","year":"1997","unstructured":"Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1):273\u2013324. https:\/\/doi.org\/10.1109\/TPWRS.2016.25566205","journal-title":"Artif Intell"},{"key":"1544_CR22","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1016\/j.knosys.2015.02.017","volume":"82","author":"I Koprinska","year":"2015","unstructured":"Koprinska I, Rana M, Agelidis VG (2015) Correlation and instance based feature selection for electricity load forecasting. Knowl-Based Syst 82:29\u201340. https:\/\/doi.org\/10.1109\/TPWRS.2016.25566206","journal-title":"Knowl-Based Syst"},{"issue":"1","key":"1544_CR23","first-page":"1235","volume":"17","author":"X Meng","year":"2016","unstructured":"Meng X, Bradley J, Yavuz B, Sparks E, Venkataraman S, Liu D, Freeman J, Tsai D, Amde M, Owen S et al (2016) Mllib: machine learning in apache spark. J Mach Learn Res 17(1):1235\u20131241","journal-title":"J Mach Learn Res"},{"issue":"C\u2014-26","key":"1544_CR24","doi-asserted-by":"publisher","first-page":"917","DOI":"10.1109\/TC.1977.1674939","volume":"9","author":"PM Narendra","year":"1977","unstructured":"Narendra PM, Fukunaga K (1977) A branch and bound algorithm for feature subset selection. IEEE Trans Comput 9(C\u2014-26):917\u2013922. https:\/\/doi.org\/10.1109\/TC.1977.1674939","journal-title":"IEEE Trans Comput"},{"issue":"2","key":"1544_CR25","doi-asserted-by":"publisher","first-page":"616","DOI":"10.1016\/j.ijforecast.2018.12.002","volume":"35","author":"A Panagiotelis","year":"2019","unstructured":"Panagiotelis A, Athanasopoulos G, Hyndman RJ, Jiang B, Vahid F (2019) Macroeconomic forecasting for Australia using a large number of predictors. Int J Forecast 35(2):616\u2013633","journal-title":"Int J Forecast"},{"key":"1544_CR26","volume-title":"Numerical recipes in C: the art of scientific computing","author":"WH Press","year":"1988","unstructured":"Press WH, Flannery BP, Teukolsky SA, Vetterling WT (1988) Numerical recipes in C: the art of scientific computing. Cambridge University Press, Cambridge"},{"issue":"1","key":"1544_CR27","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1006\/jmva.1995.1069","volume":"55","author":"H Schneeweiss","year":"1995","unstructured":"Schneeweiss H, Mathes H (1995) Factor analysis and principal components. J Multivar Anal 55(1):105\u2013124. https:\/\/doi.org\/10.1006\/jmva.1995.1069","journal-title":"J Multivar Anal"},{"issue":"2","key":"1544_CR28","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1103\/PhysRevLett.85.461","volume":"85","author":"T Schreiber","year":"2000","unstructured":"Schreiber T (2000) Measuring information transfer. Phys Rev Lett 85(2):461\u2013464. https:\/\/doi.org\/10.1103\/PhysRevLett.85.461","journal-title":"Phys Rev Lett"},{"issue":"4","key":"1544_CR29","doi-asserted-by":"publisher","first-page":"904","DOI":"10.1109\/TASSP.1981.1163632","volume":"29","author":"Y Shim","year":"1981","unstructured":"Shim Y, Cho Z (1981) SVD pseudoinversion image reconstruction. IEEE Trans Acoust Speech Signal Process 29(4):904\u2013909. https:\/\/doi.org\/10.1109\/TPWRS.2016.25566209","journal-title":"IEEE Trans Acoust Speech Signal Process"},{"key":"1544_CR30","doi-asserted-by":"publisher","first-page":"515","DOI":"10.1016\/S1574-0706(05)01010-4","volume-title":"Handbook of economic forecasting","author":"JH Stock","year":"2006","unstructured":"Stock JH, Watson MW (2006) Chapter 10 forecasting with many predictors. In: Elliott CWJG, Timmermann A (eds) Handbook of economic forecasting, vol 1. Elsevier, Amsterdam, pp 515\u2013554"},{"issue":"4","key":"1544_CR31","doi-asserted-by":"publisher","first-page":"481","DOI":"10.1080\/07350015.2012.715956","volume":"30","author":"JH Stock","year":"2012","unstructured":"Stock JH, Watson MW (2012) Generalized shrinkage methods for forecasting using many predictors. J Bus Econ Stat 30(4):481\u2013493. https:\/\/doi.org\/10.1080\/07350015.2012.715956","journal-title":"J Bus Econ Stat"},{"key":"1544_CR32","doi-asserted-by":"crossref","unstructured":"Tan SC, San\u00a0Lau JP (2014) Time series clustering: a superior alternative for market basket analysis. In: Proceedings of the first international conference on advanced data and information engineering (DaEng-2013), Springer, Singapore, pp 241\u2013248","DOI":"10.1007\/978-981-4585-18-7_28"},{"key":"1544_CR33","doi-asserted-by":"publisher","unstructured":"Turchenko V, Grandinetti L, Sachenko A (2012) Parallel batch pattern training of neural networks on computational clusters. In: 2012 International conference on high performance computing simulation (HPCS), pp 202\u2013208. https:\/\/doi.org\/10.1109\/HPCSim.2012.6266912","DOI":"10.1109\/HPCSim.2012.6266912"},{"key":"1544_CR34","doi-asserted-by":"publisher","unstructured":"Wang K, Su MYT (2002) Item selection by \u201cHub-authority\u201d profit ranking. In: Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining, ACM, New York, NY, USA, KDD \u201902, pp 652\u2013657. https:\/\/doi.org\/10.1145\/775047.775144","DOI":"10.1145\/775047.775144"},{"key":"1544_CR35","unstructured":"Yu L, Liu H (2003) Feature selection for high-dimensional data: a fast correlation-based filter solution. In: Fawcett T, Mishra N (eds) Proceedings, twentieth international conference on machine learning, vol 2, pp 856\u2013863"},{"issue":"4","key":"1544_CR36","doi-asserted-by":"publisher","first-page":"2309","DOI":"10.3390\/e16042309","volume":"16","author":"A Zaremba","year":"2014","unstructured":"Zaremba A, Aste T (2014) Measures of causality in complex datasets with application to financial data. Entropy 16(4):2309\u20132349. https:\/\/doi.org\/10.3390\/e16042309","journal-title":"Entropy"},{"key":"1544_CR37","doi-asserted-by":"publisher","first-page":"126","DOI":"10.1016\/j.eswa.2016.09.027","volume":"67","author":"X Zhong","year":"2017","unstructured":"Zhong X, Enke D (2017) Forecasting daily stock market return using dimensionality reduction. Expert Syst Appl 67:126\u2013139. https:\/\/doi.org\/10.1109\/TAC.1974.11007052","journal-title":"Expert Syst Appl"}],"container-title":["Knowledge and Information Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-021-01544-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10115-021-01544-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-021-01544-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,5]],"date-time":"2021-04-05T12:04:34Z","timestamp":1617624274000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10115-021-01544-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,5]]},"references-count":37,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,5]]}},"alternative-id":["1544"],"URL":"https:\/\/doi.org\/10.1007\/s10115-021-01544-w","relation":{},"ISSN":["0219-1377","0219-3116"],"issn-type":[{"value":"0219-1377","type":"print"},{"value":"0219-3116","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,2,5]]},"assertion":[{"value":"2 November 2018","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 May 2020","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 December 2020","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 February 2021","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}