{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T08:05:02Z","timestamp":1772093102355,"version":"3.50.1"},"reference-count":50,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T00:00:00Z","timestamp":1649894400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T00:00:00Z","timestamp":1649894400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001778","name":"Deakin University","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001778","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["World Wide Web"],"published-print":{"date-parts":[[2023,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In recent years, artificial intelligence technologies have been successfully applied in time series prediction and analytic tasks. At the same time, a lot of attention has been paid to financial time series prediction, which targets the development of novel deep learning models or optimize the forecasting results. To optimize the accuracy of stock price prediction, in this paper, we propose a clustering-enhanced deep learning framework to predict stock prices with three matured deep learning forecasting models, such as Long Short-Term Memory (LSTM), Recurrent Neural Network (RNN) and Gated Recurrent Unit (GRU). The proposed framework considers the clustering as the forecasting pre-processing, which can improve the quality of the training models. To achieve the effective clustering, we propose a new similarity measure, called Logistic Weighted Dynamic Time Warping (LWDTW), by extending a Weighted Dynamic Time Warping (WDTW) method to capture the relative importance of return observations when calculating distance matrices. Especially, based on the empirical distributions of stock returns, the cost weight function of WDTW is modified with logistic probability density distribution function. In addition, we further implement the clustering-based forecasting framework with the above three deep learning models. Finally, extensive experiments on daily US stock price data sets show that our framework has achieved excellent forecasting performance with overall best results for the combination of Logistic WDTW clustering and LSTM model using 5 different evaluation metrics.<\/jats:p>","DOI":"10.1007\/s11280-021-01003-0","type":"journal-article","created":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T17:02:41Z","timestamp":1649955761000},"page":"207-232","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":54,"title":["Clustering-enhanced stock price prediction using deep learning"],"prefix":"10.1007","volume":"26","author":[{"given":"Man","family":"Li","sequence":"first","affiliation":[]},{"given":"Ye","family":"Zhu","sequence":"additional","affiliation":[]},{"given":"Yuxin","family":"Shen","sequence":"additional","affiliation":[]},{"given":"Maia","family":"Angelova","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,4,14]]},"reference":[{"key":"1003_CR1","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1016\/j.is.2015.04.007","volume":"53","author":"S Aghabozorgi","year":"2015","unstructured":"Aghabozorgi, S., Shirkhorshidi, A.S., Wah, T.Y.: Time-series clustering\u2013a decade review. Inf. Syst. 53, 16\u201338 (2015)","journal-title":"Inf. Syst."},{"key":"1003_CR2","doi-asserted-by":"crossref","unstructured":"Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: International conference on foundations of data organization and algorithms, pp 69\u201384. Springer (1993)","DOI":"10.1007\/3-540-57301-1_5"},{"issue":"2","key":"1003_CR3","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1109\/72.279181","volume":"5","author":"Y Bengio","year":"1994","unstructured":"Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 5(2), 157\u2013166 (1994)","journal-title":"IEEE Trans. Neural Netw."},{"key":"1003_CR4","doi-asserted-by":"crossref","unstructured":"Chen, J., Zhong, M., Li, J., Wang, D., Qian, T., Tu, H.: Effective deep attributed network representation learning with topology adapted smoothing. IEEE Transactions on Cybernetics (2021)","DOI":"10.1109\/TCYB.2021.3064092"},{"key":"1003_CR5","doi-asserted-by":"crossref","unstructured":"Chen, L., Ng, R.: On the marriage of lp-norms and edit distance. In: Proceedings of the Thirtieth international conference on Very large data bases, vol. 30, pp 792\u2013803 (2004)","DOI":"10.1016\/B978-012088469-8.50070-X"},{"key":"1003_CR6","doi-asserted-by":"crossref","unstructured":"Chu, S., Keogh, E., Hart, D., Pazzani, M.: Iterative Deepening Dynamic Time Warping for Time Series. In: Proceedings of the 2002 SIAM International Conference on Data Mining, pp 195\u2013212. SIAM (2002)","DOI":"10.1137\/1.9781611972726.12"},{"key":"1003_CR7","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1109\/TPAMI.1979.4766909","volume":"2","author":"DL Davies","year":"1979","unstructured":"Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Machine Intell. 2, 224\u2013227 (1979)","journal-title":"IEEE Trans. Pattern Anal. Machine Intell."},{"key":"1003_CR8","unstructured":"Dezsi, E., Nistor, I.A.: Can deep machine learning outsmart the market? a comparison between econometric modelling and long-short term memory. Romanian Economic and Business Review (2016)"},{"key":"1003_CR9","unstructured":"Dong, X., Qian, L., Huang, L.: Short-term load forecasting in smart grid: A combined cnn and k-means clustering approach. In: 2017 IEEE International Conference on Big Data and Smart Computing (BigComp), pp 119\u2013125. IEEE (2017)"},{"issue":"3","key":"1003_CR10","doi-asserted-by":"publisher","first-page":"32","DOI":"10.1080\/01969727308546046","volume":"3","author":"JC Dunn","year":"1973","unstructured":"Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3(3), 32\u201357 (1973)","journal-title":"J. Cybern."},{"key":"1003_CR11","first-page":"200","volume":"14","author":"S Dupond","year":"2019","unstructured":"Dupond, S.: A thorough review on the current advance of neural network structures. Annu. Rev. Control. 14, 200\u2013230 (2019)","journal-title":"Annu. Rev. Control."},{"key":"1003_CR12","doi-asserted-by":"crossref","unstructured":"Eom, C., Kaizoji, T., Scalas, E.: Fat tails in financial return distributions revisited: Evidence from the korean stock market, vol. 526, p 121055 (2019)","DOI":"10.1016\/j.physa.2019.121055"},{"key":"1003_CR13","unstructured":"Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol. 96, pp 226\u2013231 (1996)"},{"issue":"2","key":"1003_CR14","doi-asserted-by":"publisher","first-page":"419","DOI":"10.1145\/191843.191925","volume":"23","author":"C Faloutsos","year":"1994","unstructured":"Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast subsequence matching in time-series databases. Acm Sigmod Record 23(2), 419\u2013429 (1994)","journal-title":"Acm Sigmod Record"},{"issue":"2","key":"1003_CR15","doi-asserted-by":"publisher","first-page":"654","DOI":"10.1016\/j.ejor.2017.11.054","volume":"270","author":"T Fischer","year":"2018","unstructured":"Fischer, T., Krauss, C.: Deep learning with long short-term memory networks for financial market predictions. Eur. J. Oper. Res. 270(2), 654\u2013669 (2018)","journal-title":"Eur. J. Oper. Res."},{"issue":"10","key":"1003_CR16","doi-asserted-by":"publisher","first-page":"2451","DOI":"10.1162\/089976600300015015","volume":"12","author":"FA Gers","year":"2000","unstructured":"Gers, F.A., Schmidhuber, J., Cummins, F.: Learning to forget: Continual prediction with LSTM. Neural computation 12(10), 2451\u20132471 (2000)","journal-title":"Neural computation"},{"key":"1003_CR17","doi-asserted-by":"crossref","unstructured":"Graves, A., Jaitly, N., Mohamed, A.-R.: Hybrid speech recognition with deep bidirectional lstm. In: 2013 IEEE workshop on automatic speech recognition and understanding, pp 273\u2013278. IEEE (2013)","DOI":"10.1109\/ASRU.2013.6707742"},{"issue":"10","key":"1003_CR18","doi-asserted-by":"publisher","first-page":"2222","DOI":"10.1109\/TNNLS.2016.2582924","volume":"28","author":"K Greff","year":"2016","unstructured":"Greff, K., Srivastava, R.K., Koutn\u00edk, J., Steunebrink, B.R., Schmidhuber, J.: Lstm: a search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. 28(10), 2222\u20132232 (2016)","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"1003_CR19","doi-asserted-by":"publisher","first-page":"1351","DOI":"10.1016\/j.procs.2018.05.050","volume":"132","author":"M Hiransha","year":"2018","unstructured":"Hiransha, M., Gopalakrishnan, E.A., Menon, V.K., Soman, K.: Nse stock market prediction using deep-learning models. Procedia Comput. Sci. 132, 1351\u20131362 (2018)","journal-title":"Procedia Comput. Sci."},{"issue":"8","key":"1003_CR20","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735\u20131780 (1997)","journal-title":"Neural Comput."},{"issue":"1","key":"1003_CR21","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1109\/TASSP.1975.1162641","volume":"23","author":"F Itakura","year":"1975","unstructured":"Itakura, F.: Minimum prediction residual principle applied to speech recognition. IEEE Trans. Acoustics Speech Signal Process. 23(1), 67\u201372 (1975)","journal-title":"IEEE Trans. Acoustics Speech Signal Process."},{"issue":"9","key":"1003_CR22","doi-asserted-by":"publisher","first-page":"2231","DOI":"10.1016\/j.patcog.2010.09.022","volume":"44","author":"Y-S Jeong","year":"2011","unstructured":"Jeong, Y.-S., Jeong, M.K., Omitaomu, O.A.: Weighted dynamic time warping for time series classification. Pattern Recognit. 44(9), 2231\u20132240 (2011)","journal-title":"Pattern Recognit."},{"issue":"8","key":"1003_CR23","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1109\/2.781637","volume":"32","author":"G Karypis","year":"1999","unstructured":"Karypis, G., Han, E.-H., Kumar, V.: Chameleon: Hierarchical clustering using dynamic modeling. Computer 32(8), 68\u201375 (1999)","journal-title":"Computer"},{"key":"1003_CR24","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316801","volume-title":"Finding groups in data: an introduction to cluster analysis","author":"L Kaufman","year":"1990","unstructured":"Kaufman, L., Rousseeuw, P.J.: Finding groups in data: an introduction to cluster analysis. Wiley, Hoboken (1990)"},{"issue":"1","key":"1003_CR25","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1007\/s10618-006-0049-3","volume":"14","author":"E Keogh","year":"2007","unstructured":"Keogh, E., Lonardi, S., Ratanamahatana, C.A., Wei, L., Lee, S.-H., Handley, J.: Compression-based data mining of sequential data. Data Min. Knowl. Disc. 14(1), 99\u2013129 (2007)","journal-title":"Data Min. Knowl. Disc."},{"issue":"2","key":"1003_CR26","doi-asserted-by":"publisher","first-page":"689","DOI":"10.1016\/j.ejor.2016.10.031","volume":"259","author":"C Krauss","year":"2017","unstructured":"Krauss, C., Do, X.A., Huck, N.: Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the s&p 500. Europ. J.Oper. Res. 259(2), 689\u2013702 (2017)","journal-title":"Europ. J.Oper. Res."},{"issue":"10","key":"1003_CR27","doi-asserted-by":"publisher","first-page":"8040","DOI":"10.1007\/s11227-018-2577-1","volume":"76","author":"SI Lee","year":"2020","unstructured":"Lee, S.I., Yoo, S.J.: Threshold-based portfolio: the role of the threshold and its applications. J. Supercomput. 76(10), 8040\u20138057 (2020)","journal-title":"J. Supercomput."},{"key":"1003_CR28","doi-asserted-by":"publisher","first-page":"106618","DOI":"10.1016\/j.knosys.2020.106618","volume":"212","author":"Z Li","year":"2021","unstructured":"Li, Z., Wang, X., Li, J., Zhang, Q.: Deep attributed network representation learning of complex coupling and interaction. Knowledge-Based Systems 212, 106618 (2021)","journal-title":"Knowledge-Based Systems"},{"key":"1003_CR29","unstructured":"MacQueen, J., et al.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol. 1, pp 281\u2013297. Oakland, USA (1967)"},{"issue":"5","key":"1003_CR30","doi-asserted-by":"publisher","first-page":"1003","DOI":"10.1109\/TKDE.2002.1033770","volume":"14","author":"RT Ng","year":"2002","unstructured":"Ng, R.T., Han, J.: Clarans: a method for clustering objects for spatial data mining. IEEE Trans. Knowl. Data Eng. 14(5), 1003\u20131016 (2002)","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"1003_CR31","doi-asserted-by":"publisher","first-page":"506","DOI":"10.1016\/j.patcog.2018.12.007","volume":"88","author":"I Oregi","year":"2019","unstructured":"Oregi, I., P\u00e9rez, A., Del Ser, J., Lozano, J.A.: On-line elastic similarity measures for time series. Pattern Recogn. 88, 506\u2013517 (2019)","journal-title":"Pattern Recogn."},{"key":"1003_CR32","doi-asserted-by":"publisher","first-page":"106384","DOI":"10.1016\/j.asoc.2020.106384","volume":"93","author":"AM Ozbayoglu","year":"2020","unstructured":"Ozbayoglu, A.M., Gudelek, M.U., Sezer, O.B.: Deep learning for financial applications: A survey. Appl. Soft Comput. 93, 106384 (2020)","journal-title":"Appl. Soft Comput."},{"issue":"1","key":"1003_CR33","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1089\/big.2013.1508","volume":"1","author":"F Provost","year":"2013","unstructured":"Provost, F., Fawcett, T.: Data science and its relationship to big data and data-driven decision making. Big data 1(1), 51\u201359 (2013)","journal-title":"Big data"},{"key":"1003_CR34","unstructured":"Sakoe, H.: Dynamic-Programming Approach to Continuous Speech Recognition. In: 1971 Proc. the International Congress of Acoustics, Budapest (1971)"},{"issue":"1","key":"1003_CR35","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1109\/TASSP.1978.1163055","volume":"26","author":"H Sakoe","year":"1978","unstructured":"Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 26(1), 43\u201349 (1978)","journal-title":"IEEE Trans. Acoust. Speech Signal Process."},{"key":"1003_CR36","doi-asserted-by":"crossref","unstructured":"Samarawickrama, A., Fernando, T.: A recurrent neural network approach in predicting daily stock prices an application to the sri lankan stock market. In: 2017 IEEE International Conference on Industrial and Information Systems (ICIIS), pp 1\u20136. IEEE (2017)","DOI":"10.1109\/ICIINFS.2017.8300345"},{"key":"1003_CR37","unstructured":"Senin, P.: Dynamic time warping algorithm review. In: Information and computer science department university of hawaii at manoa honolulu USA 855, 1-23, 40 (2008)"},{"key":"1003_CR38","doi-asserted-by":"publisher","first-page":"106181","DOI":"10.1016\/j.asoc.2020.106181","volume":"90","author":"OB Sezer","year":"2020","unstructured":"Sezer, O.B., Gudelek, M.U., Ozbayoglu, A.M.: Financial time series forecasting with deep learning: A systematic literature review: 2005\u20132019. Appl. Soft Comput. 90, 106181 (2020)","journal-title":"Appl. Soft Comput."},{"key":"1003_CR39","unstructured":"Smyth, P.: Clustering sequences with hidden markov models. In: Advances in neural information processing systems, pp 648\u2013654 (1997)"},{"key":"1003_CR40","doi-asserted-by":"publisher","first-page":"510","DOI":"10.1016\/j.ins.2021.08.100","volume":"580","author":"X Song","year":"2021","unstructured":"Song, X., Li, J., Tang, Y., Zhao, T., Chen, Y., Guan, Z.: Jkt: a joint graph convolutional network based deep knowledge tracing. Inform. Sci. 580, 510\u2013523 (2021)","journal-title":"Inform. Sci."},{"issue":"2","key":"1003_CR41","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1016\/j.fcij.2018.10.003","volume":"3","author":"A Tealab","year":"2018","unstructured":"Tealab, A.: Time series forecasting using artificial neural networks methodologies a systematic review. Future Comput. Inform. J. 3(2), 334\u2013340 (2018)","journal-title":"Future Comput. Inform. J."},{"key":"1003_CR42","unstructured":"Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: Proceedings 18th international conference on data engineering, pp 673\u2013684. IEEE (2002)"},{"issue":"1","key":"1003_CR43","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1016\/j.dsm.2021.01.001","volume":"1","author":"S Wang","year":"2021","unstructured":"Wang, S.: An interview with shouyang wang: research frontier of big data-driven economic and financial forecasting. Data Science and Management 1(1), 10\u201312 (2021)","journal-title":"Data Science and Management"},{"key":"1003_CR44","unstructured":"Wang, W., Yang, J., Muntz, R., et al.: Sting: A statistical information grid approach to spatial data mining. In: VLDB, vol. 97, pp 186\u2013195 (1997)"},{"issue":"2","key":"1003_CR45","doi-asserted-by":"publisher","first-page":"275","DOI":"10.1007\/s10618-012-0250-5","volume":"26","author":"X Wang","year":"2013","unstructured":"Wang, X., Mueen, A., Ding, H., Trajcevski, G., Scheuermann, P., Keogh, E.: Experimental comparison of representation methods and distance measures for time series data. Data Min. Knowl. Disc. 26(2), 275\u2013309 (2013)","journal-title":"Data Min. Knowl. Disc."},{"key":"1003_CR46","doi-asserted-by":"crossref","unstructured":"Wen, T.-H., Gasic, M., Mrksic, N., Su, P.-H., Vandyke, D., Young, S.: Semantically conditioned lstm-based natural language generation for spoken dialogue systems. arXiv:1508.01745 (2015)","DOI":"10.18653\/v1\/D15-1199"},{"key":"1003_CR47","doi-asserted-by":"crossref","unstructured":"Xue, G., Zhong, M., Li, J., Chen, J., Zhai, C., Kong, R.: Dynamic network embedding survey. arXiv:2103.15447 (2021)","DOI":"10.1016\/j.neucom.2021.03.138"},{"key":"1003_CR48","doi-asserted-by":"crossref","unstructured":"Zell, A., Mache, N., H\u00fcbner, R., Mamier, G., Vogt, M., Schmalzl, M., Herrmann, K.-U.: Snns (stuttgart neural network simulator). In: Neural Network Simulation Environments, pp 165\u2013186. Springer (1994)","DOI":"10.1007\/978-1-4615-2736-7_9"},{"key":"1003_CR49","doi-asserted-by":"publisher","first-page":"147653","DOI":"10.1109\/ACCESS.2019.2941987","volume":"7","author":"J Zhang","year":"2019","unstructured":"Zhang, J., Chen, F., Shen, Q.: Cluster-based lstm network for short-term passenger flow forecasting in urban rail transit. IEEE Access 7, 147653\u2013147671 (2019)","journal-title":"IEEE Access"},{"key":"1003_CR50","doi-asserted-by":"publisher","first-page":"165279","DOI":"10.1109\/ACCESS.2019.2952555","volume":"7","author":"B Zhou","year":"2019","unstructured":"Zhou, B., Ma, X., Luo, Y., Yang, D.: Wind power prediction based on lstm networks and nonparametric kernel density estimation. IEEE Access 7, 165279\u2013165292 (2019)","journal-title":"IEEE Access"}],"container-title":["World Wide Web"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11280-021-01003-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11280-021-01003-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11280-021-01003-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,20]],"date-time":"2023-01-20T22:08:04Z","timestamp":1674252484000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11280-021-01003-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,14]]},"references-count":50,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,1]]}},"alternative-id":["1003"],"URL":"https:\/\/doi.org\/10.1007\/s11280-021-01003-0","relation":{},"ISSN":["1386-145X","1573-1413"],"issn-type":[{"value":"1386-145X","type":"print"},{"value":"1573-1413","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,14]]},"assertion":[{"value":"14 October 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 December 2021","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 December 2021","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 April 2022","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"We declare that we have no known competing financial interests or personal relationships that are directly or indirectly related to the work. This is an original research work and hasn\u2019t been submitted to any other journals or conferences.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"<!--Emphasis Type='Bold' removed-->Conflict of Interests"}}]}}