{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,29]],"date-time":"2026-03-29T17:12:44Z","timestamp":1774804364189,"version":"3.50.1"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,9,15]],"date-time":"2020-09-15T00:00:00Z","timestamp":1600128000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,9,15]],"date-time":"2020-09-15T00:00:00Z","timestamp":1600128000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In the case of real-world data streams, the underlying data distribution will not be static; it is subject to variation over time, which is known as the primary reason for concept drift. Concept drift poses severe problems to the accuracy of a model in online learning scenarios. The recurring concept is a particular case of concept drift where the concepts already seen in the past reappear as the stream evolves. This problem is not yet studied in the context of stream clustering. This paper proposes a novel algorithm for identifying the recurring concepts in data stream clustering. During concept recurrence, the most matching model is retrieved from the repository and reused. The algorithm has minimum memory requirements and works online with the stream. Some of the concepts and definitions, already familiar in concept recurrence studies of stream classification have been redefined for clustering. The experiments conducted on real and synthetic data streams reveal that the proposed algorithm has the potential to identify recurring concepts.<\/jats:p>","DOI":"10.1186\/s40537-020-00354-1","type":"journal-article","created":{"date-parts":[[2020,9,15]],"date-time":"2020-09-15T13:03:46Z","timestamp":1600175026000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Learning in the presence of concept recurrence in data stream clustering"],"prefix":"10.1186","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6091-6991","authenticated-orcid":false,"given":"K.","family":"Namitha","sequence":"first","affiliation":[]},{"given":"G.","family":"Santhosh Kumar","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,9,15]]},"reference":[{"key":"354_CR1","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1145\/2133803.2184450","volume":"17","author":"MR Ackermann","year":"2012","unstructured":"Ackermann MR, Lammersen C, Sohler C, Swierkot K, Raupach C. StreamKM++: a clustering algorithm for data streams. J Exp Algorithmics. 2012;17:173\u201387. https:\/\/doi.org\/10.1145\/2133803.2184450.","journal-title":"J Exp Algorithmics"},{"key":"354_CR2","doi-asserted-by":"publisher","unstructured":"Aggarwal CC, Watson TJ, Ctr R, Han J, Wang J, Yu PS. A framework for clustering evolving data streams. In: Proceedings of the 29th international conference on very large data bases. 2003. pp. 81\u201392. https:\/\/doi.org\/10.1016\/B978-012722442-8\/50016-1","DOI":"10.1016\/B978-012722442-8\/50016-1"},{"key":"354_CR3","doi-asserted-by":"publisher","first-page":"228","DOI":"10.1016\/j.eswa.2016.09.020","volume":"67","author":"J De Andrade Silva","year":"2017","unstructured":"De Andrade Silva J, Hruschka ER, Gama J. An evolutionary algorithm for clustering data streams with a variable number of clusters. Expert Syst Appl. 2017;67:228\u201338. https:\/\/doi.org\/10.1016\/j.eswa.2016.09.020.","journal-title":"Expert Syst Appl"},{"issue":"2","key":"354_CR4","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1145\/507515.507519","volume":"3","author":"D Barbar\u00e1","year":"2002","unstructured":"Barbar\u00e1 D. Requirements for clustering data streams. ACM SIGKDD Explor Newsl. 2002;3(2):23. https:\/\/doi.org\/10.1145\/507515.507519.","journal-title":"ACM SIGKDD Explor Newsl"},{"key":"354_CR5","doi-asserted-by":"publisher","unstructured":"Cao F, Ester M, Qian W, Zhou A. Density-based clustering over an evolving data stream with noise. In: SDM. 2006. pp. 326\u201337. https:\/\/doi.org\/10.1145\/1552303.1552307.","DOI":"10.1145\/1552303.1552307"},{"key":"354_CR6","doi-asserted-by":"publisher","unstructured":"Chen Y, Tu L. Density-based clustering for real-time stream data. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining\u2014KDD \u201907. 2007;133. https:\/\/doi.org\/10.1145\/1281192.1281210","DOI":"10.1145\/1281192.1281210"},{"key":"354_CR7","doi-asserted-by":"publisher","unstructured":"De Andrade Silva J, Hruschka ER. Extending k-means-based algorithms for evolving data streams with variable number of clusters. In: Proceedings\u201410th international conference on machine learning and applications, ICMLA 2011. 2011;2:14\u201319. https:\/\/doi.org\/10.1109\/ICMLA.2011.67.","DOI":"10.1109\/ICMLA.2011.67"},{"key":"354_CR8","doi-asserted-by":"publisher","unstructured":"Ditzler G, Polikar R. Hellinger distance based drift detection for nonstationary environments. In: IEEE symposium on computational intelligence in dynamic and uncertain environments. 2011. pp. 41\u20138. https:\/\/doi.org\/10.1109\/CIDUE.2011.5948491.","DOI":"10.1109\/CIDUE.2011.5948491"},{"key":"354_CR9","doi-asserted-by":"crossref","unstructured":"Faria ER, Uberl\u00e2ndia SC, Barros RC, Carlos S, Carvalho ACPLF, Carlos S. Improving the offline clustering stage of data stream algorithms in scenarios with variable number of clusters. 2012;829\u201330.","DOI":"10.1145\/2245276.2245437"},{"issue":"44","key":"354_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2523813","volume":"46","author":"JA Gama","year":"2014","unstructured":"Gama JA. A survey on concept drift adaptation. ACM Comput Surv. 2014;46(44):1\u201337. https:\/\/doi.org\/10.1145\/2523813.","journal-title":"ACM Comput Surv"},{"key":"354_CR11","doi-asserted-by":"publisher","unstructured":"Gama J, Kosina P. Learning about the learning process. In: Proc. of the 10th int. conf. on advances in intelligent data analysis. 2011. pp. 162\u201372. https:\/\/doi.org\/10.1007\/978-3-642-24800-9_17","DOI":"10.1007\/978-3-642-24800-9_17"},{"key":"354_CR12","doi-asserted-by":"publisher","unstructured":"Gomes JB, Menasalvas E, Sousa PAC. Learning recurring concepts from data streams with a context-aware ensemble. In: Proceedings of the ACM symposium on applied computing (SAC\u201911). 2011. pp. 994\u20139. https:\/\/doi.org\/10.1145\/1982185.1982403.","DOI":"10.1145\/1982185.1982403"},{"key":"354_CR13","doi-asserted-by":"publisher","first-page":"803","DOI":"10.3233\/IDA-2012-0552","volume":"16","author":"JB Gomes","year":"2012","unstructured":"Gomes JB, Sousa PAC, Menasalvas E. Tracking recurrent concepts using context. Intell Data Anal. 2012;16:803\u201325. https:\/\/doi.org\/10.3233\/IDA-2012-0552.","journal-title":"Intell Data Anal"},{"key":"354_CR14","doi-asserted-by":"publisher","unstructured":"Guha S, Mishra N, Motwani R, O'Callaghan L. Clustering data streams. In: Proceedings 41st annual symposium on foundations of computer science, Redondo Beach, CA, USA, 2000, pp. 359\u201366. https:\/\/doi.org\/10.1109\/SFCS.2000.892124.","DOI":"10.1109\/SFCS.2000.892124"},{"issue":"Section III","key":"354_CR15","doi-asserted-by":"publisher","first-page":"1532","DOI":"10.1109\/ACCESS.2018.2886026","volume":"7","author":"AS Iwashita","year":"2019","unstructured":"Iwashita AS. An overview on concept drift learning. IEEE Access. 2019;7(Section III):1532\u201347. https:\/\/doi.org\/10.1109\/ACCESS.2018.2886026.","journal-title":"IEEE Access"},{"issue":"3","key":"354_CR16","doi-asserted-by":"publisher","first-page":"371","DOI":"10.1007\/s10115-009-0206-2","volume":"22","author":"I Katakis","year":"2010","unstructured":"Katakis I, Tsoumakas G, Vlahavas I. Tracking recurring contexts using ensemble classifiers: an application to email filtering. Knowl Inf Syst. 2010;22(3):371\u201391. https:\/\/doi.org\/10.1007\/s10115-009-0206-2.","journal-title":"Knowl Inf Syst"},{"key":"354_CR17","doi-asserted-by":"crossref","unstructured":"Kifer D, Ben-david S, Gehrke J. Detecting change in data streams. In: Proceedings of the 30th VLDB conference. 2004. pp. 180\u201391.","DOI":"10.1016\/B978-012088469-8.50019-X"},{"key":"354_CR18","unstructured":"Kubat M. Second tier for decision trees. In: ICML'96: proceedings of the thirteenth international conference on international conference on machine learning. 1996. pp. 293\u2013301."},{"key":"354_CR19","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1016\/j.neucom.2015.04.024","volume":"166","author":"P Li","year":"2015","unstructured":"Li P, Wu X, Hu X, Wang H. Learning concept-drifting data streams with random ensemble decision trees. Neurocomputing. 2015;166:68\u201383. https:\/\/doi.org\/10.1016\/j.neucom.2015.04.024.","journal-title":"Neurocomputing"},{"issue":"6","key":"354_CR20","doi-asserted-by":"publisher","first-page":"859","DOI":"10.1109\/TKDE.2010.61","volume":"23","author":"M Masud","year":"2011","unstructured":"Masud M, Gao J, Khan L, Han J, Thuraisingham BM. Classification and novel class detection in concept-drifting data streams under time constraints. IEEE Trans Knowl Data Eng. 2011;23(6):859\u201374. https:\/\/doi.org\/10.1109\/TKDE.2010.61.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"354_CR21","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1016\/j.eswa.2015.10.022","volume":"46","author":"A Miguel \u00c1ngel","year":"2016","unstructured":"Miguel \u00c1ngel A, Jo\u00e3o B\u00e1rtolo G, Ernestina M. Predicting recurring concepts on data-streams by means of a meta-model and a fuzzy similarity function. Expert Syst Appl. 2016;46:87\u2013105. https:\/\/doi.org\/10.1016\/j.eswa.2015.10.022.","journal-title":"Expert Syst Appl"},{"key":"354_CR22","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/J.NEUNET.2017.10.004","volume":"98","author":"S Mohamad","year":"2017","unstructured":"Mohamad S, Sayed-mouchaweh M, Bouchachia A. Active learning for classifying data streams with unknown number of classes. Neural Netw. 2017;98:1\u201315. https:\/\/doi.org\/10.1016\/J.NEUNET.2017.10.004.","journal-title":"Neural Netw"},{"key":"354_CR23","doi-asserted-by":"publisher","unstructured":"Naldi MC, Fontana A, Campello RJGB. Comparison among methods for k estimation in k-means. In: ISDA 2009\u20149th international conference on intelligent systems design and applications. 2009. pp. 1006\u201313. https:\/\/doi.org\/10.1109\/ISDA.2009.78.","DOI":"10.1109\/ISDA.2009.78"},{"key":"354_CR24","doi-asserted-by":"publisher","first-page":"67","DOI":"10.4018\/IJAEIS.2020010104","volume":"11","author":"K Namitha","year":"2020","unstructured":"Namitha K, Santhosh G. Concept drift detection in data stream clustering and its application on weather data. Int J Agric Environ Inf Syst. 2020;11:67\u201385. https:\/\/doi.org\/10.4018\/IJAEIS.2020010104.","journal-title":"Int J Agric Environ Inf Syst"},{"key":"354_CR25","unstructured":"Paulo B. Learning recurring concepts from data streams in ubiquitous environments. Philosophy. 2011; September."},{"issue":"5","key":"354_CR26","doi-asserted-by":"publisher","first-page":"615","DOI":"10.1109\/TKDE.2007.190727","volume":"20","author":"PP Rodrigues","year":"2008","unstructured":"Rodrigues PP, Gama J, Pedroso JP. ODAC: hierarchical clustering of time series data streams. IEEE Trans Knowl Data Eng. 2008;20(5):615\u201327. https:\/\/doi.org\/10.1109\/TKDE.2007.190727.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"354_CR27","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","volume":"20","author":"PJ Rousseeuw","year":"1987","unstructured":"Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53\u201365. https:\/\/doi.org\/10.1016\/0377-0427(87)90125-7.","journal-title":"J Comput Appl Math"},{"key":"354_CR28","doi-asserted-by":"publisher","unstructured":"Sakamoto Y, Fukui KI, Gama J, Nicklas D, Moriyama K, Numao M. Concept drift detection with clustering via statistical change detection methods. In: Proceedings\u20142015 IEEE international conference on knowledge and systems engineering, KSE. 2015. pp. 37\u201342. https:\/\/doi.org\/10.1109\/KSE.2015.19.","DOI":"10.1109\/KSE.2015.19"},{"key":"354_CR29","unstructured":"Serakiotou N. Change detection. Stanford Exploration Project. 1987."},{"key":"354_CR30","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1016\/j.eswa.2017.04.008","volume":"82","author":"TS Sethi","year":"2017","unstructured":"Sethi TS, Kantardzic M. On the reliable detection of concept drift from streaming unlabeled data. Expert Syst Appl. 2017;82:77\u201399. https:\/\/doi.org\/10.1016\/j.eswa.2017.04.008.","journal-title":"Expert Syst Appl"},{"issue":"1","key":"354_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2522968.2522981","volume":"46","author":"JA Silva","year":"2013","unstructured":"Silva JA, Faria ER, Barros RC, Hruschka ER, De Carvalho ACPLF, Gama J. Data stream clustering. ACM Comput Surv. 2013;46(1):1\u201331. https:\/\/doi.org\/10.1145\/2522968.2522981.","journal-title":"ACM Comput Surv"},{"key":"354_CR32","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1016\/j.ins.2016.03.043","volume":"357","author":"G Song","year":"2016","unstructured":"Song G, Ye Y, Zhang H, Xu X, Lau R, Liu F. Dynamic clustering forest: an ensemble framework to efficiently classify textual data stream with concept drift. Inf Sci. 2016;357:125\u201343. https:\/\/doi.org\/10.1016\/j.ins.2016.03.043.","journal-title":"Inf Sci"},{"key":"354_CR33","unstructured":"szrlee DJIA 30 Stock Time Series. 2018. www.kaggle.com\/szrlee\/stock-time-series-20050101-to-20171231. Accessed 03 Oct 2019."},{"key":"354_CR34","doi-asserted-by":"publisher","unstructured":"Vendramin L, Jaskowiak PA, Campello RJGB. On the combination of relative clustering validity criteria. In: Proceedings of the 25th international conference on scientific and statistical database management\u2014SSDBM. 2013;1. https:\/\/doi.org\/10.1145\/2484838.2484844","DOI":"10.1145\/2484838.2484844"},{"issue":"4","key":"354_CR35","doi-asserted-by":"publisher","first-page":"964","DOI":"10.1007\/s10618-015-0448-4","volume":"30","author":"GI Webb","year":"2016","unstructured":"Webb GI, Hyde R, Cao H, Nguyen HL, Petitjean F. Characterizing concept drift. Data Min Knowl Discov. 2016;30(4):964\u201394. https:\/\/doi.org\/10.1007\/s10618-015-0448-4.","journal-title":"Data Min Knowl Discov"},{"issue":"3","key":"354_CR36","doi-asserted-by":"publisher","first-page":"259","DOI":"10.1023\/A:1007365809034","volume":"27","author":"G Widmer","year":"1997","unstructured":"Widmer G. Tracking context changes through meta-learning. Mach Learn. 1997;27(3):259\u201386. https:\/\/doi.org\/10.1023\/A:1007365809034.","journal-title":"Mach Learn"},{"issue":"1","key":"354_CR37","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1007\/BF00116900","volume":"23","author":"G Widmer","year":"1996","unstructured":"Widmer G, Kubat M. Learning in the presence of concept drift and hidden contexts. Mach Learn. 1996;23(1):69\u2013101. https:\/\/doi.org\/10.1007\/BF00116900.","journal-title":"Mach Learn"},{"issue":"3","key":"354_CR38","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1007\/s10618-006-0050-x","volume":"13","author":"Y Yang","year":"2006","unstructured":"Yang Y, Wu X, Zhu X. Mining in anticipation for concept change: proactive-reactive prediction in data streams. Data Min Knowl Discov. 2006;13(3):261\u201389. https:\/\/doi.org\/10.1007\/s10618-006-0050-x.","journal-title":"Data Min Knowl Discov"},{"key":"354_CR39","doi-asserted-by":"publisher","unstructured":"Zhang T, Ramakrishnan R, Livny M. BIRCH: an efficient data clustering databases method for very large. In: ACM SIGMOD international conference on management of data. 1996;1:103\u2013114. https:\/\/doi.org\/10.1145\/233269.233324.","DOI":"10.1145\/233269.233324"},{"key":"354_CR40","doi-asserted-by":"publisher","unstructured":"Zhang X, Germain C, Sebag M. Adaptively detecting changes in autonomic grid computing. In: Proceedings\u2014IEEE\/ACM international workshop on grid computing, 2010. pp. 387\u201392. https:\/\/doi.org\/10.1109\/GRID.2010.5698017.","DOI":"10.1109\/GRID.2010.5698017"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00354-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-020-00354-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00354-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,14]],"date-time":"2021-09-14T23:17:34Z","timestamp":1631661454000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-020-00354-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,15]]},"references-count":40,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["354"],"URL":"https:\/\/doi.org\/10.1186\/s40537-020-00354-1","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,9,15]]},"assertion":[{"value":"26 May 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 September 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 September 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"75"}}