{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,5]],"date-time":"2026-02-05T07:56:43Z","timestamp":1770278203588,"version":"3.49.0"},"reference-count":31,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2019,8,10]],"date-time":"2019-08-10T00:00:00Z","timestamp":1565395200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2019,12,23]]},"abstract":"<jats:p>Data stream mining seeks to extract useful information from quickly-arriving, infinitely-sized and evolving data streams. Although these challenges have been addressed throughout the literature, none of them can be considered \u201csolved.\u201d We contribute to closing this gap for the task of data stream clustering by proposing two modifications to the well-known ClusTree data stream clustering algorithm: pruning unused branches and detecting concept drift. Our experimental results show the difficulty in tackling these aspects of data stream mining and the sensitivity of stream mining algorithms to parameter values. We conclude that further research is required to better equip stream learners for the data stream clustering task.<\/jats:p>","DOI":"10.3233\/jifs-179372","type":"journal-article","created":{"date-parts":[[2019,8,13]],"date-time":"2019-08-13T11:29:27Z","timestamp":1565695767000},"page":"7679-7688","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":0,"title":["Adapting ClusTree for more challenging data stream environments"],"prefix":"10.1177","volume":"37","author":[{"given":"Jakub","family":"Zgraja","sequence":"first","affiliation":[{"name":"Department of Systems and Computer Networks, Wroc\u0142aw University of Science and Technology, Wroc\u0142aw, Poland"}]},{"given":"Richard Hugh","family":"Moulton","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Queen\u2019s University, Kingston ON, Canada"}]},{"given":"Jo\u00e3o","family":"Gama","sequence":"additional","affiliation":[{"name":"Laboratory of Artificial Intelligence and Decision Support and Faculty of Economics, University of Porto, Porto, Portugal"}]},{"given":"Andrzej","family":"Kasprzak","sequence":"additional","affiliation":[{"name":"Department of Systems and Computer Networks, Wroc\u0142aw University of Science and Technology, Wroc\u0142aw, Poland"}]},{"given":"Micha\u0142","family":"Wo\u017aniak","sequence":"additional","affiliation":[{"name":"Department of Systems and Computer Networks, Wroc\u0142aw University of Science and Technology, Wroc\u0142aw, Poland"}]}],"member":"179","published-online":{"date-parts":[[2019,8,10]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1201\/EBK1439826119"},{"key":"e_1_3_2_3_2","doi-asserted-by":"crossref","unstructured":"ZhangP. ZhuX. TanJ. and GuoL. Classifier and cluster ensembles for mining concept drifting data streams in: Proc of IEEE International Conference on Data Mining pp. 1175\u20131180.","DOI":"10.1109\/ICDM.2010.125"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2017.02.004"},{"key":"e_1_3_2_5_2","unstructured":"EverittB.S. LandauS. and LeeseM. Cluster Analysis Wiley Publishing 4th edition 2009."},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/2522968.2522981"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2007.190727"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/2674026.2674028"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/2523813"},{"key":"e_1_3_2_10_2","unstructured":"\u017dliobait\u0117I. Learning under Concept Drift: An Overview Technical Report Vilnius University 2010."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-015-0448-4"},{"key":"e_1_3_2_12_2","doi-asserted-by":"crossref","unstructured":"MoultonR.H. ViktorH.L. JapkowiczN. and GamaJ. Clustering in the presence of concept drift in: Machine Learning and Knowledge Discovery in Databases \u2013 European Conference ECML PKDD 2018 Dublin Ireland Setember 10\u201314 2018 Proceedings Part I pp. 339\u2013355.","DOI":"10.1007\/978-3-030-10925-7_21"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-010-0342-8"},{"key":"e_1_3_2_14_2","first-page":"338","volume-title":"Drifted data stream clustering based on clustree algorithm","author":"Zgraja J.","year":"2018","unstructured":"ZgrajaJ. and Wo\u017aniakM. Drifted data stream clustering based on clustree algorithm, in: de Cos JuezF.J., VillarJ.R., de la Cal, \u00c1E. A., Herrero\u00c1., Quintia\u0144H., S\u00e1ezJ. A., CorchadoE., (Eds.), Hybrid Artificial Intelligent Systems, Springer International Publishing, Cham 2018, pp. 338\u2013349."},{"key":"e_1_3_2_15_2","doi-asserted-by":"crossref","unstructured":"AggarwalC. HanJ. WangJ. and YuP. A framework for clustering evolving data streams in: Proc of the 29th Int Conf. on Very Large Data Bases \u0213 Volume 29 VLDB \u201903 VLDB Endowment 2003 pp. 81\u201392.","DOI":"10.1016\/B978-012722442-8\/50016-1"},{"key":"e_1_3_2_16_2","first-page":"1601","article-title":"MOA: Massive online analysis","volume":"11","author":"Bifet A.","year":"2010","unstructured":"BifetA., HolmesG., KirkbyR. and PfahringerB., MOA: Massive online analysis, J Mach Learn Res 11 (2010), 1601\u20131604.","journal-title":"J Mach Learn Res"},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"KremerH. KranenP. JansenT. SeidlT. BifetA. HolmesG. and PfahringerB. An effective evaluation measure for clustering on evolving data streams in: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD \u201911 ACM New York NY USA 2011 pp. 868\u2013876.","DOI":"10.1145\/2020408.2020555"},{"key":"e_1_3_2_18_2","first-page":"414","article-title":"P3C: A robust projected clustering algorithm","volume":"00","author":"Ester M.","year":"2006","unstructured":"EsterM., SanderJ. and MoiseG., P3C: A robust projected clustering algorithm, 2013 IEEE 13th International Conference on Data Mining 00 (2006), 414\u2013425.","journal-title":"2013 IEEE 13th International Conference on Data Mining"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.2307\/3001968"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1937.10503522"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-008-0323-y"},{"key":"e_1_3_2_22_2","first-page":"255","article-title":"KEEL data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework","volume":"17","author":"Alcal\u00e1-Fdez J.","year":"2011","unstructured":"Alcal\u00e1-FdezJ., FernandezA., LuengoJ., DerracJ., Garc\u00edaS., S\u00e1nchezL. and HerreraF., KEEL data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework, Journal of Multiple-Valued Logic and Soft Computing 17 (2011), 255\u2013287.","journal-title":"Journal of Multiple-Valued Logic and Soft Computing"},{"key":"e_1_3_2_23_2","unstructured":"HultenG. SpencerL. and DomingosP. Mining time-changing data streams in: Proceedings of the seventh ACM SIGKDD in ternational conference on Knowledge discovery and data mining ACM pp. 97\u2013106."},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","unstructured":"StreetW.N. and KimY. A streaming ensemble algorithm (sea) for large-scale classification in: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD \u201901 ACM New York NY USA 2001 pp. 377\u2013382.","DOI":"10.1145\/502512.502568"},{"key":"e_1_3_2_25_2","doi-asserted-by":"crossref","unstructured":"DomingosP. and HultenG. Mining high-speed data streams in: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD \u201900 ACM New York NY USA 2000 pp. 71\u201380.","DOI":"10.1145\/347090.347107"},{"key":"e_1_3_2_26_2","first-page":"000","article-title":"Fifty Years of Pulsar Candidate Selection: From simple filters to a new principled real-time classification approach","volume":"000","author":"Lyon R.J.","year":"2015","unstructured":"LyonR.J., StappersB.W., CooperS., BrookeJ.D. and KnowlesJ.M., Fifty Years of Pulsar Candidate Selection: From simple filters to a new principled real-time classification approach, MNRAS 000 (2015), 000\u2013000.","journal-title":"MNRAS"},{"key":"e_1_3_2_27_2","doi-asserted-by":"crossref","unstructured":"BifetA. Gavald\u00e1R. Adaptive learning from evolving data streams in: N.M. Adams C. Robardet A. Siebes J.-F. Boulicaut (Eds.) Advances in Intelligent Data Analysis VIII Springer Berlin Heidelberg Berlin Heidelberg 2009 pp. 249\u2013260.","DOI":"10.1007\/978-3-642-03915-7_22"},{"key":"e_1_3_2_28_2","unstructured":"ZhuX.H. Stream data mining repository http:\/\/www.cse.fau.edu\/~xqzhu\/stream.html 2010. Accessed: 2019-02-21."},{"key":"e_1_3_2_29_2","unstructured":"R\u00f6slerO. and SuendermannD. A first step towards eye state prediction using eeg Proc. of the AIHLS (2013)."},{"key":"e_1_3_2_30_2","unstructured":"MoultonR.H. and ZgrajaJ. The Wilderness Area Data Set: Adapting the Covertype data set for unsupervised learning arXiv e-prints (2019) arXiv:1901.11040."},{"key":"e_1_3_2_31_2","unstructured":"DuaD. and Karra TaniskidouE. UCI machine learning repository 2017."},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0168-1699(99)00046-0"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179372","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179372","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179372","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T18:19:12Z","timestamp":1770229152000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179372"}},"subtitle":[],"editor":[{"given":"Ngoc Thanh","family":"Nguyen","sequence":"additional","affiliation":[]},{"given":"Edward","family":"Szczerbicki","sequence":"additional","affiliation":[]},{"given":"Bogdan","family":"Trawi\u0144ski","sequence":"additional","affiliation":[]},{"given":"Van Du","family":"Nguyen","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,8,10]]},"references-count":31,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2019,12,23]]}},"alternative-id":["10.3233\/JIFS-179372"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179372","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,8,10]]}}}