{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T17:11:06Z","timestamp":1774113066882,"version":"3.50.1"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2024,6,20]],"date-time":"2024-06-20T00:00:00Z","timestamp":1718841600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,6,20]],"date-time":"2024-06-20T00:00:00Z","timestamp":1718841600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62002187"],"award-info":[{"award-number":["62002187"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Science Research Project of Hebei Education Department of China under Grant","award":["QN2023184"],"award-info":[{"award-number":["QN2023184"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2024,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Stream data mining aims to handle the continuous and ongoing generation of data flows (e.g. weather, stock and traffic data), which often encounters concept drift as time progresses. Traditional offline algorithms struggle with learning from real-time data, making online algorithms more fitting for mining the stream data with dynamic concepts. Among families of the online learning algorithms, single pass stands out for its efficiency in processing one sample point at a time, and inspecting it only once at most. Currently, there exist online algorithms tailored for single pass over the stream data by converting the problems of classification into minimum enclosing ball. However, these methods mainly focus on expanding the ball to enclose the new data. An excessively large ball might overwrite data of the new concept, creating difficulty in triggering the model updating process. This paper proposes a new online single pass framework for stream data mining, namely Scalable Concept Drift Adaptation (SCDA), and presents three distinct online methods (SCDA-I, SCDA-II and SCDA-III) based on that framework. These methods dynamically adjust the ball by expanding or contracting when new sample points arrive, thereby effectively avoiding the issue of excessively large balls. To evaluate their performance, we conduct the experiments on 7 synthetic and 5 real-world benchmark datasets and compete with the state-of-the-arts. The experiments demonstrate the applicability and flexibility of the SCDA methods in stream data mining by comparing three aspects: predictive performance, memory usage and scalability of the ball. Among them, the SCDA-III method performs best in all these aspects.<\/jats:p>","DOI":"10.1007\/s40747-024-01524-x","type":"journal-article","created":{"date-parts":[[2024,6,20]],"date-time":"2024-06-20T14:02:11Z","timestamp":1718892131000},"page":"6725-6743","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Scalable concept drift adaptation for stream data mining"],"prefix":"10.1007","volume":"10","author":[{"given":"Lisha","family":"Hu","sequence":"first","affiliation":[]},{"given":"Wenxiu","family":"Li","sequence":"additional","affiliation":[]},{"given":"Yaru","family":"Lu","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3238-9888","authenticated-orcid":false,"given":"Chunyu","family":"Hu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,6,20]]},"reference":[{"key":"1524_CR1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2021.115303","volume":"183","author":"I Baidari","year":"2021","unstructured":"Baidari I, Honnikoll N (2021) Bhattacharyya distance based concept drift detection method for evolving data stream. Expert Syst Appl 183:115303. https:\/\/doi.org\/10.1016\/j.eswa.2021.115303","journal-title":"Expert Syst Appl"},{"key":"1524_CR2","doi-asserted-by":"publisher","unstructured":"Bifet A, Gavalda R (2007) Learning from time-changing data with adaptive windowing. In: Proceedings of the 2007 SIAM international conference on data mining. SIAM, pp 443\u2013448. https:\/\/doi.org\/10.1137\/1.9781611972771.42","DOI":"10.1137\/1.9781611972771.42"},{"key":"1524_CR3","unstructured":"Bifet A, Holmes G, Pfahringer B et\u00a0al (2010) Moa: Massive online analysis, a framework for stream classification and clustering. In: Proceedings of the first workshop on applications of pattern analysis. PMLR, pp 44\u201350"},{"issue":"3","key":"1524_CR4","doi-asserted-by":"publisher","first-page":"3043","DOI":"10.1007\/s40747-022-00760-3","volume":"9","author":"UA Butt","year":"2023","unstructured":"Butt UA, Amin R, Aldabbas H et al (2023) Cloud-based email phishing attack using machine and deep learning algorithm. Complex Intell Syst 9(3):3043\u20133070. https:\/\/doi.org\/10.1007\/s40747-022-00760-3","journal-title":"Complex Intell Syst"},{"key":"1524_CR5","doi-asserted-by":"publisher","first-page":"515","DOI":"10.1016\/j.ijinfomgt.2019.03.010","volume":"50","author":"H Fu","year":"2020","unstructured":"Fu H, Manogaran G, Wu K et al (2020) Intelligent decision-making of online shopping behavior based on internet of things. Int J Inf Manag 50:515\u2013525. https:\/\/doi.org\/10.1016\/j.ijinfomgt.2019.03.010","journal-title":"Int J Inf Manag"},{"key":"1524_CR6","doi-asserted-by":"publisher","first-page":"528","DOI":"10.1016\/j.neunet.2022.05.018","volume":"152","author":"H G\u00e2lmeanu","year":"2022","unstructured":"G\u00e2lmeanu H, Andonie R (2022) Weighted incremental-decremental support vector machines for concept drift with shifting window. Neural Netw 152:528\u2013541. https:\/\/doi.org\/10.1016\/j.neunet.2022.05.018","journal-title":"Neural Netw"},{"issue":"3","key":"1524_CR7","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1007\/s10994-012-5320-9","volume":"90","author":"J Gama","year":"2013","unstructured":"Gama J, Sebastiao R, Rodrigues PP (2013) On evaluating stream learning algorithms. Mach Learn 90(3):317\u2013346. https:\/\/doi.org\/10.1007\/s10994-012-5320-9","journal-title":"Mach Learn"},{"issue":"1","key":"1524_CR8","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1007\/s00521-020-05242-6","volume":"34","author":"M Heusinger","year":"2022","unstructured":"Heusinger M, Raab C, Schleif FM (2022) Passive concept drift handling via variations of learning vector quantization. Neural Comput Appl 34(1):89\u2013100. https:\/\/doi.org\/10.1007\/s00521-020-05242-6","journal-title":"Neural Comput Appl"},{"issue":"17","key":"1524_CR9","doi-asserted-by":"publisher","first-page":"3113","DOI":"10.3390\/math10173113","volume":"10","author":"L Hu","year":"2022","unstructured":"Hu L, Hu C, Huo Z et al (2022) Online support vector machine with a single pass for streaming data. Mathematics 10(17):3113. https:\/\/doi.org\/10.3390\/math10173113","journal-title":"Mathematics"},{"key":"1524_CR10","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1016\/j.ymssp.2016.11.004","volume":"88","author":"Y Hu","year":"2017","unstructured":"Hu Y, Baraldi P, Di Maio F et al (2017) A systematic semi-supervised self-adaptable fault diagnostics approach in an evolving environment. Mech Syst Signal Process 88:413\u2013427. https:\/\/doi.org\/10.1016\/j.ymssp.2016.11.004","journal-title":"Mech Syst Signal Process"},{"key":"1524_CR11","doi-asserted-by":"publisher","unstructured":"Hulten G, Spencer L, Domingos P (2001) Mining time-changing data streams. In: Proceedings of the seventh ACM international conference on knowledge discovery and data mining, San Francisco USA, pp 97\u2013106. https:\/\/doi.org\/10.1145\/502512.502529","DOI":"10.1145\/502512.502529"},{"key":"1524_CR12","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.116510","volume":"193","author":"M Jain","year":"2022","unstructured":"Jain M, Kaur G, Saxena V (2022) A k-means clustering and svm based hybrid concept drift detection technique for network anomaly detection. Expert Syst Appl 193:116510. https:\/\/doi.org\/10.1016\/j.eswa.2022.116510","journal-title":"Expert Syst Appl"},{"key":"1524_CR13","doi-asserted-by":"crossref","unstructured":"Kuhn HW, Tucker AW (2014) Nonlinear programming. In: Traces and emergence of nonlinear programming. Springer, pp 247\u2013258","DOI":"10.1007\/978-3-0348-0439-4_11"},{"issue":"1","key":"1524_CR14","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1007\/s40745-021-00344-x","volume":"10","author":"A Kurani","year":"2023","unstructured":"Kurani A, Doshi P, Vakharia A et al (2023) A comprehensive comparative study of artificial neural network (ann) and support vector machines (svm) on stock forecasting. Ann Data Sci 10(1):183\u2013208. https:\/\/doi.org\/10.1007\/s40745-021-00344-x","journal-title":"Ann Data Sci"},{"key":"1524_CR15","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1016\/j.patcog.2018.11.006","volume":"88","author":"Y Li","year":"2019","unstructured":"Li Y, Wang Y, Liu Q et al (2019) Incremental semi-supervised learning on streaming data. Pattern Recognit 88:383\u2013396. https:\/\/doi.org\/10.1016\/j.patcog.2018.11.006","journal-title":"Pattern Recognit"},{"key":"1524_CR16","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2020.105694","volume":"195","author":"Z Li","year":"2020","unstructured":"Li Z, Huang W, Xiong Y et al (2020) Incremental learning imbalanced data streams with concept drift: the dynamic updated ensemble algorithm. Knowl Based Syst 195:105694. https:\/\/doi.org\/10.1016\/j.knosys.2020.105694","journal-title":"Knowl Based Syst"},{"key":"1524_CR17","doi-asserted-by":"publisher","first-page":"45410","DOI":"10.1109\/ACCESS.2022.3169785","volume":"10","author":"M Lima","year":"2022","unstructured":"Lima M, Neto M, Silva Filho T et al (2022) Learning under concept drift for regression\u2014a systematic literature review. IEEE Access 10:45410\u201345429. https:\/\/doi.org\/10.1109\/ACCESS.2022.3169785","journal-title":"IEEE Access"},{"key":"1524_CR18","doi-asserted-by":"publisher","first-page":"56198","DOI":"10.1109\/ACCESS.2019.2912631","volume":"7","author":"CC Lin","year":"2019","unstructured":"Lin CC, Deng DJ, Kuo CH et al (2019) Concept drift detection and adaption in big imbalance industrial iot data using an ensemble learning method of offline classifiers. IEEE Access 7:56198\u201356207. https:\/\/doi.org\/10.1109\/ACCESS.2019.2912631","journal-title":"IEEE Access"},{"key":"1524_CR19","doi-asserted-by":"crossref","unstructured":"Liu A, Song Y, Zhang G et al (2017) Regional concept drift detection and density synchronized drift adaptation. In: IJCAI international joint conference on artificial intelligence. http:\/\/hdl.handle.net\/10453\/126374. Accessed 8 Jan 2024","DOI":"10.24963\/ijcai.2017\/317"},{"issue":"1","key":"1524_CR20","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1109\/TNNLS.2020.2978523","volume":"32","author":"A Liu","year":"2020","unstructured":"Liu A, Lu J, Zhang G (2020) Diverse instance-weighting ensemble based on region drift disagreement for concept drift adaptation. IEEE Trans Neural Netw Learn Syst 32(1):293\u2013307. https:\/\/doi.org\/10.1109\/TNNLS.2020.2978523","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"1524_CR21","doi-asserted-by":"publisher","unstructured":"Liu A, Zhang G, Wang K et\u00a0al (2020) Fast switch na\u00efve bayes to avoid redundant update for concept drift learning. In: 2020 International joint conference on neural networks (IJCNN). IEEE, pp 1\u20137. https:\/\/doi.org\/10.1109\/IJCNN48605.2020.9207077","DOI":"10.1109\/IJCNN48605.2020.9207077"},{"key":"1524_CR22","doi-asserted-by":"publisher","unstructured":"Lu J, Liu A, Dong F et al (2018) Learning under concept drift: a review. IEEE Trans Knowl Data Eng 31(12):2346\u20132363. https:\/\/doi.org\/10.1109\/TKDE.2018.2876857","DOI":"10.1109\/TKDE.2018.2876857"},{"key":"1524_CR23","doi-asserted-by":"publisher","DOI":"10.1007\/s40747-021-00456-0","author":"S Priya","year":"2021","unstructured":"Priya S, Uthra RA (2021) Deep learning framework for handling concept drift and class imbalanced complex decision-making on streaming data. Complex Intell Syst. https:\/\/doi.org\/10.1007\/s40747-021-00456-0","journal-title":"Complex Intell Syst"},{"key":"1524_CR24","unstructured":"Rai P, Daume H, Venkatasubramanian S (2009) Streamed learning: one-pass svms. In: Proceedings of the 21st international joint conference on artificial intelligence, Pasadena USA, pp 1211\u20131216"},{"key":"1524_CR25","doi-asserted-by":"publisher","unstructured":"da\u00a0Silva CA, Krohling RA (2019) Semi-supervised online elastic extreme learning machine with forgetting parameter to deal with concept drift in data streams. In: 2019 International joint conference on neural networks (IJCNN). IEEE, pp 1\u20138. https:\/\/doi.org\/10.1109\/IJCNN.2019.8852361","DOI":"10.1109\/IJCNN.2019.8852361"},{"issue":"1","key":"1524_CR26","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1023\/B:MACH.0000008084.60811.49","volume":"54","author":"DM Tax","year":"2004","unstructured":"Tax DM, Duin RP (2004) Support vector data description. Mach Learn 54(1):45\u201366","journal-title":"Mach Learn"},{"key":"1524_CR27","unstructured":"Tsang IW, Kwok JT, Cheung PM et\u00a0al (2005) Core vector machines: Fast svm training on very large data sets. J Mach Learn Res 6(4)"},{"issue":"10","key":"1524_CR28","doi-asserted-by":"publisher","first-page":"4802","DOI":"10.1109\/TNNLS.2017.2771290","volume":"29","author":"S Wang","year":"2018","unstructured":"Wang S, Minku LL, Yao X (2018) A systematic study of online class imbalance learning with concept drift. IEEE Trans Neural Netw Learn Syst 29(10):4802\u20134821. https:\/\/doi.org\/10.1109\/TNNLS.2017.2771290","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"1524_CR29","doi-asserted-by":"publisher","first-page":"3248","DOI":"10.1007\/s10489-018-1149-7","volume":"48","author":"Y Wang","year":"2018","unstructured":"Wang Y, Li T (2018) Improving semi-supervised co-forest algorithm in evolving data streams. Appl Intell 48:3248\u20133262. https:\/\/doi.org\/10.1007\/s10489-018-1149-7","journal-title":"Appl Intell"},{"issue":"1","key":"1524_CR30","doi-asserted-by":"publisher","first-page":"1201","DOI":"10.1080\/09540091.2022.2051434","volume":"34","author":"Z Wang","year":"2022","unstructured":"Wang Z, Han D, Li M et al (2022) The abnormal traffic detection scheme based on pca and ssh. Connect Sci 34(1):1201\u20131220. https:\/\/doi.org\/10.1080\/09540091.2022.2051434","journal-title":"Connect Sci"},{"key":"1524_CR31","doi-asserted-by":"crossref","unstructured":"Wen Y, Zhou Q, Xue Y et\u00a0al (2020) Transfer learning for semi-supervised classification of non-stationary data streams. In: Neural information processing: 27th international conference, ICONIP 2020, Bangkok, Thailand, November 18\u201322, 2020, Proceedings, Part V 27. Springer, pp 468\u2013477","DOI":"10.1007\/978-3-030-63823-8_54"},{"key":"1524_CR32","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1016\/j.neucom.2022.05.025","volume":"496","author":"E Yu","year":"2022","unstructured":"Yu E, Song Y, Zhang G et al (2022) Learn-to-adapt: concept drift adaptation for hybrid multiple streams. Neurocomputing 496:121\u2013130. https:\/\/doi.org\/10.1016\/j.neucom.2022.05.025","journal-title":"Neurocomputing"},{"issue":"5","key":"1524_CR33","doi-asserted-by":"publisher","first-page":"3592","DOI":"10.1109\/TCYB.2020.3015266","volume":"52","author":"H Yu","year":"2020","unstructured":"Yu H, Lu J, Zhang G (2020) Continuous support vector regression for nonstationary streaming data. IEEE Trans Cybern 52(5):3592\u20133605","journal-title":"IEEE Trans Cybern"},{"key":"1524_CR34","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2022.109113","volume":"134","author":"H Yu","year":"2023","unstructured":"Yu H, Liu W, Lu J et al (2023) Detecting group concept drift from multiple data streams. Pattern Recognit 134:109113","journal-title":"Pattern Recognit"},{"key":"1524_CR35","doi-asserted-by":"publisher","DOI":"10.1016\/j.cose.2021.102221","volume":"104","author":"S Yuan","year":"2021","unstructured":"Yuan S, Wu X (2021) Deep learning for insider threat detection: review, challenges and opportunities. Comput Secur 104:102221. https:\/\/doi.org\/10.1016\/j.cose.2021.102221","journal-title":"Comput Secur"},{"key":"1524_CR36","doi-asserted-by":"publisher","first-page":"73815","DOI":"10.1109\/ACCESS.2018.2882872","volume":"6","author":"H Zhang","year":"2018","unstructured":"Zhang H, Liu W, Shan J et al (2018) Online active learning paired ensemble for concept drift and class imbalance. IEEE Access 6:73815\u201373828. https:\/\/doi.org\/10.1109\/ACCESS.2018.2882872","journal-title":"IEEE Access"},{"issue":"3","key":"1524_CR37","doi-asserted-by":"publisher","first-page":"790","DOI":"10.1109\/JAS.2020.1003099","volume":"7","author":"H Zhang","year":"2020","unstructured":"Zhang H, Li Y, Lv Z et al (2020) A real-time and ubiquitous network attack detection based on deep belief network and support vector machine. IEEE\/CAA J Autom Sin 7(3):790\u2013799. https:\/\/doi.org\/10.1109\/JAS.2020.1003099","journal-title":"IEEE\/CAA J Autom Sin"},{"key":"1524_CR38","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-15-1967-3","volume-title":"Machine learning","author":"ZH Zhou","year":"2021","unstructured":"Zhou ZH (2021) Machine learning. Springer Nature, Berlin"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01524-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-024-01524-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01524-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,14]],"date-time":"2024-09-14T15:15:44Z","timestamp":1726326944000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-024-01524-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,20]]},"references-count":38,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,10]]}},"alternative-id":["1524"],"URL":"https:\/\/doi.org\/10.1007\/s40747-024-01524-x","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,6,20]]},"assertion":[{"value":"10 January 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 May 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 June 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. On behalf of all authors, the corresponding author states that there is no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}