{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T11:33:05Z","timestamp":1753356785810,"version":"3.40.4"},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2025,2,13]],"date-time":"2025-02-13T00:00:00Z","timestamp":1739404800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,2,13]],"date-time":"2025-02-13T00:00:00Z","timestamp":1739404800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100009567","name":"Budapest University of Technology and Economics","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100009567","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Knowl Inf Syst"],"published-print":{"date-parts":[[2025,5]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Concept drift (and data drift) is a common phenomenon in machine learning models, where the statistical properties of the input data change over time, leading to a decrease in model performance. Detecting data drift is crucial for maintaining the accuracy and reliability of machine learning models in real-world applications. While previous data drift detector approaches can identify if a drift has occurred, these approaches cannot localize which specific features have caused the drift. Feature drift detectors solve this deficiency, but the required number of detectors is equal to the number of dimensions, which is a resource-intensive solution in high-dimensional data. In this paper, we propose a novel approach for feature drift analysis and drift detection based on a domino effect caused by the correlation of features. Our approach, the so-called Domino drift effect (DDE), is based on the empirically proven assumption that an initial reference correlation can be utilized as a proxy for detecting other drifting features. The method analyzes the correlating and drifting behavior, and by using only a subset of all features, it derives inference about the drifting of the remaining features, if co-drifting phenomena occur in the data stream. At co-drifting phenomena, the DDE method can estimate the probability of feature drift, which is particularly useful in high-dimensional datasets. To evaluate the effectiveness of our approach, we conducted experiments on four real-world datasets. The results show that our approach can effectively be used to predict feature drift in the whole dataset, and it has potential industrial applications.\n<\/jats:p>","DOI":"10.1007\/s10115-025-02362-0","type":"journal-article","created":{"date-parts":[[2025,2,13]],"date-time":"2025-02-13T07:37:37Z","timestamp":1739432257000},"page":"4597-4621","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Domino drift effect approach for probability estimation of feature drift in high-dimensional data"],"prefix":"10.1007","volume":"67","author":[{"given":"G\u00e1bor","family":"Sz\u0171cs","sequence":"first","affiliation":[]},{"given":"Marcell","family":"N\u00e9meth","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,2,13]]},"reference":[{"issue":"3","key":"2362_CR1","doi-asserted-by":"publisher","first-page":"1005","DOI":"10.1007\/s10115-022-01790-6","volume":"65","author":"H Guo","year":"2022","unstructured":"Guo H, Li H, Sun N, Ren Q, Zhang A, Wang W (2022) Concept drift detection and accelerated convergence of online learning. Knowl Inf Syst 65(3):1005\u20131043. https:\/\/doi.org\/10.1007\/s10115-022-01790-6","journal-title":"Knowl Inf Syst"},{"key":"2362_CR2","doi-asserted-by":"publisher","unstructured":"N\u00e9meth M, Sz\u0171cs G (2022) Split feature space ensemble method using deep reinforcement learning for algorithmic trading. In: Proceedings of the 2022 8th international conference on computer technology applications, pp 188\u2013194. https:\/\/doi.org\/10.1145\/3543712.3543722","DOI":"10.1145\/3543712.3543722"},{"key":"2362_CR3","doi-asserted-by":"publisher","unstructured":"Parasteh S, Sadaoui S (2024) A probabilistic approach for detecting real concept drift. In: ICAART (2), pp 301\u2013311. https:\/\/doi.org\/10.5220\/0012378800003636","DOI":"10.5220\/0012378800003636"},{"issue":"3","key":"2362_CR4","doi-asserted-by":"publisher","first-page":"1549","DOI":"10.1007\/s10115-023-02018-x","volume":"66","author":"S Agrahari","year":"2024","unstructured":"Agrahari S, Srivastava S, Singh AK (2024) Review on novelty detection in the non-stationary environment. Knowl Inf Syst 66(3):1549\u20131574. https:\/\/doi.org\/10.1007\/s10115-023-02018-x","journal-title":"Knowl Inf Syst"},{"key":"2362_CR5","doi-asserted-by":"publisher","DOI":"10.1007\/s11634-022-00515-5","author":"GM Paldino","year":"2022","unstructured":"Paldino GM, Lebichot B, Le Borgne Y-A, Siblini W, Obl\u00e9 F, Boracchi G, Bontempi G (2022) The role of diversity and ensemble learning in credit card fraud detection. Adv Data Anal Classif. https:\/\/doi.org\/10.1007\/s11634-022-00515-5","journal-title":"Adv Data Anal Classif"},{"key":"2362_CR6","doi-asserted-by":"publisher","first-page":"291","DOI":"10.1007\/s10844-017-0460-9","volume":"50","author":"M Olorunnimbe","year":"2018","unstructured":"Olorunnimbe M, Viktor H, Paquet E (2018) Dynamic adaptation of online ensembles for drifting data streams. J Intell Inf Syst 50:291\u2013313. https:\/\/doi.org\/10.1007\/s10844-017-0460-9","journal-title":"J Intell Inf Syst"},{"issue":"12","key":"2362_CR7","doi-asserted-by":"publisher","first-page":"2346","DOI":"10.1109\/TKDE.2018.2876857","volume":"31","author":"J Lu","year":"2019","unstructured":"Lu J, Liu A, Dong F, Gu F, Gama J, Zhang G (2019) Learning under concept drift: a review. IEEE Trans Knowl Data Eng 31(12):2346\u20132363. https:\/\/doi.org\/10.1109\/TKDE.2018.2876857","journal-title":"IEEE Trans Knowl Data Eng"},{"issue":"1","key":"2362_CR8","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1109\/TNNLS.2013.2248094","volume":"25","author":"LI Kuncheva","year":"2014","unstructured":"Kuncheva LI, Faithfull WJ (2014) PCA feature extraction for change detection in multidimensional unlabeled data. IEEE Trans Neural Netw Learn Syst 25(1):69\u201380. https:\/\/doi.org\/10.1109\/TNNLS.2013.2248094","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"2362_CR9","first-page":"19523","volume":"33","author":"S Kulinski","year":"2020","unstructured":"Kulinski S, Bagchi S, Inouye DI (2020) Feature shift detection: localizing which features have shifted via conditional distribution tests. Adv Neural Inf Process Syst 33:19523\u201319533","journal-title":"Adv Neural Inf Process Syst"},{"issue":"5","key":"2362_CR10","doi-asserted-by":"publisher","first-page":"1175","DOI":"10.1109\/TKDE.2011.226","volume":"25","author":"LI Kuncheva","year":"2013","unstructured":"Kuncheva LI (2013) Change detection in streaming multivariate data using likelihood detectors. IEEE Trans Knowl Data Eng 25(5):1175\u20131180. https:\/\/doi.org\/10.1109\/TKDE.2011.226","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"2362_CR11","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1016\/j.eswa.2017.04.008","volume":"82","author":"TS Sethi","year":"2017","unstructured":"Sethi TS, Kantardzic M (2017) On the reliable detection of concept drift from streaming unlabeled data. Expert Syst Appl 82:77\u201399. https:\/\/doi.org\/10.1016\/j.eswa.2017.04.008","journal-title":"Expert Syst Appl"},{"key":"2362_CR12","doi-asserted-by":"publisher","first-page":"459","DOI":"10.1007\/s10844-020-00634-5","volume":"56","author":"S Micevska","year":"2021","unstructured":"Micevska S, Awad A, Sakr S (2021) SDDM: an interpretable statistical concept drift detection method for data streams. J Intell Inf Syst 56:459\u2013484. https:\/\/doi.org\/10.1007\/s10844-020-00634-5","journal-title":"J Intell Inf Syst"},{"key":"2362_CR13","doi-asserted-by":"publisher","unstructured":"Coil C, Corizzo R (2024) Assessing distance measures for change point detection in continual learning scenarios. In: International symposium on methodologies for intelligent systems, pp 260\u2013270. https:\/\/doi.org\/10.1007\/978-3-031-62700-2_23. Springer","DOI":"10.1007\/978-3-031-62700-2_23"},{"key":"2362_CR14","doi-asserted-by":"publisher","unstructured":"Bandara WGC, Patel VM (2022) A transformer-based siamese network for change detection. In: IGARSS 2022-2022 IEEE international geoscience and remote sensing symposium, pp 207\u2013210. https:\/\/doi.org\/10.1109\/IGARSS46834.2022.9883686. IEEE","DOI":"10.1109\/IGARSS46834.2022.9883686"},{"key":"2362_CR15","doi-asserted-by":"crossref","unstructured":"G\u00f6z\u00fca\u00e7\u0131k \u00d6, B\u00fcy\u00fck\u00e7ak\u0131r A, Bonab H, Can F (2019) Unsupervised concept drift detection with a discriminative classifier. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 2365\u20132368","DOI":"10.1145\/3357384.3358144"},{"key":"2362_CR16","doi-asserted-by":"crossref","unstructured":"Shen P, Ming Y, Li H, Gao J, Zhang W (2022) Unsupervised concept drift detectors: a survey. In: The international conference on natural computation, fuzzy systems and knowledge discovery, pp 1117\u20131124. Springer","DOI":"10.1007\/978-3-031-20738-9_121"},{"issue":"6","key":"2362_CR17","doi-asserted-by":"publisher","first-page":"1381","DOI":"10.1002\/widm.1381","volume":"10","author":"RN Gemaque","year":"2020","unstructured":"Gemaque RN, Costa AFJ, Giusti R, Dos Santos EM (2020) An overview of unsupervised drift detection methods. Wiley Interdiscip Rev Data Min Knowl Discov 10(6):1381","journal-title":"Wiley Interdiscip Rev Data Min Knowl Discov"},{"key":"2362_CR18","doi-asserted-by":"crossref","unstructured":"Cerqueira V, Gomes HM, Bifet A (2020) Unsupervised concept drift detection using a student\u2013teacher approach. In: Discovery science: 23rd international conference, DS 2020, Thessaloniki, Greece, October 19\u201321, 2020, Proceedings 23, pp 190\u2013204. Springer","DOI":"10.1007\/978-3-030-61527-7_13"},{"issue":"5","key":"2362_CR19","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1080\/1206212X.2023.2289706","volume":"46","author":"NA Ali Abdu","year":"2024","unstructured":"Ali Abdu NA, Basulaim KO (2024) Machine learning in concept drift detection using statistical measures. Int J Comput Appl 46(5):281\u2013291. https:\/\/doi.org\/10.1080\/1206212X.2023.2289706","journal-title":"Int J Comput Appl"},{"key":"2362_CR20","doi-asserted-by":"publisher","first-page":"108632","DOI":"10.1016\/j.knosys.2022.108632","volume":"245","author":"F Bayram","year":"2022","unstructured":"Bayram F, Ahmed BS, Kassler A (2022) From concept drift to model degradation: an overview on performance-aware drift detectors. Knowl-Based Syst 245:108632. https:\/\/doi.org\/10.1016\/j.knosys.2022.108632","journal-title":"Knowl-Based Syst"},{"key":"2362_CR21","doi-asserted-by":"publisher","unstructured":"Salem O, Na\u00eft-Abdesselam F, Mehaoua A (2012) Anomaly detection in network traffic using Jensen\u2013Shannon divergence. In: 2012 IEEE international conference on communications (ICC), pp 5200\u20135204. https:\/\/doi.org\/10.1109\/ICC.2012.6364602","DOI":"10.1109\/ICC.2012.6364602"},{"key":"2362_CR22","doi-asserted-by":"crossref","unstructured":"Bifet A, Gavalda R (2007) Learning from time-changing data with adaptive windowing. In: Proceedings of the 2007 SIAM international conference on data mining, pp 443\u2013448. SIAM","DOI":"10.1137\/1.9781611972771.42"},{"key":"2362_CR23","unstructured":"Viehmann T (2021) Partial wasserstein and maximum mean discrepancy distances for bridging the gap between outlier detection and drift detection. arXiv preprint arXiv:2106.12893"},{"key":"2362_CR24","doi-asserted-by":"publisher","first-page":"168","DOI":"10.1016\/j.procs.2022.09.049","volume":"207","author":"P Porwik","year":"2022","unstructured":"Porwik P, Dadzie BM (2022) Detection of data drift in a two-dimensional stream using the Kolmogorov\u2013Smirnov test. Procedia Comput Sci 207:168\u2013175","journal-title":"Procedia Comput Sci"},{"key":"2362_CR25","doi-asserted-by":"crossref","unstructured":"Korycki L, Krawczyk B (2019) Unsupervised drift detector ensembles for data stream mining. In: 2019 IEEE international conference on data science and advanced analytics (DSAA), pp 317\u2013325 IEEE","DOI":"10.1109\/DSAA.2019.00047"},{"issue":"18","key":"2362_CR26","doi-asserted-by":"publisher","first-page":"8144","DOI":"10.1016\/j.eswa.2014.07.019","volume":"41","author":"PM Gon\u00e7alves","year":"2014","unstructured":"Gon\u00e7alves PM, de Carvalho Santos SGT, Barros RSM, Vieira DCL (2014) A comparative study on concept drift detectors. Expert Syst Appl 41(18):8144\u20138156. https:\/\/doi.org\/10.1016\/j.eswa.2014.07.019","journal-title":"Expert Syst Appl"},{"key":"2362_CR27","doi-asserted-by":"crossref","unstructured":"Yang W, Su R, Cheng Y, Guo J (2022) A concept drift detection approach based on Jensen-Shannon divergence for network traffic classification. In: Proceedings of the 2022 5th international conference on artificial intelligence and pattern recognition, pp 982\u2013987","DOI":"10.1145\/3573942.3573979"},{"key":"2362_CR28","doi-asserted-by":"crossref","unstructured":"Fan Q, Liu C, Zhao Y, Li Y (2022) Unsupervised online concept drift detection based on divergence and EWMA. In: Asia-Pacific Web (APWeb) and web-age information management (WAIM) joint international conference on web and big data, pp 121\u2013134. Springer","DOI":"10.1007\/978-3-031-25158-0_10"},{"key":"2362_CR29","doi-asserted-by":"crossref","unstructured":"Wang Z, Wang W (2020) Concept drift detection based on kolmogorov\u2013smirnov test. In: Artificial intelligence in China: proceedings of the international conference on artificial intelligence in China, pp 273\u2013280. Springer","DOI":"10.1007\/978-981-15-0187-6_31"},{"key":"2362_CR30","doi-asserted-by":"crossref","unstructured":"Dos\u00a0Reis DM, Flach P, Matwin S, Batista G (2016) Fast unsupervised online drift detection using incremental Kolmogorov-Smirnov test. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1545\u20131554","DOI":"10.1145\/2939672.2939836"},{"key":"2362_CR31","doi-asserted-by":"crossref","unstructured":"Jafseer K, Shailesh S, Sreekumar A (2023) Modeling concept drift detection as machine learning model using overlapping window and Kolmogorov\u2013Smirnov test. In: Machine learning, image processing, network security and data sciences: select proceedings of 3rd international conference on MIND 2021, pp 113\u2013129. Springer","DOI":"10.1007\/978-981-19-5868-7_10"},{"key":"2362_CR32","doi-asserted-by":"crossref","unstructured":"Basterrech S, Plato\u0161 J, Rubino G, Wo\u017aniak M (2022) Experimental analysis on dissimilarity metrics and sudden concept drift detection. In: International conference on intelligent systems design and applications, pp 190\u2013199. Springer","DOI":"10.1007\/978-3-031-35501-1_19"},{"issue":"17","key":"2362_CR33","doi-asserted-by":"publisher","first-page":"3703","DOI":"10.3390\/s19173703","volume":"19","author":"Y Tao","year":"2019","unstructured":"Tao Y, Li C, Liang Z, Yang H, Xu J (2019) Wasserstein distance learns domain invariant feature representations for drift compensation of e-nose. Sensors 19(17):3703","journal-title":"Sensors"},{"key":"2362_CR34","doi-asserted-by":"crossref","unstructured":"Huang DTJ, Koh YS, Dobbie G, Bifet A (2015) Drift detection using stream volatility. In: Machine learning and knowledge discovery in databases: European conference, ECML PKDD 2015, Porto, Portugal, September 7\u201311, 2015, Proceedings, Part I 15, pp 417\u2013432. Springer","DOI":"10.1007\/978-3-319-23528-8_26"},{"key":"2362_CR35","doi-asserted-by":"publisher","unstructured":"Alberge F, Feutry C, Duhamel P, Piantanida P (2019) Detecting covariate shift with black box predictors. In: 2019 26th international conference on telecommunications (ICT), pp 324\u2013329. https:\/\/doi.org\/10.1109\/ICT.2019.8798827","DOI":"10.1109\/ICT.2019.8798827"},{"key":"2362_CR36","unstructured":"Jang S, Park S, Lee I, Bastani O (2022) Sequential covariate shift detection using classifier two-sample tests. In: International conference on machine learning, pp 9845\u20139880. PMLR"},{"issue":"7","key":"2362_CR37","doi-asserted-by":"publisher","first-page":"2835","DOI":"10.1007\/s10115-020-01438-3","volume":"62","author":"I Goldenberg","year":"2020","unstructured":"Goldenberg I, Webb GI (2020) PCA-based drift and shift quantification framework for multidimensional data. Knowl Inf Syst 62(7):2835\u20132854. https:\/\/doi.org\/10.1007\/s10115-020-01438-3","journal-title":"Knowl Inf Syst"},{"issue":"3","key":"2362_CR38","doi-asserted-by":"publisher","first-page":"2401","DOI":"10.1007\/s10462-022-10232-2","volume":"56","author":"EB Gulcan","year":"2022","unstructured":"Gulcan EB, Can F (2022) Unsupervised concept drift detection for multi-label data streams. Artif Intell Rev 56(3):2401\u20132434. https:\/\/doi.org\/10.1007\/s10462-022-10232-2","journal-title":"Artif Intell Rev"},{"key":"2362_CR39","doi-asserted-by":"publisher","first-page":"102177","DOI":"10.1016\/j.is.2023.102177","volume":"114","author":"JN Adams","year":"2023","unstructured":"Adams JN, van Zelst SJ, Rose T, van der Aalst WMP (2023) Explainable concept drift in process mining. Inf Syst 114:102177. https:\/\/doi.org\/10.1016\/j.is.2023.102177","journal-title":"Inf Syst"},{"key":"2362_CR40","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1016\/S0168-1699(99)00046-0","volume":"24","author":"J Blackard","year":"1999","unstructured":"Blackard J, Dean D (1999) Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables. Comput Electron Agric 24:131\u2013151. https:\/\/doi.org\/10.1016\/S0168-1699(99)00046-0","journal-title":"Comput Electron Agric"},{"key":"2362_CR41","doi-asserted-by":"publisher","first-page":"108","DOI":"10.5220\/0006639801080116","volume":"1","author":"I Sharafaldin","year":"2018","unstructured":"Sharafaldin I, Lashkari AH, Ghorbani AA (2018) Toward generating a new intrusion detection dataset and intrusion traffic characterization. ICISSp 1:108\u2013116. https:\/\/doi.org\/10.5220\/0006639801080116","journal-title":"ICISSp"},{"key":"2362_CR42","doi-asserted-by":"publisher","unstructured":"Ulanova L, Begum N, Shokoohi-Yekta M, Keogh E, Clustering in the face of fast changing streams. In: Proceedings of the 2016 SIAM international conference on data mining, pp 1\u20139. https:\/\/doi.org\/10.1137\/1.9781611974348.1https:\/\/epubs.siam.org\/doi\/pdf\/10.1137\/1.9781611974348.1","DOI":"10.1137\/1.9781611974348.1"},{"issue":"3","key":"2362_CR43","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1109\/51.932724","volume":"20","author":"GB Moody","year":"2001","unstructured":"Moody GB, Mark RG (2001) The impact of the MIT-BIH arrhythmia database. IEEE Eng Med Biol Mag 20(3):45\u201350. https:\/\/doi.org\/10.1109\/51.932724","journal-title":"IEEE Eng Med Biol Mag"}],"container-title":["Knowledge and Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-025-02362-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10115-025-02362-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-025-02362-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,12]],"date-time":"2025-04-12T03:42:10Z","timestamp":1744429330000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10115-025-02362-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,13]]},"references-count":43,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2025,5]]}},"alternative-id":["2362"],"URL":"https:\/\/doi.org\/10.1007\/s10115-025-02362-0","relation":{},"ISSN":["0219-1377","0219-3116"],"issn-type":[{"type":"print","value":"0219-1377"},{"type":"electronic","value":"0219-3116"}],"subject":[],"published":{"date-parts":[[2025,2,13]]},"assertion":[{"value":"9 May 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 January 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 January 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 February 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors, G\u00e1bor Sz\u0171cs and Marcell N\u00e9meth declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical approval"}}]}}