{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,18]],"date-time":"2026-04-18T16:58:40Z","timestamp":1776531520413,"version":"3.51.2"},"reference-count":109,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2013,7,11]],"date-time":"2013-07-11T00:00:00Z","timestamp":1373500800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2013,10]]},"abstract":"<jats:p>Data stream mining is an active research area that has recently emerged to discover knowledge from large amounts of continuously generated data. In this context, several data stream clustering algorithms have been proposed to perform unsupervised learning. Nevertheless, data stream clustering imposes several challenges to be addressed, such as dealing with nonstationary, unbounded data that arrive in an online fashion. The intrinsic nature of stream data requires the development of algorithms capable of performing fast and incremental processing of data objects, suitably addressing time and memory limitations. In this article, we present a survey of data stream clustering algorithms, providing a thorough discussion of the main design components of state-of-the-art algorithms. In addition, this work addresses the temporal aspects involved in data stream clustering, and presents an overview of the usually employed experimental methodologies. A number of references are provided that describe applications of data stream clustering in different domains, such as network intrusion detection, sensor networks, and stock market analysis. Information regarding software packages and data repositories are also available for helping researchers and practitioners. Finally, some important issues and open questions that can be subject of future research are discussed.<\/jats:p>","DOI":"10.1145\/2522968.2522981","type":"journal-article","created":{"date-parts":[[2013,11,6]],"date-time":"2013-11-06T14:09:19Z","timestamp":1383746959000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":419,"title":["Data stream clustering"],"prefix":"10.1145","volume":"46","author":[{"given":"Jonathan A.","family":"Silva","sequence":"first","affiliation":[{"name":"University of S\u00e3o Paulo, S\u00e3o Paulo, Brazil"}]},{"given":"Elaine R.","family":"Faria","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo and Federal University of Uberl\u00e2ndia, Brazil"}]},{"given":"Rodrigo C.","family":"Barros","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo, S\u00e3o Paulo, Brazil"}]},{"given":"Eduardo R.","family":"Hruschka","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo, S\u00e3o Paulo, Brazil"}]},{"given":"Andr\u00e9 C. P. L. F. de","family":"Carvalho","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo, Brazil"}]},{"given":"Jo\u00e3o","family":"Gama","sequence":"additional","affiliation":[{"name":"University of Porto, Portugal"}]}],"member":"320","published-online":{"date-parts":[[2013,7,11]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2133803.2184450"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1008731.1008736"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/872757.872826"},{"key":"e_1_2_2_4_1","volume-title":"Data Streams -- Models and Algorithms","author":"Aggarwal C. C.","unstructured":"Aggarwal , C. C. 2007. Data Streams -- Models and Algorithms . Springer . Aggarwal, C. C. 2007. Data Streams -- Models and Algorithms. Springer."},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-010-0366-0"},{"key":"e_1_2_2_6_1","volume-title":"Proceedings of the 29th Conference on Very Large Data Bases (VLDB'03)","volume":"29","author":"Aggarwal C. C.","unstructured":"Aggarwal , C. C. , Han , J. , Wang , J. , and Yu , P. S . 2003. A framework for clustering evolving data streams . In Proceedings of the 29th Conference on Very Large Data Bases (VLDB'03) . Vol. 29 , 81--92. Aggarwal, C. C., Han, J., Wang, J., and Yu, P. S. 2003. A framework for clustering evolving data streams. In Proceedings of the 29th Conference on Very Large Data Bases (VLDB'03). Vol. 29, 81--92."},{"key":"e_1_2_2_7_1","volume-title":"Proceedings of the 30th Conference on Very Large Data Bases (VLDB'04)","volume":"30","author":"Aggarwal C. C.","unstructured":"Aggarwal , C. C. , Han , J. , Wang , J. , and Yu , P. S . 2004b. A framework for projected clustering of high dimensional data streams . In Proceedings of the 30th Conference on Very Large Data Bases (VLDB'04) . Vol. 30 , 852--863. Aggarwal, C. C., Han, J., Wang, J., and Yu, P. S. 2004b. A framework for projected clustering of high dimensional data streams. In Proceedings of the 30th Conference on Very Large Data Bases (VLDB'04). Vol. 30, 852--863."},{"key":"e_1_2_2_8_1","volume-title":"Proceedings of the 6th International Conference on Data Mining (SDM'06)","author":"Aggarwal C. C.","unstructured":"Aggarwal , C. C. and Yu , P. S . 2006. A framework for clustering massive text and categorical data streams . In Proceedings of the 6th International Conference on Data Mining (SDM'06) . 479. Aggarwal, C. C. and Yu, P. S. 2006. A framework for clustering massive text and categorical data streams. In Proceedings of the 6th International Conference on Data Mining (SDM'06). 479."},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2008.4497423"},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/2595503.2595508"},{"key":"e_1_2_2_11_1","volume-title":"Proceedings of the 8th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD'11)","author":"Amini A.","unstructured":"Amini , A. , Wah , T. Y. , Saybani , M. R. , Aghabozorgi , S. R. , and Yazdi , S . 2011. A study of density-grid based clustering algorithms on data streams . In Proceedings of the 8th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD'11) . IEEE Press, Los Alamitos, CA, 1652--1656. Amini, A., Wah, T. Y., Saybani, M. R., Aghabozorgi, S. R., and Yazdi, S. 2011. A study of density-grid based clustering algorithms on data streams. In Proceedings of the 8th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD'11). IEEE Press, Los Alamitos, CA, 1652--1656."},{"key":"e_1_2_2_12_1","unstructured":"Arabie P. and Hubert L. J. 1999. An Overview of Combinatorial Data Analysis. World Scientific Publishing.  Arabie P. and Hubert L. J. 1999. An Overview of Combinatorial Data Analysis. World Scientific Publishing."},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1137856.1137880"},{"key":"e_1_2_2_14_1","volume-title":"Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms. 1027--1035","author":"Arthur D.","unstructured":"Arthur , D. and Vassilvitskii , S . 2007. K-means&plus;&plus;: The advantages of careful seeding . In Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms. 1027--1035 . Arthur, D. and Vassilvitskii, S. 2007. K-means&plus;&plus;: The advantages of careful seeding. In Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms. 1027--1035."},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/543613.543615"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/773153.773176"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/509907.509947"},{"key":"e_1_2_2_18_1","unstructured":"Ball G. H. and Hall D. J. 1965. ISODATA. A novel method of data analysis and pattern classification. Tech. rep. Stanford Research Institute Menlo Park.  Ball G. H. and Hall D. J. 1965. ISODATA. A novel method of data analysis and pattern classification. Tech. rep. Stanford Research Institute Menlo Park."},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/507515.507519"},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/361002.361007"},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/0196-6774(80)90015-2"},{"key":"e_1_2_2_22_1","unstructured":"Bhat U. N. and Miller G. K. 2002. Elements of Applied Stochastic Processes 3rd Ed. John Wiley and Sons New Jersey.  Bhat U. N. and Miller G. K. 2002. Elements of Applied Stochastic Processes 3 rd Ed. John Wiley and Sons New Jersey."},{"key":"e_1_2_2_23_1","volume-title":"Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams","author":"Bifet A.","unstructured":"Bifet , A. 2010. Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams . IOS Press . Bifet, A. 2010. Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams. IOS Press."},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/1756006.1859903"},{"key":"e_1_2_2_25_1","volume-title":"Proceedings of the 15th International Conference on Machine Learning (ICML'98)","author":"Bradley P. S.","unstructured":"Bradley , P. S. and Fayyad , U. M . 1998a. Refining initial points for k-means clustering . In Proceedings of the 15th International Conference on Machine Learning (ICML'98) . Morgan Kaufmann Publishers, San Francisco, 91--99. Bradley, P. S. and Fayyad, U. M. 1998a. Refining initial points for k-means clustering. In Proceedings of the 15th International Conference on Machine Learning (ICML'98). Morgan Kaufmann Publishers, San Francisco, 91--99."},{"key":"e_1_2_2_26_1","volume-title":"Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining (KDD'98)","author":"Bradley P. S.","unstructured":"Bradley , P. S. , Fayyad , U. M. , and Reina , C . 1998b. Scaling clustering algorithms to large databases . In Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining (KDD'98) . AAAI Press, 9--15. Bradley, P. S., Fayyad, U. M., and Reina, C. 1998b. Scaling clustering algorithms to large databases. In Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining (KDD'98). AAAI Press, 9--15."},{"key":"e_1_2_2_27_1","volume-title":"Proceedings of the 6th SIAM International Conference on Data Mining. 328--339","author":"Cao F.","unstructured":"Cao , F. , Ester , M. , Qian , W. , and Zhou , A . 2006. Density-based clustering over an evolving data stream with noise . In Proceedings of the 6th SIAM International Conference on Data Mining. 328--339 . Cao, F., Ester, M., Qian, W., and Zhou, A. 2006. Density-based clustering over an evolving data stream with noise. In Proceedings of the 6th SIAM International Conference on Data Mining. 328--339."},{"key":"e_1_2_2_28_1","volume-title":"Proceedings of the 40th Annual Symposium on Foundations of Computer Science. 378--388","author":"Charikar M.","unstructured":"Charikar , M. and Guha , S . 1999. Improved combinatorial algorithms for the facility location and k-median problems . In Proceedings of the 40th Annual Symposium on Foundations of Computer Science. 378--388 . Charikar, M. and Guha, S. 1999. Improved combinatorial algorithms for the facility location and k-median problems. In Proceedings of the 40th Annual Symposium on Foundations of Computer Science. 378--388."},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/780542.780548"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281210"},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/11827405_12"},{"key":"e_1_2_2_32_1","volume-title":"Proceedings of the Knowledge Discovery from Data Streams Workshop (ECML\/PKDD).","author":"Csernel B.","unstructured":"Csernel , B. , Clerot , F. , and Hebrail , G . 2006. Datastream clustering over tilted windows through sampling . In Proceedings of the Knowledge Discovery from Data Streams Workshop (ECML\/PKDD). Csernel, B., Clerot, F., and Hebrail, G. 2006. Datastream clustering over tilted windows through sampling. In Proceedings of the Knowledge Discovery from Data Streams Workshop (ECML\/PKDD)."},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-00887-0_18"},{"key":"e_1_2_2_34_1","volume-title":"Proceedings of the 8th International Conference on Machine Learning. Morgan Kaufmann Publishers","author":"Domingos P.","unstructured":"Domingos , P. and Hulten , G . 2001. A general method for scaling up machine learning algorithms and its application to clustering . In Proceedings of the 8th International Conference on Machine Learning. Morgan Kaufmann Publishers , San Francisco, 106--113. Domingos, P. and Hulten, G. 2001. A general method for scaling up machine learning algorithms and its application to clustering. In Proceedings of the 8th International Conference on Machine Learning. Morgan Kaufmann Publishers, San Francisco, 106--113."},{"key":"e_1_2_2_35_1","volume-title":"Proceedings of the 4th IEEE International Conference on Data Mining (ICDM'04)","author":"Dunham M. H.","unstructured":"Dunham , M. H. , Menga , Y. , and Huang , J . 2004. Extensible markov model . In Proceedings of the 4th IEEE International Conference on Data Mining (ICDM'04) . 371--374. Dunham, M. H., Menga, Y., and Huang, J. 2004. Extensible markov model. In Proceedings of the 4th IEEE International Conference on Data Mining (ICDM'04). 371--374."},{"key":"e_1_2_2_36_1","volume-title":"Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. 226--231","author":"Ester M.","unstructured":"Ester , M. , Kriegel , H.-P. , Sander , J. , and Xu , X . 1996. A density-based algorithm for discovering clusters in large spatial databases with noise . In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. 226--231 . Ester, M., Kriegel, H.-P., Sander, J., and Xu, X. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. 226--231."},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2245276.2245437"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/360402.360419"},{"key":"e_1_2_2_39_1","unstructured":"Fayyad U. Piatetsky-Shapiro G. and Smyth P. 1996. From data mining to knowledge discovery: An overview. In Advances in Knowledge Discovery and Data Mining. American Association for Artificial Intelligence Menlo Park CA 1--34.   Fayyad U. Piatetsky-Shapiro G. and Smyth P. 1996. From data mining to knowledge discovery: An overview. In Advances in Knowledge Discovery and Data Mining. American Association for Artificial Intelligence Menlo Park CA 1--34."},{"key":"e_1_2_2_40_1","unstructured":"Frank A. and Asuncion A. 2010. UCI machine learning repository. http:\/\/archive.ics.uci.edu\/ml.  Frank A. and Asuncion A. 2010. UCI machine learning repository. http:\/\/archive.ics.uci.edu\/ml."},{"key":"e_1_2_2_41_1","doi-asserted-by":"crossref","unstructured":"Gaber M. M. Vatsavai R. R. Omitaomu O. A. Gama J. Chawla N. V. and Ganguly A. R. 2010. Knowledge Discovery from Sensor Data. Springer.   Gaber M. M. Vatsavai R. R. Omitaomu O. A. Gama J. Chawla N. V. and Ganguly A. R. 2010. Knowledge Discovery from Sensor Data. Springer.","DOI":"10.1007\/978-3-642-12519-5"},{"key":"e_1_2_2_42_1","volume-title":"Knowledge Discovery from Data Streams","author":"Gama J.","unstructured":"Gama , J. 2010. Knowledge Discovery from Data Streams . Chapman Hall\/CRC. Gama, J. 2010. Knowledge Discovery from Data Streams. Chapman Hall\/CRC."},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.5555\/1349782"},{"key":"e_1_2_2_44_1","volume-title":"Proceedings of the 17th Brazilian Symposium on Artificial Intelligence (SBIA'04)","volume":"3171","author":"Gama J.","unstructured":"Gama , J. , Medas , P. , Castillo , G. , and Rodrigues , P. P . 2004. Learning with drift detection . In Proceedings of the 17th Brazilian Symposium on Artificial Intelligence (SBIA'04) . Vol. 3171 ., 286--295. Gama, J., Medas, P., Castillo, G., and Rodrigues, P. P. 2004. Learning with drift detection. In Proceedings of the 17th Brazilian Symposium on Artificial Intelligence (SBIA'04). Vol. 3171., 286--295."},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.5555\/1937721.1937723"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141277.1141429"},{"key":"e_1_2_2_47_1","volume-title":"Data Clustering: Theory, Algorithms, and Applications. ASA-SIAM Series on Statistics and Applied Probability. SIAM.","author":"Gan G.","year":"2007","unstructured":"Gan , G. , Ma , C. , and Wu , J . 2007 . Data Clustering: Theory, Algorithms, and Applications. ASA-SIAM Series on Statistics and Applied Probability. SIAM. Gan, G., Ma, C., and Wu, J. 2007. Data Clustering: Theory, Algorithms, and Applications. ASA-SIAM Series on Statistics and Applied Probability. SIAM."},{"key":"e_1_2_2_48_1","volume-title":"Proceedings of the International Conference on Computer Application and System Modeling (ICCASM'10)","author":"Gao M.-M.","unstructured":"Gao , M.-M. , Liu , J.-Z. , and Gao , X . -X. 2010. Application of compound gaussian mixture model clustering in the data stream . In Proceedings of the International Conference on Computer Application and System Modeling (ICCASM'10) . Gao, M.-M., Liu, J.-Z., and Gao, X.-X. 2010. Application of compound gaussian mixture model clustering in the data stream. In Proceedings of the International Conference on Computer Application and System Modeling (ICCASM'10)."},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1016\/0304-3975(85)90224-5"},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1514894.1514926"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2003.1198387"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.5555\/795666.796588"},{"key":"e_1_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.18637\/jss.v035.i05"},{"key":"e_1_2_2_54_1","volume-title":"Proceedings of the SIAM Conference on Data Mining. SIAM\/Omnipress, 664--675","author":"Hahsler M.","unstructured":"Hahsler , M. and Dunham , M. H . 2011. Temporal structure learning for clustering massive data streams in real-time . In Proceedings of the SIAM Conference on Data Mining. SIAM\/Omnipress, 664--675 . Hahsler, M. and Dunham, M. H. 2011. Temporal structure learning for clustering massive data streams in real-time. In Proceedings of the SIAM Conference on Data Mining. SIAM\/Omnipress, 664--675."},{"key":"e_1_2_2_55_1","volume-title":"Data Mining: Concepts and Techniques. Morgan Kaufmann.","author":"Han J.","year":"2000","unstructured":"Han , J. and Kamber , M . 2000 . Data Mining: Concepts and Techniques. Morgan Kaufmann. Han, J. and Kamber, M. 2000. Data Mining: Concepts and Techniques. Morgan Kaufmann."},{"key":"e_1_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/1007352.1007400"},{"key":"e_1_2_2_57_1","unstructured":"Hulten G. and Domingos P. 2003. VFML -- A toolkit for mining high-speed time-changing data streams. Tech. rep. University of Washington. http:\/\/www.cs.washington.edu\/dm\/vfml\/.  Hulten G. and Domingos P. 2003. VFML -- A toolkit for mining high-speed time-changing data streams. Tech. rep. University of Washington. http:\/\/www.cs.washington.edu\/dm\/vfml\/."},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-31537-4_21"},{"key":"e_1_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2009.09.011"},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/1121995.1121998"},{"key":"e_1_2_2_61_1","volume-title":"Data: An Introduction to Cluster Analysis","author":"Kaufman L.","year":"1990","unstructured":"Kaufman , L. and Rousseeuw , P. J . 1990 . Finding Groups in Data: An Introduction to Cluster Analysis . Wiley Interscience . Kaufman, L. and Rousseeuw, P. J. 1990. Finding Groups in Data: An Introduction to Cluster Analysis. Wiley Interscience."},{"key":"e_1_2_2_62_1","first-page":"289","article-title":"Clustering time series data stream - A literature survey","volume":"8","author":"Kavitha V.","year":"2010","unstructured":"Kavitha , V. and Punithavalli , M. 2010 . Clustering time series data stream - A literature survey . Int. J. Comput. Sci. Inf. Secur. 8 , 1, 289 -- 294 . Kavitha, V. and Punithavalli, M. 2010. Clustering time series data stream - A literature survey. Int. J. Comput. Sci. Inf. Secur. 8, 1, 289--294.","journal-title":"Int. J. Comput. Sci. Inf. Secur."},{"key":"e_1_2_2_63_1","volume-title":"Proceedings of International Multi Conference of Engineers and Computer Scientists. 566--569","author":"Khalilian M.","unstructured":"Khalilian , M. and Mustapha , N . 2010. Data stream clustering: challenges and issues . In Proceedings of International Multi Conference of Engineers and Computer Scientists. 566--569 . Khalilian, M. and Mustapha, N. 2010. Data stream clustering: challenges and issues. In Proceedings of International Multi Conference of Engineers and Computer Scientists. 566--569."},{"key":"e_1_2_2_64_1","volume-title":"Introduction to Clustering Large and High-Dimensional Data","author":"Kogan J.","unstructured":"Kogan , J. 2007. Introduction to Clustering Large and High-Dimensional Data . Cambridge University Press . Kogan, J. 2007. Introduction to Clustering Large and High-Dimensional Data. Cambridge University Press."},{"key":"e_1_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-85836-2_24"},{"key":"e_1_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-010-0342-8"},{"key":"e_1_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2020408.2020555"},{"key":"e_1_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-25853-4_18"},{"key":"e_1_2_2_69_1","doi-asserted-by":"publisher","DOI":"10.4028\/www.scientific.net\/AMR.267.444"},{"key":"e_1_2_2_70_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-008-9115-1"},{"key":"e_1_2_2_71_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1982.1056489"},{"key":"e_1_2_2_72_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2008.08.006"},{"key":"e_1_2_2_73_1","volume-title":"Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability. L. M. Le Cam and J. Neyman, Eds.","volume":"1","author":"MacQueen J. B.","year":"1967","unstructured":"MacQueen , J. B. 1967 . Some methods for classification and analysis of multivariate observations . In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability. L. M. Le Cam and J. Neyman, Eds. , Vol. 1 ., 281--297. MacQueen, J. B. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability. L. M. Le Cam and J. Neyman, Eds., Vol. 1., 281--297."},{"key":"e_1_2_2_74_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0169-7439(99)00047-7"},{"key":"e_1_2_2_75_1","doi-asserted-by":"publisher","DOI":"10.5555\/1609984.1609985"},{"key":"e_1_2_2_76_1","volume-title":"Dynamic Probabilistic Systems","author":"Markov A.","unstructured":"Markov , A. 1971. Extension of the limit theorems of probability theory to a sum of variables connected in a chain . In Dynamic Probabilistic Systems , Vol. 1 , R. Howard, Ed ., John Wiley and Sons , Chapter Appendix B, 552--577. Markov, A. 1971. Extension of the limit theorems of probability theory to a sum of variables connected in a chain. In Dynamic Probabilistic Systems, Vol. 1, R. Howard, Ed., John Wiley and Sons, Chapter Appendix B, 552--577."},{"key":"e_1_2_2_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/1060745.1060753"},{"key":"e_1_2_2_78_1","doi-asserted-by":"publisher","DOI":"10.5555\/874063.875567"},{"key":"e_1_2_2_79_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150531"},{"key":"e_1_2_2_80_1","volume-title":"Proceedings of the 18th International Conference on Data Engineering. 685--694","author":"O'Callaghan L.","unstructured":"O'Callaghan , L. , Mishra , N. , Meyerson , A. , Guha , S. , and Motwani , R . 2002. Streaming data algorithms for high-quality clustering . In Proceedings of the 18th International Conference on Data Engineering. 685--694 . O'Callaghan, L., Mishra, N., Meyerson, A., Guha, S., and Motwani, R. 2002. Streaming data algorithms for high-quality clustering. In Proceedings of the 18th International Conference on Data Engineering. 685--694."},{"key":"e_1_2_2_81_1","volume-title":"Proceedings of the 5th Starting AI Researchers Symposium. IOS Press, 212--224","author":"Oliveira M.","unstructured":"Oliveira , M. and Gama , J . 2010. MEC --Monitoring clusters' transitions . In Proceedings of the 5th Starting AI Researchers Symposium. IOS Press, 212--224 . Oliveira, M. and Gama, J. 2010. MEC --Monitoring clusters' transitions. In Proceedings of the 5th Starting AI Researchers Symposium. IOS Press, 212--224."},{"key":"e_1_2_2_82_1","volume-title":"Proceedings of the 6th International Conference on Data Warehousing and Knowledge Discovery (KDD'04)","author":"Ong K. L","unstructured":"Ong , K. L , Li , W. , Ng , W.-K. , and Lim , E . -P. 2004. SCLOPE: An algorithm for clustering data streams of categorical attributes . In Proceedings of the 6th International Conference on Data Warehousing and Knowledge Discovery (KDD'04) . 209--218. Ong, K. L, Li, W., Ng, W.-K., and Lim, E.-P. 2004. SCLOPE: An algorithm for clustering data streams of categorical attributes. In Proceedings of the 6th International Conference on Data Warehousing and Knowledge Discovery (KDD'04). 209--218."},{"key":"e_1_2_2_83_1","doi-asserted-by":"publisher","DOI":"10.5555\/2595503.2595509"},{"key":"e_1_2_2_84_1","doi-asserted-by":"publisher","DOI":"10.1061\/(ASCE)0733-9496(2008)134:6(556)"},{"key":"e_1_2_2_85_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2007.04.003"},{"key":"e_1_2_2_86_1","doi-asserted-by":"publisher","DOI":"10.1109\/FSKD.2009.553"},{"key":"e_1_2_2_87_1","volume-title":"Proceedings of the 6th SIAM International Conference on Data Mining. 499--503","author":"Rodrigues P.","unstructured":"Rodrigues , P. , Gama , J. , and Pedroso , J. P . 2006. ODAC: Hierarchical clustering of time series data streams . In Proceedings of the 6th SIAM International Conference on Data Mining. 499--503 . Rodrigues, P., Gama, J., and Pedroso, J. P. 2006. ODAC: Hierarchical clustering of time series data streams. In Proceedings of the 6th SIAM International Conference on Data Mining. 499--503."},{"key":"e_1_2_2_88_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2007.190727"},{"key":"e_1_2_2_89_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijar.2012.01.009"},{"key":"e_1_2_2_90_1","volume-title":"Proceedings of the 2nd International Workshop on Knowledge Discovery in Data Streams, Held in Conjunction with the 16th European Conference on Machine Learning (ECML'05)","author":"Shah R.","unstructured":"Shah , R. , Krishnaswamy , S. , and Gaber , M. M . 2005. Resource-aware very fast k-means for ubiquitous data stream mining . In Proceedings of the 2nd International Workshop on Knowledge Discovery in Data Streams, Held in Conjunction with the 16th European Conference on Machine Learning (ECML'05) . Shah, R., Krishnaswamy, S., and Gaber, M. M. 2005. Resource-aware very fast k-means for ubiquitous data stream mining. In Proceedings of the 2nd International Workshop on Knowledge Discovery in Data Streams, Held in Conjunction with the 16th European Conference on Machine Learning (ECML'05)."},{"key":"e_1_2_2_91_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-011-0448-7"},{"key":"e_1_2_2_92_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMLA.2011.67"},{"key":"e_1_2_2_93_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150491"},{"key":"e_1_2_2_94_1","first-page":"801","article-title":"Sur la division des corp materiels en parties","volume":"1","author":"Steinhaus H.","year":"1956","unstructured":"Steinhaus , H. 1956 . Sur la division des corp materiels en parties . Bull. Acad. Polon. Sci 1 , 801 -- 804 . Steinhaus, H. 1956. Sur la division des corp materiels en parties. Bull. Acad. Polon. Sci 1, 801--804.","journal-title":"Bull. Acad. Polon. Sci"},{"key":"e_1_2_2_95_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2006.165"},{"key":"e_1_2_2_96_1","volume-title":"Proceedings of the 2nd IEEE International Conference on Computational Intelligence for Security and Defense Applications. 53--58","author":"Tavallaee M.","unstructured":"Tavallaee , M. , Bagheri , E. , Lu , W. , and Ghorbani , A. A . 2009. A detailed analysis of the kdd cup 99 data set . In Proceedings of the 2nd IEEE International Conference on Computational Intelligence for Security and Defense Applications. 53--58 . Tavallaee, M., Bagheri, E., Lu, W., and Ghorbani, A. A. 2009. A detailed analysis of the kdd cup 99 data set. In Proceedings of the 2nd IEEE International Conference on Computational Intelligence for Security and Defense Applications. 53--58."},{"key":"e_1_2_2_97_1","doi-asserted-by":"publisher","DOI":"10.1145\/1542362.1542419"},{"key":"e_1_2_2_98_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCCM.2008.186"},{"key":"e_1_2_2_99_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-007-0114-2"},{"key":"e_1_2_2_100_1","unstructured":"Xu R. and Wunsch D. 2009. Clustering. Computational Intelligence Series Wiley-IEEE Press.   Xu R. and Wunsch D. 2009. Clustering. Computational Intelligence Series Wiley-IEEE Press."},{"key":"e_1_2_2_101_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2006.89"},{"key":"e_1_2_2_102_1","doi-asserted-by":"publisher","DOI":"10.1145\/233269.233324"},{"key":"e_1_2_2_103_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009783824328"},{"key":"e_1_2_2_104_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCGRID.2009.20"},{"key":"e_1_2_2_105_1","unstructured":"Zhang X. and Wang W. 2010. Self-adaptive change detection in streaming data with nonstationary distribution. In Advanced Data Mining and Applications. Springer 1--12.   Zhang X. and Wang W. 2010. Self-adaptive change detection in streaming data with nonstationary distribution. In Advanced Data Mining and Applications. Springer 1--12."},{"key":"e_1_2_2_106_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2006.142"},{"key":"e_1_2_2_107_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-007-0070-x"},{"key":"e_1_2_2_108_1","doi-asserted-by":"publisher","DOI":"10.1109\/DSDE.2010.63"},{"key":"e_1_2_2_109_1","volume-title":"Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02)","author":"Zhu Y.","unstructured":"Zhu , Y. and Shasha , D . 2002. StatStream: Statistical monitoring of thousands of data streams in real time . In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02) . VLBD Endowment, 358--369. Zhu, Y. and Shasha, D. 2002. StatStream: Statistical monitoring of thousands of data streams in real time. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02). VLBD Endowment, 358--369."}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2522968.2522981","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2522968.2522981","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:34:53Z","timestamp":1750232093000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2522968.2522981"}},"subtitle":["A survey"],"short-title":[],"issued":{"date-parts":[[2013,7,11]]},"references-count":109,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,10]]}},"alternative-id":["10.1145\/2522968.2522981"],"URL":"https:\/\/doi.org\/10.1145\/2522968.2522981","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,7,11]]},"assertion":[{"value":"2012-06-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-07-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}