{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T08:57:53Z","timestamp":1770541073562,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":46,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,4,20]],"date-time":"2020-04-20T00:00:00Z","timestamp":1587340800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,4,20]]},"DOI":"10.1145\/3366423.3380183","type":"proceedings-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T08:11:44Z","timestamp":1588579904000},"page":"1049-1059","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Real-Time Clustering for Large Sparse Online Visitor Data"],"prefix":"10.1145","author":[{"given":"Gromit Yeuk-Yin","family":"Chan","sequence":"first","affiliation":[{"name":"New York University"}]},{"given":"Fan","family":"Du","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Ryan A.","family":"Rossi","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Anup B.","family":"Rao","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Eunyee","family":"Koh","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Cl\u00e1udio T.","family":"Silva","sequence":"additional","affiliation":[{"name":"New York University"}]},{"given":"Juliana","family":"Freire","sequence":"additional","affiliation":[{"name":"New York University"}]}],"member":"320","published-online":{"date-parts":[[2020,4,20]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-44681-8_46"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1080\/1206212X.2019.1624314"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242591"},{"key":"e_1_3_2_1_4_1","volume-title":"Grouping multidimensional data","author":"Berkhin Pavel","unstructured":"Pavel Berkhin . 2006. A survey of clustering data mining techniques . In Grouping multidimensional data . Springer , 25\u201371. Pavel Berkhin. 2006. A survey of clustering data mining techniques. In Grouping multidimensional data. Springer, 25\u201371."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00779-016-0954-4"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1645953.1646038"},{"key":"e_1_3_2_1_7_1","volume-title":"Clustrophile 2: guided visual clustering analysis","author":"Cavallo Marco","year":"2018","unstructured":"Marco Cavallo and \u00c7a\u011fatay Demiralp . 2018. Clustrophile 2: guided visual clustering analysis . IEEE transactions on visualization and computer graphics 25, 1( 2018 ), 267\u2013276. Marco Cavallo and \u00c7a\u011fatay Demiralp. 2018. Clustrophile 2: guided visual clustering analysis. IEEE transactions on visualization and computer graphics 25, 1(2018), 267\u2013276."},{"key":"e_1_3_2_1_8_1","volume-title":"ViBr: Visualizing Bipartite Relations at Scale with the Minimum Description Length Principle","author":"Yeuk-Yin Chan Gromit","year":"2018","unstructured":"Gromit Yeuk-Yin Chan , Panpan Xu , Zeng Dai , and Liu Ren . 2018. ViBr: Visualizing Bipartite Relations at Scale with the Minimum Description Length Principle . IEEE transactions on visualization and computer graphics 25, 1( 2018 ), 321\u2013330. Gromit Yeuk-Yin Chan, Panpan Xu, Zeng Dai, and Liu Ren. 2018. ViBr: Visualizing Bipartite Relations at Scale with the Minimum Description Length Principle. IEEE transactions on visualization and computer graphics 25, 1(2018), 321\u2013330."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2017.2763620"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Ondrej Chum James Philbin Andrew Zisserman 2008. Near duplicate image detection: min-hash and tf-idf weighting.. In Bmvc Vol.\u00a0810. 812\u2013815.  Ondrej Chum James Philbin Andrew Zisserman 2008. Near duplicate image detection: min-hash and tf-idf weighting.. In Bmvc Vol.\u00a0810. 812\u2013815.","DOI":"10.5244\/C.22.50"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.14778\/2856318.2856330"},{"key":"e_1_3_2_1_12_1","volume-title":"Numerical analysis in modern scientific computing: an introduction","author":"Deuflhard Peter","unstructured":"Peter Deuflhard and Andreas Hohmann . 2003. Numerical analysis in modern scientific computing: an introduction . Springer . Peter Deuflhard and Andreas Hohmann. 2003. Numerical analysis in modern scientific computing: an introduction. Springer."},{"key":"e_1_3_2_1_13_1","volume-title":"Large-scale parallel data mining","author":"Dhillon S","unstructured":"Inderjit\u00a0 S Dhillon and Dharmendra\u00a0 S Modha . 2002. A data-clustering algorithm on distributed memory multiprocessors . In Large-scale parallel data mining . Springer , 245\u2013260. Inderjit\u00a0S Dhillon and Dharmendra\u00a0S Modha. 2002. A data-clustering algorithm on distributed memory multiprocessors. In Large-scale parallel data mining. Springer, 245\u2013260."},{"key":"e_1_3_2_1_14_1","first-page":"9","article-title":"Visual interfaces for recommendation systems: Finding similar and dissimilar peers","volume":"10","author":"Du Fan","year":"2018","unstructured":"Fan Du , Catherine Plaisant , Neil Spring , and Ben Shneiderman . 2018 . Visual interfaces for recommendation systems: Finding similar and dissimilar peers . ACM Transactions on Intelligent Systems and Technology (TIST) 10 , 1(2018), 9 . Fan Du, Catherine Plaisant, Neil Spring, and Ben Shneiderman. 2018. Visual interfaces for recommendation systems: Finding similar and dissimilar peers. ACM Transactions on Intelligent Systems and Technology (TIST) 10, 1(2018), 9.","journal-title":"ACM Transactions on Intelligent Systems and Technology (TIST)"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45591-4_51"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/3305381.3305511"},{"key":"e_1_3_2_1_17_1","unstructured":"Aristides Gionis Piotr Indyk Rajeev Motwani 1999. Similarity search in high dimensions via hashing. In Vldb Vol.\u00a099. 518\u2013529.  Aristides Gionis Piotr Indyk Rajeev Motwani 1999. Similarity search in high dimensions via hashing. In Vldb Vol.\u00a099. 518\u2013529."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/376284.375670"},{"key":"e_1_3_2_1_19_1","volume-title":"Design considerations for collaborative visual analytics. Information visualization 7, 1","author":"Heer Jeffrey","year":"2008","unstructured":"Jeffrey Heer and Maneesh Agrawala . 2008. Design considerations for collaborative visual analytics. Information visualization 7, 1 ( 2008 ), 49\u201362. Jeffrey Heer and Maneesh Agrawala. 2008. Design considerations for collaborative visual analytics. Information visualization 7, 1 (2008), 49\u201362."},{"key":"e_1_3_2_1_20_1","volume-title":"June 15-17","author":"Hendler James","year":"1992","unstructured":"James Hendler . 1992. Artificial intelligence planning systems: proceedings of the first international conference , June 15-17 , 1992 , College Park, Maryland . Morgan Kaufmann . James Hendler. 1992. Artificial intelligence planning systems: proceedings of the first international conference, June 15-17, 1992, College Park, Maryland. Morgan Kaufmann."},{"key":"e_1_3_2_1_21_1","volume-title":"Computer Graphics Forum, Vol.\u00a028","author":"Jeong Dong\u00a0Hyun","unstructured":"Dong\u00a0Hyun Jeong , Caroline Ziemkiewicz , Brian Fisher , William Ribarsky , and Remco Chang . 2009. iPCA: An Interactive System for PCA-based Visual Analytics . In Computer Graphics Forum, Vol.\u00a028 . Wiley Online Library , 767\u2013774. Dong\u00a0Hyun Jeong, Caroline Ziemkiewicz, Brian Fisher, William Ribarsky, and Remco Chang. 2009. iPCA: An Interactive System for PCA-based Visual Analytics. In Computer Graphics Forum, Vol.\u00a028. Wiley Online Library, 767\u2013774."},{"key":"e_1_3_2_1_22_1","volume-title":"Information visualization","author":"Keim Daniel","unstructured":"Daniel Keim , Gennady Andrienko , Jean-Daniel Fekete , Carsten G\u00f6rg , J\u00f6rn Kohlhammer , and Guy Melan\u00e7on . 2008. Visual analytics: Definition, process, and challenges . In Information visualization . Springer , 154\u2013175. Daniel Keim, Gennady Andrienko, Jean-Daniel Fekete, Carsten G\u00f6rg, J\u00f6rn Kohlhammer, and Guy Melan\u00e7on. 2008. Visual analytics: Definition, process, and challenges. In Information visualization. Springer, 154\u2013175."},{"key":"e_1_3_2_1_23_1","volume-title":"Visual data mining","author":"Keim A","unstructured":"Daniel\u00a0 A Keim , Florian Mansmann , J\u00f6rn Schneidewind , Jim Thomas , and Hartmut Ziegler . 2008. Visual analytics: Scope and challenges . In Visual data mining . Springer , 76\u201390. Daniel\u00a0A Keim, Florian Mansmann, J\u00f6rn Schneidewind, Jim Thomas, and Hartmut Ziegler. 2008. Visual analytics: Scope and challenges. In Visual data mining. Springer, 76\u201390."},{"key":"e_1_3_2_1_24_1","unstructured":"Nathan Korda Bal\u00e1zs Sz\u00f6r\u00e9nyi and Li Shuai. 2016. Distributed clustering of linear bandits in peer to peer networks. In Journal of machine learning research workshop and conference proceedings Vol.\u00a048. International Machine Learning Societ 1301\u20131309.  Nathan Korda Bal\u00e1zs Sz\u00f6r\u00e9nyi and Li Shuai. 2016. Distributed clustering of linear bandits in peer to peer networks. In Journal of machine learning research workshop and conference proceedings Vol.\u00a048. International Machine Learning Societ 1301\u20131309."},{"key":"e_1_3_2_1_25_1","volume-title":"Clustervision: Visual supervision of unsupervised clustering","author":"Kwon Bum\u00a0Chul","year":"2017","unstructured":"Bum\u00a0Chul Kwon , Ben Eysenbach , Janu Verma , Kenney Ng , Christopher De\u00a0Filippi , Walter\u00a0 F Stewart , and Adam Perer . 2017 . Clustervision: Visual supervision of unsupervised clustering . IEEE transactions on visualization and computer graphics 24, 1(2017), 142\u2013151. Bum\u00a0Chul Kwon, Ben Eysenbach, Janu Verma, Kenney Ng, Christopher De\u00a0Filippi, Walter\u00a0F Stewart, and Adam Perer. 2017. Clustervision: Visual supervision of unsupervised clustering. IEEE transactions on visualization and computer graphics 24, 1(2017), 142\u2013151."},{"key":"e_1_3_2_1_26_1","volume-title":"Computer graphics forum, Vol.\u00a031","author":"Lee Hanseung","unstructured":"Hanseung Lee , Jaeyeon Kihm , Jaegul Choo , John Stasko , and Haesun Park . 2012. iVisClustering: An interactive visual document clustering via topic modeling . In Computer graphics forum, Vol.\u00a031 . Wiley Online Library , 1155\u20131164. Hanseung Lee, Jaeyeon Kihm, Jaegul Choo, John Stasko, and Haesun Park. 2012. iVisClustering: An interactive visual document clustering via topic modeling. In Computer graphics forum, Vol.\u00a031. Wiley Online Library, 1155\u20131164."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2911451.2911548"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2014.2346452"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1982.1056489"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Rashid Mehmood Saeed El-Ashram Rongfang Bie Hussain Dawood and Anton Kos. 2017. Clustering by fast search and merge of local density peaks for gene expression microarray data. Scientific reports 7(2017) 45602.  Rashid Mehmood Saeed El-Ashram Rongfang Bie Hussain Dawood and Anton Kos. 2017. Clustering by fast search and merge of local density peaks for gene expression microarray data. Scientific reports 7(2017) 45602.","DOI":"10.1038\/srep45602"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2016.01.102"},{"key":"e_1_3_2_1_32_1","volume-title":"A nested process model for visualization design and validation","author":"Munzner Tamara","year":"2009","unstructured":"Tamara Munzner . 2009. A nested process model for visualization design and validation . IEEE Transactions on Visualization and Computer Graphics 6 ( 2009 ), 921\u2013928. Tamara Munzner. 2009. A nested process model for visualization design and validation. IEEE Transactions on Visualization and Computer Graphics6 (2009), 921\u2013928."},{"key":"e_1_3_2_1_33_1","volume-title":"X-means: Extending k-means with efficient estimation of the number of clusters.. In ICML, Vol.\u00a01. 727\u2013734.","author":"Pelleg Dan","year":"2000","unstructured":"Dan Pelleg , Andrew\u00a0 W Moore , 2000 . X-means: Extending k-means with efficient estimation of the number of clusters.. In ICML, Vol.\u00a01. 727\u2013734. Dan Pelleg, Andrew\u00a0W Moore, 2000. X-means: Extending k-means with efficient estimation of the number of clusters.. In ICML, Vol.\u00a01. 727\u2013734."},{"key":"e_1_3_2_1_34_1","volume-title":"Mining of massive datasets","author":"Rajaraman Anand","unstructured":"Anand Rajaraman and Jeffrey\u00a0David Ullman . 2011. Chapter 3 , Mining of massive datasets . Cambridge University Press . Anand Rajaraman and Jeffrey\u00a0David Ullman. 2011. Chapter 3, Mining of massive datasets. Cambridge University Press."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1971.10482356"},{"key":"e_1_3_2_1_36_1","volume-title":"Data Clustering","author":"Reddy K","unstructured":"Chandan\u00a0 K Reddy and Bhanukiran Vinzamuri . 2018. A survey of partitional and hierarchical clustering algorithms . In Data Clustering . Chapman and Hall\/CRC , 87\u2013110. Chandan\u00a0K Reddy and Bhanukiran Vinzamuri. 2018. A survey of partitional and hierarchical clustering algorithms. In Data Clustering. Chapman and Hall\/CRC, 87\u2013110."},{"key":"e_1_3_2_1_37_1","volume-title":"Clustering by fast search and find of density peaks. Science 344, 6191","author":"Rodriguez Alex","year":"2014","unstructured":"Alex Rodriguez and Alessandro Laio . 2014. Clustering by fast search and find of density peaks. Science 344, 6191 ( 2014 ), 1492\u20131496. Alex Rodriguez and Alessandro Laio. 2014. Clustering by fast search and find of density peaks. Science 344, 6191 (2014), 1492\u20131496."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1057\/ivs.2008.29"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-016-2300-1"},{"key":"e_1_3_2_1_40_1","unstructured":"Anshumali Shrivastava and Ping Li. 2014. In defense of minhash over simhash. In Artificial Intelligence and Statistics. 886\u2013894.  Anshumali Shrivastava and Ping Li. 2014. In defense of minhash over simhash. In Artificial Intelligence and Statistics. 886\u2013894."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807222"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1963.10500845"},{"key":"e_1_3_2_1_43_1","volume-title":"Towards a systematic combination of dimension reduction and clustering in visual analytics","author":"Wenskovitch John","year":"2017","unstructured":"John Wenskovitch , Ian Crandell , Naren Ramakrishnan , Leanna House , and Chris North . 2017. Towards a systematic combination of dimension reduction and clustering in visual analytics . IEEE transactions on visualization and computer graphics 24, 1( 2017 ), 131\u2013141. John Wenskovitch, Ian Crandell, Naren Ramakrishnan, Leanna House, and Chris North. 2017. Towards a systematic combination of dimension reduction and clustering in visual analytics. IEEE transactions on visualization and computer graphics 24, 1(2017), 131\u2013141."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2016.2609423"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.4236\/jcc.2018.612012"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-10665-1_71"}],"event":{"name":"WWW '20: The Web Conference 2020","location":"Taipei Taiwan","acronym":"WWW '20","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Proceedings of The Web Conference 2020"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366423.3380183","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366423.3380183","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:33:00Z","timestamp":1750199580000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366423.3380183"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,20]]},"references-count":46,"alternative-id":["10.1145\/3366423.3380183","10.1145\/3366423"],"URL":"https:\/\/doi.org\/10.1145\/3366423.3380183","relation":{},"subject":[],"published":{"date-parts":[[2020,4,20]]},"assertion":[{"value":"2020-04-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}