{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T15:54:32Z","timestamp":1769010872809,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":46,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T00:00:00Z","timestamp":1628467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001839","name":"University Grants Committee","doi-asserted-by":"publisher","award":["R5060-19"],"award-info":[{"award-number":["R5060-19"]}],"id":[{"id":"10.13039\/501100001839","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,9]]},"DOI":"10.1145\/3472456.3473518","type":"proceedings-article","created":{"date-parts":[[2021,10,5]],"date-time":"2021-10-05T18:46:04Z","timestamp":1633459564000},"page":"1-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Accelerating DBSCAN Algorithm with AI Chips for Large Datasets"],"prefix":"10.1145","author":[{"given":"Zhuoran","family":"Ji","sequence":"first","affiliation":[{"name":"The University of Hong Kong, Hong Kong"}]},{"given":"Cho-Li","family":"Wang","sequence":"additional","affiliation":[{"name":"The University of Hong Kong, Hong Kong"}]}],"member":"320","published-online":{"date-parts":[[2021,10,5]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"International Symposium on Computer Architecture (ISCA), http:\/\/www. gpgpu-sim. org\/isca2012-tutorial.","author":"Aaamodt T","year":"2012","unstructured":"T Aaamodt and A Boktor . 2012 . Gpgpu-sim 3. x: A performance simulator for many-core accelerator research . In International Symposium on Computer Architecture (ISCA), http:\/\/www. gpgpu-sim. org\/isca2012-tutorial. T Aaamodt and A Boktor. 2012. Gpgpu-sim 3. x: A performance simulator for many-core accelerator research. In International Symposium on Computer Architecture (ISCA), http:\/\/www. gpgpu-sim. org\/isca2012-tutorial."},{"key":"e_1_3_2_1_2_1","first-page":"978","article-title":"Guide to convolutional neural networks","volume":"10","author":"Aghdam Hamed\u00a0Habibi","year":"2017","unstructured":"Hamed\u00a0Habibi Aghdam and Elnaz\u00a0Jahani Heravi . 2017 . Guide to convolutional neural networks . New York, NY: Springer 10 (2017), 978 \u2013 973 . Hamed\u00a0Habibi Aghdam and Elnaz\u00a0Jahani Heravi. 2017. Guide to convolutional neural networks. New York, NY: Springer 10 (2017), 978\u2013973.","journal-title":"New York, NY: Springer"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-60936-8_6"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2013.05.200"},{"key":"e_1_3_2_1_5_1","volume-title":"Search, and Set Operations on GPUs. Online","author":"Baxter S","year":"2013","unstructured":"S Baxter and D Merrill . 2013. Efficient Merge , Search, and Set Operations on GPUs. Online ( 2013 ). S Baxter and D Merrill. 2013. Efficient Merge, Search, and Set Operations on GPUs. Online (2013)."},{"key":"e_1_3_2_1_6_1","unstructured":"Cambricon. 2021. Cambricon BANG C Developer Guide. https:\/\/www.cambricon.com\/docs\/bangc\/developer_guide_html\/  Cambricon. 2021. Cambricon BANG C Developer Guide. https:\/\/www.cambricon.com\/docs\/bangc\/developer_guide_html\/"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIME.2010.5477926"},{"key":"e_1_3_2_1_8_1","volume-title":"NVIDIA A100 Tensor Core GPU: Performance and Innovation","author":"Choquette Jack","year":"2021","unstructured":"Jack Choquette , Wishwesh Gandhi , Olivier Giroux , Nick Stam , and Ronny Krashinsky . 2021. NVIDIA A100 Tensor Core GPU: Performance and Innovation . IEEE Micro ( 2021 ). Jack Choquette, Wishwesh Gandhi, Olivier Giroux, Nick Stam, and Ronny Krashinsky. 2021. NVIDIA A100 Tensor Core GPU: Performance and Innovation. IEEE Micro (2021)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3330345.3331057"},{"key":"e_1_3_2_1_10_1","unstructured":"Mohamad Dolatshah Ali Hadian and Behrouz Minaei-Bidgoli. 2015. Ball*-tree: Efficient spatial indexing for constrained nearest-neighbor search in metric spaces. arXiv preprint arXiv:1511.00628(2015).  Mohamad Dolatshah Ali Hadian and Behrouz Minaei-Bidgoli. 2015. Ball*-tree: Efficient spatial indexing for constrained nearest-neighbor search in metric spaces. arXiv preprint arXiv:1511.00628(2015)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2347736.2347755"},{"key":"e_1_3_2_1_12_1","unstructured":"Martin Ester Hans-Peter Kriegel J\u00f6rg Sander Xiaowei Xu 1996. A density-based algorithm for discovering clusters in large spatial databases with noise.. In Kdd Vol.\u00a096. 226\u2013231.  Martin Ester Hans-Peter Kriegel J\u00f6rg Sander Xiaowei Xu 1996. A density-based algorithm for discovering clusters in large spatial databases with noise.. In Kdd Vol.\u00a096. 226\u2013231."},{"key":"e_1_3_2_1_13_1","volume-title":"Joint International Conference on Pervasive Computing and the Networked World. Springer, 73\u201387","author":"Fu Xiufen","year":"2013","unstructured":"Xiufen Fu , Yaguang Wang , Yanna Ge , Peiwen Chen , and Shaohua Teng . 2013 . Research and application of DBSCAN algorithm based on Hadoop platform . In Joint International Conference on Pervasive Computing and the Networked World. Springer, 73\u201387 . Xiufen Fu, Yaguang Wang, Yanna Ge, Peiwen Chen, and Shaohua Teng. 2013. Research and application of DBSCAN algorithm based on Hadoop platform. In Joint International Conference on Pervasive Computing and the Networked World. Springer, 73\u201387."},{"key":"e_1_3_2_1_14_1","volume-title":"Approximate nearest neighbor: Towards removing the curse of dimensionality. Theory of computing 8, 1","author":"Har-Peled Sariel","year":"2012","unstructured":"Sariel Har-Peled , Piotr Indyk , and Rajeev Motwani . 2012. Approximate nearest neighbor: Towards removing the curse of dimensionality. Theory of computing 8, 1 ( 2012 ), 321\u2013350. Sariel Har-Peled, Piotr Indyk, and Rajeev Motwani. 2012. Approximate nearest neighbor: Towards removing the curse of dimensionality. Theory of computing 8, 1 (2012), 321\u2013350."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPADS.2011.83"},{"key":"e_1_3_2_1_16_1","unstructured":"Huawei. 2021. Tensor Boost Engine Operator Development Guide. https:\/\/support.huaweicloud.com\/odevg-Inference-cann\/odevg-Inference-cann.pdf  Huawei. 2021. Tensor Boost Engine Operator Development Guide. https:\/\/support.huaweicloud.com\/odevg-Inference-cann\/odevg-Inference-cann.pdf"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/3122009.3242044"},{"key":"e_1_3_2_1_18_1","unstructured":"Zhe Jia Marco Maggioni Benjamin Staiger and Daniele\u00a0P Scarpazza. 2018. Dissecting the NVIDIA volta GPU architecture via microbenchmarking. arXiv preprint arXiv:1804.06826(2018).  Zhe Jia Marco Maggioni Benjamin Staiger and Daniele\u00a0P Scarpazza. 2018. Dissecting the NVIDIA volta GPU architecture via microbenchmarking. arXiv preprint arXiv:1804.06826(2018)."},{"key":"e_1_3_2_1_19_1","unstructured":"Heinrich Jiang Jennifer Jang and Jakub \u0141\u0105cki. 2020. Faster DBSCAN via subsampled similarity queries. arXiv preprint arXiv:2006.06743(2020).  Heinrich Jiang Jennifer Jang and Jakub \u0141\u0105cki. 2020. Faster DBSCAN via subsampled similarity queries. arXiv preprint arXiv:2006.06743(2020)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/HCS49909.2020.9220619"},{"key":"e_1_3_2_1_21_1","unstructured":"Gary\u00a0J Katz and Joseph\u00a0T Kider. 2008. All-pairs shortest-paths for large graphs on the GPU. (2008).  Gary\u00a0J Katz and Joseph\u00a0T Kider. 2008. All-pairs shortest-paths for large graphs on the GPU. (2008)."},{"key":"e_1_3_2_1_22_1","first-page":"15172","article-title":"Efficient rematerialization for deep networks","volume":"32","author":"Kumar Ravi","year":"2019","unstructured":"Ravi Kumar , Manish Purohit , Zoya Svitkina , Erik Vee , and Joshua Wang . 2019 . Efficient rematerialization for deep networks . Advances in Neural Information Processing Systems 32 (2019), 15172 \u2013 15181 . Ravi Kumar, Manish Purohit, Zoya Svitkina, Erik Vee, and Joshua Wang. 2019. Efficient rematerialization for deep networks. Advances in Neural Information Processing Systems 32 (2019), 15172\u201315181.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTCHIPS.2019.8875654"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.713"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.5754"},{"key":"e_1_3_2_1_26_1","unstructured":"Guangchun Luo Xiaoyu Luo Thomas\u00a0Fairley Gooch Ling Tian and Ke Qin. 2016. A parallel dbscan algorithm based on spark. In 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud) Social Computing and Networking (SocialCom) Sustainable Computing and Communications (SustainCom)(BDCloud-SocialCom-SustainCom). IEEE 548\u2013553.  Guangchun Luo Xiaoyu Luo Thomas\u00a0Fairley Gooch Ling Tian and Ke Qin. 2016. A parallel dbscan algorithm based on spark. In 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud) Social Computing and Networking (SocialCom) Sustainable Computing and Communications (SustainCom)(BDCloud-SocialCom-SustainCom). IEEE 548\u2013553."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1837274.1837289"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2018.00091"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2717511"},{"key":"e_1_3_2_1_30_1","unstructured":"Tomas Mikolov Ilya Sutskever Kai Chen Greg Corrado and Jeffrey Dean. 2013. Distributed representations of words and phrases and their compositionality. arXiv preprint arXiv:1310.4546(2013).  Tomas Mikolov Ilya Sutskever Kai Chen Greg Corrado and Jeffrey Dean. 2013. Distributed representations of words and phrases and their compositionality. arXiv preprint arXiv:1310.4546(2013)."},{"key":"e_1_3_2_1_31_1","volume-title":"Markus Weimer, and Matteo Interlandi.","author":"Nakandala Supun","year":"2020","unstructured":"Supun Nakandala , Karla Saur , Gyeong-In Yu , Konstantinos Karanasos , Carlo Curino , Markus Weimer, and Matteo Interlandi. 2020 . A Tensor Compiler for Unified Machine Learning Prediction Serving. In 14th {USENIX} Symposium on Operating Systems Design and Implementation ( {OSDI} 20). 899\u2013917. Supun Nakandala, Karla Saur, Gyeong-In Yu, Konstantinos Karanasos, Carlo Curino, Markus Weimer, and Matteo Interlandi. 2020. A Tensor Compiler for Unified Machine Learning Prediction Serving. In 14th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 20). 899\u2013917."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.363"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/HCS49909.2020.9220735"},{"key":"e_1_3_2_1_34_1","volume-title":"NVIDIA TURING GPU ARCHITECTURE. Online","author":"NVIDIA.","year":"2018","unstructured":"NVIDIA. 2018. NVIDIA TURING GPU ARCHITECTURE. Online ( 2018 ). NVIDIA. 2018. NVIDIA TURING GPU ARCHITECTURE. Online (2018)."},{"key":"e_1_3_2_1_35_1","volume-title":"NVIDIA AMPERE GA102 GPU ARCHITECTURE. Online","author":"NVIDIA.","year":"2020","unstructured":"NVIDIA. 2020. NVIDIA AMPERE GA102 GPU ARCHITECTURE. Online ( 2020 ). NVIDIA. 2020. NVIDIA AMPERE GA102 GPU ARCHITECTURE. Online (2020)."},{"key":"e_1_3_2_1_36_1","unstructured":"NVIDIA. 2020. RAPIDS Memory Manager (RMM). https:\/\/docs.rapids.ai\/api\/rmm\/stable\/basics.html  NVIDIA. 2020. RAPIDS Memory Manager (RMM). https:\/\/docs.rapids.ai\/api\/rmm\/stable\/basics.html"},{"key":"e_1_3_2_1_37_1","volume-title":"Programming Guide: CUDA Toolkit Documentation.","author":"NVIDIA.","year":"2021","unstructured":"NVIDIA. 2021 . Programming Guide: CUDA Toolkit Documentation. NVIDIA. 2021. Programming Guide: CUDA Toolkit Documentation."},{"key":"e_1_3_2_1_38_1","unstructured":"Opendota. 2021. Community-maintained open source Dota 2 data platform. https:\/\/www.opendota.com\/  Opendota. 2021. Community-maintained open source Dota 2 data platform. https:\/\/www.opendota.com\/"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2019.00016"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"crossref","unstructured":"Sebastian Raschka Joshua Patterson and Corey Nolet. 2020. Machine Learning in Python: Main developments and technology trends in data science machine learning and artificial intelligence. arXiv preprint arXiv:2002.04803(2020).  Sebastian Raschka Joshua Patterson and Corey Nolet. 2020. Machine Learning in Python: Main developments and technology trends in data science machine learning and artificial intelligence. arXiv preprint arXiv:2002.04803(2020).","DOI":"10.3390\/info11040193"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3068335"},{"key":"e_1_3_2_1_42_1","unstructured":"SCI-Compiler. 2018. Ping Pong Buffer. http:\/\/www.scicompiler.cloud\/userguide\/PingPongBuffer.html  SCI-Compiler. 2018. Ping Pong Buffer. http:\/\/www.scicompiler.cloud\/userguide\/PingPongBuffer.html"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3196887"},{"key":"e_1_3_2_1_44_1","first-page":"327","article-title":"Parallel implementation of DBSCAN algorithm using multiple graphics accelerators","volume":"1","author":"Sz\u00e9n\u00e1si S\u00e1ndor","year":"2016","unstructured":"S\u00e1ndor Sz\u00e9n\u00e1si . 2016 . Parallel implementation of DBSCAN algorithm using multiple graphics accelerators . International Multidisciplinary Scientific GeoConference: SGEM 1 (2016), 327 \u2013 333 . S\u00e1ndor Sz\u00e9n\u00e1si. 2016. Parallel implementation of DBSCAN algorithm using multiple graphics accelerators. International Multidisciplinary Scientific GeoConference: SGEM 1 (2016), 327\u2013333.","journal-title":"International Multidisciplinary Scientific GeoConference: SGEM"},{"key":"e_1_3_2_1_45_1","volume-title":"STING: A statistical information grid approach to spatial data mining. In VLDB, Vol.\u00a097. 186\u2013195.","author":"Wang Wei","year":"1997","unstructured":"Wei Wang , Jiong Yang , Richard Muntz , 1997 . STING: A statistical information grid approach to spatial data mining. In VLDB, Vol.\u00a097. 186\u2013195. Wei Wang, Jiong Yang, Richard Muntz, 1997. STING: A statistical information grid approach to spatial data mining. In VLDB, Vol.\u00a097. 186\u2013195."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3380582"}],"event":{"name":"ICPP 2021: 50th International Conference on Parallel Processing","location":"Lemont IL USA","acronym":"ICPP 2021"},"container-title":["50th International Conference on Parallel Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472456.3473518","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472456.3473518","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:23Z","timestamp":1750191443000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472456.3473518"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,9]]},"references-count":46,"alternative-id":["10.1145\/3472456.3473518","10.1145\/3472456"],"URL":"https:\/\/doi.org\/10.1145\/3472456.3473518","relation":{},"subject":[],"published":{"date-parts":[[2021,8,9]]},"assertion":[{"value":"2021-10-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}