{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,28]],"date-time":"2026-03-28T08:18:30Z","timestamp":1774685910619,"version":"3.50.1"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,1,16]],"date-time":"2023-01-16T00:00:00Z","timestamp":1673827200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSF","award":["CAREER 2153851, CCF2153863, and ECCS-2023780"],"award-info":[{"award-number":["CAREER 2153851, CCF2153863, and ECCS-2023780"]}]},{"name":"DOE ORNL contract","award":["4000171342, NSF 2244651, and 2301040"],"award-info":[{"award-number":["4000171342, NSF 2244651, and 2301040"]}]},{"name":"Exascale Computing Project","award":["17-SC-20-SC"],"award-info":[{"award-number":["17-SC-20-SC"]}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"crossref","award":["DE-AC05-00OR22725"],"award-info":[{"award-number":["DE-AC05-00OR22725"]}],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100016818","name":"UT-Battelle, LLC","doi-asserted-by":"crossref","award":["DE-AC05-00OR22725"],"award-info":[{"award-number":["DE-AC05-00OR22725"]}],"id":[{"id":"10.13039\/100016818","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2023,4,30]]},"abstract":"<jats:p>\n            In this article, we present a four-layer distributed simulation system and its adaptation to the Material Point Method (MPM). The system is built upon a performance portable\n            <jats:monospace>C++<\/jats:monospace>\n            programming model targeting major High-Performance-Computing (HPC) platforms. A key ingredient of our system is a hierarchical block-tile-cell sparse grid data structure that is distributable to an arbitrary number of Message Passing Interface (MPI) ranks. We additionally propose strategies for efficient dynamic load balance optimization to maximize the efficiency of MPI tasks. Our simulation pipeline can easily switch among backend programming models, including OpenMP and CUDA, and can be effortlessly dispatched onto supercomputers and the cloud. Finally, we construct benchmark experiments and ablation studies on supercomputers and consumer workstations in a local network to evaluate the scalability and load balancing criteria. We demonstrate massively parallel, highly scalable, and gigascale resolution MPM simulations of up to 1.01 billion particles for less than 323.25 seconds per frame with 8 OpenSSH-connected workstations.\n          <\/jats:p>","DOI":"10.1145\/3570160","type":"journal-article","created":{"date-parts":[[2022,11,2]],"date-time":"2022-11-02T13:22:19Z","timestamp":1667395339000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["A Sparse Distributed Gigascale Resolution Material Point Method"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2390-0282","authenticated-orcid":false,"given":"Yuxing","family":"Qiu","sequence":"first","affiliation":[{"name":"University of California, Los Angeles, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4250-9476","authenticated-orcid":false,"given":"Samuel Temple","family":"Reeve","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9868-7311","authenticated-orcid":false,"given":"Minchen","family":"Li","sequence":"additional","affiliation":[{"name":"University of California, Los Angeles &amp; Timestep Technologies"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7645-5931","authenticated-orcid":false,"given":"Yin","family":"Yang","sequence":"additional","affiliation":[{"name":"University of Utah &amp; Timestep Technologies, Salt Lake City, UT"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0103-888X","authenticated-orcid":false,"given":"Stuart Ryan","family":"Slattery","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3506-0583","authenticated-orcid":false,"given":"Chenfanfu","family":"Jiang","sequence":"additional","affiliation":[{"name":"University of California, Los Angeles &amp; Timestep Technologies"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,1,16]]},"reference":[{"key":"e_1_3_4_2_1","first-page":"42","volume-title":"Proceedings of the ACM Workshop on General-Purpose Computing on Graphics Processors","volume":"41","author":"Amada T.","year":"2004","unstructured":"T. Amada, M. Imura, Y. Yasumuro, Y. Manabe, and K. Chihara. 2004. Particle-based fluid simulation on GPU. In Proceedings of the ACM Workshop on General-Purpose Computing on Graphics Processors. Vol. 41, 42."},{"key":"e_1_3_4_3_1","first-page":"1","volume-title":"Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis","author":"Bauer Michael","year":"2012","unstructured":"Michael Bauer, Sean Treichler, Elliott Slaughter, and Alex Aiken. 2012. Legion: Expressing locality and independence with logical regions. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. IEEE, 1\u201311."},{"key":"e_1_3_4_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.1987.1676942"},{"key":"e_1_3_4_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450623.3464642"},{"key":"e_1_3_4_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2007.370258"},{"key":"e_1_3_4_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964977"},{"key":"e_1_3_4_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2013.19"},{"key":"e_1_3_4_9_1","doi-asserted-by":"crossref","unstructured":"N. Chentanez M. M\u00fcller and T. Kim. 2015. Coupling 3D eulerian heightfield and particle methods for interactive simulation of large scale liquid phenomena. IEEE Transactions on Visualization and Computer Graphics 21 10 (2015) 1116\u20131128.","DOI":"10.1109\/TVCG.2015.2449303"},{"key":"e_1_3_4_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1730804.1730807"},{"key":"e_1_3_4_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/99.660313"},{"key":"e_1_3_4_12_1","unstructured":"OpenBSD developers. 2021. OpenSSH. Retrieved October 20 2022 from https:\/\/www.openssh.com\/."},{"key":"e_1_3_4_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2014.07.003"},{"key":"e_1_3_4_14_1","first-page":"195","volume-title":"Computer Graphics Forum","author":"Fang Yu","year":"2018","unstructured":"Yu Fang, Yuanming Hu, Shi-Min Hu, and Chenfanfu Jiang. 2018. A temporally adaptive material point method with regional time stepping. Computer Graphics Forum 37, 8 (2018), 195\u2013204."},{"key":"e_1_3_4_15_1","unstructured":"Yun Fei Yuhan Huang and Ming Gao. 2021. Principles towards real-time simulation of material point method on modern GPUs. arXiv:2111.00699. Retrieved from https:\/\/arxiv.org\/abs\/2111.00699."},{"issue":"6","key":"e_1_3_4_16_1","first-page":"1","article-title":"GPU optimization of material point methods","volume":"37","author":"Gao Ming","year":"2018","unstructured":"Ming Gao, Xinlei Wang, Kui Wu, Andre Pradhana, Eftychios Sifakis, Cem Yuksel, and Chenfanfu Jiang. 2018. GPU optimization of material point methods. ACM Transactions on Graphics 37, 6 (2018), 1\u201312.","journal-title":"ACM Transactions on Graphics"},{"key":"e_1_3_4_17_1","unstructured":"P. Goswami P. Schlegel B. Solenthaler and R. Pajarola. 2010. Interactive SPH simulation and rendering on the GPU. In Proceedings of the 2010 ACM SIGGRAPH\/Eurographics Symposium on Computer Animation . Eurographics Association 55\u201364."},{"key":"e_1_3_4_18_1","first-page":"109","volume-title":"Proceedings of the High Performance Graphics","author":"Hoetzlein R.","year":"2016","unstructured":"R. Hoetzlein. 2016. GVDB: Raytracing sparse voxel database structures on the GPU. In Proceedings of the High Performance Graphics. Eurographics Association, 109\u2013117."},{"issue":"6","key":"e_1_3_4_19_1","first-page":"1","article-title":"Taichi: A language for high-performance computation on spatially sparse data structures","volume":"38","author":"Hu Yuanming","year":"2019","unstructured":"Yuanming Hu, Tzu-Mao Li, Luke Anderson, Jonathan Ragan-Kelley, and Fr\u00e9do Durand. 2019. Taichi: A language for high-performance computation on spatially sparse data structures. ACM Transactions on Graphics 38, 6 (2019), 1\u201316.","journal-title":"ACM Transactions on Graphics"},{"key":"e_1_3_4_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450626.3459671"},{"key":"e_1_3_4_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897826.2927348"},{"key":"e_1_3_4_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/165854.165874"},{"key":"e_1_3_4_23_1","volume-title":"Proceedings of the 8th SIAM Conference on Parallel Processing for Scientific Computing","author":"Karypis George","year":"1997","unstructured":"George Karypis and Vipin Kumar. 1997. A Coarse-Grain parallel formulation of multilevel k-way graph partitioning algorithm. In Proceedings of the 8th SIAM Conference on Parallel Processing for Scientific Computing."},{"key":"e_1_3_4_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3084363.3085066"},{"key":"e_1_3_4_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925906"},{"key":"e_1_3_4_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3528223.3530058"},{"issue":"4","key":"e_1_3_4_27_1","first-page":"1","article-title":"Energetically consistent inelasticity for optimization time integration","volume":"41","author":"Li Xuan","year":"2022","unstructured":"Xuan Li, Minchen Li, and Chenfanfu Jiang. 2022. Energetically consistent inelasticity for optimization time integration. ACM Transactions on Graphics 41, 4 (2022), 1\u201316.","journal-title":"ACM Transactions on Graphics"},{"key":"e_1_3_4_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3272127.3275012"},{"key":"e_1_3_4_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2982424"},{"key":"e_1_3_4_30_1","first-page":"513","volume-title":"Proceedings of the 2017 USENIX Conference on USENIX Annual Technical Conference","author":"Mashayekhi Omid","year":"2017","unstructured":"Omid Mashayekhi, Hang Qu, Chinmayee Shah, and Philip Levis. 2017. Execution templates: Caching control plane decisions for strong scaling of data analytics. In Proceedings of the 2017 USENIX Conference on USENIX Annual Technical Conference. 513\u2013526."},{"key":"e_1_3_4_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173551"},{"key":"e_1_3_4_32_1","unstructured":"David S. Medina Amik St-Cyr and Tim Warburton. 2014. OCCA: A unified approach to multi-threading languages. arXiv:1403.0968. Retrieved from https:\/\/arxiv.org\/abs\/1403.0968."},{"key":"e_1_3_4_33_1","unstructured":"OpenMPI Team Members. 2021. OpenMPI. Retrieved November 24 2021 fromhttps:\/\/www.open-mpi.org\/"},{"key":"e_1_3_4_34_1","doi-asserted-by":"publisher","DOI":"10.1177\/10943420211022829"},{"key":"e_1_3_4_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487228.2487235"},{"key":"e_1_3_4_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3450623.3464653"},{"key":"e_1_3_4_37_1","volume-title":"ACM Transactions on Graphics","author":"Pfaff T.","year":"2010","unstructured":"T. Pfaff, N. Thuerey, J. Cohen, S. Tariq, and M. Gross. 2010. Scalable fluid simulation using anisotropic turbulence particles. ACM Transactions on Graphics 29, 6 (2010), 1\u20138."},{"key":"e_1_3_4_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3190508.3190516"},{"key":"e_1_3_4_39_1","first-page":"375","volume-title":"Computer Graphics Forum","author":"Qu Hang","year":"2020","unstructured":"Hang Qu, Omid Mashayekhi, Chinmayee Shah, and Philip Levis. 2020. Accelerating distributed graphical fluid simulations with micro-partitioning. Computer Graphics Forum 39, 1 (2020), 375\u2013388."},{"key":"e_1_3_4_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661269"},{"key":"e_1_3_4_41_1","first-page":"35","volume-title":"Computer Graphics Forum","author":"Shah Chinmayee","year":"2018","unstructured":"Chinmayee Shah, David Hyde, Hang Qu, and Philip Levis. 2018. Distributing and load balancing sparse fluid simulations. Computer Graphics Forum 37, 8 (2018), 35\u201346."},{"key":"e_1_3_4_42_1","doi-asserted-by":"publisher","DOI":"10.21105\/joss.04115"},{"key":"e_1_3_4_43_1","volume-title":"MPI\u2013The Complete Reference: The MPI Core","author":"Snir Marc","year":"1998","unstructured":"Marc Snir, William Gropp, Steve Otto, Steven Huss-Lederman, Jack Dongarra, and David Walker. 1998. MPI\u2013The Complete Reference: The MPI Core. Vol. 1, MIT Press."},{"key":"e_1_3_4_44_1","first-page":"25","volume-title":"Proceedings of the 11th ACM SIGGRAPH\/Eurographics Conference on Computer Animation","author":"Stomakhin Alexey","year":"2012","unstructured":"Alexey Stomakhin, Russell Howes, Craig Schroeder, and Joseph M. Teran. 2012. Energetically consistent invertible elasticity. In Proceedings of the 11th ACM SIGGRAPH\/Eurographics Conference on Computer Animation. 25\u201332."},{"key":"e_1_3_4_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461912.2461948"},{"key":"e_1_3_4_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-21909-7_12"},{"key":"e_1_3_4_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073651"},{"key":"e_1_3_4_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2021.3097283"},{"key":"e_1_3_4_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3272127.3275086"},{"key":"e_1_3_4_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3386569.3392442"},{"key":"e_1_3_4_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/1370082.1370085"},{"key":"e_1_3_4_52_1","unstructured":"R. Winchenbach H. Hochstetter and A. Kolb. 2016. Constrained neighbor lists for SPH-based fluid simulations. In Proceedings of the ACM SIGGRAPH\/Eurographics Symposium on Computer Animation . Eurographics Association 49\u201356."},{"key":"e_1_3_4_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3306346.3322949"},{"key":"e_1_3_4_54_1","doi-asserted-by":"crossref","unstructured":"K. Wu N. Truong C. Yuksel and R. Hoetzlein. 2018. Fast fluid simulations with sparse volumes on the GPU. Computer Graphics Forum 37 2 (2018) 157\u2013167.","DOI":"10.1111\/cgf.13350"},{"key":"e_1_3_4_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2016.50"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3570160","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3570160","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3570160","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:37Z","timestamp":1750182577000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3570160"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,16]]},"references-count":54,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,4,30]]}},"alternative-id":["10.1145\/3570160"],"URL":"https:\/\/doi.org\/10.1145\/3570160","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,16]]},"assertion":[{"value":"2022-08-07","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-10-18","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-01-16","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}