{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,29]],"date-time":"2026-05-29T11:59:27Z","timestamp":1780055967340,"version":"3.54.0"},"reference-count":55,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2018,12,4]],"date-time":"2018-12-04T00:00:00Z","timestamp":1543881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CMMI-1538593, IIS-1253598, IIS-1755544, CCF-1533885, IIS1763638, CCF-1812944"],"award-info":[{"award-number":["CMMI-1538593, IIS-1253598, IIS-1755544, CCF-1533885, IIS1763638, CCF-1812944"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2018,12,31]]},"abstract":"<jats:p>The Material Point Method (MPM) has been shown to facilitate effective simulations of physically complex and topologically challenging materials, with a wealth of emerging applications in computational engineering and visual computing. Borne out of the extreme importance of regularity, MPM is given attractive parallelization opportunities on high-performance modern multiprocessors. Parallelization of MPM that fully leverages computing resources presents challenges that require exploring an extensive design-space for favorable data structures and algorithms. Unlike the conceptually simple CPU parallelization, where the coarse partition of tasks can be easily applied, it takes greater effort to reach the GPU hardware saturation due to its many-core SIMT architecture. In this paper we introduce methods for addressing the computational challenges of MPM and extending the capabilities of general simulation systems based on MPM, particularly concentrating on GPU optimization. In addition to our open-source high-performance framework, we also conduct performance analyses and benchmark experiments to compare against alternative design choices which may superficially appear to be reasonable, but can suffer from suboptimal performance in practice. Our explicit and fully implicit GPU MPM solvers are further equipped with a Moving Least Squares MPM heat solver and a novel sand constitutive model to enable fast simulations of a wide range of materials. We demonstrate that more than an order of magnitude performance improvement can be achieved with our GPU solvers. Practical high-resolution examples with up to ten million particles run in less than one minute per frame.<\/jats:p>","DOI":"10.1145\/3272127.3275044","type":"journal-article","created":{"date-parts":[[2018,11,28]],"date-time":"2018-11-28T19:16:10Z","timestamp":1543432570000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":61,"title":["GPU optimization of material point methods"],"prefix":"10.1145","volume":"37","author":[{"given":"Ming","family":"Gao","sequence":"first","affiliation":[{"name":"University of Pennsylvania"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xinlei","family":"Wang","sequence":"additional","affiliation":[{"name":"University of Pennsylvania"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kui","family":"Wu","sequence":"additional","affiliation":[{"name":"University of Utah"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Andre","family":"Pradhana","sequence":"additional","affiliation":[{"name":"University of Pennsylvania and DreamWorks Animation"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Eftychios","family":"Sifakis","sequence":"additional","affiliation":[{"name":"University of Wisconsin - Madison"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Cem","family":"Yuksel","sequence":"additional","affiliation":[{"name":"University of Utah"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chenfanfu","family":"Jiang","sequence":"additional","affiliation":[{"name":"University of Pennsylvania"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2018,12,4]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073625"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(88)90123-4"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1278780.1278807"},{"key":"e_1_2_2_4_1","volume-title":"Fluid simulation for computer graphics","author":"Bridson R.","unstructured":"R. Bridson . 2008. Fluid simulation for computer graphics . Taylor & Francis . R. Bridson. 2008. Fluid simulation for computer graphics. Taylor & Francis."},{"key":"e_1_2_2_5_1","volume-title":"Symp App Accel High Perf Comp","author":"Chiang W-F.","year":"2009","unstructured":"W-F. Chiang , M. DeLisi , T. Hummel , T. Prete , K. Tew , M. Hall , P. Wallstedt , and J. Guilkey . 2009. GPU acceleration of the Generalized Interpolation Material Point method . Symp App Accel High Perf Comp ( 2009 ). W-F. Chiang, M. DeLisi, T. Hummel, T. Prete, K. Tew, M. Hall, P. Wallstedt, and J. Guilkey. 2009. GPU acceleration of the Generalized Interpolation Material Point method. Symp App Accel High Perf Comp (2009)."},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3092818"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925877"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.compgeo.2018.04.001"},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.compgeo.2015.01.009"},{"key":"e_1_2_2_10_1","volume-title":"Proceedings of the ACM SIGGRAPH\/Eurographics","author":"Y.","unstructured":"Y. Fang*, Y. Hu*, S. Hu , and C. Jiang . 2018. A Temporally Adaptive Material Point Method with Regional Time Stepping . In Proceedings of the ACM SIGGRAPH\/Eurographics Y. Fang*, Y. Hu*, S. Hu, and C. Jiang. 2018. A Temporally Adaptive Material Point Method with Regional Time Stepping. In Proceedings of the ACM SIGGRAPH\/Eurographics"},{"key":"e_1_2_2_11_1","volume-title":"Computer Animation (SCA '18)","author":"Symposium","unstructured":"Symposium on Computer Animation (SCA '18) . Eurographics Association. (*Joint First Authors). Symposium on Computer Animation (SCA '18). Eurographics Association. (*Joint First Authors)."},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201309"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3130800.3130879"},{"key":"e_1_2_2_15_1","volume-title":"Supplemental Document: GPU Optimization of Material Point Methods.","author":"Gao M.","year":"2018","unstructured":"M. Gao , X Wang , K Wu , A. Pradhana , E. Sifakis , C. Yuksel , and C. Jiang . 2018 b. Supplemental Document: GPU Optimization of Material Point Methods. (2018). M. Gao, X Wang, K Wu, A. Pradhana, E. Sifakis, C. Yuksel, and C. Jiang. 2018b. Supplemental Document: GPU Optimization of Material Point Methods. (2018)."},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cag.2006.09.005"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073301"},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201346"},{"key":"e_1_2_2_20_1","volume-title":"Comp Graph Int","volume":"40","author":"Harada T.","unstructured":"T. Harada , S. Koshizuka , and Y. Kawaguchi . 2007. Smoothed particle hydrodynamics on GPUs . In Comp Graph Int , Vol. 40 . SBC Petropolis, 63--70. T. Harada, S. Koshizuka, and Y. Kawaguchi. 2007. Smoothed particle hydrodynamics on GPUs. In Comp Graph Int, Vol. 40. SBC Petropolis, 63--70."},{"key":"e_1_2_2_21_1","volume-title":"GVDB: Raytracing sparse voxel database structures on the GPU. In Proc of High Perf Graph","author":"Hoetzlein R. K.","year":"2016","unstructured":"R. K. Hoetzlein . 2016 . GVDB: Raytracing sparse voxel database structures on the GPU. In Proc of High Perf Graph . Eurographics Association , 109--117. R. K. Hoetzlein. 2016. GVDB: Raytracing sparse voxel database structures on the GPU. In Proc of High Perf Graph. Eurographics Association, 109--117."},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531347"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201293"},{"key":"e_1_2_2_24_1","first-page":"119","article-title":"Shared memory OpenMP parallelization of explicit MPM and its application to hypervelocity impact","volume":"38","author":"Huang P.","year":"2008","unstructured":"P. Huang , X. Zhang , S. Ma , and H. K. Wang . 2008 . Shared memory OpenMP parallelization of explicit MPM and its application to hypervelocity impact . Comp Mod in Eng and Sci 38 (2008), 119 -- 148 . P. Huang, X. Zhang, S. Ma, and H. K. Wang. 2008. Shared memory OpenMP parallelization of explicit MPM and its application to hypervelocity impact. Comp Mod in Eng and Sci 38 (2008), 119--148.","journal-title":"Comp Mod in Eng and Sci"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073623"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766996"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2017.02.050"},{"key":"e_1_2_2_28_1","volume-title":"Speculative Atomics: A Case-Study of the GPU Optimization of the Material Point Method for Graphics. In GPU Technology Conference.","author":"Kl\u00e1r G.","year":"2014","unstructured":"G. Kl\u00e1r . 2014 . Speculative Atomics: A Case-Study of the GPU Optimization of the Material Point Method for Graphics. In GPU Technology Conference. G. Kl\u00e1r. 2014. Speculative Atomics: A Case-Study of the GPU Optimization of the Material Point Method for Graphics. In GPU Technology Conference."},{"key":"e_1_2_2_29_1","volume-title":"Blasting Sand with CUDA: MPM Sand Simulation for VFX. In GPU Technology Conference.","author":"Kl\u00e1r G.","year":"2017","unstructured":"G. Kl\u00e1r . 2017 . Blasting Sand with CUDA: MPM Sand Simulation for VFX. In GPU Technology Conference. G. Kl\u00e1r. 2017. Blasting Sand with CUDA: MPM Sand Simulation for VFX. In GPU Technology Conference."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3084363.3085066"},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925906"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2982430"},{"key":"e_1_2_2_33_1","volume-title":"Faster parallel reductions on Kepler. Nvidia","author":"Luitjens Justin","year":"2014","unstructured":"Justin Luitjens . 2014. Faster parallel reductions on Kepler. Nvidia ( 2014 ). Justin Luitjens. 2014. Faster parallel reductions on Kepler. Nvidia (2014)."},{"key":"e_1_2_2_34_1","unstructured":"A. McAdams A. Selle R. Tamstorf J. Teran and E. Sifakis. 2011. Computing the singular value decomposition of 3\u00d7 3 matrices with minimal branching and elementary floating point operations. University of Wisconsin Madison (2011).  A. McAdams A. Selle R. Tamstorf J. Teran and E. Sifakis. 2011. Computing the singular value decomposition of 3\u00d7 3 matrices with minimal branching and elementary floating point operations. University of Wisconsin Madison (2011)."},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487228.2487235"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.5555\/1708430.1708501"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2786784.2786798"},{"key":"e_1_2_2_38_1","unstructured":"K. P Ruggirello and S. C. Schumacher. 2014. A comparison of parallelization strategies for the material point method. In 11th World Cong on Comp Mech. 20--25.  K. P Ruggirello and S. C. Schumacher. 2014. A comparison of parallelization strategies for the material point method. In 11th World Cong on Comp Mech. 20--25."},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661269"},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2008.05.009"},{"key":"e_1_2_2_41_1","volume-title":"In Proc Symp Comp Anim. 25--32","author":"Stomakhin A.","unstructured":"A. Stomakhin , R. Howes , C. Schroeder , and J. Teran . 2012. Energetically consistent invertible elasticity . In In Proc Symp Comp Anim. 25--32 . A. Stomakhin, R. Howes, C. Schroeder, and J. Teran. 2012. Energetically consistent invertible elasticity. In In Proc Symp Comp Anim. 25--32."},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461912.2461948"},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2601097.2601176"},{"key":"e_1_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1016\/0010-4655(94)00170-7"},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073651"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3203188"},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.12208"},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.12851"},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201320"},{"key":"e_1_2_2_50_1","volume-title":"Comp Graph Forum","volume":"37","author":"Wang X.","unstructured":"X. Wang , M. Tang , D. Manocha , and R. Tong . 2018. Efficient BVH-based collision detection scheme with ordering and restructuring . In Comp Graph Forum , Vol. 37 . Wiley Online Library, 227--237. X. Wang, M. Tang, D. Manocha, and R. Tong. 2018. Efficient BVH-based collision detection scheme with ordering and restructuring. In Comp Graph Forum, Vol. 37. Wiley Online Library, 227--237."},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.13113"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cag.2017.10.005"},{"key":"e_1_2_2_53_1","doi-asserted-by":"crossref","unstructured":"K. Wu N. Truong Yuksel C. and Hoetzlein R. 2018. Fast fluid simulations with sparse volumes on the GPU. Comp Graph Forum (2018).  K. Wu N. Truong Yuksel C. and Hoetzlein R. 2018. Fast fluid simulations with sparse volumes on the GPU. Comp Graph Forum (2018).","DOI":"10.1111\/cgf.13350"},{"key":"e_1_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2751541"},{"key":"e_1_2_2_55_1","volume-title":"Proc of the Fifth Eurographics \/ IEEE VGTC Conf on Point-Based Graph. 137--146","author":"Zhang Y.","unstructured":"Y. Zhang , B. Solenthaler , and R. Pajarola . 2008. Adaptive sampling and rendering of fluids on the GPU . In Proc of the Fifth Eurographics \/ IEEE VGTC Conf on Point-Based Graph. 137--146 . Y. Zhang, B. Solenthaler, and R. Pajarola. 2008. Adaptive sampling and rendering of fluids on the GPU. In Proc of the Fifth Eurographics \/ IEEE VGTC Conf on Point-Based Graph. 137--146."},{"key":"e_1_2_2_56_1","first-page":"143","article-title":"An alternated grid updating parallel algorithm for material point method using OpenMP","volume":"69","author":"Zhang Y.","year":"2010","unstructured":"Y. Zhang , X. Zhang , and Y. Liu . 2010 . An alternated grid updating parallel algorithm for material point method using OpenMP . Comp Modeling in Eng and Sci 69 , 2 (2010), 143 -- 165 . Y. Zhang, X. Zhang, and Y. Liu. 2010. An alternated grid updating parallel algorithm for material point method using OpenMP. Comp Modeling in Eng and Sci 69, 2 (2010), 143--165.","journal-title":"Comp Modeling in Eng and Sci"},{"key":"e_1_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073298"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275044","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3272127.3275044","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3272127.3275044","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:44:04Z","timestamp":1750207444000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275044"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,12,4]]},"references-count":55,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,12,31]]}},"alternative-id":["10.1145\/3272127.3275044"],"URL":"https:\/\/doi.org\/10.1145\/3272127.3275044","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,12,4]]},"assertion":[{"value":"2018-12-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}