{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:19:19Z","timestamp":1750220359579,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T00:00:00Z","timestamp":1628467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"NSF (National Science Foundation)","doi-asserted-by":"publisher","award":["2018758"],"award-info":[{"award-number":["2018758"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,9]]},"DOI":"10.1145\/3472456.3472522","type":"proceedings-article","created":{"date-parts":[[2021,10,5]],"date-time":"2021-10-05T18:39:57Z","timestamp":1633459197000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Optimizing Work Stealing Communication with Structured Atomic Operations"],"prefix":"10.1145","author":[{"given":"Hannah","family":"Cartier","sequence":"first","affiliation":[{"name":"Rhodes College"}]},{"given":"James","family":"Dinan","sequence":"additional","affiliation":[{"name":"NVIDIA"}]},{"given":"D. Brian","family":"Larkins","sequence":"additional","affiliation":[{"name":"Rhodes College"}]}],"member":"320","published-online":{"date-parts":[[2021,10,5]]},"reference":[{"doi-asserted-by":"publisher","key":"e_1_3_2_1_1_1","DOI":"10.1145\/2442516.2442538"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_3_1","DOI":"10.5555\/2388996.2389086"},{"volume-title":"Proc. 1st ACM SIGPLAN Intl. Conf. on Functional Programming (ICFP)","author":"E.","unstructured":"Guy\u00a0 E. Blelloch and John Greiner. 1996. A Provable Time and Space Efflcient Implementation of NESL . In Proc. 1st ACM SIGPLAN Intl. Conf. on Functional Programming (ICFP) . Philadelphia, Pennsylvania, 213\u2013225. Guy\u00a0E. Blelloch and John Greiner. 1996. A Provable Time and Space Efflcient Implementation of NESL. In Proc. 1st ACM SIGPLAN Intl. Conf. on Functional Programming (ICFP). Philadelphia, Pennsylvania, 213\u2013225.","key":"e_1_3_2_1_4_1"},{"volume-title":"Proc. 35th Symposium on Foundations of Computer Science (FOCS). 356\u2013368","author":"D.","unstructured":"Robert\u00a0 D. Blumofe and Charles Leiserson. 1994. Scheduling multithreaded computations by work stealing . In Proc. 35th Symposium on Foundations of Computer Science (FOCS). 356\u2013368 . Robert\u00a0D. Blumofe and Charles Leiserson. 1994. Scheduling multithreaded computations by work stealing. In Proc. 35th Symposium on Foundations of Computer Science (FOCS). 356\u2013368.","key":"e_1_3_2_1_5_1"},{"volume-title":"Proc. USENIX Annual Technical Conference (ATEC)","author":"D.","unstructured":"Robert\u00a0 D. Blumofe and Philip\u00a0A. Lisiecki. 1997. Adaptive and reliable parallel computing on networks of workstations . In Proc. USENIX Annual Technical Conference (ATEC) ( Anaheim, California). 10\u201310. Robert\u00a0D. Blumofe and Philip\u00a0A. Lisiecki. 1997. Adaptive and reliable parallel computing on networks of workstations. In Proc. USENIX Annual Technical Conference (ATEC) (Anaheim, California). 10\u201310.","key":"e_1_3_2_1_6_1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_7_1","DOI":"10.1109\/IPDPS.2007.370258"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_8_1","DOI":"10.1145\/1094811.1094852"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_9_1","DOI":"10.1109\/ICPP.2008.88"},{"volume-title":"Proc. 37th Intl. Conf. on Parallel Processing (ICPP). 586\u2013593","author":"Dinan James","unstructured":"James Dinan , Sriram Krishnamoorthy , D.\u00a0 Brian Larkins , Jarek Nieplocha , and P. Sadayappan . 2008. Scioto: A Framework for Global-View Task Parallelism . In Proc. 37th Intl. Conf. on Parallel Processing (ICPP). 586\u2013593 . James Dinan, Sriram Krishnamoorthy, D.\u00a0Brian Larkins, Jarek Nieplocha, and P. Sadayappan. 2008. Scioto: A Framework for Global-View Task Parallelism. In Proc. 37th Intl. Conf. on Parallel Processing (ICPP). 586\u2013593.","key":"e_1_3_2_1_10_1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_11_1","DOI":"10.1145\/1654059.1654113"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_12_1","DOI":"10.1016\/j.simpat.2008.06.004"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_13_1","DOI":"10.1145\/3095770.3095774"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_14_1","DOI":"10.1145\/277650.277725"},{"key":"e_1_3_2_1_15_1","volume-title":"Proc. 23rd Intl. Parallel and Distributed Processing Symposium (IPDPS).","author":"Guo Yi","year":"2009","unstructured":"Yi Guo , Rajkishore Barik , Raghavan Raman , and Vivek Sarkar . 2009 . Work-First and Help-First Scheduling Policies for Terminally Strict Parallel Programs . In Proc. 23rd Intl. Parallel and Distributed Processing Symposium (IPDPS). Yi Guo, Rajkishore Barik, Raghavan Raman, and Vivek Sarkar. 2009. Work-First and Help-First Scheduling Policies for Terminally Strict Parallel Programs. In Proc. 23rd Intl. Parallel and Distributed Processing Symposium (IPDPS)."},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming","author":"Guo Yi","year":"2010","unstructured":"Yi Guo , Jisheng Zhao , Vincent Cave , and Vivek Sarkar . 2010 . SLAW: A Scalable Locality-aware Adaptive Work-stealing Scheduler for Multi-core Systems . In Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming ( Bangalore, India) (PPoPP \u201910). ACM, New York, NY, USA, 341\u2013342. https:\/\/doi.org\/10.1145\/1693453.1693504 Yi Guo, Jisheng Zhao, Vincent Cave, and Vivek Sarkar. 2010. SLAW: A Scalable Locality-aware Adaptive Work-stealing Scheduler for Multi-core Systems. In Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (Bangalore, India) (PPoPP \u201910). ACM, New York, NY, USA, 341\u2013342. https:\/\/doi.org\/10.1145\/1693453.1693504"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_17_1","DOI":"10.1145\/571825.571876"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_18_1","DOI":"10.1006\/jpdc.1994.1070"},{"doi-asserted-by":"crossref","unstructured":"Vivek Kumar Karthik Murthy Vivek Sarkar and Yili Zheng. 2016. Optimized Distributed Work-Stealing. 74\u201377. https:\/\/doi.org\/10.1109\/IA3.2016.019  Vivek Kumar Karthik Murthy Vivek Sarkar and Yili Zheng. 2016. Optimized Distributed Work-Stealing. 74\u201377. https:\/\/doi.org\/10.1109\/IA3.2016.019","key":"e_1_3_2_1_19_1","DOI":"10.1109\/IA3.2016.019"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_20_1","DOI":"10.1145\/344588.344618"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_21_1","DOI":"10.1145\/3337821.3337878"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_22_1","DOI":"10.1109\/HPEC.2016.7761580"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_23_1","DOI":"10.1145\/1504176.1504186"},{"key":"e_1_3_2_1_24_1","volume-title":"MPI: A Message-Passing Interface Standard Version 3.1. Technical Report","author":"Forum MPI","year":"2015","unstructured":"MPI Forum . 2015 . MPI: A Message-Passing Interface Standard Version 3.1. Technical Report . University of Tennessee , Knoxville. MPI Forum. 2015. MPI: A Message-Passing Interface Standard Version 3.1. Technical Report. University of Tennessee, Knoxville."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_25_1","DOI":"10.5555\/645611.662053"},{"unstructured":"NVIDIA Inc. 2021. MLNX_OFED Documentation Rev 5.3-1.0.0.1. https:\/\/docs.mellanox.com\/display\/MLNXOFEDv461000\/Advanced+Transport.  NVIDIA Inc. 2021. MLNX_OFED Documentation Rev 5.3-1.0.0.1. https:\/\/docs.mellanox.com\/display\/MLNXOFEDv461000\/Advanced+Transport.","key":"e_1_3_2_1_26_1"},{"key":"e_1_3_2_1_27_1","first-page":"12","article-title":"OpenFabrics Interface Application Programming Interface","volume":"1","author":"Alliance OpenFabrics","year":"2021","unstructured":"OpenFabrics Alliance 2021 . OpenFabrics Interface Application Programming Interface , Version 1 . 12 .1. https:\/\/ofiwg.github.io\/libfabric\/. OpenFabrics Alliance 2021. OpenFabrics Interface Application Programming Interface, Version 1.12.1. https:\/\/ofiwg.github.io\/libfabric\/.","journal-title":"Version"},{"unstructured":"OpenSHMEM Specification Committee 2020. OpenSHMEM Application Programming Interface Version 1.5. http:\/\/www.openshmem.org.  OpenSHMEM Specification Committee 2020. OpenSHMEM Application Programming Interface Version 1.5. http:\/\/www.openshmem.org.","key":"e_1_3_2_1_28_1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_29_1","DOI":"10.1145\/1941553.1941582"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_30_1","DOI":"10.1109\/IPPS.1993.262887"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_31_1","DOI":"10.1145\/2370036.2145850"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_32_1","DOI":"10.1016\/j.jpdc.2007.11.002"},{"key":"e_1_3_2_1_33_1","first-page":"10","article-title":"Unified Communication X Application Programming Interface","volume":"1","author":"Unified Communication Framework Consortium","year":"2021","unstructured":"Unified Communication Framework Consortium 2021 . Unified Communication X Application Programming Interface , Version 1 . 10 . https:\/\/www.openucx.org\/documentation\/. Unified Communication Framework Consortium 2021. Unified Communication X Application Programming Interface, Version 1.10. https:\/\/www.openucx.org\/documentation\/.","journal-title":"Version"},{"volume-title":"UPC Language and Library Specifications, v1.3","author":"UPC Consortium","unstructured":"UPC Consortium . 2013. UPC Language and Library Specifications, v1.3 . Technical Report LBNL-6623E. Lawrence Berkeley National Lab . UPC Consortium. 2013. UPC Language and Library Specifications, v1.3. Technical Report LBNL-6623E. Lawrence Berkeley National Lab.","key":"e_1_3_2_1_34_1"},{"doi-asserted-by":"crossref","unstructured":"Ke Wang Xiaobing Zhou Tonglin Li Dongfang Zhao Michael Lang and Ioan Raicu. 2014. Optimizing Load Balancing and Data-Locality with Data-aware Scheduling. https:\/\/doi.org\/10.13140\/2.1.4577.8880  Ke Wang Xiaobing Zhou Tonglin Li Dongfang Zhao Michael Lang and Ioan Raicu. 2014. Optimizing Load Balancing and Data-Locality with Data-aware Scheduling. https:\/\/doi.org\/10.13140\/2.1.4577.8880","key":"e_1_3_2_1_35_1","DOI":"10.1109\/BigData.2014.7004220"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_36_1","DOI":"10.1109\/ICPPW.2010.65"}],"event":{"acronym":"ICPP 2021","name":"ICPP 2021: 50th International Conference on Parallel Processing","location":"Lemont IL USA"},"container-title":["50th International Conference on Parallel Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472456.3472522","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472456.3472522","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472456.3472522","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:23Z","timestamp":1750191443000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472456.3472522"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,9]]},"references-count":35,"alternative-id":["10.1145\/3472456.3472522","10.1145\/3472456"],"URL":"https:\/\/doi.org\/10.1145\/3472456.3472522","relation":{},"subject":[],"published":{"date-parts":[[2021,8,9]]},"assertion":[{"value":"2021-10-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}