{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,2]],"date-time":"2025-11-02T13:55:51Z","timestamp":1762091751278,"version":"build-2065373602"},"publisher-location":"New York, NY, USA","reference-count":16,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,7,23]],"date-time":"2023-07-23T00:00:00Z","timestamp":1690070400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["2018627,2007991"],"award-info":[{"award-number":["2018627,2007991"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100008902","name":"Los Alamos National Laboratory","doi-asserted-by":"publisher","award":["19537"],"award-info":[{"award-number":["19537"]}],"id":[{"id":"10.13039\/100008902","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,7,23]]},"DOI":"10.1145\/3569951.3593595","type":"proceedings-article","created":{"date-parts":[[2023,9,10]],"date-time":"2023-09-10T15:34:03Z","timestamp":1694360043000},"page":"94-101","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["DPU-Bench: A Micro-Benchmark Suite to Measure Offload Efficiency Of SmartNICs"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7450-5787","authenticated-orcid":false,"given":"Benjamin","family":"Michalowicz","sequence":"first","affiliation":[{"name":"The Ohio State University, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3705-2387","authenticated-orcid":false,"given":"Kaushik","family":"Kandadi Suresh","sequence":"additional","affiliation":[{"name":"The Ohio State University, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1200-2754","authenticated-orcid":false,"given":"Hari","family":"Subramoni","sequence":"additional","affiliation":[{"name":"The Ohio State University, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0356-1781","authenticated-orcid":false,"given":"Dhabaleswar","family":"Panda","sequence":"additional","affiliation":[{"name":"The Ohio State University, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4531-7453","authenticated-orcid":false,"given":"Steve","family":"Poole","sequence":"additional","affiliation":[{"name":"Los Alamos National Laboratory, USA"}]}],"member":"320","published-online":{"date-parts":[[2023,9,10]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Argonne National Laboratory 2022. MPICH. https:\/\/www.mpich.org\/.  Argonne National Laboratory 2022. MPICH. https:\/\/www.mpich.org\/."},{"key":"e_1_3_2_1_2_1","unstructured":"Dotan Barak. 2014. Verbs Programming Tutorial. https:\/\/www.cs.mtsu.edu\/\u00a0waderholdt\/6430\/papers\/ibverbs.pdf  Dotan Barak. 2014. Verbs Programming Tutorial. https:\/\/www.cs.mtsu.edu\/\u00a0waderholdt\/6430\/papers\/ibverbs.pdf"},{"volume-title":"BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs","author":"Bayatpour Mohammadreza","key":"e_1_3_2_1_3_1","unstructured":"Mohammadreza Bayatpour , Nick Sarkauskas , Hari Subramoni , Jahanzeb Maqbool\u00a0Hashmi , and Dhabaleswar\u00a0 K. Panda . 2021. BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs . In High Performance Computing, Bradford\u00a0L. Chamberlain, Ana-Lucia Varbanescu, Hatem Ltaief, and Piotr Luszczek (Eds.). Springer International Publishing , Cham , 18\u201337. Mohammadreza Bayatpour, Nick Sarkauskas, Hari Subramoni, Jahanzeb Maqbool\u00a0Hashmi, and Dhabaleswar\u00a0K. Panda. 2021. BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs. In High Performance Computing, Bradford\u00a0L. Chamberlain, Ana-Lucia Varbanescu, Hatem Ltaief, and Piotr Luszczek (Eds.). Springer International Publishing, Cham, 18\u201337."},{"key":"e_1_3_2_1_4_1","volume-title":"Nvidia Data Center Processing Unit (DPU) Architecture. In 2021 IEEE Hot Chips 33 Symposium (HCS). 1\u201320","author":"Burstein Idan","year":"2021","unstructured":"Idan Burstein . 2021 . Nvidia Data Center Processing Unit (DPU) Architecture. In 2021 IEEE Hot Chips 33 Symposium (HCS). 1\u201320 . https:\/\/doi.org\/10.1109\/HCS52781.2021.9567066 10.1109\/HCS52781.2021.9567066 Idan Burstein. 2021. Nvidia Data Center Processing Unit (DPU) Architecture. In 2021 IEEE Hot Chips 33 Symposium (HCS). 1\u201320. https:\/\/doi.org\/10.1109\/HCS52781.2021.9567066"},{"key":"e_1_3_2_1_5_1","unstructured":"High Performance Compute Availability Group 2022. OpenHPCA Benchmark Suite. https:\/\/github.com\/openucx\/openhpca.  High Performance Compute Availability Group 2022. OpenHPCA Benchmark Suite. https:\/\/github.com\/openucx\/openhpca."},{"key":"e_1_3_2_1_6_1","unstructured":"Intel 2022. Intel MPI Benchmarks. https:\/\/www.intel.com\/content\/www\/us\/en\/developer\/articles\/technical\/intel-mpi-benchmarks.html.  Intel 2022. Intel MPI Benchmarks. https:\/\/www.intel.com\/content\/www\/us\/en\/developer\/articles\/technical\/intel-mpi-benchmarks.html."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTI52880.2021.00017"},{"key":"e_1_3_2_1_8_1","volume-title":"Mansa Kedia, Hari Subramoni, Arpan Jain, Aamir Shafi, Dhabaleswar Panda, Trey Dockendorf, Heechang Na, and Karen Tomko.","author":"Kousha Pouya","year":"2021","unstructured":"Pouya Kousha , Kamal\u00a0Raj Sankarapandian Dayala Ganesh\u00a0Ram , Mansa Kedia, Hari Subramoni, Arpan Jain, Aamir Shafi, Dhabaleswar Panda, Trey Dockendorf, Heechang Na, and Karen Tomko. 2021 . INAM : Cross-Stack Profiling and Analysis of Communication in MPI-Based Applications. In Practice and Experience in Advanced Research Computing (Boston, MA, USA) (PEARC \u201921). Association for Computing Machinery , New York, NY, USA, Article 14, 11\u00a0pages. https:\/\/doi.org\/10.1145\/3437359.3465582 10.1145\/3437359.3465582 Pouya Kousha, Kamal\u00a0Raj Sankarapandian Dayala Ganesh\u00a0Ram, Mansa Kedia, Hari Subramoni, Arpan Jain, Aamir Shafi, Dhabaleswar Panda, Trey Dockendorf, Heechang Na, and Karen Tomko. 2021. INAM: Cross-Stack Profiling and Analysis of Communication in MPI-Based Applications. In Practice and Experience in Advanced Research Computing (Boston, MA, USA) (PEARC \u201921). Association for Computing Machinery, New York, NY, USA, Article 14, 11\u00a0pages. https:\/\/doi.org\/10.1145\/3437359.3465582"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMST.2017.2746083"},{"key":"e_1_3_2_1_10_1","volume-title":"MPI: A Message-Passing Interface Standard Version 4.0. https:\/\/www.mpi-forum.org\/docs\/mpi-4.0\/mpi40-report.pdf","author":"Interface Forum Message Passing","year":"2021","unstructured":"Message Passing Interface Forum . 2021 . MPI: A Message-Passing Interface Standard Version 4.0. https:\/\/www.mpi-forum.org\/docs\/mpi-4.0\/mpi40-report.pdf Message Passing Interface Forum. 2021. MPI: A Message-Passing Interface Standard Version 4.0. https:\/\/www.mpi-forum.org\/docs\/mpi-4.0\/mpi40-report.pdf"},{"key":"e_1_3_2_1_11_1","unstructured":"Network-Based Computing Laboratory 2022. MVAPICH: MPI over InfiniBand 10GigE\/iWARP and RoCE. http:\/\/mvapich.cse.ohio-state.edu\/.  Network-Based Computing Laboratory 2022. MVAPICH: MPI over InfiniBand 10GigE\/iWARP and RoCE. http:\/\/mvapich.cse.ohio-state.edu\/."},{"key":"e_1_3_2_1_12_1","unstructured":"Network-Based Computing Laboratory 2022. OSU Microbenchmarks. http:\/\/mvapich.cse.ohio-state.edu\/benchmarks\/.  Network-Based Computing Laboratory 2022. OSU Microbenchmarks. http:\/\/mvapich.cse.ohio-state.edu\/benchmarks\/."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/HiPC53243.2021.00054"},{"key":"e_1_3_2_1_14_1","first-page":"2","article-title":"The Tau Parallel Performance","volume":"20","author":"Shende S.","year":"2006","unstructured":"Sameer\u00a0 S. Shende and Allen\u00a0 D. Malony . 2006 . The Tau Parallel Performance System. Int. J. High Perform. Comput. Appl. 20 , 2 (may 2006), 287\u2013311. https:\/\/doi.org\/10.1177\/1094342006064482 10.1177\/1094342006064482 Sameer\u00a0S. Shende and Allen\u00a0D. Malony. 2006. The Tau Parallel Performance System. Int. J. High Perform. Comput. Appl. 20, 2 (may 2006), 287\u2013311. https:\/\/doi.org\/10.1177\/1094342006064482","journal-title":"System. Int. J. High Perform. Comput. Appl."},{"key":"e_1_3_2_1_15_1","volume-title":"Tools for High Performance Computing","author":"Terpstra Dan","year":"2009","unstructured":"Dan Terpstra , Heike Jagode , Haihang You , and Jack Dongarra . 2010. Collecting Performance Data with PAPI-C . In Tools for High Performance Computing 2009 , Matthias\u00a0S. M\u00fcller, Michael\u00a0M. Resch, Alexander Schulz, and Wolfgang\u00a0E. Nagel (Eds.). Springer Berlin Heidelberg , Berlin, Heidelberg, 157\u2013173. https:\/\/doi.org\/10.1007\/978-3-642-11261-4_11 10.1007\/978-3-642-11261-4_11 Dan Terpstra, Heike Jagode, Haihang You, and Jack Dongarra. 2010. Collecting Performance Data with PAPI-C. In Tools for High Performance Computing 2009, Matthias\u00a0S. M\u00fcller, Michael\u00a0M. Resch, Alexander Schulz, and Wolfgang\u00a0E. Nagel (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 157\u2013173. https:\/\/doi.org\/10.1007\/978-3-642-11261-4_11"},{"key":"e_1_3_2_1_16_1","unstructured":"Top500 2022. Top500. https:\/\/www.top500.org\/lists\/top500\/2022\/11\/.  Top500 2022. Top500. https:\/\/www.top500.org\/lists\/top500\/2022\/11\/."}],"event":{"name":"PEARC '23: Practice and Experience in Advanced Research Computing","sponsor":["SIGAPP ACM Special Interest Group on Applied Computing","SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing"],"location":"Portland OR USA","acronym":"PEARC '23"},"container-title":["Practice and Experience in Advanced Research Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3569951.3593595","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3569951.3593595","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3569951.3593595","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:07:51Z","timestamp":1750183671000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3569951.3593595"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,23]]},"references-count":16,"alternative-id":["10.1145\/3569951.3593595","10.1145\/3569951"],"URL":"https:\/\/doi.org\/10.1145\/3569951.3593595","relation":{},"subject":[],"published":{"date-parts":[[2023,7,23]]},"assertion":[{"value":"2023-09-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}