{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T08:54:56Z","timestamp":1775638496769,"version":"3.50.1"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2024,3,12]],"date-time":"2024-03-12T00:00:00Z","timestamp":1710201600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"DFG priority program 2037","award":["BI2011\/1 & BI2011\/2"],"award-info":[{"award-number":["BI2011\/1 & BI2011\/2"]}]},{"name":"DFG Collaborative Research Center","award":["1053"],"award-info":[{"award-number":["1053"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. ACM Manag. Data"],"published-print":{"date-parts":[[2024,3,12]]},"abstract":"<jats:p>In this paper, we present a novel communication scheme called zero-sided RDMA, enabling data exchange as a native network service using a programmable switch. In contrast to one- or two-sided RDMA, in zero-sided RDMA, neither the sender nor the receiver is actively involved in data exchange. Zero-sided RDMA thus enables efficient RDMA-based data shuffling between heterogeneous hardware devices in a disaggregated setup without the need to implement a complete RDMA stack on each heterogeneous device or the need for a CPU that is co-located with the accelerator to coordinate the data transfer. As such, we think that zero-sided RDMA is a major building block to make efficient use of heterogeneous accelerators in future cloud DBMSs. In our evaluation, we show that zero-sided RDMA can outperform existing one-sided RDMA-based schemes for accelerator-to-accelerator communication and thus speed up typical distributed database operations such as joins.<\/jats:p>","DOI":"10.1145\/3639291","type":"journal-article","created":{"date-parts":[[2024,3,26]],"date-time":"2024-03-26T18:51:32Z","timestamp":1711479092000},"page":"1-28","source":"Crossref","is-referenced-by-count":5,"title":["Zero-sided RDMA: Network-driven Data Shuffling for Disaggregated Heterogeneous Cloud DBMSs"],"prefix":"10.1145","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5068-4645","authenticated-orcid":false,"given":"Matthias","family":"Jasny","sequence":"first","affiliation":[{"name":"Technical University Darmstadt, Darmstadt, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-7243-5507","authenticated-orcid":false,"given":"Lasse","family":"Thostrup","sequence":"additional","affiliation":[{"name":"Technical University Darmstadt, Darmstadt, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8092-2969","authenticated-orcid":false,"given":"Sajjad","family":"Tamimi","sequence":"additional","affiliation":[{"name":"Technical University Darmstadt, Darmstadt, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1164-3082","authenticated-orcid":false,"given":"Andreas","family":"Koch","sequence":"additional","affiliation":[{"name":"Technical University Darmstadt, Darmstadt, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4127-8573","authenticated-orcid":false,"given":"Zsolt","family":"Istv\u00e1n","sequence":"additional","affiliation":[{"name":"Technical University Darmstadt, Darmstadt, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2744-7836","authenticated-orcid":false,"given":"Carsten","family":"Binnig","sequence":"additional","affiliation":[{"name":"Technical University Darmstadt &amp; DFKI, Darmstadt, Germany"}]}],"member":"320","published-online":{"date-parts":[[2024,3,26]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2017.12.007"},{"key":"e_1_2_1_2_1","volume-title":"Empowering Azure Storage with RDMA. In 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023","author":"Bai Wei","year":"2023","unstructured":"Wei Bai, Shanim Sainul Abdeen, Ankit Agrawal, Krishan Kumar Attre, Paramvir Bahl, Ameya Bhagat, Gowri Bhaskara, Tanya Brokhman, Lei Cao, Ahmad Cheema, Rebecca Chow, Jeff Cohen, Mahmoud Elhaddad, Vivek Ette, Igal Figlin, Daniel Firestone, Mathew George, Ilya German, Lakhmeet Ghai, Eric Green, Albert G. Greenberg, Manish Gupta, Randy Haagens, Matthew Hendel, Ridwan Howlader, Neetha John, Julia Johnstone, Tom Jolly, Greg Kramer, David Kruse, Ankit Kumar, Erica Lan, Ivan Lee, Avi Levy, Marina Lipshteyn, Xin Liu, Chen Liu, Guohan Lu, Yuemin Lu, Xiakun Lu, Vadim Makhervaks, Ulad Malashanka, David A. Maltz, Ilias Marinos, Rohan Mehta, Sharda Murthi, Anup Namdhari, Aaron Ogus, Jitendra Padhye, Madhav Pandya, Douglas Phillips, Adrian Power, Suraj Puri, Shachar Raindel, Jordan Rhee, Anthony Russo, Maneesh Sah, Ali Sheriff, Chris Sparacino, Ashutosh Srivastava, Weixiang Sun, Nick Swanson, Fuhou Tian, Lukasz Tomczyk, Vamsi Vadlamuri, Alec Wolman, Ying Xie, Joyce Yom, Lihua Yuan, Yanzhao Zhang, and Brian Zill. 2023. Empowering Azure Storage with RDMA. In 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023, Boston, MA, April 17--19, 2023, Mahesh Balakrishnan and Manya Ghobadi (Eds.). USENIX Association, 49--67. https:\/\/www.usenix.org\/conference\/nsdi23\/presentation\/bai"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2750547"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2750547"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.14778\/2904483.2904485"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3139645.3139648"},{"key":"e_1_2_1_7_1","volume-title":"10th Conference on Innovative Data Systems Research, CIDR 2020, Amsterdam, The Netherlands, January 12--15, 2020, Online Proceedings. www.cidrdb.org. http:\/\/cidrdb.org\/cidr2020\/papers\/p27-chen-cidr20","author":"Chen Xinyu","year":"2020","unstructured":"Xinyu Chen, Yao Chen, Ronak Bajaj, Jiong He, Bingsheng He, Weng-Fai Wong, and Deming Chen. 2020. Is FPGA Useful for Hash Joins?. In 10th Conference on Innovative Data Systems Research, CIDR 2020, Amsterdam, The Netherlands, January 12--15, 2020, Online Proceedings. www.cidrdb.org. http:\/\/cidrdb.org\/cidr2020\/papers\/p27-chen-cidr20.pdf"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2931088.2931091"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 11th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2014","author":"Dragojevic Aleksandar","year":"2014","unstructured":"Aleksandar Dragojevic, Dushyanth Narayanan, Miguel Castro, and Orion Hodson. 2014. FaRM: Fast Remote Memory. In Proceedings of the 11th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2014, Seattle, WA, USA, April 2--4, 2014, Ratul Mahajan and Ion Stoica (Eds.). USENIX Association, 401--414. https:\/\/www.usenix.org\/conference\/nsdi14\/technical-sessions\/dragojevi%C4%87"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352137"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-019-00581-w"},{"key":"e_1_2_1_12_1","volume-title":"Using advanced MPI: Modern features of the message-passing interface","author":"Gropp William","unstructured":"William Gropp, Torsten Hoefler, Rajeev Thakur, and Ewing Lusk. 2014. Using advanced MPI: Modern features of the message-passing interface. MIT Press."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/FTCS.1999.781046"},{"key":"e_1_2_1_14_1","unstructured":"Intel. 2023. Intel P4 Studio. https:\/\/www.intel.com\/content\/www\/us\/en\/products\/details\/network-io\/intelligent-fabric-processors\/p4-studio.html."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2785956.2787488"},{"key":"e_1_2_1_16_1","unstructured":"Matthias Jasny and Lasse Thostrup. 2023. Zerosided RDMA Code. https:\/\/github.com\/DataManagementLab\/zerosided_rdma."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517825"},{"key":"e_1_2_1_18_1","volume-title":"Scalable and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs. In 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2016","author":"Kalia Anuj","year":"2016","unstructured":"Anuj Kalia, Michael Kaminsky, and David G. Andersen. 2016. FaSST: Fast, Scalable and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs. In 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2016, Savannah, GA, USA, November 2--4, 2016, Kimberly Keeton and Timothy Roscoe (Eds.). USENIX Association, 185--201. https:\/\/www.usenix.org\/conference\/osdi16\/technical-sessions\/presentation\/kalia"},{"key":"e_1_2_1_19_1","volume-title":"16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019","author":"Kalia Anuj","year":"2019","unstructured":"Anuj Kalia, Michael Kaminsky, and David G. Andersen. 2019. Datacenter RPCs can be General and Fast. In 16th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2019, Boston, MA, February 26--28, 2019, Jay R. Lorch and Minlan Yu (Eds.). USENIX Association, 1--16. https:\/\/www.usenix.org\/conference\/nsdi19\/presentation\/kalia"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920847"},{"key":"e_1_2_1_21_1","volume-title":"14th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2020","author":"Korolija Dario","year":"2020","unstructured":"Dario Korolija, Timothy Roscoe, and Gustavo Alonso. 2020. Do OS abstractions make sense on FPGAs?. In 14th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2020, Virtual Event, November 4--6, 2020. USENIX Association, 991--1010. https:\/\/www.usenix.org\/conference\/osdi20\/presentation\/roscoe"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.5441\/002\/edbt.2022.03"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3464298.3493393"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2610507"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2882949"},{"key":"e_1_2_1_26_1","volume-title":"12th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2016","author":"Li Jialin","year":"2016","unstructured":"Jialin Li, Ellis Michael, Naveen Kr. Sharma, Adriana Szekeres, and Dan R. K. Ports. 2016b. Just Say NO to Paxos Overhead: Replacing Consensus with Network Ordering. In 12th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2016, Savannah, GA, USA, November 2--4, 2016, Kimberly Keeton and Timothy Roscoe (Eds.). USENIX Association, 467--483. https:\/\/www.usenix.org\/conference\/osdi16\/technical-sessions\/presentation\/li"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389705"},{"key":"e_1_2_1_28_1","unstructured":"Mailinglist. 2023. [RFC 6\/7] IB\/core: Peer memory client for IO memory. https:\/\/www.spinics.net\/lists\/linux-rdma\/msg33298.html."},{"key":"e_1_2_1_29_1","volume-title":"Manpage: MPI_Alltoall. https:\/\/www.mpich.org\/static\/docs\/latest\/www3\/MPI_Alltoall.html.","author":"MPICH.","year":"2023","unstructured":"MPICH. 2023. Manpage: MPI_Alltoall. https:\/\/www.mpich.org\/static\/docs\/latest\/www3\/MPI_Alltoall.html."},{"key":"e_1_2_1_30_1","unstructured":"APS Networks. 2021. Intel Tofino APS Networks BF2556X-1T-A1F. https:\/\/www.aps-networks.com\/wp-content\/uploads\/2021\/07\/210712_APS_BF2556X-1T_V04.pdf."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3458336.3465291"},{"key":"e_1_2_1_32_1","unstructured":"NVIDIA. 2023 a. HowTo Implement PeerDirect Client using MLNX_OFED. https:\/\/enterprise-support.nvidia.com\/s\/article\/howto-implement-peerdirect-client-using-mlnx-ofed."},{"key":"e_1_2_1_33_1","unstructured":"NVIDIA. 2023 b. Nvidia NVSHMEM. https:\/\/developer.nvidia.com\/nvshmem."},{"key":"e_1_2_1_34_1","unstructured":"NVIDIA. 2023 c. RDMA Over Converged Ethernet (RoCE). https:\/\/docs.nvidia.com\/networking\/m\/view-rendered-page.action?abstractPageId=56986516."},{"key":"e_1_2_1_35_1","unstructured":"OpenUCX. 2023. SparkUCX ShuffleManager Plugin. https:\/\/github.com\/openucx\/sparkucx."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807207"},{"key":"e_1_2_1_37_1","unstructured":"Amazon Web Services. 2023. Elastic Fabric Adapter. https:\/\/aws.amazon.com\/hpc\/efa\/."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3380595"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3588709"},{"key":"e_1_2_1_40_1","volume-title":"International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, ADMS@VLDB 2022","author":"Thostrup Lasse","year":"2022","unstructured":"Lasse Thostrup, Daniel Failing, Tobius Ziegler, and Carsten Binnig. 2022a. A DBMS-centric Evaluation of BlueField DPUs on Fast Networks. In International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, ADMS@VLDB 2022, Sydney, Australia, September 5, 2022, Rajesh Bordawekar and Tirthankar Lahiri (Eds.). 1--10. http:\/\/www.adms-conf.org\/2022-camera-ready\/ADMS22_thostrup.pdf"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3542700.3542705"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3373376.3378528"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3492321.3519569"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.14778\/3547305.3547307"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3127479.3128609"},{"key":"e_1_2_1_46_1","volume-title":"2022 USENIX Annual Technical Conference, USENIX ATC 2022","author":"Wang Zeke","year":"2022","unstructured":"Zeke Wang, Hongjing Huang, Jie Zhang, Fei Wu, and Gustavo Alonso. 2022. FpgaNIC: An FPGA-based Versatile 100Gb SmartNIC for GPUs. In 2022 USENIX Annual Technical Conference, USENIX ATC 2022, Carlsbad, CA, USA, July 11--13, 2022, Jiri Schindler and Noa Zilberman (Eds.). USENIX Association, 967--986. https:\/\/www.usenix.org\/conference\/atc22\/presentation\/wang-zeke"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/2815400.2815419"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/1755913.1755940"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.14778\/3055330.3055335"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389724"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3526187"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.18420\/btw2019-ws-06"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1007\/s13222-020-00355--7"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3300081"}],"container-title":["Proceedings of the ACM on Management of Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3639291","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3639291","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,21]],"date-time":"2025-08-21T15:17:32Z","timestamp":1755789452000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3639291"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,12]]},"references-count":54,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,3,12]]}},"alternative-id":["10.1145\/3639291"],"URL":"https:\/\/doi.org\/10.1145\/3639291","relation":{},"ISSN":["2836-6573"],"issn-type":[{"value":"2836-6573","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,12]]}}}