{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T08:26:06Z","timestamp":1774599966669,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":40,"publisher":"ACM","license":[{"start":{"date-parts":[[2024,8,3]],"date-time":"2024-08-03T00:00:00Z","timestamp":1722643200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2024,8,3]]},"DOI":"10.1145\/3663408.3663413","type":"proceedings-article","created":{"date-parts":[[2024,7,2]],"date-time":"2024-07-02T12:23:29Z","timestamp":1719923009000},"page":"31-37","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Rethinking Intra-host Congestion Control in RDMA Networks"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-1673-8104","authenticated-orcid":false,"given":"Zirui","family":"Wan","sequence":"first","affiliation":[{"name":"Beijing University of Posts and Telecommunications, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5614-3420","authenticated-orcid":false,"given":"Jiao","family":"Zhang","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, China and Purple Mountain Laboratories, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-0276-4538","authenticated-orcid":false,"given":"Yuxiang","family":"Wang","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-5874-6610","authenticated-orcid":false,"given":"Kefei","family":"Liu","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-0212-3780","authenticated-orcid":false,"given":"Haoyu","family":"Pan","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3545-1122","authenticated-orcid":false,"given":"Tao","family":"Huang","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, China and Purple Mountain Laboratories, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,8,3]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2012. Intel\u00ae Data Direct I\/O Technology (Intel\u00ae DDIO): A Primer. https:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/technology-briefs\/data-direct-i-o-technology-brief.pdf."},{"key":"e_1_3_2_1_2_1","unstructured":"2018. Mellanox perftest package.https:\/\/community.mellanox.com\/docs\/DOC- 2802.."},{"key":"e_1_3_2_1_3_1","unstructured":"2019. Intel Xeon Processor Scalable Family Datasheet.https:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/datasheets\/2nd-gen-xeon-scalable-datasheet-vol-1.pdf."},{"key":"e_1_3_2_1_4_1","unstructured":"2019. Introduction to Memory Bandwidth Allocation. https:\/\/www.intel.com\/content\/www\/us\/en\/developer\/articles\/technical\/introduction-to-memory-bandwidth-allocation.html."},{"key":"e_1_3_2_1_5_1","unstructured":"2022. Intel\u00ae performance counter monitor.https:\/\/www.intel.com\/software\/pcm."},{"key":"e_1_3_2_1_6_1","unstructured":"2022. Mellanox NEO-Host.https:\/\/support.mellanox.com\/s\/productdetails\/a2v50000000N2OlAAK\/mellanox-neohost."},{"key":"e_1_3_2_1_7_1","unstructured":"2022. Nvidia dgx a100.https:\/\/www.nvidia.com\/en-us\/data-center\/dgx-a100\/."},{"key":"e_1_3_2_1_8_1","unstructured":"2023. Github-Terabit-Ethernet\/hostcc.https:\/\/github.com\/Terabit-Ethernet\/hostCC\/tree\/main."},{"key":"e_1_3_2_1_9_1","unstructured":"2023. Intel. 2023. Intel\u00ae Memory Latency Checker. (2023).https:\/\/www.intel.com\/content\/www\/us\/en\/developer\/articles\/tool\/intelr-memory-latency-checker.html."},{"key":"e_1_3_2_1_10_1","unstructured":"2023. Intel\u00ae 64 and IA-32 Architectures Software Developer Manuals.https:\/\/www.intel.com\/content\/www\/us\/en\/developer\/articles\/technical\/intel-sdm.html."},{"key":"e_1_3_2_1_11_1","unstructured":"2023. Intel\u00ae Resource Director Technology (Intel\u00ae RDT). https:\/\/www.intel.com\/content\/www\/us\/en\/architecture-and-technology\/resource-director-technology.html."},{"key":"e_1_3_2_1_12_1","unstructured":"2023. NVLink and NVSwitch:Fastest HPC Data Center Platform. https:\/\/www.nvidia.com\/en-us\/data-center\/nvlink\/."},{"key":"e_1_3_2_1_13_1","unstructured":"2023. Understanding MLX5 Linux counters and status parameters. https:\/\/enterprise-support.nvidia.com\/s\/article\/understanding-mlx5-linux-counters-and-status-parameters."},{"key":"e_1_3_2_1_14_1","unstructured":"2024. Intel\u00ae 64 and IA-32 Architectures Software Developer\u2019s Manual. https:\/\/www.intel.com\/content\/www\/us\/en\/developer\/articles\/technical\/intel-sdm.html."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Saksham Agarwal Rachit Agarwal Behnam Montazeri Masoud Moshref Khaled Elmeleegy Luigi Rizzo Marc\u00a0Asher de Kruijf Gautam Kumar Sylvia Ratnasamy David Culler 2022. Understanding host interconnect congestion. In HotNets.","DOI":"10.1145\/3563766.3564110"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Saksham Agarwal Arvind Krishnamurthy and Rachit Agarwal. 2023. Host Congestion Control. In SIGCOMM.","DOI":"10.1145\/3603269.3604878"},{"key":"e_1_3_2_1_17_1","unstructured":"Wei Bai Shanim\u00a0Sainul Abdeen Ankit Agrawal Krishan\u00a0Kumar Attre Paramvir Bahl Ameya Bhagat Gowri Bhaskara Tanya Brokhman Lei Cao Ahmad Cheema 2023. Empowering azure storage with RDMA. In NSDI."},{"key":"e_1_3_2_1_18_1","volume-title":"Eflops: Algorithm and system co-design for a high performance distributed training platform. In HPCA.","author":"Dong Jianbo","year":"2020","unstructured":"Jianbo Dong, Zheng Cao, Tao Zhang, Jianxi Ye, Shaochuang Wang, Fei Feng, Li Zhao, Xiaoyong Liu, Liuyihan Song, Liwei Peng, 2020. Eflops: Algorithm and system co-design for a high performance distributed training platform. In HPCA."},{"key":"e_1_3_2_1_19_1","unstructured":"Alireza Farshin Amir Roozbeh Gerald\u00a0Q Maguire\u00a0Jr and Dejan Kosti\u0107. 2020. Reexamining Direct Cache Access to Optimize { I\/O} Intensive Applications for Multi-hundred-gigabit Networks. In ATC."},{"key":"e_1_3_2_1_20_1","unstructured":"Yixiao Gao Qiang Li Lingbo Tang Yongqing Xi Pengcheng Zhang Wenwen Peng Bo Li Yaohui Wu Shaozong Liu Lei Yan 2021. When cloud storage meets rdma. In NSDI."},{"key":"e_1_3_2_1_21_1","unstructured":"Yimin Jiang Yibo Zhu Chang Lan Bairen Yi Yong Cui and Chuanxiong Guo. 2020. A unified architecture for accelerating distributed { DNN} training in heterogeneous { GPU\/CPU} clusters. In OSDI."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3593856.3595890"},{"key":"e_1_3_2_1_23_1","volume-title":"Collie: Finding Performance Anomalies in { RDMA} Subsystems. In NSDI.","author":"Kong Xinhao","year":"2022","unstructured":"Xinhao Kong, Yibo Zhu, Huaping Zhou, Zhuo Jiang, Jianxi Ye, Chuanxiong Guo, and Danyang Zhuo. 2022. Collie: Finding Performance Anomalies in { RDMA} Subsystems. In NSDI."},{"key":"e_1_3_2_1_24_1","volume-title":"Swift: Delay is simple and effective for congestion control in the datacenter. In SIGCOMM.","author":"Kumar Gautam","year":"2020","unstructured":"Gautam Kumar, Nandita Dukkipati, Keon Jang, Hassan\u00a0MG Wassel, Xian Wu, Behnam Montazeri, Yaogong Wang, Kevin Springborn, Christopher Alfeld, Michael Ryan, 2020. Swift: Delay is simple and effective for congestion control in the datacenter. In SIGCOMM."},{"key":"e_1_3_2_1_25_1","unstructured":"Qiang Li Qiao Xiang Derui Liu Yuxin Wang Haonan Qiu Xiaoliang Wang Jie Zhang Ridi Wen Haohao Song Gexiao Tian Chenyang Huang Lulu Chen Shaozong Liu Yaohui Wu Zhiwu Wu Zicheng Luo Yuchao Shao Chao Han Zhongjie Wu Jianbo Dong Zheng Cao Jinbo Wu Jiwu Shu and Jiesheng Wu. 2023. From RDMA to RDCA: Toward High-Speed Last Mile of Data Center Networks Using Remote Direct Cache Access."},{"key":"e_1_3_2_1_26_1","unstructured":"Qiang Li Qiao Xiang Yuxin Wang Haohao Song Ridi Wen Wenhui Yao Yuanyuan Dong Shuqi Zhao Shuo Huang Zhaosheng Zhu 2023. More than capacity: performance-oriented evolution of Pangu in Alibaba. In FAST."},{"key":"e_1_3_2_1_27_1","volume-title":"Hostping: Diagnosing intra-host network bottlenecks in { RDMA} servers. In NSDI.","author":"Liu Kefei","year":"2023","unstructured":"Kefei Liu, Zhuo Jiang, Jiao Zhang, Haoran Wei, Xiaolong Zhong, Lizhuang Tan, Tian Pan, and Tao Huang. 2023. Hostping: Diagnosing intra-host network bottlenecks in { RDMA} servers. In NSDI."},{"key":"e_1_3_2_1_28_1","volume-title":"High-performance design of hadoop rpc with rdma over infiniband","author":"Lu Xiaoyi","unstructured":"Xiaoyi Lu, Nusrat\u00a0S Islam, Md Wasi-Ur-Rahman, Jithin Jose, Hari Subramoni, Hao Wang, and Dhabaleswar\u00a0K Panda. 2013. High-performance design of hadoop rpc with rdma over infiniband. In IPCC. IEEE."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTI.2014.15"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Rolf Neugebauer Gianni Antichi Jos\u00e9\u00a0Fernando Zazo Yury Audzevich Sergio Lpez-Buedo and Andrew\u00a0W Moore. 2018. Understanding PCIe performance for end host networking. In SIGCOMM.","DOI":"10.1145\/3230543.3230560"},{"key":"e_1_3_2_1_31_1","unstructured":"NVIDIA. 2021. BlueField-2. https:\/\/resources.nvidia.com\/en-us-accelerated-networking-resource-library\/bluefield-2-dpu-datasheet?lx=LbHvpR&topic=networking-cloud."},{"key":"e_1_3_2_1_32_1","unstructured":"NVIDIA. 2022. BlueField-3. https:\/\/www.nvidia.com\/content\/dam\/en-zz\/Solutions\/Data-Center\/documents\/datasheet-nvidia-bluefield-3-dpu.pdf."},{"key":"e_1_3_2_1_33_1","unstructured":"Satadru Pan Theano Stavrinos Yunqiao Zhang Atul Sikaria Pavel Zakharov Abhinav Sharma Mike Shuey Richard Wareing Monika Gangapuram Guanglei Cao 2021. Facebook\u2019s tectonic filesystem: Efficiency from exascale. In FAST."},{"key":"e_1_3_2_1_34_1","volume-title":"Compute Express Link (CXL): Enabling heterogeneous data-centric computing with heterogeneous memory hierarchy","author":"Sharma Debendra\u00a0Das","year":"2022","unstructured":"Debendra\u00a0Das Sharma. 2022. Compute Express Link (CXL): Enabling heterogeneous data-centric computing with heterogeneous memory hierarchy. IEEE Micro (2022)."},{"key":"e_1_3_2_1_35_1","first-page":"428","article-title":"Programmable Congestion Control Communication Scheme","volume":"16","author":"Shpigelman Yuval","year":"2021","unstructured":"Yuval Shpigelman, Idan Burstein, Noam Bloch, Reut Zuck, and Roee Moyal. 2021. Programmable Congestion Control Communication Scheme. US Patent App. 16\/986,428.","journal-title":"US Patent App."},{"key":"e_1_3_2_1_36_1","unstructured":"Amin Tootoonchian Aurojit Panda Chang Lan Melvin Walls Katerina Argyraki Sylvia Ratnasamy and Scott Shenker. 2018. ResQ: Enabling { SLOs} in Network Function Virtualization. In NSDI."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3489048.3522662"},{"key":"e_1_3_2_1_38_1","volume-title":"SRNIC: A Scalable Architecture for { RDMA}{ NICs}. In NSDI.","author":"Wang Zilong","year":"2023","unstructured":"Zilong Wang, Layong Luo, Qingsong Ning, Chaoliang Zeng, Wenxue Li, Xinchen Wan, Peng Xie, Tao Feng, Ke Cheng, Xiongfei Geng, 2023. SRNIC: A Scalable Architecture for { RDMA}{ NICs}. In NSDI."},{"key":"e_1_3_2_1_39_1","volume-title":"Don\u2019t forget the I\/O when allocating your LLC","author":"Yuan Yifan","unstructured":"Yifan Yuan, Mohammad Alian, Yipeng Wang, Ren Wang, Ilia Kurakin, Charlie Tai, and Nam\u00a0Sung Kim. 2021. Don\u2019t forget the I\/O when allocating your LLC. In ISCA. IEEE."},{"key":"e_1_3_2_1_40_1","volume-title":"Congestion control for large-scale RDMA deployments. SIGCOMM","author":"Zhu Yibo","year":"2015","unstructured":"Yibo Zhu, Haggai Eran, Daniel Firestone, Chuanxiong Guo, Marina Lipshteyn, Yehonatan Liron, Jitendra Padhye, Shachar Raindel, Mohamad\u00a0Haj Yahia, and Ming Zhang. 2015. Congestion control for large-scale RDMA deployments. SIGCOMM (2015)."}],"event":{"name":"APNet 2024: The 8th Asia-Pacific Workshop on Networking","location":"Sydney Australia","acronym":"APNet 2024"},"container-title":["Proceedings of the 8th Asia-Pacific Workshop on Networking"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3663408.3663413","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3663408.3663413","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T23:31:42Z","timestamp":1755905502000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3663408.3663413"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,3]]},"references-count":40,"alternative-id":["10.1145\/3663408.3663413","10.1145\/3663408"],"URL":"https:\/\/doi.org\/10.1145\/3663408.3663413","relation":{},"subject":[],"published":{"date-parts":[[2024,8,3]]},"assertion":[{"value":"2024-08-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}