{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T07:31:02Z","timestamp":1776238262122,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T00:00:00Z","timestamp":1628467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSF","award":["1704742,2047283"],"award-info":[{"award-number":["1704742,2047283"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,9]]},"DOI":"10.1145\/3452296.3472888","type":"proceedings-article","created":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T18:13:15Z","timestamp":1628532795000},"page":"65-77","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":126,"title":["Understanding host network stack overheads"],"prefix":"10.1145","author":[{"given":"Qizhe","family":"Cai","sequence":"first","affiliation":[{"name":"Cornell University"}]},{"given":"Shubham","family":"Chaudhary","sequence":"additional","affiliation":[{"name":"Cornell University"}]},{"given":"Midhul","family":"Vuppalapati","sequence":"additional","affiliation":[{"name":"Cornell University"}]},{"given":"Jaehyun","family":"Hwang","sequence":"additional","affiliation":[{"name":"Cornell University"}]},{"given":"Rachit","family":"Agarwal","sequence":"additional","affiliation":[{"name":"Cornell University"}]}],"member":"320","published-online":{"date-parts":[[2021,8,9]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"crossref","unstructured":"Mohammad Alizadeh Albert Greenberg David A. Maltz Jitendra Padhye Parveen Patel Balaji Prabhakar Sudipta Sengupta and Murari Sridharan. 2010. Data Center TCP (DCTCP). In ACM SIGCOMM.  Mohammad Alizadeh Albert Greenberg David A. Maltz Jitendra Padhye Parveen Patel Balaji Prabhakar Sudipta Sengupta and Murari Sridharan. 2010. Data Center TCP (DCTCP). In ACM SIGCOMM .","DOI":"10.1145\/1851182.1851192"},{"key":"e_1_3_2_2_2_1","unstructured":"Amazon. 2021. Amazon EC2 F1 Instances. rmhttps:\/\/aws.amazon.com\/ec2\/instance-types\/f1\/. (2021).  Amazon. 2021. Amazon EC2 F1 Instances. rmhttps:\/\/aws.amazon.com\/ec2\/instance-types\/f1\/. (2021)."},{"key":"e_1_3_2_2_3_1","unstructured":"Mina Tahmasbi Arashloo Alexey Lavrov Manya Ghobadi Jennifer Rexford David Walker and David Wentzlaff. 2020. Enabling Programmable Transport Protocols in High-Speed NICs. In USENIX NSDI.  Mina Tahmasbi Arashloo Alexey Lavrov Manya Ghobadi Jennifer Rexford David Walker and David Wentzlaff. 2020. Enabling Programmable Transport Protocols in High-Speed NICs. In USENIX NSDI ."},{"key":"e_1_3_2_2_4_1","volume-title":"IX: A Protected Dataplane Operating System for High Throughput and Low Latency. In USENIX OSDI.","author":"Belay Adam","year":"2014","unstructured":"Adam Belay , George Prekas , Ana Klimovic , Samuel Grossman , Christos Kozyrakis , and Edouard Bugnion . 2014 . IX: A Protected Dataplane Operating System for High Throughput and Low Latency. In USENIX OSDI. Adam Belay, George Prekas, Ana Klimovic, Samuel Grossman, Christos Kozyrakis, and Edouard Bugnion. 2014. IX: A Protected Dataplane Operating System for High Throughput and Low Latency. In USENIX OSDI."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"crossref","unstructured":"Theophilus Benson Aditya Akella and David A Maltz. 2010. Network traffic characteristics of data centers in the wild. In IMC.  Theophilus Benson Aditya Akella and David A Maltz. 2010. Network traffic characteristics of data centers in the wild. In IMC .","DOI":"10.1145\/1879141.1879175"},{"key":"e_1_3_2_2_6_1","unstructured":"Zhan Bokai Yu Chengye and Chen Zhonghe. 2005. TCP\/IP Offload Engine (TOE) for an SOC System. https:\/\/www.intel.com\/content\/dam\/www\/programmable\/us\/en\/pdfs\/literature\/dc\/_3_3-2005_taiwan_3rd_chengkungu-web.pdf. (2005).  Zhan Bokai Yu Chengye and Chen Zhonghe. 2005. TCP\/IP Offload Engine (TOE) for an SOC System. https:\/\/www.intel.com\/content\/dam\/www\/programmable\/us\/en\/pdfs\/literature\/dc\/_3_3-2005_taiwan_3rd_chengkungu-web.pdf. (2005)."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"crossref","unstructured":"Qizhe Cai Shubham Chaudhary Midhul Vuppalapati Jaehyun Hwang and Rachit Agarwal. 2021. Understanding Host Network Stack Overheads. https:\/\/github.com\/Terabit-Ethernet\/terabit-network-stack-profiling. (2021).  Qizhe Cai Shubham Chaudhary Midhul Vuppalapati Jaehyun Hwang and Rachit Agarwal. 2021. Understanding Host Network Stack Overheads. https:\/\/github.com\/Terabit-Ethernet\/terabit-network-stack-profiling. (2021).","DOI":"10.1145\/3452296.3472888"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3012426.3022184"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"crossref","unstructured":"Adrian M Caulfield Eric S Chung Andrew Putnam Hari Angepat Jeremy Fowers Michael Haselman Stephen Heil Matt Humphrey Puneet Kaur Joo-Young Kim etal 2016. A cloud-scale acceleration architecture. In IEEE\/ACM MICRO.  Adrian M Caulfield Eric S Chung Andrew Putnam Hari Angepat Jeremy Fowers Michael Haselman Stephen Heil Matt Humphrey Puneet Kaur Joo-Young Kim et al. 2016. A cloud-scale acceleration architecture. In IEEE\/ACM MICRO .","DOI":"10.1109\/MICRO.2016.7783710"},{"key":"e_1_3_2_2_10_1","volume-title":"JLS2009: Generic receive offload. https:\/\/lwn.net\/Articles\/358910\/.","author":"Corbet Jonathan","year":"2009","unstructured":"Jonathan Corbet . 2009 . JLS2009: Generic receive offload. https:\/\/lwn.net\/Articles\/358910\/. (2009). Jonathan Corbet. 2009. JLS2009: Generic receive offload. https:\/\/lwn.net\/Articles\/358910\/. (2009)."},{"key":"e_1_3_2_2_11_1","unstructured":"Jonathan Corbet. 2017. Zero-copy networking. https:\/\/lwn.net\/Articles\/726917\/. (2017).  Jonathan Corbet. 2017. Zero-copy networking. https:\/\/lwn.net\/Articles\/726917\/. (2017)."},{"key":"e_1_3_2_2_12_1","unstructured":"Jonathan Corbet. 2018. Zero-copy TCP receive. https:\/\/lwn.net\/Articles\/752188\/. (2018).  Jonathan Corbet. 2018. Zero-copy TCP receive. https:\/\/lwn.net\/Articles\/752188\/. (2018)."},{"key":"e_1_3_2_2_13_1","unstructured":"Patrick Dehkord. 2019. NVMe over TCP Storage with SPDK. https:\/\/ci.spdk.io\/download\/events\/2019-summit\/(Solareflare)+NVMe+over+TCP+Storage+with+SPDK.pdf. (2019).  Patrick Dehkord. 2019. NVMe over TCP Storage with SPDK. https:\/\/ci.spdk.io\/download\/events\/2019-summit\/(Solareflare)+NVMe+over+TCP+Storage+with+SPDK.pdf. (2019)."},{"key":"e_1_3_2_2_14_1","volume-title":"Ajay Tirumala, and Alex Warshavsky.","author":"Dugan Jon","year":"2021","unstructured":"Jon Dugan , John Estabrook , Jim Ferbuson , Andrew Gallatin , Mark Gates , Kevin Gibbs , Stephen Hemminger , Nathan Jones , Gerrit Renker Feng Qin , Ajay Tirumala, and Alex Warshavsky. 2021 . iPerf - The ultimate speed test tool for TCP, UDP and SCTP. https:\/\/iperf.fr\/. (2021). Jon Dugan, John Estabrook, Jim Ferbuson, Andrew Gallatin, Mark Gates, Kevin Gibbs, Stephen Hemminger, Nathan Jones, Gerrit Renker Feng Qin, Ajay Tirumala, and Alex Warshavsky. 2021. iPerf - The ultimate speed test tool for TCP, UDP and SCTP. https:\/\/iperf.fr\/. (2021)."},{"key":"e_1_3_2_2_15_1","unstructured":"Alireza Farshin Amir Roozbeh Gerald Q. Maguire  Jr. and Dejan Kosti\u0107. 2020. Reexamining Direct Cache Access to Optimize I\/O Intensive Applications for Multi-hundred-gigabit Networks. In USENIX ATC.  Alireza Farshin Amir Roozbeh Gerald Q. Maguire Jr. and Dejan Kosti\u0107. 2020. Reexamining Direct Cache Access to Optimize I\/O Intensive Applications for Multi-hundred-gigabit Networks. In USENIX ATC ."},{"key":"e_1_3_2_2_16_1","unstructured":"Daniel Firestone Andrew Putnam Sambhrama Mundkur Derek Chiou Alireza Dabagh Mike Andrewartha Hari Angepat Vivek Bhanu Adrian Caulfield Eric Chung etal 2018. Azure accelerated networking: SmartNICs in the public cloud. In USENIX NSDI.  Daniel Firestone Andrew Putnam Sambhrama Mundkur Derek Chiou Alireza Dabagh Mike Andrewartha Hari Angepat Vivek Bhanu Adrian Caulfield Eric Chung et al. 2018. Azure accelerated networking: SmartNICs in the public cloud. In USENIX NSDI ."},{"key":"e_1_3_2_2_17_1","unstructured":"The Linux Foundation. 2016. Linux Foundation DocuWiki: napi. https:\/\/wiki.linuxfoundation.org\/networking\/napi. (2016).  The Linux Foundation. 2016. Linux Foundation DocuWiki: napi. https:\/\/wiki.linuxfoundation.org\/networking\/napi. (2016)."},{"key":"e_1_3_2_2_18_1","unstructured":"Peter X Gao Akshay Narayan Gautam Kumar Rachit Agarwal Sylvia Ratnasamy and Scott Shenker. 2015. phost: Distributed near-optimal datacenter transport over commodity network fabric. In ACM CoNEXT.  Peter X Gao Akshay Narayan Gautam Kumar Rachit Agarwal Sylvia Ratnasamy and Scott Shenker. 2015. phost: Distributed near-optimal datacenter transport over commodity network fabric. In ACM CoNEXT ."},{"key":"e_1_3_2_2_19_1","unstructured":"Sebastien Godard. 2021. Performance monitoring tools for Linux. https:\/\/github.com\/sysstat\/sysstat. (2021).  Sebastien Godard. 2021. Performance monitoring tools for Linux. https:\/\/github.com\/sysstat\/sysstat. (2021)."},{"key":"e_1_3_2_2_20_1","unstructured":"Brendan Gregg. 2020. Linux perf Examples. http:\/\/www.brendangregg.com\/perf.html. (2020).  Brendan Gregg. 2020. Linux perf Examples. http:\/\/www.brendangregg.com\/perf.html. (2020)."},{"key":"e_1_3_2_2_21_1","unstructured":"Sangjin Han Scott Marshall Byung-Gon Chun and Sylvia Ratnasamy. 2012. MegaPipe: A New Programming Interface for Scalable Network I\/O. In USENIX OSDI.  Sangjin Han Scott Marshall Byung-Gon Chun and Sylvia Ratnasamy. 2012. MegaPipe: A New Programming Interface for Scalable Network I\/O. In USENIX OSDI ."},{"key":"e_1_3_2_2_22_1","unstructured":"HewlettPackard. 2021. Netperf. https:\/\/github.com\/HewlettPackard\/netperf. (2021).  HewlettPackard. 2021. Netperf. https:\/\/github.com\/HewlettPackard\/netperf. (2021)."},{"key":"e_1_3_2_2_23_1","volume-title":"Daniel Borkmann, John Fastabend, Tom Herbert, David Ahern, and David Miller.","author":"H\u00f8iland-J\u00f8rgensen Toke","year":"2018","unstructured":"Toke H\u00f8iland-J\u00f8rgensen , Jesper Dangaard Brouer , Daniel Borkmann, John Fastabend, Tom Herbert, David Ahern, and David Miller. 2018 . The eXpress Data Path : Fast Programmable Packet Processing in the Operating System Kernel. In ACM CoNEXT. Toke H\u00f8iland-J\u00f8rgensen, Jesper Dangaard Brouer, Daniel Borkmann, John Fastabend, Tom Herbert, David Ahern, and David Miller. 2018. The eXpress Data Path: Fast Programmable Packet Processing in the Operating System Kernel. In ACM CoNEXT."},{"key":"e_1_3_2_2_24_1","unstructured":"Jaehyun Hwang Qizhe Cai Ao Tang and Rachit Agarwal. 2020. TCP &asymp; RDMA: CPU-efficient Remote Storage Access with i10. In USENIX NSDI.  Jaehyun Hwang Qizhe Cai Ao Tang and Rachit Agarwal. 2020. TCP &asymp; RDMA: CPU-efficient Remote Storage Access with i10. In USENIX NSDI ."},{"key":"e_1_3_2_2_25_1","unstructured":"Intel. 2012. Intel\u00ae Data Direct I\/O Technology. https:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/technology-briefs\/data-direct-i-o-technology-brief.pdf. (2012).  Intel. 2012. Intel\u00ae Data Direct I\/O Technology. https:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/technology-briefs\/data-direct-i-o-technology-brief.pdf. (2012)."},{"key":"e_1_3_2_2_26_1","unstructured":"Intel. 2020. SPDK NVMe-oF TCP Performance Report. https:\/\/ci.spdk.io\/download\/performance-reports\/SPDK_tcp_perf_report_2010.pdf. (2020).  Intel. 2020. SPDK NVMe-oF TCP Performance Report. https:\/\/ci.spdk.io\/download\/performance-reports\/SPDK_tcp_perf_report_2010.pdf. (2020)."},{"key":"e_1_3_2_2_27_1","unstructured":"EunYoung Jeong Shinae Woo Muhammad Jamshed Haewon Jeong Sunghwan Ihm Dongsu Han and KyoungSoo Park. 2014. mTCP: a Highly Scalable User-level TCP Stack for Multicore Systems. In USENIX NSDI.  EunYoung Jeong Shinae Woo Muhammad Jamshed Haewon Jeong Sunghwan Ihm Dongsu Han and KyoungSoo Park. 2014. mTCP: a Highly Scalable User-level TCP Stack for Multicore Systems. In USENIX NSDI ."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"crossref","unstructured":"Srikanth Kandula Sudipta Sengupta Albert Greenberg Parveen Patel and Ronnie Chaiken. 2009. The nature of data center traffic: measurements & analysis. In IMC.  Srikanth Kandula Sudipta Sengupta Albert Greenberg Parveen Patel and Ronnie Chaiken. 2009. The nature of data center traffic: measurements & analysis. In IMC .","DOI":"10.1145\/1644893.1644918"},{"key":"e_1_3_2_2_29_1","volume-title":"Linux Plumbers Conference.","author":"Karlsson Magnus","year":"2018","unstructured":"Magnus Karlsson and Bj\u00f6rn T\u00f6pel . 2018 . The Path to DPDK Speeds for AF XDP . In Linux Plumbers Conference. Magnus Karlsson and Bj\u00f6rn T\u00f6pel. 2018. The Path to DPDK Speeds for AF XDP. In Linux Plumbers Conference."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3302424.3303985"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341302.3342085"},{"key":"e_1_3_2_2_32_1","unstructured":"Xiaofeng Lin Yu Chen Xiaodong Li Junjie Mao Jiaquan He Wei Xu and Yuanchun Shi. 2016. Scalable Kernel TCP Design and Implementation for Short-Lived Connections. In ACM ASPLOS.  Xiaofeng Lin Yu Chen Xiaodong Li Junjie Mao Jiaquan He Wei Xu and Yuanchun Shi. 2016. Scalable Kernel TCP Design and Implementation for Short-Lived Connections. In ACM ASPLOS ."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2740070.2626311"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"crossref","unstructured":"Radhika Mittal Alexander Shpiner Aurojit Panda Eitan Zahavi Arvind Krishnamurthy Sylvia Ratnasamy and Scott Shenker. 2018. Revisiting Network Support for RDMA. In ACM SIGCOMM.  Radhika Mittal Alexander Shpiner Aurojit Panda Eitan Zahavi Arvind Krishnamurthy Sylvia Ratnasamy and Scott Shenker. 2018. Revisiting Network Support for RDMA. In ACM SIGCOMM .","DOI":"10.1145\/3230543.3230557"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3230543.3230564"},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"crossref","unstructured":"George Prekas Marios Kogias and Edouard Bugnion. 2017. ZygOS: Achieving Low Tail Latency for Microsecond-scale Networked Tasks. In ACM SOSP.  George Prekas Marios Kogias and Edouard Bugnion. 2017. ZygOS: Achieving Low Tail Latency for Microsecond-scale Networked Tasks. In ACM SOSP .","DOI":"10.1145\/3132747.3132780"},{"key":"e_1_3_2_2_37_1","unstructured":"Quoc-Thai V Le Jonathan Stern and Stephen M Brenner. 2017. Fast memcpy with SPDK and Intel I\/OAT DMA Engine. rmhttps:\/\/software.intel.com\/content\/www\/us\/en\/develop\/articles\/fast-memcpy-using-spdk-and-ioat-dma-engine.html. (2017).  Quoc-Thai V Le Jonathan Stern and Stephen M Brenner. 2017. Fast memcpy with SPDK and Intel I\/OAT DMA Engine. rmhttps:\/\/software.intel.com\/content\/www\/us\/en\/develop\/articles\/fast-memcpy-using-spdk-and-ioat-dma-engine.html. (2017)."},{"key":"e_1_3_2_2_38_1","unstructured":"Livio Soares and Michael Stumm. 2010. FlexSC: Flexible System Call Scheduling with Exception-Less System Calls. In USENIX OSDI.  Livio Soares and Michael Stumm. 2010. FlexSC: Flexible System Call Scheduling with Exception-Less System Calls. In USENIX OSDI ."},{"key":"e_1_3_2_2_39_1","unstructured":"Amin Tootoonchian Aurojit Panda Chang Lan Melvin Walls Katerina Argyraki Sylvia Ratnasamy and Scott Shenker. 2018. ResQ: Enabling SLOs in Network Function Virtualization. In USENIX NSDI.  Amin Tootoonchian Aurojit Panda Chang Lan Melvin Walls Katerina Argyraki Sylvia Ratnasamy and Scott Shenker. 2018. ResQ: Enabling SLOs in Network Function Virtualization. In USENIX NSDI ."},{"key":"e_1_3_2_2_40_1","unstructured":"Vijay Vasudevan David G. Andersen and Michael Kaminsky. 2011. The Case for VOS: The Vector Operating System. In USENIX HotOS.  Vijay Vasudevan David G. Andersen and Michael Kaminsky. 2011. The Case for VOS: The Vector Operating System. In USENIX HotOS ."},{"key":"e_1_3_2_2_41_1","unstructured":"Kenichi Yasukata Michio Honda Douglas Santry and Lars Eggert. 2016. StackMap: Low-Latency Networking with the OS Stack and Dedicated NICs. In USENIX ATC.  Kenichi Yasukata Michio Honda Douglas Santry and Lars Eggert. 2016. StackMap: Low-Latency Networking with the OS Stack and Dedicated NICs. In USENIX ATC ."},{"key":"e_1_3_2_2_42_1","unstructured":"Neal Cardwell Yuchung Cheng. [n. d.]. Making Linux TCP Fast. \"https:\/\/netdevconf.info\/1.2\/papers\/bbr-netdev-1.2.new.new.pdf\". ([n. d.]).  Neal Cardwell Yuchung Cheng. [n. d.]. Making Linux TCP Fast. \"https:\/\/netdevconf.info\/1.2\/papers\/bbr-netdev-1.2.new.new.pdf\". ([n. d.])."},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2829988.2787484"}],"event":{"name":"SIGCOMM '21: ACM SIGCOMM 2021 Conference","location":"Virtual Event USA","acronym":"SIGCOMM '21","sponsor":["SIGCOMM ACM Special Interest Group on Data Communication"]},"container-title":["Proceedings of the 2021 ACM SIGCOMM 2021 Conference"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3452296.3472888","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3452296.3472888","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3452296.3472888","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:01:13Z","timestamp":1750197673000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3452296.3472888"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,9]]},"references-count":43,"alternative-id":["10.1145\/3452296.3472888","10.1145\/3452296"],"URL":"https:\/\/doi.org\/10.1145\/3452296.3472888","relation":{},"subject":[],"published":{"date-parts":[[2021,8,9]]},"assertion":[{"value":"2021-08-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}