{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T01:05:49Z","timestamp":1773277549475,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":107,"publisher":"ACM","funder":[{"name":"NSF","award":["2339755"],"award-info":[{"award-number":["2339755"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,11,17]]},"DOI":"10.1145\/3772356.3772389","type":"proceedings-article","created":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T12:02:48Z","timestamp":1763380968000},"page":"289-299","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Server Chiplet Networking"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-6928-6895","authenticated-orcid":false,"given":"Seunghyun","family":"An","sequence":"first","affiliation":[{"name":"University of Wisconsin-Madison, Madison, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3084-092X","authenticated-orcid":false,"given":"Joontaek","family":"Oh","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison, Madison, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6509-9449","authenticated-orcid":false,"given":"Ming","family":"Liu","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison, Madison, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,11,17]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation (NSDI'12)","author":"Alizadeh Mohammad","year":"2012","unstructured":"Mohammad Alizadeh, Abdul Kabbani, Tom Edsall, Balaji Prabhakar, Amin Vahdat, and Masato Yasuda. 2012. Less is more: trading a little bandwidth for ultra-low latency in the data center. In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation (NSDI'12)."},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the 44th Annual International Symposium on Computer Architecture.","author":"Bin Altaf Muhammad Shoaib","unstructured":"Muhammad Shoaib Bin Altaf and David A. Wood. 2017. LogCA: A High-Level Performance Model for Hardware Accelerators. In Proceedings of the 44th Annual International Symposium on Computer Architecture."},{"key":"e_1_3_2_1_3_1","unstructured":"AMD. 2025. AMD Infinity Fabric. https:\/\/www.amd.com\/content\/dam\/amd\/en\/documents\/instinct-tech-docs\/other\/56978.pdf."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3015146"},{"key":"e_1_3_2_1_5_1","volume-title":"Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP'09)","author":"Baumann Andrew","year":"2009","unstructured":"Andrew Baumann, Paul Barham, Pierre-Evariste Dagand, Tim Harris, Rebecca Isaacs, Simon Peter, Timothy Roscoe, Adrian Sch\u00fcpbach, and Akhilesh Singhania. 2009. The multikernel: a new OS architecture for scalable multicore systems. In Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP'09). 29\u201344."},{"key":"e_1_3_2_1_6_1","unstructured":"Timo Bingmann. 2013. PMBW. http:\/\/panthema.net\/2013\/pmbw\/."},{"key":"e_1_3_2_1_7_1","volume-title":"Proceedings of the ACM SIGCOMM 2013 Conference on SIGCOMM (SIGCOMM'13)","author":"Bosshart Pat","year":"2013","unstructured":"Pat Bosshart, Glen Gibb, Hun-Seok Kim, George Varghese, Nick McKeown, Martin Izzard, Fernando Mujica, and Mark Horowitz. 2013. Forwarding metamorphosis: fast programmable match-action processing in hardware for SDN. In Proceedings of the ACM SIGCOMM 2013 Conference on SIGCOMM (SIGCOMM'13). 99\u2013110."},{"key":"e_1_3_2_1_8_1","volume-title":"Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation (OSDI'08)","author":"Boyd-Wickizer Silas","year":"2008","unstructured":"Silas Boyd-Wickizer, Haibo Chen, Rong Chen, Yandong Mao, Frans Kaashoek, Robert Morris, Aleksey Pesterev, Lex Stein, Ming Wu, Yuehua Dai, Yang Zhang, and Zheng Zhang. 2008. Corey: an operating system for many cores. In Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation (OSDI'08). 43\u201357."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/1924943.1924944"},{"key":"e_1_3_2_1_10_1","volume-title":"2022 IEEE International Solid-State Circuits Conference (ISSCC)","volume":"65","author":"Burd Thomas","year":"2022","unstructured":"Thomas Burd, Wilson Li, James Pistole, Srividhya Venkataraman, Michael McCabe, Timothy Johnson, James Vinh, Thomas Yiu, Mark Wasio, Hon-Hin Wong, Daryl Lieu, Jonathan White, Benjamin Munger, Joshua Lindner, Javin Olson, Steven Bakke, Jeshuah Sniderman, Carson Henrion, Russell Schreiber, Eric Busta, Brett Johnson, Tim Jackson, Aron Miller, Ryan Miller, Matthew Pickett, Aaron Horiuchi, Josef Dvorak, Sabeesh Balagangadharan, Sajeesh Ammikkallingal, and Pankaj Kumar. 2022. Zen3: The AMD 2nd-Generation 7nm x86-64 Microprocessor Core. In 2022 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 65. 1\u20133."},{"key":"e_1_3_2_1_11_1","volume-title":"2024 IEEE International Solid-State Circuits Conference (ISSCC)","volume":"67","author":"Burd Thomas","year":"2024","unstructured":"Thomas Burd, Srividhya Venkataraman, Wilson Li, Timothy Johnson, Jerry Lee, Srikirti Velaga, Mark Wasio, Thomas Yiu, Franklin Bodine, Michael McCabe, Udin Salim, Santosh Kumar Thouta, Michael Golden, Sowmya Ramachandran, Gokul Subramani Lakshmi Devi, John Wuu, Yarek Kuszczak, Gaurav Singla, Carson Henrion, Andy Robison, Sabeesh Balagangadharan, Umesh Nair, Naveen Srivastava, Hari Prasad, Mohini Polimetla, Phaneendra Chennupati, Eshwar Gupta, Mahesh Vykuntam, Sumantra Sarkar, Praveen Kumar Duvvuru, Theja Mardi, and G Swetha. 2024. 2.2 \"Zen 4c\": The AMD 5nm Area-Optimized x86-64 Microprocessor Core. In 2024 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 67. 38\u201340."},{"key":"e_1_3_2_1_12_1","unstructured":"Cadence. 2025. The Cadence Allegro X Design Platform. https:\/\/www.cadence.com\/en_US\/home\/tools\/pcb-design-and-analysis\/allegro-x-design-platform.html."},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the ACM SIGCOMM 2022 Conference (SIGCOMM'22)","author":"Cai Qizhe","year":"2022","unstructured":"Qizhe Cai, Midhul Vuppalapati, Jaehyun Hwang, Christos Kozyrakis, and Rachit Agarwal. 2022. Towards us tail latency and terabit ethernet: disaggregating the host network stack. In Proceedings of the ACM SIGCOMM 2022 Conference (SIGCOMM'22). 767\u2013779."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICNP61940.2024.10858560"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of 2022 IEEE Hot Chips 34 Symposium (HCS). IEEE Computer Society, 1\u201346","author":"Choquette Jack","year":"2022","unstructured":"Jack Choquette. 2022. Nvidia hopper gpu: Scaling performance. In Proceedings of 2022 IEEE Hot Chips 34 Symposium (HCS). IEEE Computer Society, 1\u201346."},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (SOSP'13)","author":"Clements Austin T.","year":"2013","unstructured":"Austin T. Clements, M. Frans Kaashoek, Nickolai Zeldovich, Robert T. Morris, and Eddie Kohler. 2013. The scalable commutativity rule: designing scalable software for multicore processors. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (SOSP'13). 1\u201317."},{"key":"e_1_3_2_1_17_1","volume-title":"Breaking Boundaries: AmpereOne's Disaggregation Strategy for the Next-Gen Cloud. https:\/\/amperecomputing.com\/blogs\/next-gen-cloud.","author":"Computing Ampere","year":"2024","unstructured":"Ampere Computing. 2024. Breaking Boundaries: AmpereOne's Disaggregation Strategy for the Next-Gen Cloud. https:\/\/amperecomputing.com\/blogs\/next-gen-cloud."},{"key":"e_1_3_2_1_18_1","unstructured":"The UCIe Consortium. 2025. Universal Chiplet Interconnect Express (UCIe). https:\/\/www.uciexpress.org."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/155332.155333"},{"key":"e_1_3_2_1_20_1","volume-title":"Slides from Linux Kongress","author":"De Melo Arnaldo Carvalho","unstructured":"Arnaldo Carvalho De Melo. 2010. The new linux'perf'tools. In Slides from Linux Kongress, Vol. 18. 1\u201342."},{"key":"e_1_3_2_1_21_1","unstructured":"The devicetree.org community. 2025. The Devicetree Specification. https:\/\/www.devicetree.org."},{"key":"e_1_3_2_1_22_1","volume-title":"15th USENIX Symposium on Networked Systems Design and Implementation (NSDI'18)","author":"Firestone Daniel","year":"2018","unstructured":"Daniel Firestone, Andrew Putnam, Sambhrama Mundkur, Derek Chiou, Alireza Dabagh, Mike Andrewartha, Hari Angepat, Vivek Bhanu, Adrian Caulfield, Eric Chung, Harish Kumar Chandrappa, Somesh Chaturmohta, Matt Humphrey, Jack Lavier, Norman Lam, Fengfen Liu, Kalin Ovtcharov, Jitu Padhye, Gautham Popuri, Shachar Raindel, Tejas Sapre, Mark Shaw, Gabriel Silva, Madhan Sivakumar, Nisheeth Srivastava, Anshuman Verma, Qasim Zuhair, Deepak Bansal, Doug Burger, Kushagra Vaid, David A. Maltz, and Albert Greenberg. 2018. Azure accelerated networking: SmartNICs in the public cloud. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI'18). 51\u201366."},{"key":"e_1_3_2_1_23_1","unstructured":"GigaIO. 2025. GigaIO SuperNode. https:\/\/gigaio.com\/supernode\/."},{"key":"e_1_3_2_1_24_1","unstructured":"Groq. 2025. GroqRack Compute Cluster. https:\/\/groq.com\/groqrack\/."},{"key":"e_1_3_2_1_25_1","volume-title":"Express Cube Topologies for on-Chip Interconnects. In 2009 IEEE 15th International Symposium on High Performance Computer Architecture. 163\u2013174","author":"Grot Boris","year":"2009","unstructured":"Boris Grot, Joel Hestness, Stephen W. Keckler, and Onur Mutlu. 2009. Express Cube Topologies for on-Chip Interconnects. In 2009 IEEE 15th International Symposium on High Performance Computer Architecture. 163\u2013174."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613424.3614291"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3603269.3604880"},{"key":"e_1_3_2_1_28_1","unstructured":"NVIDIA H100. 2024. NVIDIA H100 Tensor Core GPU. https:\/\/www.nvidia.com\/en-us\/data-center\/h100\/."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3575693.3575708"},{"key":"e_1_3_2_1_30_1","volume-title":"Understanding Routable PCIe Performance for Composable Infrastructures. In 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI'24)","author":"Hou Wentao","year":"2024","unstructured":"Wentao Hou, Jie Zhang, Zeke Wang, and Ming Liu. 2024. Understanding Routable PCIe Performance for Composable Infrastructures. In 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI'24). 297\u2013312."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1119772.1119818"},{"key":"e_1_3_2_1_32_1","volume-title":"15th USENIX Symposium on Operating Systems Design and Implementation (OSDI'21)","author":"Hwang Jaehyun","year":"2021","unstructured":"Jaehyun Hwang, Midhul Vuppalapati, Simon Peter, and Rachit Agarwal. 2021. Rearchitecting linux storage stack for \u03bcs latency and high throughput. In 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI'21). 113\u2013128."},{"key":"e_1_3_2_1_33_1","unstructured":"IDTechEx. 2025. Chiplet Technology 2025-2035: Technology Opportunities Applications. https:\/\/www.idtechex.com\/en\/research-report\/chiplet-technology-2025-2035-technology-opportunities-applications\/1041."},{"key":"e_1_3_2_1_34_1","unstructured":"Intel. 2024. Software-Defined Vehicle Transformation Starts with Intel. https:\/\/download.intel.com\/newsroom\/2024\/automotive\/Intel-SDV-Demo-Fact-Sheet.pdf."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.5555\/3767955.3768016"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.5555\/3767955.3768015"},{"key":"e_1_3_2_1_37_1","unstructured":"Keysight. 2024. What is a Chiplet and Why Should You Care? https:\/\/www.keysight.com\/blogs\/en\/tech\/sim-des\/2024\/2\/8\/what-is-a-chiplet-and-why-should-you-care."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2008.19"},{"key":"e_1_3_2_1_39_1","volume-title":"Proceedings of the ACM Special Interest Group on Data Communication (SIGCOMM'19)","author":"Kumar Praveen","year":"2019","unstructured":"Praveen Kumar, Nandita Dukkipati, Nathan Lewis, Yi Cui, Yaogong Wang, Chonggang Li, Valas Valancius, Jake Adriaens, Steve Gribble, Nate Foster, and Amin Vahdat. 2019. Picnic: predictable virtualized nic. In Proceedings of the ACM Special Interest Group on Data Communication (SIGCOMM'19). 351\u2013366."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3317550.3321447"},{"key":"e_1_3_2_1_41_1","volume-title":"Proceedings of the ACM SIGCOMM 2025 Conference (SIGCOMM'25)","author":"Li Xiao","year":"2025","unstructured":"Xiao Li, Zerui Guo, Yuebin Bai, Mehash Ketkar, Hugh Willkinson, and Ming Liu. 2025. Understanding and Profiling CXL.mem Using PathFinder. In Proceedings of the ACM SIGCOMM 2025 Conference (SIGCOMM'25)."},{"key":"e_1_3_2_1_42_1","volume-title":"Building Distributed Systems Using Programmable Networks","author":"Liu Ming","unstructured":"Ming Liu. 2020. Building Distributed Systems Using Programmable Networks. University of Washington."},{"key":"e_1_3_2_1_43_1","volume-title":"Fabric-Centric Computing. In Proceedings of the 19th Workshop on Hot Topics in Operating Systems (HotOS'23)","author":"Liu Ming","year":"2023","unstructured":"Ming Liu. 2023. Fabric-Centric Computing. In Proceedings of the 19th Workshop on Hot Topics in Operating Systems (HotOS'23). 118\u2013126."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341302.3342079"},{"key":"e_1_3_2_1_45_1","volume-title":"17th USENIX Symposium on Networked Systems Design and Implementation (NSDI'20)","author":"Liu Ming","year":"2020","unstructured":"Ming Liu, Arvind Krishnamurthy, Harsha V. Madhyastha, Rishi Bhardwaj, Karan Gupta, Chinmay Kamat, Huapeng Yuan, Aditya Jaltade, Roger Liao, Pavan Konka, and Anoop Jawahar. 2020. Fine-Grained Replicated State Machines for a Cluster Storage System. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI'20). 305\u2013323."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037731"},{"key":"e_1_3_2_1_47_1","volume-title":"2019 USENIX Annual Technical Conference (USENIX ATC'19)","author":"Liu Ming","year":"2019","unstructured":"Ming Liu, Simon Peter, Arvind Krishnamurthy, and Phitchaya Mangpo Phothilimthana. 2019. E3: Energy-Efficient Microservices on SmartNIC-Accelerated Servers. In 2019 USENIX Annual Technical Conference (USENIX ATC'19). 363\u2013378."},{"key":"e_1_3_2_1_48_1","volume-title":"Workshop on Approximate Computing Across the Stack.","author":"Luo Liang","year":"2017","unstructured":"Liang Luo, Ming Liu, Jacob Nelson, Luis Ceze, Amar Phanishayee, and Arvind Krishnamurthy. 2017. Motivating in-network aggregation for distributed deep neural network training. In Workshop on Approximate Computing Across the Stack."},{"key":"e_1_3_2_1_49_1","volume-title":"Proceedings of 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI'24)","author":"Luo Zhihong","year":"2024","unstructured":"Zhihong Luo, Sam Son, Sylvia Ratnasamy, and Scott Shenker. 2024. Harvesting Memory-bound CPU Stall Cycles in Software with MSH. In Proceedings of 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI'24). 57\u201375."},{"key":"e_1_3_2_1_50_1","volume-title":"Yoav Shoham, Russell Wald, Tobi Walsh, Armin Hamrah, Lapo Santarlasci, Julia Betts Lotufo, Alexandra Rome, Andrew Shi, and Sukrut Oak.","author":"Maslej Nestor","year":"2025","unstructured":"Nestor Maslej, Loredana Fattorini, Raymond Perrault, Yolanda Gil, Vanessa Parli, Njenga Kariuki, Emily Capstick, Anka Reuel, Erik Brynjolfsson, John Etchemendy, Katrina Ligett, Terah Lyons, James Manyika, Juan Carlos Niebles, Yoav Shoham, Russell Wald, Tobi Walsh, Armin Hamrah, Lapo Santarlasci, Julia Betts Lotufo, Alexandra Rome, Andrew Shi, and Sukrut Oak. 2025. Artificial Intelligence Index Report 2025. arXiv:2504.07139 [cs.AI] https:\/\/arxiv.org\/abs\/2504.07139"},{"key":"e_1_3_2_1_51_1","volume-title":"Proceedings of the 2002 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM'02)","author":"Medina A.","unstructured":"A. Medina, N. Taft, K. Salamatian, S. Bhattacharyya, and C. Diot. 2002. Traffic matrix estimation: existing techniques and new directions. In Proceedings of the 2002 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM'02). 161\u2013174."},{"key":"e_1_3_2_1_52_1","volume-title":"Hwan Doh, and Arvind Krishnamurthy. 2021. Gimbal: enabling multi-tenant storage disaggregation on SmartNIC JBOFs. In Proceedings of the 2021 ACM SIGCOMM 2021 Conference (SIGCOMM'21)","author":"Min Jaehong","unstructured":"Jaehong Min, Ming Liu, Tapan Chugh, Chenxingyu Zhao, Andrew Wei, In Hwan Doh, and Arvind Krishnamurthy. 2021. Gimbal: enabling multi-tenant storage disaggregation on SmartNIC JBOFs. In Proceedings of the 2021 ACM SIGCOMM 2021 Conference (SIGCOMM'21). 106\u2013122."},{"key":"e_1_3_2_1_53_1","volume-title":"17th USENIX Symposium on Operating Systems Design and Implementation (OSDI'23)","author":"Min Jaehong","year":"2023","unstructured":"Jaehong Min, Chenxingyu Zhao, Ming Liu, and Arvind Krishnamurthy. 2023. eZNS: An Elastic Zoned Namespace for Commodity ZNS SSDs. In 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI'23). 461\u2013477."},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/3653716"},{"key":"e_1_3_2_1_55_1","volume-title":"Proceedings of the 36th Annual International Symposium on Computer Architecture. 196\u2013207","author":"Mutlu Thomas","year":"2009","unstructured":"Moscibroda, Thomas and Mutlu, Onur. 2009. A case for bufferless routing in on-chip networks. In Proceedings of the 36th Annual International Symposium on Computer Architecture. 196\u2013207."},{"key":"e_1_3_2_1_56_1","volume-title":"2024 IEEE International Solid-State Circuits Conference (ISSCC)","volume":"67","author":"Munch Ashley O.","year":"2024","unstructured":"Ashley O. Munch, Nevine Nassif, Carleton L. Molnar, Jason Crop, Rich Gammack, Chinmay P. Joshi, Goran Zelic, Kambiz Munshi, Min Huang, Charles R. Morganti, Sireesha Kandula, and Arijit Biswas. 2024. 2.3 Emerald Rapids: 5th-Generation Intel\u00ae Xeon\u00ae Scalable Processors. In 2024 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 67. 40\u201342."},{"key":"e_1_3_2_1_57_1","volume-title":"2023 IEEE International Solid-State Circuits Conference (ISSCC). 38\u201339","author":"Munger Benjamin","year":"2023","unstructured":"Benjamin Munger, Kathy Wilcox, Jeshuah Sniderman, Chuck Tung, Brett Johnson, Russell Schreiber, Carson Henrion, Kevin Gillespie, Tom Burd, Harry Fair, David Johnson, Jonathan White, Scott McLelland, Steven Bakke, Javin Olson, Ryan McCracken, Matthew Pickett, Aaron Horiuchi, Hien Nguyen, and Tim H Jackson. 2023. \"Zen 4\": The AMD 5nm 5.7GHz x86-64 Microprocessor Core. In 2023 IEEE International Solid-State Circuits Conference (ISSCC). 38\u201339."},{"key":"e_1_3_2_1_58_1","volume-title":"2021 ACM\/IEEE 48th Annual International Symposium on Computer Architecture (ISCA). 57\u201370","author":"Naffziger Samuel","year":"2021","unstructured":"Samuel Naffziger, Noah Beck, Thomas Burd, Kevin Lepak, Gabriel H. Loh, Mahesh Subramony, and Sean White. 2021. Pioneering Chiplet Technology and Design for the AMD EPYC\u2122 and Ryzen\u2122 Processor Families : Industrial Product. In 2021 ACM\/IEEE 48th Annual International Symposium on Computer Architecture (ISCA). 57\u201370."},{"key":"e_1_3_2_1_59_1","volume-title":"Proceedings of 2022 IEEE International Solid-State Circuits Conference (ISSCC)","volume":"65","author":"Nassif Nevine","year":"2022","unstructured":"Nevine Nassif, Ashley O. Munch, Carleton L. Molnar, Gerald Pasdast, Sitaraman V. Lyer, Zibing Yang, Oscar Mendoza, Mark Huddart, Srikrishnan Venkataraman, Sireesha Kandula, Rafi Marom, Alexandra M. Kern, Bill Bowhill, David R. Mulvihill, Srikanth Nimmagadda, Varma Kalidindi, Jonathan Krause, Mohammad M. Haq, Roopali Sharma, and Kevin Duda. 2022. Sapphire Rapids: The Next-Generation Intel Xeon Scalable Processor. In Proceedings of 2022 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 65. 44\u201346."},{"key":"e_1_3_2_1_60_1","volume-title":"Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication (SIGCOMM'18)","author":"Neugebauer Rolf","unstructured":"Rolf Neugebauer, Gianni Antichi, Jos\u00e9 Fernando Zazo, Yury Audzevich, Sergio L\u00f3pez-Buedo, and Andrew W. Moore. 2018. Understanding PCIe performance for end host networking. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication (SIGCOMM'18). 327\u2013341."},{"key":"e_1_3_2_1_61_1","unstructured":"The NextPlatform. 2023. AWS ADOPTS ARM V2 CORES FOR EXPANSIVE GRAVITON4 SERVER CPU. https:\/\/www.nextplatform.com\/2023\/11\/28\/aws-adopts-arm-v2-cores-for-expansive-graviton4-server-cpu\/."},{"key":"e_1_3_2_1_62_1","unstructured":"NVIDIA. 2025. NVIDIA GB200 NVL72. https:\/\/www.nvidia.com\/en-us\/data-center\/gb200-nvl72\/."},{"key":"e_1_3_2_1_63_1","unstructured":"NVIDIA. 2025. NVIDIA HGX Platform. https:\/\/www.nvidia.com\/en-us\/data-center\/hgx\/."},{"key":"e_1_3_2_1_64_1","volume-title":"Proceedings of 2006 IEEE International Symposium on Performance Analysis of Systems and Software.","author":"Patterson D.A.","year":"2006","unstructured":"D.A. Patterson. 2006. RAMP: research accelerator for multiple processors - a community vision for a shared experimental parallel HW\/SW platform. In Proceedings of 2006 IEEE International Symposium on Performance Analysis of Systems and Software."},{"key":"e_1_3_2_1_65_1","volume-title":"Proceedings of 1997 IEEE International Solids-State Circuits Conference. Digest of Technical Papers. 224\u2013225","author":"Patterson David","year":"1997","unstructured":"David Patterson, Thomas Anderson, Neal Cardwell, Richard Fromm, Kimberley Keeton, Christoforos Kozyrakis, Randi Thomas, and Katherine Yelick. 1997. Intelligent RAM (IRAM): Chips that remember and compute. In Proceedings of 1997 IEEE International Solids-State Circuits Conference. Digest of Technical Papers. 224\u2013225."},{"key":"e_1_3_2_1_66_1","volume-title":"A case for intelligent RAM","author":"Patterson David","year":"2002","unstructured":"David Patterson, Thomas Anderson, Neal Cardwell, Richard Fromm, Kimberly Keeton, Christoforos Kozyrakis, Randi Thomas, and Katherine Yelick. 2002. A case for intelligent RAM. IEEE micro 17, 2 (2002), 34\u201344."},{"key":"e_1_3_2_1_67_1","volume-title":"Floem: A Programming System for NIC-Accelerated Network Applications. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI'18)","author":"Phothilimthana Phitchaya Mangpo","year":"2018","unstructured":"Phitchaya Mangpo Phothilimthana, Ming Liu, Antoine Kaufmann, Simon Peter, Rastislav Bodik, and Thomas Anderson. 2018. Floem: A Programming System for NIC-Accelerated Network Applications. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI'18). 663\u2013679."},{"key":"e_1_3_2_1_68_1","unstructured":"The Next Platform. 2025. SILICON ONE G200 FINALLY DRIVES CISCO'S AI NETWORKING BUSINESS. https:\/\/www.nextplatform.com\/2025\/05\/21\/silicon-one-g200-finally-drives-ciscos-ai-networking-business\/."},{"key":"e_1_3_2_1_69_1","unstructured":"The Next Platform. 2025. THE AI DATACENTER IS RAVENOUS FOR 102.4 TB\/SEC ETHERNET SWITCH ASICS. https:\/\/www.nextplatform.com\/2025\/06\/03\/the-ai-datacenter-is-ravenous-for-102-4-tb-sec-ethernet\/."},{"key":"e_1_3_2_1_70_1","volume-title":"2021 IEEE Hot Chips 33 Symposium (HCS). 1\u201337","author":"Prabhakar Raghu","year":"2021","unstructured":"Raghu Prabhakar and Sumti Jairath. 2021. SambaNova SN10 RDU: Accelerating software 2.0 with dataflow. In 2021 IEEE Hot Chips 33 Symposium (HCS). 1\u201337."},{"key":"e_1_3_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISSCC42614.2022.9731612"},{"key":"e_1_3_2_1_72_1","volume-title":"2024 57th IEEE\/ACM International Symposium on Microarchitecture (MICRO). 1353\u20131366","author":"Prabhakar Raghu","year":"2024","unstructured":"Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Xiaoyan Li, et al. 2024. Sambanova sn40l: Scaling the ai memory wall with dataflow and composition of experts. In 2024 57th IEEE\/ACM International Symposium on Microarchitecture (MICRO). 1353\u20131366."},{"key":"e_1_3_2_1_73_1","unstructured":"The Open Compute Project. 2021. OpenHBI Specification Version 1.0. https:\/\/www.opencompute.org\/documents\/odsa-openhbi-v1-0-spec-rc-final-1-pdf."},{"key":"e_1_3_2_1_74_1","unstructured":"The Open Compute Project. 2022. Bunch of Wires PHY Specification. https:\/\/www.opencompute.org\/documents\/bunch-of-wires-phy-specification-pdf."},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1145\/3422604.3425929"},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477132.3483583"},{"key":"e_1_3_2_1_77_1","volume-title":"Proceedings of 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI'14)","author":"Radhakrishnan Sivasankar","year":"2014","unstructured":"Sivasankar Radhakrishnan, Yilong Geng, Vimalkumar Jeyakumar, Abdul Kabbani, George Porter, and Amin Vahdat. 2014. SENIC: Scalable NIC for End-Host rate limiting. In Proceedings of 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI'14). 475\u2013488."},{"key":"e_1_3_2_1_78_1","volume-title":"2022 IEEE High Performance Extreme Computing Conference (HPEC). 1\u201310","author":"Reuther Albert","year":"2022","unstructured":"Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, and Jeremy Kepner. 2022. AI and ML accelerator survey and trends. In 2022 IEEE High Performance Extreme Computing Conference (HPEC). 1\u201310."},{"key":"e_1_3_2_1_79_1","volume-title":"Proceedings of 2019 USENIX Annual Technical Conference (USENIX ATC'19). 379\u2013394","author":"Ruan Zhenyuan","year":"2019","unstructured":"Zhenyuan Ruan, Tong He, and Jason Cong. 2019. INSIDER: Designing In-Storage computing system for emerging High-Performance drive. In Proceedings of 2019 USENIX Annual Technical Conference (USENIX ATC'19). 379\u2013394."},{"key":"e_1_3_2_1_80_1","volume-title":"Proceedings of the Conference of the ACM Special Interest Group on Data Communication (SIGCOMM'17)","author":"Saeed Ahmed","year":"2017","unstructured":"Ahmed Saeed, Nandita Dukkipati, Vytautas Valancius, Vinh The Lam, Carlo Contavalli, and Amin Vahdat. 2017. Carousel: Scalable traffic shaping at end hosts. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication (SIGCOMM'17). 404\u2013417."},{"key":"e_1_3_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/3477132.3483555"},{"key":"e_1_3_2_1_82_1","volume-title":"Riduan Khaddam-Aljameh, and Evangelos Eleftheriou.","author":"Sebastian Abu","year":"2020","unstructured":"Abu Sebastian, Manuel Le Gallo, Riduan Khaddam-Aljameh, and Evangelos Eleftheriou. 2020. Memory devices and applications for in-memory computing. Nature nanotechnology 15, 7 (2020), 529\u2013544."},{"key":"e_1_3_2_1_83_1","volume-title":"Approximating Fair Queueing on Reconfigurable Switches. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI'18)","author":"Sharma Naveen Kr.","year":"2018","unstructured":"Naveen Kr. Sharma, Ming Liu, Kishore Atreya, and Arvind Krishnamurthy. 2018. Approximating Fair Queueing on Reconfigurable Switches. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI'18). 1\u201316."},{"key":"e_1_3_2_1_84_1","volume-title":"Programmable Calendar Queues for High-speed Packet Scheduling. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI'20)","author":"Sharma Naveen Kr.","year":"2020","unstructured":"Naveen Kr. Sharma, Chenxingyu Zhao, Ming Liu, Pravein G Kannan, Changhoon Kim, Arvind Krishnamurthy, and Anirudh Sivaraman. 2020. Programmable Calendar Queues for High-speed Packet Scheduling. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI'20). 685\u2013699."},{"key":"e_1_3_2_1_85_1","volume-title":"Proceedings of 2025 IEEE International Solid-State Circuits Conference (ISSCC)","volume":"68","author":"Singh Teja","year":"2025","unstructured":"Teja Singh, Spence Oliver, Sundar Rangarajan, Shane Southard, Carson Henrion, Alex Schaefer, Brett Johnson, Sarah Bartaszewicz Tower, Kathy Hoover, Deepesh John, Ted Antoniadis, Shravan Lakshman, Vibhor Mittal, Brian Kasprzyk, Ross McCoy, Kurt Mohlman, Anitha Mohan, Hon-Hin Wong, Daryl Lieu, Russell Schreiber, Sahilpreet Singh, Nick Lance, Darryl Prudich, Justin Coppin, Tim Jackson, Anita Karegar, Ryan Miller, Sabeesh Balagangadharan, James Pistole, Wilson Li, and Michael McCabe. 2025. \"Zen 5\": The AMD High-Performance 4nm x86-64 Microprocessor Core. In Proceedings of 2025 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 68. 1\u20133."},{"key":"e_1_3_2_1_86_1","volume-title":"2020 IEEE International Solid-State Circuits Conference - (ISSCC). 42\u201344","author":"Singh Teja","year":"2020","unstructured":"Teja Singh, Sundar Rangarajan, Deepesh John, Russell Schreiber, Spence Oliver, Rajit Seahra, and Alex Schaefer. 2020. 2.1 Zen 2: The AMD 7nm Energy-Efficient High-Performance x86-64 Microprocessor Core. In 2020 IEEE International Solid-State Circuits Conference - (ISSCC). 42\u201344."},{"key":"e_1_3_2_1_87_1","volume-title":"Proceedings of 2024 IEEE International Solid-State Circuits Conference (ISSCC)","volume":"67","author":"Smith Alan","year":"2024","unstructured":"Alan Smith, Eric Chapman, Chintan Patel, Raja Swaminathan, John Wuu, Tyrone Huang, Wonjun Jung, Alexander Kaganov, Hugh McIntyre, and Ramon Mangaser. 2024. 11.1 AMD InstinctTM MI300 Series Modular Chiplet Package - HPC and AI Accelerator for Exa-Class Systems. In Proceedings of 2024 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 67. 490\u2013492."},{"key":"e_1_3_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1145\/1375581.1375599"},{"key":"e_1_3_2_1_89_1","unstructured":"Synopsys. 2025. What are Chiplets. https:\/\/www.synopsys.com\/glossary\/what-are-chiplets.html."},{"key":"e_1_3_2_1_90_1","unstructured":"Microchip USA. 2023. What is a Chiplet? https:\/\/www.microchipusa.com\/industry-news\/what-is-a-chiplet."},{"key":"e_1_3_2_1_91_1","volume-title":"Proceedings of 2025 IEEE International Solid-State Circuits Conference (ISSCC)","volume":"68","author":"Varada Raj R.","year":"2025","unstructured":"Raj R. Varada, Rohini Krishnan, Ajith Subramonia, Rathish Chandran, Kalyana Chakravarthy, Uttpal D. Desai, Sumedha Limaye, Puneesh Puri, David R. Mulvihill, Mike Bichan, Martin Koolhaas, Vijayalakshmi Ramachandran, and Srinivasu Kendle. 2025. 2.3 Granite Rapids-D: Intel Xeon 6 SoC for vRAN, Edge, Networking, and Storage. In Proceedings of 2025 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 68. 48\u201350."},{"key":"e_1_3_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1996.10476697"},{"key":"e_1_3_2_1_93_1","volume-title":"2024 IEEE Hot Chips 36 Symposium (HCS). 1\u201330","author":"Vasiljevic Jasmina","year":"2024","unstructured":"Jasmina Vasiljevic and Davor Capalija. 2024. Blackhole & tt-metalium: The standalone ai computer and its programming model. In 2024 IEEE Hot Chips 36 Symposium (HCS). 1\u201330."},{"key":"e_1_3_2_1_94_1","volume-title":"In-memory computing: Advances and prospects","author":"Verma Naveen","year":"2019","unstructured":"Naveen Verma, Hongyang Jia, Hossein Valavi, Yinqi Tang, Murat Ozatay, Lung-Yen Chen, Bonan Zhang, and Peter Deaville. 2019. In-memory computing: Advances and prospects. IEEE solid-state circuits magazine 11, 3 (2019), 43\u201355."},{"key":"e_1_3_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1145\/3651890.3672271"},{"key":"e_1_3_2_1_96_1","doi-asserted-by":"crossref","unstructured":"John Wawrzynek. 2015. Accelerating Science Driven System Design With RAMP. Technical Report. Univ. of California Berkeley CA (United States).","DOI":"10.2172\/1186854"},{"key":"e_1_3_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2749443"},{"key":"e_1_3_2_1_98_1","unstructured":"WikiChip. 2025. Chiplet. https:\/\/en.wikichip.org\/wiki\/chiplet."},{"key":"e_1_3_2_1_99_1","unstructured":"Wikipedia. 2025. 2.5D integrated circuit. https:\/\/en.wikipedia.org\/wiki\/2.5D_integrated_circuit."},{"key":"e_1_3_2_1_100_1","unstructured":"Wikipedia. 2025. Advanced packaging (semiconductors). https:\/\/en.wikipedia.org\/wiki\/Advanced_packaging_(semiconductors)."},{"key":"e_1_3_2_1_101_1","doi-asserted-by":"publisher","DOI":"10.1145\/1498765.1498785"},{"key":"e_1_3_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.5555\/3767955.3768021"},{"key":"e_1_3_2_1_103_1","volume-title":"2017 IEEE 33rd International Conference on Data Engineering (ICDE). 103\u2013106","author":"Yang Tong","year":"2017","unstructured":"Tong Yang, Lingtong Liu, Yibo Yan, Muhammad Shahzad, Yulong Shen, Xiaoming Li, Bin Cui, and Gaogang Xie. 2017. Sf-sketch: A fast, accurate, and memory efficient data structure to store frequencies of data items. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE). 103\u2013106."},{"key":"e_1_3_2_1_104_1","volume-title":"RpcNIC: Enabling Efficient Datacenter RPC Offloading on PCIe-attached SmartNICs. In 2025 IEEE International Symposium on High Performance Computer Architecture (HPCA'25)","author":"Zhang Jie","year":"2025","unstructured":"Jie Zhang, Hongjing Huang, Xuzheng Chen, Xiang Li, Jieru Zhao, Ming Liu, and Zeke Wang. 2025. RpcNIC: Enabling Efficient Datacenter RPC Offloading on PCIe-attached SmartNICs. In 2025 IEEE International Symposium on High Performance Computer Architecture (HPCA'25). 1379\u20131394."},{"key":"e_1_3_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1145\/3489048.3530970"},{"key":"e_1_3_2_1_106_1","volume-title":"White-Boxing RDMA with Packet-Granular Software Control. In 22nd USENIX Symposium on Networked Systems Design and Implementation (NSDI'25)","author":"Zhao Chenxingyu","year":"2025","unstructured":"Chenxingyu Zhao, Jaehong Min, Ming Liu, and Arvind Krishnamurthy. 2025. White-Boxing RDMA with Packet-Granular Software Control. In 22nd USENIX Symposium on Networked Systems Design and Implementation (NSDI'25). 427\u2013449."},{"key":"e_1_3_2_1_107_1","unstructured":"Pengfei Zuo Huimin Lin Junbo Deng Nan Zou Xingkun Yang Yingyu Diao Weifeng Gao Ke Xu Zhangyu Chen Shirui Lu et al. 2025. Serving Large Language Models on Huawei CloudMatrix384. arXiv preprint arXiv:2506.12708 (2025)."}],"event":{"name":"HotNets '25: 24th ACM Workshop on Hot Topics in Networks","location":"UMD Campus College Park MD USA","acronym":"HotNets '25","sponsor":["SIGCOMM ACM Special Interest Group on Data Communication"]},"container-title":["Proceedings of the 24th ACM Workshop on Hot Topics in Networks"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3772356.3772389","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T12:05:14Z","timestamp":1763381114000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3772356.3772389"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,17]]},"references-count":107,"alternative-id":["10.1145\/3772356.3772389","10.1145\/3772356"],"URL":"https:\/\/doi.org\/10.1145\/3772356.3772389","relation":{},"subject":[],"published":{"date-parts":[[2025,11,17]]},"assertion":[{"value":"2025-11-17","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}