{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T14:11:29Z","timestamp":1773843089049,"version":"3.50.1"},"reference-count":67,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2024,1,19]],"date-time":"2024-01-19T00:00:00Z","timestamp":1705622400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62072196"],"award-info":[{"award-number":["62072196"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100015956","name":"Key Research and Development Program of Guangdong Province","doi-asserted-by":"crossref","award":["2021B0101400003"],"award-info":[{"award-number":["2021B0101400003"]}],"id":[{"id":"10.13039\/501100015956","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Creative Research Group Project of NSFC","award":["61821003"],"award-info":[{"award-number":["61821003"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2024,3,31]]},"abstract":"<jats:p>Memory disaggregation is a promising architecture for modern datacenters that separates compute and memory resources into independent pools connected by ultra-fast networks, which can improve memory utilization, reduce cost, and enable elastic scaling of compute and memory resources. However, existing memory disaggregation solutions based on remote direct memory access (RDMA) suffer from high latency and additional overheads including page faults and code refactoring. Emerging cache-coherent interconnects such as CXL offer opportunities to reconstruct high-performance memory disaggregation. However, existing CXL-based approaches have physical distance limitation and cannot be deployed across racks.<\/jats:p><jats:p>In this article, we propose Rcmp, a novel low-latency and highly scalable memory disaggregation system based on RDMA and CXL. The significant feature is that Rcmp improves the performance of RDMA-based systems via CXL, and leverages RDMA to overcome CXL\u2019s distance limitation. To address the challenges of the mismatch between RDMA and CXL in terms of granularity, communication, and performance, Rcmp (1) provides a global page-based memory space management and enables fine-grained data access, (2) designs an efficient communication mechanism to avoid communication blocking issues, (3)\u00a0proposes a hot-page identification and swapping strategy to reduce RDMA communications, and (4) designs an RDMA-optimized RPC framework to accelerate RDMA transfers. We implement a prototype of Rcmp and evaluate its performance by using micro-benchmarks and running a key-value store with YCSB benchmarks. The results show that Rcmp can achieve 5.2\u00d7 lower latency and 3.8\u00d7 higher throughput than RDMA-based systems. We also demonstrate that Rcmp can scale well with the increasing number of nodes without compromising performance.<\/jats:p>","DOI":"10.1145\/3634916","type":"journal-article","created":{"date-parts":[[2024,1,19]],"date-time":"2024-01-19T12:39:45Z","timestamp":1705667985000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Rcmp: Reconstructing RDMA-Based Memory Disaggregation via CXL"],"prefix":"10.1145","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1370-482X","authenticated-orcid":false,"given":"Zhonghua","family":"Wang","sequence":"first","affiliation":[{"name":"Huazhong University of Science and Technology, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-1666-5820","authenticated-orcid":false,"given":"Yixing","family":"Guo","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7757-4083","authenticated-orcid":false,"given":"Kai","family":"Lu","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3440-4460","authenticated-orcid":false,"given":"Jiguang","family":"Wan","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-8169-294X","authenticated-orcid":false,"given":"Daohui","family":"Wang","sequence":"additional","affiliation":[{"name":"Huawei Cloud, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9358-9373","authenticated-orcid":false,"given":"Ting","family":"Yao","sequence":"additional","affiliation":[{"name":"Huawei Cloud, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-7971-5014","authenticated-orcid":false,"given":"Huatao","family":"Wu","sequence":"additional","affiliation":[{"name":"Huawei Cloud, China"}]}],"member":"320","published-online":{"date-parts":[[2024,1,19]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"GitHub. 2023. FUSE (Filesystem in Userspace). Retrieved December 8 2023 from http:\/\/libfuse.github.io\/"},{"issue":"1","key":"e_1_3_1_3_2","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1145\/3606557.3606563","article-title":"Memory disaggregation: Why now and what are the challenges","volume":"57","author":"Aguilera Marcos K.","year":"2023","unstructured":"Marcos K. Aguilera, Emmanuel Amaro, Nadav Amit, Erika Hunhoff, Anil Yelam, and Gerd Zellweger. 2023. Memory disaggregation: Why now and what are the challenges. ACM SIGOPS Operating Systems Review 57, 1 (2023), 38\u201346.","journal-title":"ACM SIGOPS Operating Systems Review"},{"key":"e_1_3_1_4_2","first-page":"1","volume-title":"Proceedings of the 15th European Conference on Computer Systems","author":"Amaro Emmanuel","year":"2020","unstructured":"Emmanuel Amaro, Christopher Branner-Augmon, Zhihong Luo, Amy Ousterhout, Marcos K. Aguilera, Aurojit Panda, Sylvia Ratnasamy, and Scott Shenker. 2020. Can far memory improve job throughput? In Proceedings of the 15th European Conference on Computer Systems. 1\u201316."},{"key":"e_1_3_1_5_2","first-page":"387","volume-title":"Proceedings of the 19th USENIX Conference on File and Storage Technologies (FAST\u201921)","author":"Bae Jonghyun","year":"2021","unstructured":"Jonghyun Bae, Jongsung Lee, Yunho Jin, Sam Son, Shine Kim, Hakbeom Jang, Tae Jun Ham, and Jae W. Lee. 2021. FlashNeuron: SSD-enabled large-batch training of very deep neural networks. In Proceedings of the 19th USENIX Conference on File and Storage Technologies (FAST\u201921). 387\u2013401."},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-031-01741-4","author":"Barroso Luiz Andr\u00e9","year":"2013","unstructured":"Luiz Andr\u00e9 Barroso, Jimmy Clidaras, and Urs H\u00f6lzle. 2013. TheDatacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines (2nd ed.). Synthesis Lectures on Computer Architecture. Morgan & Claypool.","journal-title":"Synthesis Lectures on Computer Architecture."},{"key":"e_1_3_1_7_2","volume-title":"Proceedings of the OpenFabrics Workshop","author":"Benton Brad","year":"2017","unstructured":"Brad Benton. 2017. CCIX, GEN-Z, OpenCAPI: Overview & comparison. In Proceedings of the OpenFabrics Workshop."},{"issue":"5","key":"e_1_3_1_8_2","doi-asserted-by":"crossref","first-page":"868","DOI":"10.1007\/s10439-023-03172-7","article-title":"Role of Chat GPT in public health","volume":"51","author":"Biswas Som S.","year":"2023","unstructured":"Som S. Biswas. 2023. Role of Chat GPT in public health. Annals of Biomedical Engineering 51, 5 (2023), 868\u2013869.","journal-title":"Annals of Biomedical Engineering"},{"key":"e_1_3_1_9_2","volume-title":"Proceedings of the USENIX Summer 1994 Technical Conference","volume":"16","year":"1994","unstructured":"Jeff Bonwick. 1994. The slab allocator: An object-caching kernel memory allocator. In Proceedings of the USENIX Summer 1994 Technical Conference, Vol. 16. 1\u201312."},{"issue":"11","key":"e_1_3_1_10_2","doi-asserted-by":"crossref","first-page":"1604","DOI":"10.14778\/3236187.3236209","article-title":"Efficient distributed memory management with RDMA and caching","volume":"11","author":"Cai Qingchao","year":"2018","unstructured":"Qingchao Cai, Wentian Guo, Hao Zhang, Divyakant Agrawal, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, Yong Meng Teo, and Sheng Wang. 2018. Efficient distributed memory management with RDMA and caching. Proceedings of the VLDB Endowment 11, 11 (2018), 1604\u20131617.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_3_1_11_2","first-page":"2","volume-title":"Proceedings of the 18th ACM SIGPLAN\/SIGOPS International Conference on Virtual Execution Environments","author":"Calciu Irina","year":"2021","unstructured":"Irina Calciu, M. Talha Imran, Ivan Puddu, Sanidhya Kashyap, and Zviad Metreveli. 2021. Rethinking software runtimes for disaggregated memory. In Proceedings of the 18th ACM SIGPLAN\/SIGOPS International Conference on Virtual Execution Environments. 2\u201316."},{"key":"e_1_3_1_12_2","doi-asserted-by":"crossref","first-page":"2477","DOI":"10.1145\/3448016.3457560","volume-title":"Proceedings of the 2021 International Conference on Management of Data","author":"Cao Wei","year":"2021","unstructured":"Wei Cao, Yingqiang Zhang, Xinjun Yang, Feifei Li, Sheng Wang, Qingda Hu, Xuntao Cheng, Zongzhi Chen, Zhenjun Liu, Jing Fang, et\u00a0al. 2021. PolarDB Serverless: A cloud native database for disaggregated data centers. In Proceedings of the 2021 International Conference on Management of Data. 2477\u20132489."},{"key":"e_1_3_1_13_2","volume-title":"Proceedings of the 18th USENIX Conference on File and Storage Technologies (FAST\u201920)","author":"Cao Zhichao","year":"2020","unstructured":"Zhichao Cao and Siying Dong. 2020. Characterizing, modeling, and benchmarking RocksDB key-value workloads at Facebook. In Proceedings of the 18th USENIX Conference on File and Storage Technologies (FAST\u201920)."},{"key":"e_1_3_1_14_2","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1109\/BigData.2018.8622518","volume-title":"Proceedings of the 2018 IEEE International Conference on Big Data (Big Data\u201918)","author":"Cheng Yue","year":"2018","unstructured":"Yue Cheng, Ali Anwar, and Xuejing Duan. 2018. Analyzing Alibaba\u2019s co-located datacenter workloads. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data\u201918). IEEE, Los Alamitos, CA, 292\u2013297."},{"key":"e_1_3_1_15_2","unstructured":"Adrian Cockcroft. 2023. Supercomputing Predictions: Custom CPUs CXL3.0 and Petalith Architectures. Retrieved December 8 2023 from https:\/\/adrianco.medium.com\/supercomputing-predictions-custom-cpus-cxl3-0-and-petalith-architectures-b67cc324588f\/"},{"key":"e_1_3_1_16_2","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1145\/1807128.1807152","volume-title":"Proceedings of the 1st ACM Symposium on Cloud Computing","author":"Cooper Brian F.","year":"2010","unstructured":"Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, and Russell Sears. 2010. Benchmarking cloud serving systems with YCSB. In Proceedings of the 1st ACM Symposium on Cloud Computing. 143\u2013154."},{"key":"e_1_3_1_17_2","unstructured":"Anritsu Corporation and KYOCERA Corporation. 2023. PCI Express\u00ae5.0 Optical Signal Transmission Test. Retrieved December 8 2023 from https:\/\/global.kyocera.com\/newsroom\/news\/2023\/000694.html"},{"key":"e_1_3_1_18_2","first-page":"401","volume-title":"Proceedings of the 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201914)","author":"Dragojevi\u0107 Aleksandar","year":"2014","unstructured":"Aleksandar Dragojevi\u0107, Dushyanth Narayanan, Miguel Castro, and Orion Hodson. 2014. FaRM: Fast remote memory. In Proceedings of the 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201914). 401\u2013414."},{"key":"e_1_3_1_19_2","first-page":"92","volume-title":"Proceedings of the 2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS\u201921)","author":"Duan Zhuohui","year":"2021","unstructured":"Zhuohui Duan, Haikun Liu, Haodi Lu, Xiaofei Liao, Hai Jin, Yu Zhang, and Bingsheng He. 2021. Gengar: An RDMA-based distributed hybrid memory pool. In Proceedings of the 2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS\u201921). IEEE, Los Alamitos, CA, 92\u2013103."},{"key":"e_1_3_1_20_2","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1007\/s11023-020-09548-1","article-title":"GPT-3: Its nature, scope, limits, and consequences","volume":"30","author":"Floridi Luciano","year":"2020","unstructured":"Luciano Floridi and Massimo Chiriatti. 2020. GPT-3: Its nature, scope, limits, and consequences. Minds and Machines 30 (2020), 681\u2013694.","journal-title":"Minds and Machines"},{"key":"e_1_3_1_21_2","volume-title":"Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI\u201916)","author":"Gao Peter Xiang","year":"2016","unstructured":"Peter Xiang Gao, Akshay Narayan, Sagar Karandikar, Jo\u00e3o Carreira, Sangjin Han, Rachit Agarwal, Sylvia Ratnasamy, and Scott Shenker. 2016. Network requirements for resource disaggregation. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI\u201916). 249\u2013264. https:\/\/www.usenix.org\/conference\/osdi16\/technical-sessions\/presentation\/gao"},{"key":"e_1_3_1_22_2","first-page":"287","volume-title":"Proceedings of the 2022 USENIX Annual Technical Conference (USENIX ATC\u201922)","author":"Gouk Donghyun","year":"2022","unstructured":"Donghyun Gouk, Sangwon Lee, Miryeong Kwon, and Myoungsoo Jung. 2022. Direct access, high-performance memory disaggregation with DirectCXL. In Proceedings of the 2022 USENIX Annual Technical Conference (USENIX ATC\u201922). 287\u2013294."},{"key":"e_1_3_1_23_2","first-page":"649","volume-title":"Proceedings of the 14th USENIX Conference on Networked Systems Design and Implementation (NSDI\u201917)","author":"Gu Juncheng","year":"2017","unstructured":"Juncheng Gu, Youngmoon Lee, Yiwen Zhang, Mosharaf Chowdhury, and Kang G. Shin. 2017. Efficient memory disaggregation with INFINISWAP. In Proceedings of the 14th USENIX Conference on Networked Systems Design and Implementation (NSDI\u201917). 649\u2013667."},{"key":"e_1_3_1_24_2","volume-title":"Proceedings of the USENIX Annual Technical Conference","volume":"8","author":"Hunt Patrick","year":"2010","unstructured":"Patrick Hunt, Mahadev Konar, Flavio Paiva Junqueira, and Benjamin Reed. 2010. ZooKeeper: Wait-free coordination for internet-scale systems. In Proceedings of the USENIX Annual Technical Conference, Vol. 8."},{"key":"e_1_3_1_25_2","first-page":"1","volume-title":"Proceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201919)","author":"Kalia Anuj","year":"2019","unstructured":"Anuj Kalia, Michael Kaminsky, and David Andersen. 2019. Datacenter RPCs can be general and fast. In Proceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201919). 1\u201316."},{"key":"e_1_3_1_26_2","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1145\/2619239.2626299","volume-title":"Proceedings of the 2014 ACM Conference on SIGCOMM","author":"Kalia Anuj","year":"2014","unstructured":"Anuj Kalia, Michael Kaminsky, and David G. Andersen. 2014. Using RDMA efficiently for key-value services. In Proceedings of the 2014 ACM Conference on SIGCOMM. 295\u2013306."},{"key":"e_1_3_1_27_2","volume-title":"Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201916)","author":"Kalia Anuj","year":"2016","unstructured":"Anuj Kalia, Michael Kaminsky, and David G. Andersen. 2016. FaSST: Fast, scalable and simple distributed transactions with two-sided (RDMA) datagram RPCs. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (OSDI\u201916). 185\u2013201."},{"key":"e_1_3_1_28_2","doi-asserted-by":"crossref","first-page":"690","DOI":"10.3850\/9783981537079_1014","volume-title":"Proceedings of the 2016 Design, Automation, and Test in Europe Conference and Exhibition (DATE\u201916)","author":"Katrinis Kostas","year":"2016","unstructured":"Kostas Katrinis, Dimitris Syrivelis, Dionisios Pnevmatikatos, Georgios Zervas, Dimitris Theodoropoulos, Iordanis Koutsopoulos, Kobi Hasharoni, Daniel Raho, Christian Pinto, F. Espina, et\u00a0al. 2016. Rack-scale disaggregated cloud data centers: The dReDBox project vision. In Proceedings of the 2016 Design, Automation, and Test in Europe Conference and Exhibition (DATE\u201916). IEEE, Los Alamitos, CA, 690\u2013695."},{"key":"e_1_3_1_29_2","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1109\/MICRO.2018.00021","volume-title":"Proceedings of the 2018 51st Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201918)","author":"Kwon Youngeun","year":"2018","unstructured":"Youngeun Kwon and Minsoo Rhu. 2018. Beyond the memory wall: A case for memory-centric HPC system for deep learning. In Proceedings of the 2018 51st Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO\u201918). IEEE, Los Alamitos, CA, 148\u2013161."},{"key":"e_1_3_1_30_2","first-page":"488","volume-title":"Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles","author":"Lee Seung-Seob","year":"2021","unstructured":"Seung-Seob Lee, Yanpeng Yu, Yupeng Tang, Anurag Khandelwal, Lin Zhong, and Abhishek Bhattacharjee. 2021. Mind: In-network memory management for disaggregated data centers. In Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles. 488\u2013504."},{"key":"e_1_3_1_31_2","first-page":"574","volume-title":"Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Vol. 2","author":"Li Huaicheng","year":"2023","unstructured":"Huaicheng Li, Daniel S Berger, Lisa Hsu, Daniel Ernst, Pantea Zardoshti, Stanko Novakovic, Monish Shah, Samir Rajadnya, Scott Lee, Ishwar Agarwal, et\u00a0al. 2023. Pond: CXL-based memory pooling systems for cloud platforms. In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Vol. 2. 574\u2013587."},{"key":"e_1_3_1_32_2","first-page":"653","volume-title":"Proceedings of the 2018 18th IEEE\/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGRID\u201918)","author":"Makrani Hosein Mohammadi","year":"2018","unstructured":"Hosein Mohammadi Makrani, Setareh Rafatirad, Amir Houmansadr, and Houman Homayoun. 2018. Main-memory requirements of big data applications on commodity server platform. In Proceedings of the 2018 18th IEEE\/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGRID\u201918). IEEE, Los Alamitos, CA, 653\u2013660."},{"key":"e_1_3_1_33_2","volume-title":"Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Vol. 3. 742\u2013755.","author":"Maruf Hasan Al","year":"2023","unstructured":"Hasan Al Maruf, Hao Wang, Abhishek Dhanotia, Johannes Weiner, Niket Agarwal, Pallab Bhattacharya, Chris Petersen, Mosharaf Chowdhury, Shobhit Kanaujia, and Prakash Chauhan. 2023. TPP: Transparent page placement for CXL-enabled tiered-memory. In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Vol. 3. 742\u2013755."},{"key":"e_1_3_1_34_2","article-title":"Memtrade: A disaggregated-memory marketplace for public clouds","author":"Maruf Hasan Al","year":"2021","unstructured":"Hasan Al Maruf, Yuhong Zhong, Hongyi Wang, Mosharaf Chowdhury, Asaf Cidon, and Carl Waldspurger. 2021. Memtrade: A disaggregated-memory marketplace for public clouds. arXiv preprint arXiv:2108.06893 (2021).","journal-title":"arXiv preprint arXiv:2108.06893"},{"key":"e_1_3_1_35_2","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1177\/10943420231166608","article-title":"Myths and legends in high-performance computing","author":"Matsuoka Satoshi","year":"2023","unstructured":"Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, and Torsten Hoefler. 2023. Myths and legends in high-performance computing. International Journal of High Performance Computing Applications 37, 3-4 (2023), 245\u2013259.","journal-title":"International Journal of High Performance Computing Applications"},{"issue":"2","key":"e_1_3_1_36_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3514245","article-title":"A case for intra-rack resource disaggregation in HPC","volume":"19","author":"Michelogiannakis George","year":"2022","unstructured":"George Michelogiannakis, Benjamin Klenk, Brandon Cook, Min Yee Teh, Madeleine Glick, Larry Dennison, Keren Bergman, and John Shalf. 2022. A case for intra-rack resource disaggregation in HPC. ACM Transactions on Architecture and Code Optimization 19, 2 (2022), 1\u201326.","journal-title":"ACM Transactions on Architecture and Code Optimization"},{"key":"e_1_3_1_37_2","doi-asserted-by":"crossref","first-page":"212","DOI":"10.1145\/3477132.3483576","volume-title":"Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles","author":"Monga Sumit Kumar","year":"2021","unstructured":"Sumit Kumar Monga, Sanidhya Kashyap, and Changwoo Min. 2021. Birds of a feather flock together: Scaling RDMA RPCs with Flock. In Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles. 212\u2013227."},{"key":"e_1_3_1_38_2","first-page":"183","volume-title":"Proceedings of the 2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD\u201920)","author":"Peng Ivy","year":"2020","unstructured":"Ivy Peng, Roger Pearce, and Maya Gokhale. 2020. On the memory underutilization: Exploring disaggregated memory on HPC systems. In Proceedings of the 2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD\u201920). IEEE, Los Alamitos, CA, 183\u2013190."},{"key":"e_1_3_1_39_2","unstructured":"The Next Platform. 2022. Just How Bad Is CXL Memory Latency? Retrieved December 8 2023 from https:\/\/www.nextplatform.com\/2022\/12\/05\/just-how-bad-is-cxl-memory-latency\/"},{"key":"e_1_3_1_40_2","doi-asserted-by":"crossref","first-page":"392","DOI":"10.1145\/3477132.3483550","volume-title":"Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles","author":"Raybuck Amanda","year":"2021","unstructured":"Amanda Raybuck, Tim Stamler, Wei Zhang, Mattan Erez, and Simon Peter. 2021. HeMem: Scalable tiered memory management for big data applications and real NVM. In Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles. 392\u2013407."},{"key":"e_1_3_1_41_2","first-page":"1","volume-title":"Proceedings of the 3rd ACM Symposium on Cloud Computing","author":"Reiss Charles","year":"2012","unstructured":"Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, and Michael A. Kozuch. 2012. Heterogeneity and dynamicity of clouds at scale: Google trace analysis. In Proceedings of the 3rd ACM Symposium on Cloud Computing. 1\u201313."},{"key":"e_1_3_1_42_2","first-page":"315","volume-title":"Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation","author":"Ruan Zhenyuan","year":"2020","unstructured":"Zhenyuan Ruan, Malte Schwarzkopf, Marcos K. Aguilera, and Adam Belay. 2020. AIFM: High-performance, application-integrated far memory. In Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation. 315\u2013332."},{"key":"e_1_3_1_43_2","unstructured":"Rick Salmonson Troy Oxby Larry Briski Robert Normand Russell Stacy and Jeffrey Glanzman. 2019. PCIe Riser Extension Assembly. Technical Disclosure Commons (January 11 2019).https:\/\/www.tdcommons.org\/dpubs_series\/1878"},{"key":"e_1_3_1_44_2","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1145\/3299869.3300069","volume-title":"Proceedings of the 2019 International Conference on Management of Data","author":"Shamis Alex","year":"2019","unstructured":"Alex Shamis, Matthew Renzelmann, Stanko Novakovic, Georgios Chatzopoulos, Aleksandar Dragojevi\u0107, Dushyanth Narayanan, and Miguel Castro. 2019. Fast general distributed transactions with opacity. In Proceedings of the 2019 International Conference on Management of Data. 433\u2013448."},{"key":"e_1_3_1_45_2","first-page":"69","volume-title":"Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI\u201918)","author":"Shan Yizhou","year":"2018","unstructured":"Yizhou Shan, Yutong Huang, Yilun Chen, and Yiying Zhang. 2018. LegoOS: A disseminated, distributed OS for hardware resource disaggregation. In Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI\u201918). 69\u201387."},{"key":"e_1_3_1_46_2","unstructured":"Debendra Das Sharma and Ishwar Agarwal. 2022. Compute Express Link. Retrieved December 8 2023 from https:\/\/www.computeexpresslink.org\/_files\/ugd\/0c1418_a8713008916044ae9604405d10a7773b.pdf\/"},{"key":"e_1_3_1_47_2","unstructured":"Navin Shenoy. 2023. A Milestone in Moving Data. Retrieved December 8 2023 from https:\/\/www.intel.com\/content\/www\/us\/en\/newsroom\/home.html"},{"key":"e_1_3_1_48_2","volume-title":"Proceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201919)","author":"Shrivastav Vishal","year":"2019","unstructured":"Vishal Shrivastav, Asaf Valadarsky, Hitesh Ballani, Paolo Costa, Ki Suh Lee, Han Wang, Rachit Agarwal, and Hakim Weatherspoon. 2019. Shoal: A network architecture for disaggregated racks. In Proceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201919). 255\u2013270. https:\/\/www.usenix.org\/conference\/nsdi19\/presentation\/shrivastav"},{"key":"e_1_3_1_49_2","unstructured":"Intel. 2019. Intel\u00ae Rack Scale Design (Intel\u00ae RSD) Storage Services. API Specification. Intel."},{"key":"e_1_3_1_50_2","article-title":"Demystifying CXL memory with genuine CXL-ready systems and devices","author":"Sun Yan","year":"2023","unstructured":"Yan Sun, Yifan Yuan, Zeduo Yu, Reese Kuper, Ipoom Jeong, Ren Wang, and Nam Sung Kim. 2023. Demystifying CXL memory with genuine CXL-ready systems and devices. arXiv preprint arXiv:2303.15375 (2023).","journal-title":"arXiv preprint arXiv:2303.15375"},{"key":"e_1_3_1_51_2","unstructured":"Torvalds. 2023. Linux Kernel Source Tree. Retrieved December 8 2023 from https:\/\/github.com\/torvalds\/linux\/blob\/master\/lib\/kfifo.c"},{"key":"e_1_3_1_52_2","first-page":"33","volume-title":"Proceedings of the 2020 USENIX Annual Technical Conference","author":"Tsai Shin-Yeh","year":"2020","unstructured":"Shin-Yeh Tsai, Yizhou Shan, and Yiying Zhang. 2020. Disaggregating persistent memory and controlling them remotely: An exploration of passive disaggregated key-value stores. In Proceedings of the 2020 USENIX Annual Technical Conference. 33\u201348."},{"key":"e_1_3_1_53_2","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1109\/HOTI.2019.00017","volume-title":"Proceedings of the 2019 IEEE Symposium on High-Performance Interconnects (HOTI\u201919)","author":"Doren Stephen Van","year":"2019","unstructured":"Stephen Van Doren. 2019. HOTI 2019: Compute express link. In Proceedings of the 2019 IEEE Symposium on High-Performance Interconnects (HOTI\u201919). IEEE, Los Alamitos, CA, 18\u201318."},{"key":"e_1_3_1_54_2","doi-asserted-by":"crossref","first-page":"1041","DOI":"10.1145\/3035918.3056101","volume-title":"Proceedings of the 2017 ACM International Conference on Management of Data","author":"Verbitski Alexandre","year":"2017","unstructured":"Alexandre Verbitski, Anurag Gupta, Debanjan Saha, Murali Brahmadesam, Kamal Gupta, Raman Mittal, Sailesh Krishnamurthy, Sandor Maurice, Tengiz Kharatishvili, and Xiaofeng Bao. 2017. Amazon Aurora: Design considerations for high throughput cloud-native relational databases. In Proceedings of the 2017 ACM International Conference on Management of Data. 1041\u20131052."},{"key":"e_1_3_1_55_2","first-page":"449","volume-title":"Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201920)","author":"Vuppalapati Midhul","year":"2020","unstructured":"Midhul Vuppalapati, Justin Miron, Rachit Agarwal, Dan Truong, Ashish Motivala, and Thierry Cruanes. 2020. Building an elastic query engine on disaggregated storage. In Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI\u201920). 449\u2013462."},{"key":"e_1_3_1_56_2","article-title":"Evaluating emerging CXL-enabled memory pooling for HPC systems","author":"Wahlgren Jacob","year":"2022","unstructured":"Jacob Wahlgren, Maya Gokhale, and Ivy B. Peng. 2022. Evaluating emerging CXL-enabled memory pooling for HPC systems. arXiv preprint arXiv:2211.02682 (2022).","journal-title":"arXiv preprint arXiv:2211.02682"},{"key":"e_1_3_1_57_2","first-page":"75","volume-title":"Proceedings of the 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM\u201923)","author":"Wang Chenjiu","year":"2023","unstructured":"Chenjiu Wang, Ke He, Ruiqi Fan, Xiaonan Wang, Wei Wang, and Qinfen Hao. 2023. CXL over Ethernet: A novel FPGA-based memory disaggregation design in data centers. In Proceedings of the 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM\u201923). IEEE, Los Alamitos, CA, 75\u201382."},{"key":"e_1_3_1_58_2","first-page":"117","volume-title":"Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation","author":"Wei Xingda","year":"2020","unstructured":"Xingda Wei, Rong Chen, and Haibo Chen. 2020. Fast RDMA-based ordered key-value store using remote learned cache. In Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation. 117\u2013135."},{"key":"e_1_3_1_59_2","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1109\/MASS.2003.1194851","volume-title":"Proceedings of the 2003 20th IEEE\/11th NASA Goddard Conference on Mass Storage Systems and Technologies (MSST\u201903)","author":"Xin Qin","year":"2003","unstructured":"Qin Xin, Ethan L. Miller, Thomas Schwarz, Darrell D. E. Long, Scott A. Brandt, and Witold Litwin. 2003. Reliability mechanisms for very large storage systems. In Proceedings of the 2003 20th IEEE\/11th NASA Goddard Conference on Mass Storage Systems and Technologies (MSST\u201903). IEEE, Los Alamitos, CA, 146\u2013156."},{"key":"e_1_3_1_60_2","first-page":"191","volume-title":"Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation","author":"Yang Juncheng","year":"2020","unstructured":"Juncheng Yang, Yao Yue, and K. V. Rashmi. 2020. A large scale analysis of hundreds of in-memory cache clusters at Twitter. In Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation. 191\u2013208."},{"key":"e_1_3_1_61_2","first-page":"1","volume-title":"Proceedings of the 2022 IEEE International Conference on Networking, Architecture, and Storage (NAS\u201922)","author":"Yang Qirui","year":"2022","unstructured":"Qirui Yang, Runyu Jin, Bridget Davis, Devasena Inupakutika, and Ming Zhao. 2022. Performance evaluation on CXL-enabled hybrid memory pool. In Proceedings of the 2022 IEEE International Conference on Networking, Architecture, and Storage (NAS\u201922). IEEE, Los Alamitos, CA, 1\u20135."},{"key":"e_1_3_1_62_2","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1109\/HPCA56546.2023.10071127","volume-title":"Proceedings of the 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA\u201923)","author":"Yuan Yifan","year":"2023","unstructured":"Yifan Yuan, Jinghan Huang, Yan Sun, Tianchen Wang, Jacob Nelson, Dan R. K. Ports, Yipeng Wang, Ren Wang, Charlie Tai, and Nam Sung Kim. 2023. RAMBDA: RDMA-driven acceleration framework for memory-intensive \\(\\mu\\) s-scale datacenter applications. In Proceedings of the 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA\u201923). IEEE, Los Alamitos, CA, 499\u2013515."},{"key":"e_1_3_1_63_2","article-title":"The end of a myth: Distributed transactions can scale","volume":"1607","author":"Zamanian Erfan","year":"2016","unstructured":"Erfan Zamanian, Carsten Binnig, Tim Kraska, and Tim Harris. 2016. The end of a myth: Distributed transactions can scale. CoRR abs\/1607.00655 (2016). http:\/\/arxiv.org\/abs\/1607.00655","journal-title":"CoRR"},{"key":"e_1_3_1_64_2","first-page":"51","volume-title":"Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST\u201922)","author":"Zhang Ming","year":"2022","unstructured":"Ming Zhang, Yu Hua, Pengfei Zuo, and Lurong Liu. 2022. FORD: Fast one-sided RDMA-based distributed transactions for disaggregated persistent memory. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST\u201922). 51\u201368."},{"key":"e_1_3_1_65_2","article-title":"Sherman: A write-optimized distributed B+Tree index on disaggregated memory","author":"Zhang Yifan","year":"2021","unstructured":"Yifan Zhang, Zhihao Liang, Jianguo Wang, and Stratos Idreos. 2021. Sherman: A write-optimized distributed B+Tree index on disaggregated memory. arXiv preprint arXiv:2112.07320 (2021).","journal-title":"arXiv preprint arXiv:2112.07320"},{"issue":"10","key":"e_1_3_1_66_2","doi-asserted-by":"crossref","first-page":"1900","DOI":"10.14778\/3467861.3467877","article-title":"Towards cost-effective and elastic cloud database deployment via memory disaggregation","volume":"14","author":"Zhang Yingqiang","year":"2021","unstructured":"Yingqiang Zhang, Chaoyi Ruan, Cheng Li, Xinjun Yang, Wei Cao, Feifei Li, Bo Wang, Jing Fang, Yuhui Wang, Jingze Huo, et\u00a0al. 2021. Towards cost-effective and elastic cloud database deployment via memory disaggregation. Proceedings of the VLDB Endowment 14, 10 (2021), 1900\u20131912.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_3_1_67_2","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1145\/3514221.3526187","volume-title":"Proceedings of the 2022 International Conference on Management of Data","author":"Ziegler Tobias","year":"2022","unstructured":"Tobias Ziegler, Carsten Binnig, and Viktor Leis. 2022. ScaleStore: A fast and cost-efficient storage engine using DRAM, NVMe, and RDMA. In Proceedings of the 2022 International Conference on Management of Data. 685\u2013699."},{"key":"e_1_3_1_68_2","first-page":"15","volume-title":"Proceedings of the USENIX Annual Technical Conference","author":"Zuo Pengfei","year":"2021","unstructured":"Pengfei Zuo, Jiazhao Sun, Liu Yang, Shuangwu Zhang, and Yu Hua. 2021. One-sided RDMA-conscious extendible hashing for disaggregated memory. In Proceedings of the USENIX Annual Technical Conference. 15\u201329."}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3634916","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3634916","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:51:07Z","timestamp":1750287067000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3634916"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,19]]},"references-count":67,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,3,31]]}},"alternative-id":["10.1145\/3634916"],"URL":"https:\/\/doi.org\/10.1145\/3634916","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"value":"1544-3566","type":"print"},{"value":"1544-3973","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,19]]},"assertion":[{"value":"2023-07-09","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-11-26","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}