{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T10:29:48Z","timestamp":1770892188096,"version":"3.50.1"},"reference-count":82,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2023,6,26]],"date-time":"2023-06-26T00:00:00Z","timestamp":1687737600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGOPS Oper. Syst. Rev."],"published-print":{"date-parts":[[2023,6,26]]},"abstract":"<jats:p>Disaggregated memory provides many cost savings and resource provisioning benefits for current datacenters, but software systems enabling disaggregated memory access result in high performance penalties. These systems require intrusive code changes to port applications for disaggregated memory or employ slow virtual memory mechanisms to avoid code changes. Such mechanisms result in high overhead page faults to access remote data and high dirty data amplification when tracking changes to cached data at page-granularity. In this paper, we propose a fundamentally new approach for disaggregated memory systems, based on the observation that we can use local cache coherence to track applications' memory accesses transparently, without code changes, at cache-line granularity. This simple idea (1) eliminates page faults from the application critical path when accessing remote data, and (2) decouples the application memory access tracking from the virtual memory page size, enabling cache-line granularity dirty data tracking and eviction. Using this observation, we implemented a new software runtime for disaggregated memory that improves average memory access time and reduces dirty data amplification1.<\/jats:p>","DOI":"10.1145\/3606557.3606561","type":"journal-article","created":{"date-parts":[[2023,6,28]],"date-time":"2023-06-28T16:25:16Z","timestamp":1687969516000},"page":"21-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Using Local Cache Coherence for Disaggregated Memory Systems"],"prefix":"10.1145","volume":"57","author":[{"given":"Irina","family":"Calciu","sequence":"first","affiliation":[{"name":"Graft"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"M. Talha","family":"Imran","sequence":"additional","affiliation":[{"name":"Google"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ivan","family":"Puddu","sequence":"additional","affiliation":[{"name":"ETH Z\u00fcrich"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sanidhya","family":"Kashyap","sequence":"additional","affiliation":[{"name":"EPFL"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hasan","family":"Al Maruf","sequence":"additional","affiliation":[{"name":"University of Michigan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Onur","family":"Mutlu","sequence":"additional","affiliation":[{"name":"ETH Z\u00fcrich"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Aasheesh","family":"Kolli","sequence":"additional","affiliation":[{"name":"Google"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,6,28]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Balance LRU lists based on relative thrashing. https:\/\/lwn.net\/Articles\/690069\/.  Balance LRU lists based on relative thrashing. https:\/\/lwn.net\/Articles\/690069\/."},{"key":"e_1_2_1_2_1","unstructured":"CCIX. https:\/\/www.ccixconsortium.com.  CCIX. https:\/\/www.ccixconsortium.com."},{"key":"e_1_2_1_3_1","unstructured":"Enzian a research computer built by the Systems Group at ETH Z\u00fcrich. http: \/\/www.enzian.systems\/index.html.  Enzian a research computer built by the Systems Group at ETH Z\u00fcrich. http: \/\/www.enzian.systems\/index.html."},{"key":"e_1_2_1_4_1","unstructured":"memtier benchmark: A high-throughput benchmarking tool for redis and memcached. https:\/\/redislabs.com\/blog\/memtier_benchmark-a-high-throughputbenchmarking- tool-for-redis-memcached\/.  memtier benchmark: A high-throughput benchmarking tool for redis and memcached. https:\/\/redislabs.com\/blog\/memtier_benchmark-a-high-throughputbenchmarking- tool-for-redis-memcached\/."},{"key":"e_1_2_1_5_1","unstructured":"Pin - a dynamic binary instrumentation tool. https:\/\/software.intel.com\/enus\/ articles\/pin-a-dynamic-binary-instrumentation-tool.  Pin - a dynamic binary instrumentation tool. https:\/\/software.intel.com\/enus\/ articles\/pin-a-dynamic-binary-instrumentation-tool."},{"key":"e_1_2_1_6_1","unstructured":"Reconsidering swapping. https:\/\/lwn.net\/Articles\/690079\/.  Reconsidering swapping. https:\/\/lwn.net\/Articles\/690079\/."},{"key":"e_1_2_1_7_1","unstructured":"Redis: open-source in-memory data structure store. https:\/\/redis.io.  Redis: open-source in-memory data structure store. https:\/\/redis.io."},{"key":"e_1_2_1_8_1","unstructured":"VOLTDB. https:\/\/www.voltdb.com\/.  VOLTDB. https:\/\/www.voltdb.com\/."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3317550.3321434"},{"key":"e_1_2_1_10_1","volume-title":"USENIX Annual Technical Conference (ATC)","author":"Aguilera Marcos K.","year":"2018","unstructured":"Marcos K. Aguilera , Nadav Amit , Irina Calciu , Xavier Deguillard , Jayneel Gandhi , Stanko Novakovic , Arun Ramanathan , Pratap Subrahmanyam , Lalith Suresh , Kiran Tati , Rajesh Venkatasubramanian , and Michael Wei . Remote regions: a simple abstraction for remote memory . In USENIX Annual Technical Conference (ATC) , 2018 . Marcos K. Aguilera, Nadav Amit, Irina Calciu, Xavier Deguillard, Jayneel Gandhi, Stanko Novakovic, Arun Ramanathan, Pratap Subrahmanyam, Lalith Suresh, Kiran Tati, Rajesh Venkatasubramanian, and Michael Wei. Remote regions: a simple abstraction for remote memory. In USENIX Annual Technical Conference (ATC), 2018."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3127479.3131612"},{"key":"e_1_2_1_12_1","volume-title":"Shouji: a fast and efficient pre-alignment filter for sequence alignment. Bioinformatics, 35(21)","author":"Alser Mohammed","year":"2019","unstructured":"Mohammed Alser , Hasan Hassan , Akash Kumar , Onur Mutlu , and Can Alkan . Shouji: a fast and efficient pre-alignment filter for sequence alignment. Bioinformatics, 35(21) , 2019 . Mohammed Alser, Hasan Hassan, Akash Kumar, Onur Mutlu, and Can Alkan. Shouji: a fast and efficient pre-alignment filter for sequence alignment. Bioinformatics, 35(21), 2019."},{"key":"e_1_2_1_13_1","volume-title":"GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping. Bioinformatics, 33(21)","author":"Alser Mohammed","year":"2017","unstructured":"Mohammed Alser , Hasan Hassan , Hongyi Xin , O?uz Ergin, Onur Mutlu , and Can Alkan . GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping. Bioinformatics, 33(21) , 2017 . Mohammed Alser, Hasan Hassan, Hongyi Xin, O?uz Ergin, Onur Mutlu, and Can Alkan. GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping. Bioinformatics, 33(21), 2017."},{"key":"e_1_2_1_14_1","volume-title":"GPUs and FPGAs. Bioinformatics","author":"Alser Mohammed","year":"2020","unstructured":"Mohammed Alser , Taha Shahroodi , Juan G\u00f3mez-Luna , Can Alkan , and Onur Mutlu . SneakySnake : a fast and accurate universal genome pre-alignment filter for CPUs , GPUs and FPGAs. Bioinformatics , 2020 . Mohammed Alser, Taha Shahroodi, Juan G\u00f3mez-Luna, Can Alkan, and Onur Mutlu. SneakySnake: a fast and accurate universal genome pre-alignment filter for CPUs, GPUs and FPGAs. Bioinformatics, 2020."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3342195.3387522"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.485843"},{"key":"e_1_2_1_17_1","unstructured":"Apple. How We Ported Linux to the M1. https:\/\/corellium.com\/blog\/linux-m1.  Apple. How We Ported Linux to the M1. https:\/\/corellium.com\/blog\/linux-m1."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2007.443"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/99163.99182"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037705"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2024716.2024718"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTCHIPS.2013.7478311"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2151024.2151043"},{"key":"e_1_2_1_24_1","volume-title":"Onur Mutlu, and Aasheesh Kolli. Rethinking Software Runtimes for Disaggregated Memory","author":"Calciu Irina","year":"2021","unstructured":"Irina Calciu , Talha Imran , Ivan Puddu , Sanidhya Kashyap , Hasan Al Maruf , Onur Mutlu, and Aasheesh Kolli. Rethinking Software Runtimes for Disaggregated Memory , February 2021 . https:\/\/github.com\/project- kona\/asplos21-ae. Irina Calciu, Talha Imran, Ivan Puddu, Sanidhya Kashyap, Hasan Al Maruf, Onur Mutlu, and Aasheesh Kolli. Rethinking Software Runtimes for Disaggregated Memory, February 2021. https:\/\/github.com\/project- kona\/asplos21-ae."},{"key":"e_1_2_1_25_1","volume-title":"Pratap Subrahmanyam. Project PBerry: FPGA Acceleration for Remote Memory. In Workshop on Hot Topics in Operating Systems (HotOS)","author":"Calciu Irina","year":"2019","unstructured":"Irina Calciu , Ivan Puddu , Aasheesh Kolli , Andreas Nowatzyk , Jayneel Gandhi , Onur Mutlu , and Pratap Subrahmanyam. Project PBerry: FPGA Acceleration for Remote Memory. In Workshop on Hot Topics in Operating Systems (HotOS) , 2019 . Irina Calciu, Ivan Puddu, Aasheesh Kolli, Andreas Nowatzyk, Jayneel Gandhi, Onur Mutlu, and Pratap Subrahmanyam. Project PBerry: FPGA Acceleration for Remote Memory. In Workshop on Hot Topics in Operating Systems (HotOS), 2019."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037721"},{"key":"e_1_2_1_27_1","volume-title":"Doug Burger. A Cloud-Scale Acceleration Architecture. In International Symposium on Microarchitecture (MICRO)","author":"Caulfield Adrian","year":"2016","unstructured":"Adrian Caulfield , Eric Chung , Andrew Putnam , Hari Angepat , Jeremy Fowers , Michael Haselman , Stephen Heil , Matt Humphrey , Puneet Kaur , Joo-Young Kim , Daniel Lo , Todd Massengill , Kalin Ovtcharov , Michael Papamichael , Lisa Woods , Sitaram Lanka , Derek Chiou , and Doug Burger. A Cloud-Scale Acceleration Architecture. In International Symposium on Microarchitecture (MICRO) , 2016 . Adrian Caulfield, Eric Chung, Andrew Putnam, Hari Angepat, Jeremy Fowers, Michael Haselman, Stephen Heil, Matt Humphrey, Puneet Kaur, Joo-Young Kim, Daniel Lo, Todd Massengill, Kalin Ovtcharov, Michael Papamichael, Lisa Woods, Sitaram Lanka, Derek Chiou, and Doug Burger. A Cloud-Scale Acceleration Architecture. In International Symposium on Microarchitecture (MICRO), 2016."},{"key":"e_1_2_1_28_1","volume-title":"The Convey HC-2 Computer. Architectural Overview. https:\/\/www.micron.com\/~\/media\/documents\/products\/whitepaper\/ wp_convey_hc2_architectual_overview.pdf","author":"Computer Convey","year":"2012","unstructured":"Convey Computer . The Convey HC-2 Computer. Architectural Overview. https:\/\/www.micron.com\/~\/media\/documents\/products\/whitepaper\/ wp_convey_hc2_architectual_overview.pdf , 2012 . Convey Computer. The Convey HC-2 Computer. Architectural Overview. https:\/\/www.micron.com\/~\/media\/documents\/products\/whitepaper\/ wp_convey_hc2_architectual_overview.pdf, 2012."},{"key":"e_1_2_1_29_1","volume-title":"Symposium on Networked Systems Design and Implementation (NSDI)","author":"Aleksandar","year":"2014","unstructured":"Aleksandar Dragojevi?, Dushyanth Narayanan , Miguel Castro , and Orion Hodson . FaRM : Fast remote memory . In Symposium on Networked Systems Design and Implementation (NSDI) , April 2014 . Aleksandar Dragojevi?, Dushyanth Narayanan, Miguel Castro, and Orion Hodson. FaRM: Fast remote memory. In Symposium on Networked Systems Design and Implementation (NSDI), April 2014."},{"key":"e_1_2_1_30_1","volume-title":"ACM Symposium on Operating Systems Principles (SOSP)","author":"Aleksandar","year":"2015","unstructured":"Aleksandar Dragojevi?, Dushyanth Narayanan , Ed Nightingale , Matthew Renzelmann , Alex Shamis , Anirudh Badam , and Miguel Castro . No compromises: distributed transactions with consistency, availability, and performance . In ACM Symposium on Operating Systems Principles (SOSP) , October 2015 . Aleksandar Dragojevi?, Dushyanth Narayanan, Ed Nightingale, Matthew Renzelmann, Alex Shamis, Anirudh Badam, and Miguel Castro. No compromises: distributed transactions with consistency, availability, and performance. In ACM Symposium on Operating Systems Principles (SOSP), October 2015."},{"key":"e_1_2_1_31_1","volume-title":"USENIX Annual Technical Conference (ATC)","author":"Duplyakin Dmitry","year":"2019","unstructured":"Dmitry Duplyakin , Robert Ricci , Aleksander Maricq , Gary Wong , Jonathon Duerig , Eric Eide , Leigh Stoller , Mike Hibler , David Johnson , Kirk Webb , Aditya Akella , Kuangching Wang , Glenn Ricart , Larry Landweber , Chip Elliott , Michael Zink , Emmanuel Cecchet , Snigdhaswin Kar , and Prabodh Mishra . The design and operation of CloudLab . In USENIX Annual Technical Conference (ATC) , 2019 . Dmitry Duplyakin, Robert Ricci, Aleksander Maricq, Gary Wong, Jonathon Duerig, Eric Eide, Leigh Stoller, Mike Hibler, David Johnson, Kirk Webb, Aditya Akella, Kuangching Wang, Glenn Ricart, Larry Landweber, Chip Elliott, Michael Zink, Emmanuel Cecchet, Snigdhaswin Kar, and Prabodh Mishra. The design and operation of CloudLab. In USENIX Annual Technical Conference (ATC), 2019."},{"key":"e_1_2_1_32_1","volume-title":"Symposium on Operating Systems Design and Implementation (OSDI)","author":"Gao Peter X.","year":"2016","unstructured":"Peter X. Gao , Akshay Narayan , Sagar Karandikar , Joao Carreira , Sangjin Han , Rachit Agarwal , Sylvia Ratnasamy , and Scott Shenker . Network requirements for resource disaggregation . In Symposium on Operating Systems Design and Implementation (OSDI) , October 2016 . Peter X. Gao, Akshay Narayan, Sagar Karandikar, Joao Carreira, Sangjin Han, Rachit Agarwal, Sylvia Ratnasamy, and Scott Shenker. Network requirements for resource disaggregation. In Symposium on Operating Systems Design and Implementation (OSDI), October 2016."},{"key":"e_1_2_1_33_1","unstructured":"Gen-Z draft core specification-december 2016. http:\/\/genzconsortium.org\/draftcore- specification-december-2016.  Gen-Z draft core specification-december 2016. http:\/\/genzconsortium.org\/draftcore- specification-december-2016."},{"key":"e_1_2_1_34_1","article-title":"NetFPGA: An open platform for teaching how to build Gigabit-rate network switches and routers","author":"Gibb G.","year":"2008","unstructured":"G. Gibb , J. W. Lockwood , J. Naous , P. Hartke , and N. McKeown . NetFPGA: An open platform for teaching how to build Gigabit-rate network switches and routers . IEEE Transactions on Education , 2008 . G. Gibb, J. W. Lockwood, J. Naous, P. Hartke, and N. McKeown. NetFPGA: An open platform for teaching how to build Gigabit-rate network switches and routers. IEEE Transactions on Education, 2008.","journal-title":"IEEE Transactions on Education"},{"key":"e_1_2_1_35_1","volume-title":"Automation & Test in Europe (DATE)","author":"Giefers Heiner","year":"2015","unstructured":"Heiner Giefers , Raphael Polig , and Christoph Hagleitner . Accelerating Arithmetic Kernels with Coherent Attached FPGA Coprocessors. In Design , Automation & Test in Europe (DATE) , 2015 . Heiner Giefers, Raphael Polig, and Christoph Hagleitner. Accelerating Arithmetic Kernels with Coherent Attached FPGA Coprocessors. In Design, Automation & Test in Europe (DATE), 2015."},{"key":"e_1_2_1_36_1","volume-title":"Kang G Shin. Efficient Memory Disaggregation with Infiniswap. In Symposium on Networked Systems Design and Implementation (NSDI)","author":"Gu Juncheng","year":"2017","unstructured":"Juncheng Gu , Youngmoon Lee , Yiwen Zhang , Mosharaf Chowdhury , and Kang G Shin. Efficient Memory Disaggregation with Infiniswap. In Symposium on Networked Systems Design and Implementation (NSDI) , 2017 . Juncheng Gu, Youngmoon Lee, Yiwen Zhang, Mosharaf Chowdhury, and Kang G Shin. Efficient Memory Disaggregation with Infiniswap. In Symposium on Networked Systems Design and Implementation (NSDI), 2017."},{"key":"e_1_2_1_37_1","volume-title":"ACM","author":"Guo Chuanxiong","unstructured":"Chuanxiong Guo , Haitao Wu , Zhong Deng , Gaurav Soni , Jianxi Ye , Jitu Padhye , and Marina Lipshteyn . RDMA over Commodity Ethernet at Scale . In ACM Chuanxiong Guo, Haitao Wu, Zhong Deng, Gaurav Soni, Jianxi Ye, Jitu Padhye, and Marina Lipshteyn. RDMA over Commodity Ethernet at Scale. In ACM"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/FPL.2018.00069"},{"key":"e_1_2_1_39_1","unstructured":"Intel. Cachegrind. https:\/\/valgrind.org\/docs\/manual\/cg-manual.html.  Intel. Cachegrind. https:\/\/valgrind.org\/docs\/manual\/cg-manual.html."},{"key":"e_1_2_1_40_1","unstructured":"Intel. EPT-based Sub-Page Permissions. https:\/\/software.intel.com\/sites\/default\/ files\/managed\/c5\/15\/architecture-instruction-set-extensions-programmingreference. pdf.  Intel. EPT-based Sub-Page Permissions. https:\/\/software.intel.com\/sites\/default\/ files\/managed\/c5\/15\/architecture-instruction-set-extensions-programmingreference. pdf."},{"key":"e_1_2_1_41_1","unstructured":"Intel. Intel Xeon+FPGA Platform for the Data Center. http:\/\/ reconfigurablecomputing4themasses.net\/files\/2.2%20PK.pdf.  Intel. Intel Xeon+FPGA Platform for the Data Center. http:\/\/ reconfigurablecomputing4themasses.net\/files\/2.2%20PK.pdf."},{"key":"e_1_2_1_42_1","unstructured":"Intel. Page Modification Logging for Virtual Machine Monitor White Paper. https:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/whitepapers\/ page-modification-logging-vmm-white-paper.pdf.  Intel. Page Modification Logging for Virtual Machine Monitor White Paper. https:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/whitepapers\/ page-modification-logging-vmm-white-paper.pdf."},{"key":"e_1_2_1_43_1","volume-title":"Intel\u00ae 64 and IA-32 Architectures Software Developer's Manual","year":"2020","unstructured":"Intel. Intel\u00ae 64 and IA-32 Architectures Software Developer's Manual . November 2020 . Intel. Intel\u00ae 64 and IA-32 Architectures Software Developer's Manual. November 2020."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/512429.512445"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749246.2749250"},{"key":"e_1_2_1_46_1","volume-title":"Symposium on Operating Systems Design and Implementation (OSDI)","author":"Khawaja Ahmed","year":"2018","unstructured":"Ahmed Khawaja , Joshua Landgraf , Rohith Prakash , Michael Wei , Eric Schkufza , and Christopher J. Rossbach . Sharing, Protection, and Compatibility for Reconfigurable Fabric with AmorphOS . In Symposium on Operating Systems Design and Implementation (OSDI) , Carlsbad, CA , 2018 . Ahmed Khawaja, Joshua Landgraf, Rohith Prakash, Michael Wei, Eric Schkufza, and Christopher J. Rossbach. Sharing, Protection, and Compatibility for Reconfigurable Fabric with AmorphOS. In Symposium on Operating Systems Design and Implementation (OSDI), Carlsbad, CA, 2018."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/L-CA.2013.17"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/75104.75105"},{"key":"e_1_2_1_49_1","unstructured":"libibverbs. http:\/\/www.rdmamojo.com\/2012\/05\/18\/libibverbs.  libibverbs. http:\/\/www.rdmamojo.com\/2012\/05\/18\/libibverbs."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2012.6168955"},{"key":"e_1_2_1_51_1","volume-title":"Energy-efficient Platforms Using In-socket FPGA Accelerators. In International Symposium on Field Programmable Gate Arrays (FPGA)","author":"Ling Liu","year":"2009","unstructured":"Liu Ling , Neal Oliver , Chitlur Bhushan , Wang Qigang , Alvin Chen , Shen Wenbo , Yu Zhihong , Arthur Sheiman , Ian McCallum , Joseph Grecco , Henry Mitchel , Liu Dong , and Prabhat Gupta . High-performance , Energy-efficient Platforms Using In-socket FPGA Accelerators. In International Symposium on Field Programmable Gate Arrays (FPGA) , 2009 . Liu Ling, Neal Oliver, Chitlur Bhushan, Wang Qigang, Alvin Chen, Shen Wenbo, Yu Zhihong, Arthur Sheiman, Ian McCallum, Joseph Grecco, Henry Mitchel, Liu Dong, and Prabhat Gupta. High-performance, Energy-efficient Platforms Using In-socket FPGA Accelerators. In International Symposium on Field Programmable Gate Arrays (FPGA), 2009."},{"key":"e_1_2_1_52_1","volume-title":"Joseph Hellerstein. GraphLab: A New Framework for Parallel Machine Learning. In Conference on Uncertainty in Artificial Intelligence (UAI)","author":"Low Yucheng","year":"2010","unstructured":"Yucheng Low , Joseph Gonzalez , Aapo Kyrola , Danny Bickson , Carlos Guestrin , and Joseph Hellerstein. GraphLab: A New Framework for Parallel Machine Learning. In Conference on Uncertainty in Artificial Intelligence (UAI) , 2010 . Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, and Joseph Hellerstein. GraphLab: A New Framework for Parallel Machine Learning. In Conference on Uncertainty in Artificial Intelligence (UAI), 2010."},{"key":"e_1_2_1_53_1","volume-title":"Kim Hazelwood. Pin: Building Customized Program Analysis Tools with Dynamic Instrumentation. In International Conference on Programming Language Design and Implementation (PLDI)","author":"Luk Chi-Keung","year":"2005","unstructured":"Chi-Keung Luk , Robert Cohn , Robert Muth , Harish Patil , Artur Klauser , Geoff Lowney , Steven Wallace , Vijay Janapa Reddi , and Kim Hazelwood. Pin: Building Customized Program Analysis Tools with Dynamic Instrumentation. In International Conference on Programming Language Design and Implementation (PLDI) , 2005 . Chi-Keung Luk, Robert Cohn, Robert Muth, Harish Patil, Artur Klauser, Geoff Lowney, Steven Wallace, Vijay Janapa Reddi, and Kim Hazelwood. Pin: Building Customized Program Analysis Tools with Dynamic Instrumentation. In International Conference on Programming Language Design and Implementation (PLDI), 2005."},{"key":"e_1_2_1_54_1","volume-title":"Baris Kasikci. A Hypervisor for Shared-Memory FPGA Platforms. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)","author":"Ma Jiacheng","year":"2020","unstructured":"Jiacheng Ma , Gefei Zuo , Kevin Loughlin , Xiaohe Cheng , Yanqiang Liu , Abel Mulugeta Eneyew , Zhengwei Qi , and Baris Kasikci. A Hypervisor for Shared-Memory FPGA Platforms. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) , 2020 . Jiacheng Ma, Gefei Zuo, Kevin Loughlin, Xiaohe Cheng, Yanqiang Liu, Abel Mulugeta Eneyew, Zhengwei Qi, and Baris Kasikci. A Hypervisor for Shared-Memory FPGA Platforms. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2020."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2016.7446050"},{"key":"e_1_2_1_57_1","volume-title":"Al Maruf and Mosharaf Chowdhury. Effectively Prefetching Remote Memory with Leap. In USENIX Annual Technical Conference (ATC)","author":"Hasan","year":"2020","unstructured":"Hasan Al Maruf and Mosharaf Chowdhury. Effectively Prefetching Remote Memory with Leap. In USENIX Annual Technical Conference (ATC) , 2020 . Hasan Al Maruf and Mosharaf Chowdhury. Effectively Prefetching Remote Memory with Leap. In USENIX Annual Technical Conference (ATC), 2020."},{"key":"e_1_2_1_58_1","unstructured":"Mellanox. Mellanox Innova? IPsec 4 Lx Ethernet Adapter Card User Manual. http:\/\/www.mellanox.com\/related-docs\/prod_software\/Mellanox_Innova_ IPsec_4_Lx_Ethernet_Adapter_Card_User_Manual_rev_1_3.pdf.  Mellanox. Mellanox Innova? IPsec 4 Lx Ethernet Adapter Card User Manual. http:\/\/www.mellanox.com\/related-docs\/prod_software\/Mellanox_Innova_ IPsec_4_Lx_Ethernet_Adapter_Card_User_Manual_rev_1_3.pdf."},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3278532.3278566"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.2200\/S00962ED2V01Y201910CAC049"},{"key":"e_1_2_1_61_1","volume-title":"USENIX Annual Technical Conference (ATC)","author":"Nelson Jacob","year":"2015","unstructured":"Jacob Nelson , Brandon Holt , Brandon Myers , Preston Briggs , Luis Ceze , Simon Kahan , and Mark Oskin . Latency-tolerant software distributed shared memory . In USENIX Annual Technical Conference (ATC) , July 2015 . Jacob Nelson, Brandon Holt, Brandon Myers, Preston Briggs, Luis Ceze, Simon Kahan, and Mark Oskin. Latency-tolerant software distributed shared memory. In USENIX Annual Technical Conference (ATC), July 2015."},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2017.37"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358267"},{"key":"e_1_2_1_64_1","volume-title":"International Symposium on Computer Architecture (ISCA)","author":"Mark","year":"1984","unstructured":"Mark S. Papamarcos and Janak H. Patel. A low-overhead coherence solution for multiprocessors with private cache memories . In International Symposium on Computer Architecture (ISCA) , 1984 . Mark S. Papamarcos and Janak H. Patel. A low-overhead coherence solution for multiprocessors with private cache memories. In International Symposium on Computer Architecture (ISCA), 1984."},{"key":"e_1_2_1_65_1","volume-title":"Doug Burger. A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services. In International Symposium on Computer Architecture (ISCA)","author":"Putnam Andrew","year":"2014","unstructured":"Andrew Putnam , Adrian M. Caulfield , Eric S. Chung , Derek Chiou , Kypros Constantinides , John Demme , Hadi Esmaeilzadeh , Jeremy Fowers , Gopi Prashanth Gopal , Jan Gray , Michael Haselman , Scott Hauck , Stephen Heil , Amir Hormati , Joo-Young Kim , Sitaram Lanka , James Larus , Eric Peterson , Simon Pope , Aaron Smith , Jason Thong , Phillip Yi Xiao , and Doug Burger. A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services. In International Symposium on Computer Architecture (ISCA) , 2014 . Andrew Putnam, Adrian M. Caulfield, Eric S. Chung, Derek Chiou, Kypros Constantinides, John Demme, Hadi Esmaeilzadeh, Jeremy Fowers, Gopi Prashanth Gopal, Jan Gray, Michael Haselman, Scott Hauck, Stephen Heil, Amir Hormati, Joo-Young Kim, Sitaram Lanka, James Larus, Eric Peterson, Simon Pope, Aaron Smith, Jason Thong, Phillip Yi Xiao, and Doug Burger. A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services. In International Symposium on Computer Architecture (ISCA), 2014."},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/2391229.2391236"},{"key":"e_1_2_1_67_1","volume-title":"Symposium on Operating Systems Design and Implementation (OSDI)","author":"Ruan Zhenyuan","year":"2020","unstructured":"Zhenyuan Ruan , Malte Schwarzkopf , Marcos K. Aguilera , and Adam Belay . AIFM : High-performance, application-integrated far memory . In Symposium on Operating Systems Design and Implementation (OSDI) , November 2020 . Zhenyuan Ruan, Malte Schwarzkopf, Marcos K. Aguilera, and Adam Belay. AIFM: High-performance, application-integrated far memory. In Symposium on Operating Systems Design and Implementation (OSDI), November 2020."},{"key":"e_1_2_1_68_1","volume-title":"Sanchez and Christos Kozyrakis. ZSim: Fast and Accurate Microarchitectural Simulation of Thousand-Core Systems. In International Symposium on Computer Architecture (ISCA)","author":"Daniel","year":"2013","unstructured":"Daniel Sanchez and Christos Kozyrakis. ZSim: Fast and Accurate Microarchitectural Simulation of Thousand-Core Systems. In International Symposium on Computer Architecture (ISCA) , 2013 . Daniel Sanchez and Christos Kozyrakis. ZSim: Fast and Accurate Microarchitectural Simulation of Thousand-Core Systems. In International Symposium on Computer Architecture (ISCA), 2013."},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/237090.237179"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1145\/195473.195575"},{"key":"e_1_2_1_71_1","volume-title":"Symposium on Operating Systems Design and Implementation (OSDI)","author":"Shan Yizhou","year":"2018","unstructured":"Yizhou Shan , Yutong Huang , Yilun Chen , and Yiying Zhang . LegoOS : A disseminated, distributed OS for hardware resource disaggregation . In Symposium on Operating Systems Design and Implementation (OSDI) , Carlsbad, CA , 2018 . Yizhou Shan, Yutong Huang, Yilun Chen, and Yiying Zhang. LegoOS: A disseminated, distributed OS for hardware resource disaggregation. In Symposium on Operating Systems Design and Implementation (OSDI), Carlsbad, CA, 2018."},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/3127479.3128610"},{"key":"e_1_2_1_73_1","volume-title":"International Symposium on Computer Architecture (ISCA)","author":"Shen Yongming","year":"2017","unstructured":"Yongming Shen , Michael Ferdman , and Peter Milder . Maximizing CNN accelerator efficiency through resource partitioning . In International Symposium on Computer Architecture (ISCA) , 2017 . Yongming Shen, Michael Ferdman, and Peter Milder. Maximizing CNN accelerator efficiency through resource partitioning. In International Symposium on Computer Architecture (ISCA), 2017."},{"key":"e_1_2_1_74_1","unstructured":"Navin Shenoy. A Milestone in Moving Data. https:\/\/newsroom.intel.com\/ editorials\/milestone-moving-data.  Navin Shenoy. A Milestone in Moving Data. https:\/\/newsroom.intel.com\/ editorials\/milestone-moving-data."},{"key":"e_1_2_1_75_1","volume-title":"International Conference on Management of Data (SIGMOD)","author":"Sidler David","year":"2017","unstructured":"David Sidler , Zsolt Istv\u00e1n , Muhsen Owaida , Kaan Kara , and Gustavo Alonso . doppioDB : A hardware accelerated database . In International Conference on Management of Data (SIGMOD) , 2017 . David Sidler, Zsolt Istv\u00e1n, Muhsen Owaida, Kaan Kara, and Gustavo Alonso. doppioDB: A hardware accelerated database. In International Conference on Management of Data (SIGMOD), 2017."},{"key":"e_1_2_1_76_1","volume-title":"International Conference on Field Programmable Logic and Applications (FPL)","author":"Singh Gagandeep","year":"2020","unstructured":"Gagandeep Singh , Dionysios Diamantopoulos , Christoph Hagleitner , Juan G\u00f3mez- Luna , Sander Stuijk , Onur Mutlu , and Henk Corp oraal. NERO : A near highbandwidth memory stencil accelerator for weather prediction modeling . In International Conference on Field Programmable Logic and Applications (FPL) , 2020 . Gagandeep Singh, Dionysios Diamantopoulos, Christoph Hagleitner, Juan G\u00f3mez- Luna, Sander Stuijk, Onur Mutlu, and Henk Corporaal. NERO: A near highbandwidth memory stencil accelerator for weather prediction modeling. In International Conference on Field Programmable Logic and Applications (FPL), 2020."},{"key":"e_1_2_1_77_1","unstructured":"Mario Smarduch. Enhanced Live Migration For Intensive Memory Loads. https:\/\/events.static.linuxfound.org\/sites\/events\/files\/slides\/CloudOpen- Japan-2015.pdf.  Mario Smarduch. Enhanced Live Migration For Intensive Memory Loads. https:\/\/events.static.linuxfound.org\/sites\/events\/files\/slides\/CloudOpen- Japan-2015.pdf."},{"key":"e_1_2_1_78_1","volume-title":"European Conference on Computer Systems (EuroSys)","author":"Tirmazi Muhammad","year":"2020","unstructured":"Muhammad Tirmazi , Adam Barker , Nan Deng , Md E. Haque , Zhijing Gene Qin , Steven Hand , Mor Harchol-Balter , and John Wilkes . Borg : The next generation . In European Conference on Computer Systems (EuroSys) , 2020 . Muhammad Tirmazi, Adam Barker, Nan Deng, Md E. Haque, Zhijing Gene Qin, Steven Hand, Mor Harchol-Balter, and John Wilkes. Borg: The next generation. In European Conference on Computer Systems (EuroSys), 2020."},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132747.3132762"},{"key":"e_1_2_1_80_1","unstructured":"Userfaultfd. https:\/\/www.kernel.org\/doc\/Documentation\/vm\/userfaultfd.txt.  Userfaultfd. https:\/\/www.kernel.org\/doc\/Documentation\/vm\/userfaultfd.txt."},{"key":"e_1_2_1_81_1","first-page":"261","volume-title":"Symposium on Operating Systems Design and Implementation (OSDI)","author":"Wang Chenxi","year":"2020","unstructured":"Chenxi Wang , Haoran Ma , Shi Liu , Yuanqi Li , Zhenyuan Ruan , Khanh Nguyen , Michael D. Bond , Ravi Netravali , Miryung Kim , and Guoqing Harry Xu. Semeru : A memory-disaggregated managed runtime . In Symposium on Operating Systems Design and Implementation (OSDI) , pages 261 -- 280 , November 2020 . Chenxi Wang, Haoran Ma, Shi Liu, Yuanqi Li, Zhenyuan Ruan, Khanh Nguyen, Michael D. Bond, Ravi Netravali, Miryung Kim, and Guoqing Harry Xu. Semeru: A memory-disaggregated managed runtime. In Symposium on Operating Systems Design and Implementation (OSDI), pages 261--280, November 2020."},{"key":"e_1_2_1_82_1","volume-title":"International Conference on Very Large Data Bases (VLDB), 10(6)","author":"Zamanian Erfan","year":"2017","unstructured":"Erfan Zamanian , Carsten Binnig , Tim Harris , and Tim Kraska . The End of a Myth: Distributed Transactions Can Scale . International Conference on Very Large Data Bases (VLDB), 10(6) , February 2017 . Erfan Zamanian, Carsten Binnig, Tim Harris, and Tim Kraska. The End of a Myth: Distributed Transactions Can Scale. International Conference on Very Large Data Bases (VLDB), 10(6), February 2017."},{"key":"e_1_2_1_83_1","volume-title":"Zha and Jing Li. Virtualizing FPGAs in the Cloud. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)","author":"Yue","year":"2020","unstructured":"Yue Zha and Jing Li. Virtualizing FPGAs in the Cloud. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) , 2020 . Yue Zha and Jing Li. Virtualizing FPGAs in the Cloud. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2020."}],"container-title":["ACM SIGOPS Operating Systems Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3606557.3606561","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3606557.3606561","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:48:51Z","timestamp":1750182531000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3606557.3606561"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,26]]},"references-count":82,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,6,26]]}},"alternative-id":["10.1145\/3606557.3606561"],"URL":"https:\/\/doi.org\/10.1145\/3606557.3606561","relation":{},"ISSN":["0163-5980"],"issn-type":[{"value":"0163-5980","type":"print"}],"subject":[],"published":{"date-parts":[[2023,6,26]]},"assertion":[{"value":"2023-06-28","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}