{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,6]],"date-time":"2026-06-06T01:13:31Z","timestamp":1780708411542,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":36,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,11,7]],"date-time":"2022-11-07T00:00:00Z","timestamp":1667779200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100017693","name":"HiPEAC Network","doi-asserted-by":"publisher","award":["871174"],"award-info":[{"award-number":["871174"]}],"id":[{"id":"10.13039\/100017693","id-type":"DOI","asserted-by":"publisher"}]},{"name":"EuroHPC","award":["955606"],"award-info":[{"award-number":["955606"]}]},{"DOI":"10.13039\/100010661","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["101034126"],"award-info":[{"award-number":["101034126"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,11,7]]},"DOI":"10.1145\/3542929.3563467","type":"proceedings-article","created":{"date-parts":[[2022,11,7]],"date-time":"2022-11-07T20:19:18Z","timestamp":1667852358000},"page":"1-15","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Arax"],"prefix":"10.1145","author":[{"given":"Manos","family":"Pavlidakis","sequence":"first","affiliation":[{"name":"University of Crete, Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Stelios","family":"Mavridis","sequence":"additional","affiliation":[{"name":"Institute of Computer Science (ICS), Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Antony","family":"Chazapis","sequence":"additional","affiliation":[{"name":"Institute of Computer Science (ICS), Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Giorgos","family":"Vasiliadis","sequence":"additional","affiliation":[{"name":"Institute of Computer Science (ICS), Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Angelos","family":"Bilas","sequence":"additional","affiliation":[{"name":"University of Crete, Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,11,7]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. In Euro-Par '09","author":"Augonnet C\u00e9dric","year":"2009","unstructured":"C\u00e9dric Augonnet , Samuel Thibault , Raymond Namyst , and Pierre-Andr\u00e9 Wacrenier . 2009 . StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. In Euro-Par '09 . C\u00e9dric Augonnet, Samuel Thibault, Raymond Namyst, and Pierre-Andr\u00e9 Wacrenier. 2009. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. In Euro-Par '09."},{"key":"e_1_3_2_1_2_1","volume-title":"Artificial-intelligence hardware: New opportunities for semiconductor companies","author":"Batra Gaurav","unstructured":"Gaurav Batra , Zach Jacobson , Siddarth Madhav , Andrea Queirolo , and Nick Santhanam . 2018. Artificial-intelligence hardware: New opportunities for semiconductor companies . In McKinsey & Company , New York, NY, USA , Tech. Rep. Gaurav Batra, Zach Jacobson, Siddarth Madhav, Andrea Queirolo, and Nick Santhanam. 2018. Artificial-intelligence hardware: New opportunities for semiconductor companies. In McKinsey & Company, New York, NY, USA, Tech. Rep."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2742060.2743766"},{"key":"e_1_3_2_1_4_1","volume-title":"Balancing Efficiency and Fairness in Heterogeneous GPU Clusters for Deep Learning. In EuroSys '20","author":"Chaudhary Shubham","unstructured":"Shubham Chaudhary , Ramachandran Ramjee , Muthian Sivathanu , N. Kwatra , and S. Viswanatha . 2020 . Balancing Efficiency and Fairness in Heterogeneous GPU Clusters for Deep Learning. In EuroSys '20 . Shubham Chaudhary, Ramachandran Ramjee, Muthian Sivathanu, N. Kwatra, and S. Viswanatha. 2020. Balancing Efficiency and Fairness in Heterogeneous GPU Clusters for Deep Learning. In EuroSys '20."},{"key":"e_1_3_2_1_5_1","volume-title":"Rodinia: A Benchmark Suite for Heterogeneous Computing. In IISWC '09","author":"Che Shuai","year":"2009","unstructured":"Shuai Che , Michael Boyer , Jiayuan Meng , David Tarjan , Jeremy W. Sheaffer , Sang-Ha Lee , and Kevin Skadron . 2009 . Rodinia: A Benchmark Suite for Heterogeneous Computing. In IISWC '09 . Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, Sang-Ha Lee, and Kevin Skadron. 2009. Rodinia: A Benchmark Suite for Heterogeneous Computing. In IISWC '09."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2996864"},{"key":"e_1_3_2_1_7_1","volume-title":"HiPC '11","author":"Duato Jose","unstructured":"Jose Duato , Antonio J. Pena , Federico Silla , Juan C. Fernandez , Rafael Mayo , and Enrique S . Quintana-Orti. 2011. Enabling CUDA acceleration within virtual machines using rCUDA . In HiPC '11 . Jose Duato, Antonio J. Pena, Federico Silla, Juan C. Fernandez, Rafael Mayo, and Enrique S. Quintana-Orti. 2011. Enabling CUDA acceleration within virtual machines using rCUDA. In HiPC '11."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080246"},{"key":"e_1_3_2_1_9_1","unstructured":"Mart\u00edn Abadi et. al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https:\/\/www.tensorflow.org\/ Software available from tensorflow.org.  Mart\u00edn Abadi et. al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https:\/\/www.tensorflow.org\/ Software available from tensorflow.org."},{"key":"e_1_3_2_1_10_1","volume-title":"ISCA '18","author":"Fowers Jeremy","unstructured":"Jeremy Fowers , Kalin Ovtcharov , Michael Papamichael , Todd Massengill , Ming Liu , Daniel Lo , Shlomi Alkalay , Michael Haselman , Logan Adams , Mahdi Ghandi , Stephen Heil , Prerak Patel , Adam Sapek , Gabriel Weisz , Lisa Woods , Sitaram Lanka , Steven K. Reinhardt , Adrian M. Caulfield , E. S. Chung , and D. Burger . 2018. A Configurable Cloud-Scale DNN Processor for Real-Time AI . In ISCA '18 . Jeremy Fowers, Kalin Ovtcharov, Michael Papamichael, Todd Massengill, Ming Liu, Daniel Lo, Shlomi Alkalay, Michael Haselman, Logan Adams, Mahdi Ghandi, Stephen Heil, Prerak Patel, Adam Sapek, Gabriel Weisz, Lisa Woods, Sitaram Lanka, Steven K. Reinhardt, Adrian M. Caulfield, E. S. Chung, and D. Burger. 2018. A Configurable Cloud-Scale DNN Processor for Real-Time AI. In ISCA '18."},{"key":"e_1_3_2_1_11_1","volume-title":"SYCL2020","author":"Kronos Group","year":"2022","unstructured":"Kronos Group . 2022 . SYCL2020 . Retrieved September 2022 from https:\/\/www.khronos.org\/sycl\/ Kronos Group. 2022. SYCL2020. Retrieved September 2022 from https:\/\/www.khronos.org\/sycl\/"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357223.3362714"},{"key":"e_1_3_2_1_13_1","volume-title":"oneAPI. Retrieved","year":"2022","unstructured":"Intel. 2020. oneAPI. Retrieved September 2022 from https:\/\/software.intel.com\/content\/www\/us\/en\/develop\/tools\/oneapi.html#gs.4ac4fz Intel. 2020. oneAPI. Retrieved September 2022 from https:\/\/software.intel.com\/content\/www\/us\/en\/develop\/tools\/oneapi.html#gs.4ac4fz"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654889"},{"key":"e_1_3_2_1_15_1","volume-title":"Keras Code Examples. Retrieved","year":"2022","unstructured":"Keras. 2014. Keras Code Examples. Retrieved September 2022 from https:\/\/keras.io\/examples\/ Keras. 2014. Keras Code Examples. Retrieved September 2022 from https:\/\/keras.io\/examples\/"},{"key":"e_1_3_2_1_16_1","volume-title":"The mnist database of handwritten digits. Retrieved","author":"Lecun Cortes","year":"2022","unstructured":"Cortes Lecun . 2022. The mnist database of handwritten digits. Retrieved September 2022 from http:\/\/yann.lecun.com\/exdb\/mnist Cortes Lecun. 2022. The mnist database of handwritten digits. Retrieved September 2022 from http:\/\/yann.lecun.com\/exdb\/mnist"},{"key":"e_1_3_2_1_17_1","volume-title":"Proc. IEEE.","author":"Lecun Y.","unstructured":"Y. Lecun , L. Bottou , Y. Bengio , and P. Haffner . 1998. Gradient-based learning applied to document recognition . Proc. IEEE. Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE."},{"key":"e_1_3_2_1_18_1","volume-title":"PuDianNao: A Polyvalent Machine Learning Accelerator. In ASPLOS '15","author":"Liu Daofu","unstructured":"Daofu Liu , Tianshi Chen , Shaoli Liu , Jinhong Zhou , Shengyuan Zhou , Olivier Teman , Xiaobing Feng , X. Zhou , and Y. Chen . 2015 . PuDianNao: A Polyvalent Machine Learning Accelerator. In ASPLOS '15 . Daofu Liu, Tianshi Chen, Shaoli Liu, Jinhong Zhou, Shengyuan Zhou, Olivier Teman, Xiaobing Feng, X. Zhou, and Y. Chen. 2015. PuDianNao: A Polyvalent Machine Learning Accelerator. In ASPLOS '15."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Stelios Mavridis Manolis Pavlidakis Ioannis Stamoulias Christos Kozanitis Nikolaos Chrysos Christoforos Kachris Dimitrios Soudris and Angelos Bilas. 2017. VineTalk: Simplifying software access and sharing of FPGAs in datacenters. In FPL' 17.  Stelios Mavridis Manolis Pavlidakis Ioannis Stamoulias Christos Kozanitis Nikolaos Chrysos Christoforos Kachris Dimitrios Soudris and Angelos Bilas. 2017. VineTalk: Simplifying software access and sharing of FPGAs in datacenters. In FPL' 17.","DOI":"10.23919\/FPL.2017.8056788"},{"key":"e_1_3_2_1_20_1","volume-title":"Memory Bandwidth and Machine Balance in Current High Performance Computers. In TCCA '95","author":"McCalpin John D.","year":"1995","unstructured":"John D. McCalpin . 1995 . Memory Bandwidth and Machine Balance in Current High Performance Computers. In TCCA '95 . John D. McCalpin. 1995. Memory Bandwidth and Machine Balance in Current High Performance Computers. In TCCA '95."},{"key":"e_1_3_2_1_21_1","volume-title":"CUDA Binary Utilities. Retrieved","author":"NVIDIA.","year":"2022","unstructured":"NVIDIA. 2021. CUDA Binary Utilities. Retrieved September 2022 from https:\/\/docs.nvidia.com\/cuda\/parallel-thread-execution\/index.html NVIDIA. 2021. CUDA Binary Utilities. Retrieved September 2022 from https:\/\/docs.nvidia.com\/cuda\/parallel-thread-execution\/index.html"},{"key":"e_1_3_2_1_22_1","volume-title":"CUDA: Compute Unified Device Architecture. Retrieved","author":"NVIDIA.","year":"2022","unstructured":"NVIDIA. 2022 . CUDA: Compute Unified Device Architecture. Retrieved Sep. 2022 from https:\/\/developer.nvidia.com\/cuda-toolkit NVIDIA. 2022. CUDA: Compute Unified Device Architecture. Retrieved Sep. 2022 from https:\/\/developer.nvidia.com\/cuda-toolkit"},{"key":"e_1_3_2_1_23_1","volume-title":"Retrieved","author":"Multi-Process Service NVIDIA.","year":"2022","unstructured":"NVIDIA. 2022. Multi-Process Service . Retrieved September 2022 from https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf NVIDIA. 2022. Multi-Process Service. Retrieved September 2022 from https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf"},{"key":"e_1_3_2_1_24_1","volume-title":"Retrieved","author":"Direct NVIDIA.","year":"2022","unstructured":"NVIDIA. 2022. NVIDIA GPU Direct . Retrieved September 2022 from https:\/\/developer.nvidia.com\/gpudirect NVIDIA. 2022. NVIDIA GPUDirect. Retrieved September 2022 from https:\/\/developer.nvidia.com\/gpudirect"},{"key":"e_1_3_2_1_25_1","volume-title":"Parallel Thread Execution ISA. Retrieved","author":"NVIDIA.","year":"2022","unstructured":"NVIDIA. 2022. Parallel Thread Execution ISA. Retrieved September 2022 from https:\/\/docs.nvidia.com\/cuda\/parallel-thread-execution\/index.html NVIDIA. 2022. Parallel Thread Execution ISA. Retrieved September 2022 from https:\/\/docs.nvidia.com\/cuda\/parallel-thread-execution\/index.html"},{"key":"e_1_3_2_1_26_1","volume-title":"TReM: A Task Revocation Mechanism for GPUs. In HPCC'20","author":"Pavlidakis Manos","year":"2020","unstructured":"Manos Pavlidakis , Stelios Mavridis , Nikos Chrysos , and Angelos Bilas . 2020 . TReM: A Task Revocation Mechanism for GPUs. In HPCC'20 . Manos Pavlidakis, Stelios Mavridis, Nikos Chrysos, and Angelos Bilas. 2020. TReM: A Task Revocation Mechanism for GPUs. In HPCC'20."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3319423"},{"key":"e_1_3_2_1_28_1","volume-title":"ImageNet Large Scale Visual Recognition Challenge. IJCV '15","author":"Russakovsky Olga","year":"2015","unstructured":"Olga Russakovsky , Jia Deng , Hao Su , Jonathan Krause , Sanjeev Satheesh , Sean Ma , Zhiheng Huang , Andrej Karpathy , Aditya Khosla , Michael Bernstein , Alexander C. Berg , and Li Fei-Fei . 2015 . ImageNet Large Scale Visual Recognition Challenge. IJCV '15 . Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet Large Scale Visual Recognition Challenge. IJCV '15."},{"key":"e_1_3_2_1_29_1","volume-title":"Keckler","author":"Shao Yakun Sophia","year":"2021","unstructured":"Yakun Sophia Shao , Jason Cemons , Rangharajan Venkatesan , Brian Zimmer , Matthew Fojtik , Nan Jiang , Ben Keller , Alicia Klinefelter , Nathaniel Pinckney , Priyanka Raina , Stephen G. Tell , Yanqing Zhang , William J. Dally , Joel Emer , C. Thomas Gray , Brucek Khailany , and Stephen W . Keckler . 2021 . Simba : Scaling Deep-Learning Inference with Chiplet-Based Architecture. In MICRO '21. Yakun Sophia Shao, Jason Cemons, Rangharajan Venkatesan, Brian Zimmer, Matthew Fojtik, Nan Jiang, Ben Keller, Alicia Klinefelter, Nathaniel Pinckney, Priyanka Raina, Stephen G. Tell, Yanqing Zhang, William J. Dally, Joel Emer, C. Thomas Gray, Brucek Khailany, and Stephen W. Keckler. 2021. Simba: Scaling Deep-Learning Inference with Chiplet-Based Architecture. In MICRO '21."},{"key":"e_1_3_2_1_30_1","volume-title":"IPDPS'09","author":"Shi Lin","year":"2009","unstructured":"Lin Shi , Hao Chen , and Jianhua Sun . 2009 . vCUDA: GPU accelerated high performance computing in virtual machines . In IPDPS'09 . Lin Shi, Hao Chen, and Jianhua Sun. 2009. vCUDA: GPU accelerated high performance computing in virtual machines. In IPDPS'09."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTR.2009.5289193"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1723112.1723134"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.2172\/1473756"},{"key":"e_1_3_2_1_34_1","volume-title":"Gandiva: Introspective Cluster Scheduling for Deep Learning. In OSDI '18","author":"Xiao Wencong","unstructured":"Wencong Xiao , Romil Bhardwaj , Ramachandran Ramjee , Muthian Sivathanu , Nipun Kwatra , Zhenhua Han , Pratyush Patel , Xuan Peng , Hanyu Zhao , Quanlu Zhang , F. Yang , and L. Zhou . 2018 . Gandiva: Introspective Cluster Scheduling for Deep Learning. In OSDI '18 . Wencong Xiao, Romil Bhardwaj, Ramachandran Ramjee, Muthian Sivathanu, Nipun Kwatra, Zhenhua Han, Pratyush Patel, Xuan Peng, Hanyu Zhao, Quanlu Zhang, F. Yang, and L. Zhou. 2018. Gandiva: Introspective Cluster Scheduling for Deep Learning. In OSDI '18."},{"key":"e_1_3_2_1_35_1","volume-title":"AntMan: Dynamic Scaling on GPU Clusters for Deep Learning. In OSDI '20","author":"Xiao Wencong","year":"2020","unstructured":"Wencong Xiao , Shiru Ren , Yong Li , Yang Zhang , Pengyang Hou , Zhi Li , Yihui Feng , Wei Lin , and Yangqing Jia . 2020 . AntMan: Dynamic Scaling on GPU Clusters for Deep Learning. In OSDI '20 . Wencong Xiao, Shiru Ren, Yong Li, Yang Zhang, Pengyang Hou, Zhi Li, Yihui Feng, Wei Lin, and Yangqing Jia. 2020. AntMan: Dynamic Scaling on GPU Clusters for Deep Learning. In OSDI '20."},{"key":"e_1_3_2_1_36_1","volume":"202","author":"Yu Hangchen","unstructured":"Hangchen Yu , Arthur Michener Peters , Amogh Akshintala , and Christopher J. Rossbach. 202 0. AvA: Accelerated Virtualization of Accelerators. In ASPLOS '20. Hangchen Yu, Arthur Michener Peters, Amogh Akshintala, and Christopher J. Rossbach. 2020. AvA: Accelerated Virtualization of Accelerators. In ASPLOS '20.","journal-title":"Christopher J. Rossbach."}],"event":{"name":"SoCC '22: ACM Symposium on Cloud Computing","location":"San Francisco California","acronym":"SoCC '22","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGOPS ACM Special Interest Group on Operating Systems"]},"container-title":["Proceedings of the 13th Symposium on Cloud Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3542929.3563467","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3542929.3563467","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:23Z","timestamp":1750186943000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3542929.3563467"}},"subtitle":["a runtime framework for decoupling applications from heterogeneous accelerators"],"short-title":[],"issued":{"date-parts":[[2022,11,7]]},"references-count":36,"alternative-id":["10.1145\/3542929.3563467","10.1145\/3542929"],"URL":"https:\/\/doi.org\/10.1145\/3542929.3563467","relation":{},"subject":[],"published":{"date-parts":[[2022,11,7]]},"assertion":[{"value":"2022-11-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}