{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,23]],"date-time":"2025-08-23T00:08:50Z","timestamp":1755907730746,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":81,"publisher":"ACM","license":[{"start":{"date-parts":[[2024,11,20]],"date-time":"2024-11-20T00:00:00Z","timestamp":1732060800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Huawei","award":["2989"],"award-info":[{"award-number":["2989"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2024,11,20]]},"DOI":"10.1145\/3698038.3698549","type":"proceedings-article","created":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T06:32:43Z","timestamp":1731565963000},"page":"995-1011","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-2229-235X","authenticated-orcid":false,"given":"Diana","family":"Petrescu","sequence":"first","affiliation":[{"name":"EPFL, Lausanne, CH"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0898-0387","authenticated-orcid":false,"given":"Arsany","family":"Guirguis","sequence":"additional","affiliation":[{"name":"EPFL, Lausanne, CH"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1433-0217","authenticated-orcid":false,"given":"Do Le","family":"Quoc","sequence":"additional","affiliation":[{"name":"Huawei Munich Research Center, Munich, DE"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-6984-1303","authenticated-orcid":false,"given":"Javier","family":"Picorel","sequence":"additional","affiliation":[{"name":"Huawei Munich Research Center, Munich, DE"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4794-8902","authenticated-orcid":false,"given":"Rachid","family":"Guerraoui","sequence":"additional","affiliation":[{"name":"EPFL, Lausanne, CH"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-1514-2997","authenticated-orcid":false,"given":"Florin","family":"Dinu","sequence":"additional","affiliation":[{"name":"Huawei Munich Research Center, Munich, DE"}]}],"member":"320","published-online":{"date-parts":[[2024,11,20]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Amazon. 2021. AQUA (Advanced Query Accelerator) - A Speed Boost for Your Amazon Redshift Queries. https:\/\/aws.amazon.com\/blogs\/a ws\/new-aqua-advanced-query-accelerator-for-amazon-redshift\/"},{"key":"e_1_3_2_1_2_1","unstructured":"Amazon. 2024. Amazon S3. https:\/\/aws.amazon.com\/s3\/"},{"key":"e_1_3_2_1_3_1","volume-title":"Amazon Web Services","author":"Inc.","year":"2024","unstructured":"Inc. Amazon Web Services. 2024. Build Generative AI Applications with Foundation Models - Amazon Bedrock. https:\/\/aws.amazon.com\/bedrock\/"},{"key":"e_1_3_2_1_4_1","unstructured":"Amazon AWS. 2021. Use pre-trained financial language models for transfer learning in Amazon SageMaker JumpStart. https:\/\/aws.amazon.com\/blogs\/machine-learning\/use-pre-trained-financial-language-models-for-transfer-learning-in-amazon-sagemaker-jumpstart\/"},{"key":"e_1_3_2_1_5_1","volume-title":"PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications. In OSDI","author":"Bai Zhihao","year":"2020","unstructured":"Zhihao Bai, Zhen Zhang, Yibo Zhu, and Xin Jin. 2020. PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications. In OSDI 2020."},{"key":"e_1_3_2_1_6_1","unstructured":"Gedas Bertasius Heng Wang and Lorenzo Torresani. 2021. Is space-time attention all you need for video understanding. (2021). https:\/\/arxiv.org\/abs\/2102.05095"},{"key":"e_1_3_2_1_7_1","unstructured":"AWS News Blog. 2017. S3 Select and Glacier Select - Retrieving Subsets of Objects. https:\/\/aws.amazon.com\/blogs\/aws\/s3- glacier-select\/"},{"key":"e_1_3_2_1_8_1","volume-title":"NeurIPS","author":"Brown Tom","year":"2020","unstructured":"Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. In NeurIPS 2020."},{"key":"e_1_3_2_1_9_1","unstructured":"Tom B Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et al. 2020. Language models are few-shot learners. (2020). https:\/\/arxiv.org\/abs\/2005.14165"},{"key":"e_1_3_2_1_10_1","unstructured":"Curcial by Micron. 2024. Crucial T705 Gen5 NVMe M.2 SSD. https:\/\/www.crucial.in\/content\/dam\/crucial\/ssd-products\/t705\/flyers\/b2c\/crucial-t705-b2c-product-flyer-en.pdf"},{"key":"e_1_3_2_1_11_1","volume-title":"Muhammad Ikram ul Haq, Deepali Bhardwaj, Sowmya Dayanand, Anitha Adusumilli, Marvin McNett, Sriram Sankaran, Kavitha Manivannan, and Leonidas Rigas.","author":"Calder Brad","year":"2011","unstructured":"Brad Calder, Ju Wang, Aaron Ogus, Niranjan Nilakantan, Arild Skjolsvold, Sam McKelvie, Yikang Xu, Shashwat Srivastav, Jiesheng Wu, Huseyin Simitci, Jaidev Haridas, Chakravarthy Uddaraju, Hemal Khatri, Andrew Edwards, Vaman Bedekar, Shane Mainali, Rafay Abbasi, Arpit Agarwal, Mian Fahim ul Haq, Muhammad Ikram ul Haq, Deepali Bhardwaj, Sowmya Dayanand, Anitha Adusumilli, Marvin McNett, Sriram Sankaran, Kavitha Manivannan, and Leonidas Rigas. 2011. Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency. In SOSP 2011."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER49012.2020.00045"},{"key":"e_1_3_2_1_13_1","unstructured":"Alibaba Cloud. 2021. Real-Time Image Processing by Object Storage Service. https:\/\/www.alibabacloud.com\/blog\/real-time-image-processing-by-object-storage-service_597996"},{"key":"e_1_3_2_1_14_1","unstructured":"Google Cloud. 2024. Vertex AI. https:\/\/cloud.google.com\/vertex-ai"},{"key":"e_1_3_2_1_15_1","unstructured":"McKinsey & Company. 2019. Edlich Alex et al. Driving impact at scale from automation and AI. https:\/\/mck.co\/3IQIBVi"},{"key":"e_1_3_2_1_16_1","unstructured":"Microsoft Azure Databricks. 2024. Featurization for transfer learning. https:\/\/docs.microsoft.com\/en-us\/azure\/databricks\/applications\/machine-learning\/preprocess-data\/transfer-learning-tensorflow"},{"key":"e_1_3_2_1_17_1","unstructured":"Mostafa Dehghani Josip Djolonga Basil Mustafa Piotr Padlewski Jonathan Heek Justin Gilmer Andreas Steiner Mathilde Caron Robert Geirhos Ibrahim Alabdulmohsin Rodolphe Jenatton Lucas Beyer Michael Tschannen Anurag Arnab Xiao Wang Carlos Riquelme Matthias Minderer Joan Puigcerver Utku Evci Manoj Kumar Sjoerd van Steenkiste Gamaleldin F. Elsayed Aravindh Mahendran Fisher Yu Avital Oliver Fantine Huot Jasmijn Bastings Mark Patrick Collier Alexey Gritsenko Vighnesh Birodkar Cristina Vasconcelos Yi Tay Thomas Mensink Alexander Kolesnikov Filip Paveti\u0107 Dustin Tran Thomas Kipf Mario Lu\u010di\u0107 Xiaohua Zhai Daniel Keysers Jeremiah Harmsen and Neil Houlsby. 2023. Scaling Vision Transformers to 22 Billion Parameters. https:\/\/arxiv.org\/abs\/2302.05442"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_19_1","volume-title":"NeurIPS","author":"Dettmers Tim","year":"2023","unstructured":"Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, and Luke Zettlemoyer. 2023. QLORA: efficient finetuning of quantized LLMs. In NeurIPS 2023."},{"key":"e_1_3_2_1_20_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT 2019."},{"key":"e_1_3_2_1_21_1","volume-title":"Exploiting Cloud Object Storage for High-Performance Analytics. In VLDB","author":"Durner Dominik","year":"2023","unstructured":"Dominik Durner, Viktor Leis, and Thomas Neumann. 2023. Exploiting Cloud Object Storage for High-Performance Analytics. In VLDB 2023."},{"key":"e_1_3_2_1_22_1","volume-title":"KONVENS","author":"Eberhard Onno","year":"2021","unstructured":"Onno Eberhard and Torsten Zesch. 2021. Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages. In KONVENS 2021."},{"key":"e_1_3_2_1_23_1","volume-title":"When Cloud Storage Meets RDMA. In NSDI","author":"Gao Yixiao","year":"2021","unstructured":"Yixiao Gao, Qiang Li, Lingbo Tang, Yongqing Xi, Pengcheng Zhang, Wenwen Peng, Bo Li, Yaohui Wu, Shaozong Liu, Lei Yan, Fei Feng, Yan Zhuang, Fan Liu, Pan Liu, Xingkui Liu, Zhongjie Wu, Junping Wu, Zheng Cao, Chen Tian, Jinbo Wu, Jiaji Zhu, Haiyong Wang, Dennis Cai, and Jiesheng Wu. 2021. When Cloud Storage Meets RDMA. In NSDI 2021."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3368089.3417050"},{"key":"e_1_3_2_1_25_1","volume-title":"NSDI","author":"Gkantsidis Christos","year":"2013","unstructured":"Christos Gkantsidis, Dimitrios Vytiniotis, Orion Hodson, Dushyanth Narayanan, Florin Dinu, and Ant Rowstron. 2013. Rhea: automatic filtering for unstructured cloud storage. In NSDI 2013."},{"volume-title":"Deep learning","author":"Goodfellow Ian","key":"e_1_3_2_1_26_1","unstructured":"Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep learning. MIT press."},{"key":"e_1_3_2_1_27_1","unstructured":"Google. 2024. Google Cloud Storage. https:\/\/cloud.google.com\/storage"},{"key":"e_1_3_2_1_28_1","volume-title":"Mun Wai Lee, and Tianfu Wu","author":"Grainger Ryan","year":"2022","unstructured":"Ryan Grainger, Thomas Paniagua, Xi Song, Naresh P. Cuntoor, Mun Wai Lee, and Tianfu Wu. 2022. PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers. In CVPR 2022."},{"key":"e_1_3_2_1_29_1","volume-title":"Pecan: Cost-Efficient ML Data Preprocessing with Automatic Transformation Ordering and Hybrid Placement. In USENIX ATC","author":"Graur Dan","year":"2024","unstructured":"Dan Graur, Oto Mraz, Muyu Li, Sepehr Pourghannad, Chandramohan A. Thekkath, and Ana Klimovic. 2024. Pecan: Cost-Efficient ML Data Preprocessing with Automatic Transformation Ordering and Hybrid Placement. In USENIX ATC 2024."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2023.3314659"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-67630-2_13"},{"key":"e_1_3_2_1_32_1","unstructured":"Awni Hannun Carl Case Jared Casper Bryan Catanzaro Greg Diamos Erich Elsen Ryan Prenger Sanjeev Satheesh Shubho Sengupta Adam Coates et al. 2014. Deep speech: Scaling up end-to-end speech recognition. (2014). https:\/\/arxiv.org\/abs\/1412.5567"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_34_1","unstructured":"Edward J. Hu Yelong Shen Phillip Wallis Zeyuan Allen-Zhu Yuanzhi Li Shean Wang Lu Wang and Weizhu Chen. 2021. LoRA: Low-Rank Adaptation of Large Language Models. https:\/\/arxiv.org\/abs\/2106.09685"},{"key":"e_1_3_2_1_35_1","volume-title":"AutoPipe: Automatic Configuration of Pipeline Parallelism in Shared GPU Cluster. In ICPP","author":"Hu Jinbin","year":"2024","unstructured":"Jinbin Hu, Ying Liu, Hao Wang, and Jin Wang. 2024. AutoPipe: Automatic Configuration of Pipeline Parallelism in Shared GPU Cluster. In ICPP 2024."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.243"},{"key":"e_1_3_2_1_37_1","volume-title":"Dehao Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V. Le, Yonghui Wu, and Zhifeng Chen.","author":"Huang Yanping","year":"2019","unstructured":"Yanping Huang, Youlong Cheng, Ankur Bapna, Orhan Firat, Mia Xu Chen, Dehao Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V. Le, Yonghui Wu, and Zhifeng Chen. 2019. GPipe: Efficient Training of Giant Neural Networks Using Pipeline Parallelism. In NeurIPS 2019."},{"key":"e_1_3_2_1_38_1","unstructured":"Huawei. 2024. Huawei ECS. https:\/\/www.huaweicloud.com\/intl\/en-us\/product\/ecs.html"},{"key":"e_1_3_2_1_39_1","unstructured":"IBM. 2024. IBM Storage Scale System 6000. https:\/\/www.ibm.com\/products\/storage-scale-system"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517848"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3552326.3567508"},{"key":"e_1_3_2_1_42_1","volume-title":"OSDl","author":"Jiang Yimin","year":"2020","unstructured":"Yimin Jiang, Yibo Zhu, Chang Lan, Bairen Yi, Yong Cui, and Chuanxiong Guo. 2020. A Unified Architecture for Accelerating Distributed DNN Training in Heterogeneous GPU\/CPU Clusters. In OSDl 2020."},{"key":"e_1_3_2_1_43_1","volume-title":"Trevor Mudge, Jason Mars, and Lingjia Tang.","author":"Kang Yiping","year":"2017","unstructured":"Yiping Kang, Johann Hauswald, Cao Gao, Austin Rovinski, Trevor Mudge, Jason Mars, and Lingjia Tang. 2017. Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge. In ASPLOS 2017."},{"key":"e_1_3_2_1_44_1","volume-title":"Fahad Shahbaz Khan, and Mubarak Shah.","author":"Khan Salman","year":"2021","unstructured":"Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah. 2021. Transformers in vision: A survey. ACM Computing Surveys (CSUR) (2021)."},{"key":"e_1_3_2_1_45_1","volume-title":"NDPipe: Exploiting Near-data Processing for Scalable Inference and Continuous Training in Photo Storage. In ASPLOS","author":"Kim Jungwoo","year":"2024","unstructured":"Jungwoo Kim, Seonggyun Oh, Jaeha Kung, Yeseong Kim, and Sungjin Lee. 2024. NDPipe: Exploiting Near-data Processing for Scalable Inference and Continuous Training in Photo Storage. In ASPLOS 2024."},{"key":"e_1_3_2_1_46_1","volume-title":"Understanding Ephemeral Storage for Serverless Analytics. In USENIX ATC","author":"Klimovic Ana","year":"2018","unstructured":"Ana Klimovic, Yawen Wang, Christos Kozyrakis, Patrick Stuedi, Jonas Pfefferle, and Animesh Trivedi. 2018. Understanding Ephemeral Storage for Serverless Analytics. In USENIX ATC 2018."},{"key":"e_1_3_2_1_47_1","volume-title":"Te I., H.V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng, Steven Swanson, and Murali Annavaram.","author":"Koo Gunjae","year":"2017","unstructured":"Gunjae Koo, Kiran Kumar Matam, Te I., H.V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng, Steven Swanson, and Murali Annavaram. 2017. Summarizer: Trading Communication with Computing Near Storage. In MICRO 2017."},{"key":"e_1_3_2_1_48_1","volume-title":"SIGMOD","author":"Kuschewski Maximilian","year":"2025","unstructured":"Maximilian Kuschewski, Jana Giceva, Thomas Neumann, and Viktor Leis. 2025. High-Performance Query Processing with NVMe Arrays: Spilling without Killing Performance. In SIGMOD 2025."},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.14778\/3007328.3007331"},{"key":"e_1_3_2_1_50_1","volume-title":"LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models. In ICLR","author":"Li Yixiao","year":"2024","unstructured":"Yixiao Li, Yifan Yu, Chen Liang, Nikos Karampatziakis, Pengcheng He, Weizhu Chen, and Tuo Zhao. 2024. LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models. In ICLR 2024."},{"key":"e_1_3_2_1_51_1","volume-title":"HotStorage","author":"Lillaney Kunal","year":"2019","unstructured":"Kunal Lillaney, Vasily Tarasov, David Pease, and Randal Burns. 2019. The Case for Dual-access File Systems over Object Storage. In HotStorage 2019."},{"key":"e_1_3_2_1_52_1","volume-title":"Aceso: Efficient Parallel DNN Training through Iterative Bottleneck Alleviation. In EuroSys","author":"Liu Guodong","year":"2024","unstructured":"Guodong Liu, Youshan Miao, Zhiqi Lin, Xiaoxiang Shi, Saeed Maleki, Fan Yang, Yungang Bao, and Sa Wang. 2024. Aceso: Efficient Parallel DNN Training through Iterative Bottleneck Alleviation. In EuroSys 2024."},{"key":"e_1_3_2_1_53_1","volume-title":"SOSP","author":"Luo Mai Peter Pietzuch Bo Zhao","year":"2024","unstructured":"Bo Zhao Luo Mai Peter Pietzuch Marcel Wagenl\u00e4nder, Guo Li. 2024. Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections. In SOSP 2024."},{"key":"e_1_3_2_1_54_1","volume-title":"Tf.Data: A Machine Learning Data Processing Framework. In VLDB","author":"Murray Derek G.","year":"2021","unstructured":"Derek G. Murray, Ji\u0159\u00ed \u0160im\u0161a, Ana Klimovic, and Ihor Indyk. 2021. Tf.Data: A Machine Learning Data Processing Framework. In VLDB 2021."},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/3388440.3412467"},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341301.3359646"},{"key":"e_1_3_2_1_57_1","unstructured":"NVIDIA. 2024. GPUDirect Storage. https:\/\/docs.nvidia.com\/gpudirect-storage\/index.html"},{"key":"e_1_3_2_1_58_1","volume-title":"Pytorch: An imperative style, high-performance deep learning library. In NeurIPS","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. Pytorch: An imperative style, high-performance deep learning library. In NeurIPS 2019."},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3599691.3603404"},{"key":"e_1_3_2_1_60_1","volume-title":"Fast and Slow: Scalable Analytics on Serverless Infrastructure. In NSDI","author":"Pu Qifan","year":"2019","unstructured":"Qifan Pu, Shivaram Venkataraman, and Ion Stoica. 2019. Shuffling, Fast and Slow: Scalable Analytics on Serverless Infrastructure. In NSDI 2019."},{"key":"e_1_3_2_1_61_1","volume-title":"Alibaba HPN: A Data Center Network for Large Language Model Training. In SIGCOMM","author":"Qian Kun","year":"2024","unstructured":"Kun Qian, Yongqing Xi, Jiamin Cao, Jiaqi Gao, Yichi Xu, Yu Guan, Binzhang Fu, Xuemei Shi, Fangbo Zhu, Rui Miao, Chao Wang, Peng Wang, Pengcheng Zhang, Xianlong Zeng, Eddie Ruan, Zhiping Yao, Ennan Zhai, and Dennis Cai. 2024. Alibaba HPN: A Data Center Network for Large Language Model Training. In SIGCOMM 2024."},{"key":"e_1_3_2_1_62_1","volume-title":"Glow: Graph Lowering Compiler Techniques for Neural Networks. https:\/\/arxiv.org\/abs\/1805.00907","author":"Rotem Nadav","year":"2019","unstructured":"Nadav Rotem, Jordan Fix, Saleem Abdulrasool, Garret Catron, Summer Deng, Roman Dzhabarov, Nick Gibson, James Hegeman, Meghan Lele, Roman Levenstein, Jack Montgomery, Bert Maher, Satish Nadathur, Jakob Olesen, Jongsoo Park, Artem Rakhov, Misha Smelyanskiy, and Man Wang. 2019. Glow: Graph Lowering Compiler Techniques for Neural Networks. https:\/\/arxiv.org\/abs\/1805.00907"},{"key":"e_1_3_2_1_63_1","volume-title":"Effective Elastic Scaling of Deep Learning Workloads. In MASCOTS","author":"Saxena V.","year":"2020","unstructured":"V. Saxena, K. R. Jayaram, S. Basu, Y. Sabharwal, and A. Verma. 2020. Effective Elastic Scaling of Deep Learning Workloads. In MASCOTS 2020."},{"key":"e_1_3_2_1_64_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. (2014). https:\/\/arxiv.org\/abs\/1409.1556"},{"key":"e_1_3_2_1_65_1","volume-title":"Increase the Batch Size. In ICLR","author":"Smith Samuel L.","year":"2018","unstructured":"Samuel L. Smith, Pieter-Jan Kindermans, and Quoc V. Le. 2018. Don't Decay the Learning Rate, Increase the Batch Size. In ICLR 2018."},{"key":"e_1_3_2_1_66_1","volume-title":"NeurIPS","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NeurIPS 2017."},{"key":"e_1_3_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/3655038.3665947"},{"key":"e_1_3_2_1_68_1","unstructured":"Wenxiao Wang Wei Chen Qibo Qiu Long Chen Boxi Wu Binbin Lin Xiaofei He and Wei Liu. 2023. CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention. https:\/\/arxiv.org\/abs\/2303.06908"},{"key":"e_1_3_2_1_69_1","volume-title":"A survey of transfer learning. Journal of Big data","author":"Weiss Karl","year":"2016","unstructured":"Karl Weiss, Taghi M Khoshgoftaar, and DingDing Wang. 2016. A survey of transfer learning. Journal of Big data (2016)."},{"key":"e_1_3_2_1_70_1","volume-title":"AQUOMAN: An Analytic-Query Offloading Machine. In MICRO","author":"Xu Shuotao","year":"2020","unstructured":"Shuotao Xu, Thomas Bourgeat, Tianhao Huang, Hojun Kim, Sungjin Lee, and Arvind Arvind. 2020. AQUOMAN: An Analytic-Query Offloading Machine. In MICRO 2020."},{"key":"e_1_3_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/3452296.3472927"},{"key":"e_1_3_2_1_72_1","volume-title":"FAST","author":"Yang Jian","year":"2020","unstructured":"Jian Yang, Juno Kim, Morteza Hoseinzadeh, Joseph Izraelevitz, and Steve Swanson. 2020. An Empirical Guide to the Behavior and Use of Scalable Persistent Memory. In FAST 2020."},{"key":"e_1_3_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476249.3476265"},{"key":"e_1_3_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/3097983.3098147"},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE48307.2020.00174"},{"key":"e_1_3_2_1_76_1","volume-title":"DDS: DPU-Optimized Disaggregated Storage. In VLDB","author":"Zhang Qizhen","year":"2024","unstructured":"Qizhen Zhang, Philip A. Bernstein, Badrish Chandramouli, Jiasheng Hu, and Yiming Zheng. 2024. DDS: DPU-Optimized Disaggregated Storage. In VLDB 2024."},{"key":"e_1_3_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/3552326.3567499"},{"key":"e_1_3_2_1_78_1","volume-title":"ISCA","author":"Zhao Mark","year":"2022","unstructured":"Mark Zhao, Niket Agarwal, Aarti Basant, Bugra Gedik, Satadru Pan, Mustafa Ozdal, Rakesh Komuravelli, Jerry Pan, Tianshu Bao, Haowei Lu, et al. 2022. Understanding and co-designing the data ingestion pipeline for industry-scale recsys training. In ISCA 2022."},{"key":"e_1_3_2_1_79_1","volume-title":"ISCA","author":"Zhao Mark","year":"2022","unstructured":"Mark Zhao, Niket Agarwal, Aarti Basant, Bu\u011fra Gedik, Satadru Pan, Mustafa Ozdal, Rakesh Komuravelli, Jerry Pan, Tianshu Bao, Haowei Lu, Sundaram Narayanan, Jack Langman, Kevin Wilfong, Harsha Rastogi, Carole-Jean Wu, Christos Kozyrakis, and Parik Pol. 2022. Understanding data storage and ingestion for large-scale deep recommendation model training: industrial product. In ISCA 2022."},{"key":"e_1_3_2_1_80_1","unstructured":"Fuzhen Zhuang Zhiyuan Qi Keyu Duan Dongbo Xi Yongchun Zhu Hengshu Zhu Hui Xiong and Qing He. 2019. A Comprehensive Survey on Transfer Learning. (2019). http:\/\/arxiv.org\/abs\/1911.02685"},{"key":"e_1_3_2_1_81_1","volume-title":"Intel OpenVINO Toolkit for Computer Vision: Object Detection and Semantic Segmentation. In 2021 International Russian Automation Conference (RusAutoCon).","author":"Zunin V. V.","year":"2021","unstructured":"V. V. Zunin. 2021. Intel OpenVINO Toolkit for Computer Vision: Object Detection and Semantic Segmentation. In 2021 International Russian Automation Conference (RusAutoCon)."}],"event":{"name":"SoCC '24: ACM Symposium on Cloud Computing","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGOPS ACM Special Interest Group on Operating Systems"],"location":"Redmond WA USA","acronym":"SoCC '24"},"container-title":["Proceedings of the ACM Symposium on Cloud Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3698038.3698549","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3698038.3698549","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T19:02:46Z","timestamp":1755889366000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3698038.3698549"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,20]]},"references-count":81,"alternative-id":["10.1145\/3698038.3698549","10.1145\/3698038"],"URL":"https:\/\/doi.org\/10.1145\/3698038.3698549","relation":{},"subject":[],"published":{"date-parts":[[2024,11,20]]},"assertion":[{"value":"2024-11-20","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}