{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T15:19:17Z","timestamp":1774019957500,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":78,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,11,1]],"date-time":"2021-11-01T00:00:00Z","timestamp":1635724800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,11]]},"DOI":"10.1145\/3472883.3486972","type":"proceedings-article","created":{"date-parts":[[2021,10,27]],"date-time":"2021-10-27T10:48:16Z","timestamp":1635331696000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":80,"title":["Llama"],"prefix":"10.1145","author":[{"given":"Francisco","family":"Romero","sequence":"first","affiliation":[{"name":"Stanford University"}]},{"given":"Mark","family":"Zhao","sequence":"additional","affiliation":[{"name":"Stanford University"}]},{"given":"Neeraja J.","family":"Yadwadkar","sequence":"additional","affiliation":[{"name":"Stanford University"}]},{"given":"Christos","family":"Kozyrakis","sequence":"additional","affiliation":[{"name":"Stanford University"}]}],"member":"320","published-online":{"date-parts":[[2021,11]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"2021. Amazon ECU. https:\/\/aws.amazon.com\/ec2\/faqs\/#What_is_an_EC2_Compute_Unit_and_why_did_you_introduce_it.  2021. Amazon ECU. https:\/\/aws.amazon.com\/ec2\/faqs\/#What_is_an_EC2_Compute_Unit_and_why_did_you_introduce_it."},{"key":"e_1_3_2_2_2_1","unstructured":"2021. Ambarella CVFlow Architecture. https:\/\/www.ambarella.com\/teehnology\/#evflow.  2021. Ambarella CVFlow Architecture. https:\/\/www.ambarella.com\/teehnology\/#evflow."},{"key":"e_1_3_2_2_3_1","unstructured":"2021. AWS Lambda. https:\/\/aws.amazon.com\/lambda\/.  2021. AWS Lambda. https:\/\/aws.amazon.com\/lambda\/."},{"key":"e_1_3_2_2_4_1","unstructured":"2021. AWS Step Functions. https:\/\/docs.aws.amazon.com\/step-functions\/latest\/dg\/welcome.html.  2021. AWS Step Functions. https:\/\/docs.aws.amazon.com\/step-functions\/latest\/dg\/welcome.html."},{"key":"e_1_3_2_2_5_1","unstructured":"2021. Azure Functions. https:\/\/azure.microsoft.com\/en-us\/services\/functions\/.  2021. Azure Functions. https:\/\/azure.microsoft.com\/en-us\/services\/functions\/."},{"key":"e_1_3_2_2_6_1","unstructured":"2021. Cisco Annual Internet Report (2018-2023). https:\/\/www.cisco.com\/c\/en\/us\/solutions\/collateral\/executive-perspectives\/annual-internet-report\/white-paper-c11-741490.html.  2021. Cisco Annual Internet Report (2018-2023). https:\/\/www.cisco.com\/c\/en\/us\/solutions\/collateral\/executive-perspectives\/annual-internet-report\/white-paper-c11-741490.html."},{"key":"e_1_3_2_2_7_1","unstructured":"2021. CNN - Futuristic cop cars may identify suspects. https:\/\/money.cnn.com\/2017\/10\/19\/technology\/future\/police-ai-dashcam\/index.html.  2021. CNN - Futuristic cop cars may identify suspects. https:\/\/money.cnn.com\/2017\/10\/19\/technology\/future\/police-ai-dashcam\/index.html."},{"key":"e_1_3_2_2_8_1","unstructured":"2021. Google Cloud. https:\/\/cloud.google.com\/.  2021. Google Cloud. https:\/\/cloud.google.com\/."},{"key":"e_1_3_2_2_9_1","unstructured":"2021. Google Cloud Functions. https:\/\/cloud.google.com\/functions.  2021. Google Cloud Functions. https:\/\/cloud.google.com\/functions."},{"key":"e_1_3_2_2_10_1","unstructured":"2021. Multi-Process Service. https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf.  2021. Multi-Process Service. https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf."},{"key":"e_1_3_2_2_11_1","unstructured":"2021. NVIDIA A100 GPU. https:\/\/www.nvidia.com\/en-us\/data-center\/a100\/.  2021. NVIDIA A100 GPU. https:\/\/www.nvidia.com\/en-us\/data-center\/a100\/."},{"key":"e_1_3_2_2_12_1","unstructured":"2021. Political Rally Video. https:\/\/www.youtube.com\/watch?v=FGDFAD3Jkuc.  2021. Political Rally Video. https:\/\/www.youtube.com\/watch?v=FGDFAD3Jkuc."},{"key":"e_1_3_2_2_13_1","unstructured":"2021. Scanner. http:\/\/scanner.run\/.  2021. Scanner. http:\/\/scanner.run\/."},{"key":"e_1_3_2_2_14_1","unstructured":"2021. Tears of Steel. https:\/\/www.youtube.com\/watch?v=tjgM6ckoz88.  2021. Tears of Steel. https:\/\/www.youtube.com\/watch?v=tjgM6ckoz88."},{"key":"e_1_3_2_2_15_1","unstructured":"2021. Traffic Footage. https:\/\/www.youtube.com\/watch?v=MNn9qKG2UFI.  2021. Traffic Footage. https:\/\/www.youtube.com\/watch?v=MNn9qKG2UFI."},{"key":"e_1_3_2_2_16_1","volume-title":"Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation","author":"Abadi Mart\u00edn","unstructured":"Mart\u00edn Abadi , Paul Barham , Jianmin Chen , Zhifeng Chen , Andy Davis , Jeffrey Dean , Matthieu Devin , Sanjay Ghemawat , Geoffrey Irving , Michael Isard , and et al. 2016. TensorFlow: A System for Large-Scale Machine Learning . In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation ( Savannah, GA, USA) (OSDI'16). USENIX Association, USA, 265--283. Mart\u00edn Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, and et al. 2016. TensorFlow: A System for Large-Scale Machine Learning. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (Savannah, GA, USA) (OSDI'16). USENIX Association, USA, 265--283."},{"key":"e_1_3_2_2_17_1","first-page":"10","article-title":"DBToaster: Higher-Order Delta Processing for Dynamic","volume":"5","author":"Ahmad Yanif","year":"2012","unstructured":"Yanif Ahmad , Oliver Kennedy , Christoph Koch , and Milos Nikolic . 2012 . DBToaster: Higher-Order Delta Processing for Dynamic , Frequently Fresh Views. Proc. VLDB Endow. 5 , 10 (June 2012), 968--979. https:\/\/doi.org\/10.14778\/2336664.2336670 10.14778\/2336664.2336670 Yanif Ahmad, Oliver Kennedy, Christoph Koch, and Milos Nikolic. 2012. DBToaster: Higher-Order Delta Processing for Dynamic, Frequently Fresh Views. Proc. VLDB Endow. 5, 10 (June 2012), 968--979. https:\/\/doi.org\/10.14778\/2336664.2336670","journal-title":"Frequently Fresh Views. Proc. VLDB Endow."},{"key":"e_1_3_2_2_18_1","volume-title":"CherryPick: Adaptively Unearthing the Best Cloud Configurations for Big Data Analytics. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17)","author":"Alipourfard Omid","year":"2017","unstructured":"Omid Alipourfard , Hongqiang Harry Liu , Jianshu Chen , Shivaram Venkataraman , Minlan Yu , and Ming Zhang . 2017 . CherryPick: Adaptively Unearthing the Best Cloud Configurations for Big Data Analytics. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17) . USENIX Association, Boston, MA, 469--482. https:\/\/www.usenix.org\/conference\/nsdi17\/technical-sessions\/presentation\/alipourfard Omid Alipourfard, Hongqiang Harry Liu, Jianshu Chen, Shivaram Venkataraman, Minlan Yu, and Ming Zhang. 2017. CherryPick: Adaptively Unearthing the Best Cloud Configurations for Big Data Analytics. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). USENIX Association, Boston, MA, 469--482. https:\/\/www.usenix.org\/conference\/nsdi17\/technical-sessions\/presentation\/alipourfard"},{"key":"e_1_3_2_2_19_1","unstructured":"Amazon Go 2021. Amazon Go. https:\/\/www.amazon.com\/b?ie=UTF8&node=16008589011.  Amazon Go 2021. Amazon Go. https:\/\/www.amazon.com\/b?ie=UTF8&node=16008589011."},{"key":"e_1_3_2_2_20_1","first-page":"58","article-title":"Real-Time Video Analytics","volume":"50","author":"Ananthanarayanan G.","year":"2017","unstructured":"G. Ananthanarayanan , P. Bahl , P. Bod\u00edk , K. Chintalapudi , M. Philipose , L. Ravindranath , and S. Sinha . 2017 . Real-Time Video Analytics : The Killer App for Edge Computing. Computer 50 , 10 (2017), 58 -- 67 . https:\/\/doi.org\/10.1109\/MC.2017.3641638 10.1109\/MC.2017.3641638 G. Ananthanarayanan, P. Bahl, P. Bod\u00edk, K. Chintalapudi, M. Philipose, L. Ravindranath, and S. Sinha. 2017. Real-Time Video Analytics: The Killer App for Edge Computing. Computer 50, 10 (2017), 58--67. https:\/\/doi.org\/10.1109\/MC.2017.3641638","journal-title":"The Killer App for Edge Computing. Computer"},{"key":"e_1_3_2_2_21_1","volume-title":"Effective Straggler Mitigation: Attack of the Clones. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13)","author":"Ananthanarayanan Ganesh","year":"2013","unstructured":"Ganesh Ananthanarayanan , Ali Ghodsi , Scott Shenker , and Ion Stoica . 2013 . Effective Straggler Mitigation: Attack of the Clones. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13) . USENIX Association, Lombard, IL, 185--198. https:\/\/www.usenix.org\/conference\/nsdi13\/technical-sessions\/presentation\/ananthanarayanan Ganesh Ananthanarayanan, Ali Ghodsi, Scott Shenker, and Ion Stoica. 2013. Effective Straggler Mitigation: Attack of the Clones. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13). USENIX Association, Lombard, IL, 185--198. https:\/\/www.usenix.org\/conference\/nsdi13\/technical-sessions\/presentation\/ananthanarayanan"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3267809.3267815"},{"key":"e_1_3_2_2_23_1","unstructured":"Artificial Intelligence Security Surveillance Cameras 2018. Artificial Intelligence Security Surveillance Cameras. https:\/\/www.theverge.com\/2018\/1\/23\/16907238\/artificial-intelligence-surveillance-cameras-security.  Artificial Intelligence Security Surveillance Cameras 2018. Artificial Intelligence Security Surveillance Cameras. https:\/\/www.theverge.com\/2018\/1\/23\/16907238\/artificial-intelligence-surveillance-cameras-security."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1122971.1122990"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3190508.3190532"},{"key":"e_1_3_2_2_26_1","volume-title":"The OpenCV Library. Dr. Dobb's Journal of Software Tools","author":"Bradski G.","year":"2000","unstructured":"G. Bradski . 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools ( 2000 ). G. Bradski. 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools (2000)."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/HCS49909.2020.9220622"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3419111.3421285"},{"key":"e_1_3_2_2_29_1","volume-title":"Clipper: A Low-Latency Online Prediction Serving System. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17)","author":"Crankshaw Daniel","year":"2017","unstructured":"Daniel Crankshaw , Xin Wang , Guilio Zhou , Michael J. Franklin , Joseph E. Gonzalez , and Ion Stoica . 2017 . Clipper: A Low-Latency Online Prediction Serving System. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17) . USENIX Association, Boston, MA, 613--627. https:\/\/www.usenix.org\/conference\/nsdi17\/technical-sessions\/presentation\/crankshaw Daniel Crankshaw, Xin Wang, Guilio Zhou, Michael J. Franklin, Joseph E. Gonzalez, and Ion Stoica. 2017. Clipper: A Low-Latency Online Prediction Serving System. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). USENIX Association, Boston, MA, 613--627. https:\/\/www.usenix.org\/conference\/nsdi17\/technical-sessions\/presentation\/crankshaw"},{"key":"e_1_3_2_2_30_1","volume-title":"Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation -","volume":"6","author":"Dean Jeffrey","year":"2004","unstructured":"Jeffrey Dean and Sanjay Ghemawat . 2004 . MapReduce: Simplified Data Processing on Large Clusters . In Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation - Volume 6 (San Francisco, CA) (OSDI'04). USENIX Association, USA, 10. Jeffrey Dean and Sanjay Ghemawat. 2004. MapReduce: Simplified Data Processing on Large Clusters. In Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation - Volume 6 (San Francisco, CA) (OSDI'04). USENIX Association, USA, 10."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000001"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/SEC.2018.00029"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2168836.2168847"},{"key":"e_1_3_2_2_34_1","unstructured":"FFmpeg 2021. FFmpeg. https:\/\/ffmpeg.org\/.  FFmpeg 2021. FFmpeg. https:\/\/ffmpeg.org\/."},{"key":"e_1_3_2_2_35_1","volume-title":"Proceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference (Renton, WA, USA) (USENIX ATC '19). USENIX Association, USA, 475--488","author":"Fouladi Sadjad","year":"2019","unstructured":"Sadjad Fouladi , Francisco Romero , Dan Iter , Qian Li , Shuvo Chatterjee , Christos Kozyrakis , Matei Zaharia , and Keith Winstein . 2019 . From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers . In Proceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference (Renton, WA, USA) (USENIX ATC '19). USENIX Association, USA, 475--488 . Sadjad Fouladi, Francisco Romero, Dan Iter, Qian Li, Shuvo Chatterjee, Christos Kozyrakis, Matei Zaharia, and Keith Winstein. 2019. From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers. In Proceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference (Renton, WA, USA) (USENIX ATC '19). USENIX Association, USA, 475--488."},{"key":"e_1_3_2_2_36_1","volume-title":"Proceedings of the 14th USENIX Conference on Networked Systems Design and Implementation","author":"Fouladi Sadjad","year":"2017","unstructured":"Sadjad Fouladi , Riad S. Wahby , Brennan Shacklett , Karthikeyan Vasuki Balasubramaniam , William Zeng , Rahul Bhalerao , Anirudh Sivaraman , George Porter , and Keith Winstein . 2017 . Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads . In Proceedings of the 14th USENIX Conference on Networked Systems Design and Implementation ( Boston, MA, USA) (NSDI'17). USENIX Association, USA, 363--376. Sadjad Fouladi, Riad S. Wahby, Brennan Shacklett, Karthikeyan Vasuki Balasubramaniam, William Zeng, Rahul Bhalerao, Anirudh Sivaraman, George Porter, and Keith Winstein. 2017. Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads. In Proceedings of the 14th USENIX Conference on Networked Systems Design and Implementation (Boston, MA, USA) (NSDI'17). USENIX Association, USA, 363--376."},{"key":"e_1_3_2_2_37_1","volume-title":"Agilex Generation of Intel FPGAs. In 2020 IEEE Hot Chips 32 Symposium (HCS)","author":"Ganusov Ilya","year":"2020","unstructured":"Ilya Ganusov and Mahesh Iyer . 2020 . Agilex Generation of Intel FPGAs. In 2020 IEEE Hot Chips 32 Symposium (HCS) , Virtual , August 16-18, 2020. IEEE. Ilya Ganusov and Mahesh Iyer. 2020. Agilex Generation of Intel FPGAs. In 2020 IEEE Hot Chips 32 Symposium (HCS), Virtual, August 16-18, 2020. IEEE."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2019.2952113"},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2741948.2741968"},{"key":"e_1_3_2_2_40_1","volume-title":"Proceedings of the 21st International Middleware Conference","author":"Gunasekaran Jashwant Raj","unstructured":"Jashwant Raj Gunasekaran , Prashanth Thinakaran , Nachiappan C. Nachiappan , Mahmut Taylan Kandemir , and Chita R. Das . 2020. Fifer: Tackling Resource Underutilization in the Serverless Era . In Proceedings of the 21st International Middleware Conference ( Delft, Netherlands) (Middleware '20). Association for Computing Machinery, New York, NY, USA, 280--295. https:\/\/doi.org\/10.1145\/3423211.3425683 10.1145\/3423211.3425683 Jashwant Raj Gunasekaran, Prashanth Thinakaran, Nachiappan C. Nachiappan, Mahmut Taylan Kandemir, and Chita R. Das. 2020. Fifer: Tackling Resource Underutilization in the Serverless Era. In Proceedings of the 21st International Middleware Conference (Delft, Netherlands) (Middleware '20). Association for Computing Machinery, New York, NY, USA, 280--295. https:\/\/doi.org\/10.1145\/3423211.3425683"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.14778\/3402707.3402746"},{"key":"e_1_3_2_2_42_1","volume-title":"Focus: Querying Large Video Datasets with Low Latency and Low Cost. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18)","author":"Hsieh Kevin","year":"2018","unstructured":"Kevin Hsieh , Ganesh Ananthanarayanan , Peter Bodik , Shivaram Venkataraman , Paramvir Bahl , Matthai Philipose , Phillip B. Gibbons , and Onur Mutlu . 2018 . Focus: Querying Large Video Datasets with Low Latency and Low Cost. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18) . USENIX Association, Carlsbad, CA, 269--286. https:\/\/www.usenix.org\/conference\/osdi18\/presentation\/hsieh Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, and Onur Mutlu. 2018. Focus: Querying Large Video Datasets with Low Latency and Low Cost. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). USENIX Association, Carlsbad, CA, 269--286. https:\/\/www.usenix.org\/conference\/osdi18\/presentation\/hsieh"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1272996.1273005"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1136\/svn-2017-000101"},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3230543.3230574"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3127479.3128601"},{"key":"e_1_3_2_2_47_1","volume-title":"2017 ACM\/IEEE 44th Annual International Symposium on Computer Architecture (ISCA). 1--12","author":"Jouppi N. P.","unstructured":"N. P. Jouppi , C. Young , N. Patil , D. Patterson , G. Agrawal , R. Bajwa , S. Bates , S. Bhatia , N. Boden , A. Borchers , R. Boyle , P. Cantin , C. Chao , C. Clark , J. Coriell , M. Daley , M. Dau , J. Dean , B. Gelb , T. V. Ghaemmaghami , R. Gottipati , W. Gulland , R. Hagmann , C. R. Ho , D. Hogberg , J. Hu , R. Hundt , D. Hurt , J. Ibarz , A. Jaffey , A. Jaworski , A. Kaplan , H. Khaitan , D. Killebrew , A. Koch , N. Kumar , S. Lacy , J. Laudon , J. Law , D. Le , C. Leary , Z. Liu , K. Lucke , A. Lundin , G. MacKean , A. Maggiore , M. Mahony , K. Miller , R. Nagarajan , R. Narayanaswami , R. Ni , K. Nix , T. Norrie , M. Omernick , N. Penukonda , A. Phelps , J. Ross , M. Ross , A. Salek , E. Samadiani , C. Severn , G. Sizikov , M. Snelham , J. Souter , D. Steinberg , A. Swing , M. Tan , G. Thorson , B. Tian , H. Toma , E. Tuttle , V. Vasudevan , R. Walter , W. Wang , E. Wilcox , and D. H. Yoon . 2017. In-datacenter performance analysis of a tensor processing unit . In 2017 ACM\/IEEE 44th Annual International Symposium on Computer Architecture (ISCA). 1--12 . https:\/\/doi.org\/10.1145\/3079856.3080246 10.1145\/3079856.3080246 N. P. Jouppi, C. Young, N. Patil, D. Patterson, G. Agrawal, R. Bajwa, S. Bates, S. Bhatia, N. Boden, A. Borchers, R. Boyle, P. Cantin, C. Chao, C. Clark, J. Coriell, M. Daley, M. Dau, J. Dean, B. Gelb, T. V. Ghaemmaghami, R. Gottipati, W. Gulland, R. Hagmann, C. R. Ho, D. Hogberg, J. Hu, R. Hundt, D. Hurt, J. Ibarz, A. Jaffey, A. Jaworski, A. Kaplan, H. Khaitan, D. Killebrew, A. Koch, N. Kumar, S. Lacy, J. Laudon, J. Law, D. Le, C. Leary, Z. Liu, K. Lucke, A. Lundin, G. MacKean, A. Maggiore, M. Mahony, K. Miller, R. Nagarajan, R. Narayanaswami, R. Ni, K. Nix, T. Norrie, M. Omernick, N. Penukonda, A. Phelps, J. Ross, M. Ross, A. Salek, E. Samadiani, C. Severn, G. Sizikov, M. Snelham, J. Souter, D. Steinberg, A. Swing, M. Tan, G. Thorson, B. Tian, H. Toma, E. Tuttle, V. Vasudevan, R. Walter, W. Wang, E. Wilcox, and D. H. Yoon. 2017. In-datacenter performance analysis of a tensor processing unit. In 2017 ACM\/IEEE 44th Annual International Symposium on Computer Architecture (ISCA). 1--12. https:\/\/doi.org\/10.1145\/3079856.3080246"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3302424.3303958"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.14778\/3275536.3275537"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.5555\/1577069.1755843"},{"key":"e_1_3_2_2_51_1","volume-title":"Selecta: Heterogeneous Cloud Storage Configuration for Data Analytics. In 2018 USENIX Annual Technical Conference (USENIX ATC 18)","author":"Klimovic Ana","year":"2018","unstructured":"Ana Klimovic , Heiner Litz , and Christos Kozyrakis . 2018 . Selecta: Heterogeneous Cloud Storage Configuration for Data Analytics. In 2018 USENIX Annual Technical Conference (USENIX ATC 18) . USENIX Association, Boston, MA, 759--773. https:\/\/www.usenix.org\/conference\/atc18\/presentation\/klimovic-selecta Ana Klimovic, Heiner Litz, and Christos Kozyrakis. 2018. Selecta: Heterogeneous Cloud Storage Configuration for Data Analytics. In 2018 USENIX Annual Technical Conference (USENIX ATC 18). USENIX Association, Boston, MA, 759--773. https:\/\/www.usenix.org\/conference\/atc18\/presentation\/klimovic-selecta"},{"key":"e_1_3_2_2_52_1","volume-title":"Sol: Fast Distributed Computation Over Slow Networks. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20)","author":"Lai Fan","year":"2020","unstructured":"Fan Lai , Jie You , Xiangfeng Zhu , Harsha V. Madhyastha , and Mosharaf Chowdhury . 2020 . Sol: Fast Distributed Computation Over Slow Networks. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20) . USENIX Association, Santa Clara, CA, 273--288. https:\/\/www.usenix.org\/conference\/nsdi20\/presentation\/lai Fan Lai, Jie You, Xiangfeng Zhu, Harsha V. Madhyastha, and Mosharaf Chowdhury. 2020. Sol: Fast Distributed Computation Over Slow Networks. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20). USENIX Association, Santa Clara, CA, 273--288. https:\/\/www.usenix.org\/conference\/nsdi20\/presentation\/lai"},{"key":"e_1_3_2_2_53_1","volume-title":"Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation","author":"Mahajan Kshiteej","year":"2018","unstructured":"Kshiteej Mahajan , Mosharaf Chowdhury , Aditya Akella , and Shuchi Chawla . 2018 . Dynamic Query Re-Planning Using QOOP . In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation ( Carlsbad, CA, USA) (OSDI'18). USENIX Association, USA, 253--267. Kshiteej Mahajan, Mosharaf Chowdhury, Aditya Akella, and Shuchi Chawla. 2018. Dynamic Query Re-Planning Using QOOP. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (Carlsbad, CA, USA) (OSDI'18). USENIX Association, USA, 253--267."},{"key":"e_1_3_2_2_54_1","volume-title":"OPTIMUSCLOUD: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud. In 2020 USENIX Annual Technical Conference (USENIX ATC 20)","author":"Mahgoub Ashraf","year":"2020","unstructured":"Ashraf Mahgoub , Alexander Michaelson Medoff , Rakesh Kumar , Subrata Mitra , Ana Klimovic , Somali Chaterji , and Saurabh Bagchi . 2020 . OPTIMUSCLOUD: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud. In 2020 USENIX Annual Technical Conference (USENIX ATC 20) . USENIX Association, 189--203. https:\/\/www.usenix.org\/conference\/atc20\/presentation\/mahgoub Ashraf Mahgoub, Alexander Michaelson Medoff, Rakesh Kumar, Subrata Mitra, Ana Klimovic, Somali Chaterji, and Saurabh Bagchi. 2020. OPTIMUSCLOUD: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud. In 2020 USENIX Annual Technical Conference (USENIX ATC 20). USENIX Association, 189--203. https:\/\/www.usenix.org\/conference\/atc20\/presentation\/mahgoub"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807184"},{"key":"e_1_3_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.14778\/3415478.3415568"},{"key":"e_1_3_2_2_57_1","volume-title":"Proceedings of the 2nd International Conference on Supercomputing (St","author":"Mirchandaney R.","unstructured":"R. Mirchandaney , J. H. Saltz , R. M. Smith , D. M. Nico , and K. Crowley . 1988. Principles of Runtime Support for Parallel Processors . In Proceedings of the 2nd International Conference on Supercomputing (St . Malo, France) (ICS '88). Association for Computing Machinery, New York, NY, USA, 140--152. https:\/\/doi.org\/10.1145\/55364.55378 10.1145\/55364.55378 R. Mirchandaney, J. H. Saltz, R. M. Smith, D. M. Nico, and K. Crowley. 1988. Principles of Runtime Support for Parallel Processors. In Proceedings of the 2nd International Conference on Supercomputing (St. Malo, France) (ICS '88). Association for Computing Machinery, New York, NY, USA, 140--152. https:\/\/doi.org\/10.1145\/55364.55378"},{"key":"e_1_3_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522738"},{"key":"e_1_3_2_2_59_1","volume-title":"Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation","author":"Murray Derek G.","year":"2011","unstructured":"Derek G. Murray , Malte Schwarzkopf , Christopher Smowton , Steven Smith , Anil Madhavapeddy , and Steven Hand . 2011 . CIEL: A Universal Execution Engine for Distributed Data-Flow Computing . In Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation ( Boston, MA) (NSDI'11). USENIX Association, USA, 113--126. Derek G. Murray, Malte Schwarzkopf, Christopher Smowton, Steven Smith, Anil Madhavapeddy, and Steven Hand. 2011. CIEL: A Universal Execution Engine for Distributed Data-Flow Computing. In Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation (Boston, MA) (NSDI'11). USENIX Association, USA, 113--126."},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522716"},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201394"},{"key":"e_1_3_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522715"},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341301.3359658"},{"key":"e_1_3_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.14778\/3368289.3368296"},{"key":"e_1_3_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.14778\/3339490.3339503"},{"key":"e_1_3_2_2_66_1","volume-title":"Michael A. Kozuch, Mor Harchol-Balter, and Gregory R. Ganger.","author":"Tumanov Alexey","year":"2016","unstructured":"Alexey Tumanov , Timothy Zhu , Jun Woo Park , Michael A. Kozuch, Mor Harchol-Balter, and Gregory R. Ganger. 2016 . TetriSched: Global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters. In Proceedings of the 11th European Conference on Computer Systems, EuroSys 2016 (Proceedings of the 11th European Conference on Computer Systems, EuroSys 2016). Association for Computing Machinery , Inc. https:\/\/doi.org\/10.1145\/2901318.2901355 11th European Conference on Computer Systems, EuroSys 2016; Conference date: 18-04-2016 Through 21-04-2016. 10.1145\/2901318.2901355 Alexey Tumanov, Timothy Zhu, Jun Woo Park, Michael A. Kozuch, Mor Harchol-Balter, and Gregory R. Ganger. 2016. TetriSched: Global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters. In Proceedings of the 11th European Conference on Computer Systems, EuroSys 2016 (Proceedings of the 11th European Conference on Computer Systems, EuroSys 2016). Association for Computing Machinery, Inc. https:\/\/doi.org\/10.1145\/2901318.2901355 11th European Conference on Computer Systems, EuroSys 2016; Conference date: 18-04-2016 Through 21-04-2016."},{"key":"e_1_3_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.5555\/2685048.2685072"},{"key":"e_1_3_2_2_68_1","volume-title":"Ernest: Efficient Performance Prediction for Large-Scale Advanced Analytics. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16)","author":"Venkataraman Shivaram","year":"2016","unstructured":"Shivaram Venkataraman , Zongheng Yang , Michael Franklin , Benjamin Recht , and Ion Stoica . 2016 . Ernest: Efficient Performance Prediction for Large-Scale Advanced Analytics. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16) . USENIX Association, Santa Clara, CA, 363--378. https:\/\/www.usenix.org\/conference\/nsdi16\/technical-sessions\/presentation\/venkataraman Shivaram Venkataraman, Zongheng Yang, Michael Franklin, Benjamin Recht, and Ion Stoica. 2016. Ernest: Efficient Performance Prediction for Large-Scale Advanced Analytics. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16). USENIX Association, Santa Clara, CA, 363--378. https:\/\/www.usenix.org\/conference\/nsdi16\/technical-sessions\/presentation\/venkataraman"},{"key":"e_1_3_2_2_69_1","volume-title":"Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data","author":"Stratis","unstructured":"Stratis D. Viglas and Jeffrey F. Naughton. 2002. Rate-Based Query Optimization for Streaming Information Sources . In Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data ( Madison, Wisconsin) (SIGMOD '02). Association for Computing Machinery, New York, NY, USA, 37--48. https:\/\/doi.org\/10.1145\/564691.564697 10.1145\/564691.564697 Stratis D. Viglas and Jeffrey F. Naughton. 2002. Rate-Based Query Optimization for Streaming Information Sources. In Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data (Madison, Wisconsin) (SIGMOD '02). Association for Computing Machinery, New York, NY, USA, 37--48. https:\/\/doi.org\/10.1145\/564691.564697"},{"key":"e_1_3_2_2_70_1","volume-title":"Xilinx Versal Premium Series. In 2020 IEEE Hot Chips 32 Symposium (HCS)","author":"Voogel Martin","year":"2020","unstructured":"Martin Voogel , Yohan Frans , and Matt Ouellette . 2020 . Xilinx Versal Premium Series. In 2020 IEEE Hot Chips 32 Symposium (HCS) , Virtual , August 16-18, 2020. IEEE. Martin Voogel, Yohan Frans, and Matt Ouellette. 2020. Xilinx Versal Premium Series. In 2020 IEEE Hot Chips 32 Symposium (HCS), Virtual, August 16-18, 2020. IEEE."},{"key":"e_1_3_2_2_71_1","volume-title":"Peeking Behind the Curtains of Serverless Platforms. In 2018 USENIX Annual Technical Conference (USENIX ATC 18)","author":"Wang Liang","year":"2018","unstructured":"Liang Wang , Mengyuan Li , Yinqian Zhang , Thomas Ristenpart , and Michael Swift . 2018 . Peeking Behind the Curtains of Serverless Platforms. In 2018 USENIX Annual Technical Conference (USENIX ATC 18) . USENIX Association, Boston, MA, 133--146. https:\/\/www.usenix.org\/conference\/atc18\/presentation\/wang-liang Liang Wang, Mengyuan Li, Yinqian Zhang, Thomas Ristenpart, and Michael Swift. 2018. Peeking Behind the Curtains of Serverless Platforms. In 2018 USENIX Annual Technical Conference (USENIX ATC 18). USENIX Association, Boston, MA, 133--146. https:\/\/www.usenix.org\/conference\/atc18\/presentation\/wang-liang"},{"key":"e_1_3_2_2_72_1","doi-asserted-by":"publisher","DOI":"10.14778\/3421424.3421427"},{"key":"e_1_3_2_2_73_1","volume-title":"VideoChef: Efficient Approximation for Streaming Video Processing Pipelines. In 2018 USENIX Annual Technical Conference (USENIX ATC 18)","author":"Xu Ran","year":"2018","unstructured":"Ran Xu , Jinkyu Koo , Rakesh Kumar , Peter Bai , Subrata Mitra , Sasa Misailovic , and Saurabh Bagchi . 2018 . VideoChef: Efficient Approximation for Streaming Video Processing Pipelines. In 2018 USENIX Annual Technical Conference (USENIX ATC 18) . USENIX Association, Boston, MA, 43--56. https:\/\/www.usenix.org\/conference\/atc18\/presentation\/xu-ran Ran Xu, Jinkyu Koo, Rakesh Kumar, Peter Bai, Subrata Mitra, Sasa Misailovic, and Saurabh Bagchi. 2018. VideoChef: Efficient Approximation for Streaming Video Processing Pipelines. In 2018 USENIX Annual Technical Conference (USENIX ATC 18). USENIX Association, Boston, MA, 43--56. https:\/\/www.usenix.org\/conference\/atc18\/presentation\/xu-ran"},{"key":"e_1_3_2_2_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/2670979.2671005"},{"key":"e_1_3_2_2_75_1","volume-title":"Proceedings of the 2017 Symposium on Cloud Computing","author":"Yadwadkar Neeraja J.","unstructured":"Neeraja J. Yadwadkar , Bharath Hariharan , Joseph E. Gonzalez , Burton Smith , and Randy H. Katz . 2017. Selecting the Best VM Across Multiple Public Clouds: A Data-driven Performance Modeling Approach . In Proceedings of the 2017 Symposium on Cloud Computing ( Santa Clara, California) (SoCC '17). ACM, 452--465. https:\/\/doi.org\/10.1145\/3127479.3131614 10.1145\/3127479.3131614 Neeraja J. Yadwadkar, Bharath Hariharan, Joseph E. Gonzalez, Burton Smith, and Randy H. Katz. 2017. Selecting the Best VM Across Multiple Public Clouds: A Data-driven Performance Modeling Approach. In Proceedings of the 2017 Symposium on Cloud Computing (Santa Clara, California) (SoCC '17). ACM, 452--465. https:\/\/doi.org\/10.1145\/3127479.3131614"},{"key":"e_1_3_2_2_76_1","volume-title":"Efficient Algorithms for Web Services Selection with End-to-End QoS Constraints. ACM Trans. Web","author":"Yu Tao","year":"2007","unstructured":"Tao Yu , Yue Zhang , and Kwei-Jay Lin . 2007. Efficient Algorithms for Web Services Selection with End-to-End QoS Constraints. ACM Trans. Web ( 2007 ). Tao Yu, Yue Zhang, and Kwei-Jay Lin. 2007. Efficient Algorithms for Web Services Selection with End-to-End QoS Constraints. ACM Trans. Web (2007)."},{"key":"e_1_3_2_2_77_1","volume-title":"Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing","author":"Zaharia Matei","year":"2010","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Michael J. Franklin , Scott Shenker , and Ion Stoica . 2010 . Spark: Cluster Computing with Working Sets . In Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing ( Boston, MA) (HotCloud'10). USENIX Association, USA, 10. Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, and Ion Stoica. 2010. Spark: Cluster Computing with Working Sets. In Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing (Boston, MA) (HotCloud'10). USENIX Association, USA, 10."},{"key":"e_1_3_2_2_78_1","volume-title":"14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17)","author":"Zhang Haoyu","unstructured":"Haoyu Zhang , Ganesh Ananthanarayanan , Peter Bodik , Matthai Philipose , Paramvir Bahl , and Michael J. Freedman . 2017. Live Video Analytics at Scale with Approximation and Delay-Tolerance . In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17) . USENIX Association, Boston, MA, 377--392. https:\/\/www.usenix.org\/conference\/nsdi17\/technical-sessions\/presentation\/zhang Haoyu Zhang, Ganesh Ananthanarayanan, Peter Bodik, Matthai Philipose, Paramvir Bahl, and Michael J. Freedman. 2017. Live Video Analytics at Scale with Approximation and Delay-Tolerance. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). USENIX Association, Boston, MA, 377--392. https:\/\/www.usenix.org\/conference\/nsdi17\/technical-sessions\/presentation\/zhang"}],"event":{"name":"SoCC '21: ACM Symposium on Cloud Computing","location":"Seattle WA USA","acronym":"SoCC '21","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGOPS ACM Special Interest Group on Operating Systems"]},"container-title":["Proceedings of the ACM Symposium on Cloud Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472883.3486972","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472883.3486972","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:11:57Z","timestamp":1750191117000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472883.3486972"}},"subtitle":["A Heterogeneous &amp; Serverless Framework for Auto-Tuning Video Analytics Pipelines"],"short-title":[],"issued":{"date-parts":[[2021,11]]},"references-count":78,"alternative-id":["10.1145\/3472883.3486972","10.1145\/3472883"],"URL":"https:\/\/doi.org\/10.1145\/3472883.3486972","relation":{},"subject":[],"published":{"date-parts":[[2021,11]]},"assertion":[{"value":"2021-11-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}