{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T11:31:17Z","timestamp":1763724677533,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":84,"publisher":"ACM","license":[{"start":{"date-parts":[[2024,11,27]],"date-time":"2024-11-27T00:00:00Z","timestamp":1732665600000},"content-version":"vor","delay-in-days":366,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"NSF (National Science Foundation)","doi-asserted-by":"publisher","award":["2153422"],"award-info":[{"award-number":["2153422"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,11,27]]},"DOI":"10.1145\/3590140.3592850","type":"proceedings-article","created":{"date-parts":[[2023,11,24]],"date-time":"2023-11-24T18:06:33Z","timestamp":1700849193000},"page":"29-42","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Smartpick"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5578-7109","authenticated-orcid":false,"given":"Anshuman Das","family":"Mohapatra","sequence":"first","affiliation":[{"name":"University of Nebraska at Omaha, Omaha, Nebraska, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3281-7325","authenticated-orcid":false,"given":"Kwangsung","family":"Oh","sequence":"additional","affiliation":[{"name":"University of Nebraska at Omaha, Omaha, Nebraska, USA"}]}],"member":"320","published-online":{"date-parts":[[2023,11,27]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517892"},{"key":"e_1_3_2_1_2_1","volume-title":"CherryPick: Adaptively Unearthing the Best Cloud Configurations for Big Data Analytics. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17)","author":"Alipourfard Omid","year":"2017","unstructured":"Omid Alipourfard, Hongqiang Harry Liu, Jianshu Chen, Shivaram Venkataraman, Minlan Yu, and Ming Zhang. 2017. CherryPick: Adaptively Unearthing the Best Cloud Configurations for Big Data Analytics. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). USENIX Association, Boston, MA, 469--482. https:\/\/www.usenix.org\/conference\/nsdi17\/technical-sessions\/presentation\/alipourfard"},{"key":"e_1_3_2_1_3_1","unstructured":"Amazon AWS. [n.d.]. https:\/\/aws.amazon.com\/lambda\/."},{"key":"e_1_3_2_1_4_1","unstructured":"Amazon AWS. [n.d.]. https:\/\/aws.amazon.com\/."},{"key":"e_1_3_2_1_5_1","unstructured":"Amazon AWS. [n.d.]. https:\/\/aws.amazon.com\/ec2\/instance-types\/t3\/."},{"key":"e_1_3_2_1_6_1","unstructured":"Amazon AWS. [n.d.]. https:\/\/aws.amazon.com\/lambda\/pricing\/."},{"key":"e_1_3_2_1_7_1","unstructured":"Amazon AWS. [n.d.]. AWS SDK for Java. https:\/\/aws.amazon.com\/sdk-for-java\/."},{"key":"e_1_3_2_1_8_1","unstructured":"Amazon Simple Storage Service. [n.d.]. https:\/\/aws.amazon.com\/s3\/."},{"key":"e_1_3_2_1_9_1","first-page":"12","article-title":"Disk-locality in datacenter computing considered irrelevant","volume":"13","author":"Ananthanarayanan Ganesh","year":"2011","unstructured":"Ganesh Ananthanarayanan, Ali Ghodsi, Scott Shenker, and Ion Stoica. 2011. Disk-locality in datacenter computing considered irrelevant.. In HotOS, Vol. 13. 12--12.","journal-title":"HotOS"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","unstructured":"Omer Anisfeld Erez Biton Ruven Milshtein Mark Shifrin and Omer Gurewitz. 2018. Scaling of Cloud Resources-Principal Component Analysis and Random Forest Approach. In 2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE). 1--5. https:\/\/doi.org\/10.1109\/ICSEE.2018.8646134","DOI":"10.1109\/ICSEE.2018.8646134"},{"key":"e_1_3_2_1_11_1","unstructured":"Apache Hadoop. [n.d.]. https:\/\/hadoop.apache.org\/."},{"key":"e_1_3_2_1_12_1","unstructured":"Apache OpenWhisk. [n.d.]. http:\/\/https:\/\/openwhisk.apache.org\/."},{"key":"e_1_3_2_1_13_1","unstructured":"Apache Spark. [n.d.]. https:\/\/spark.apache.org\/docs\/2.2.1\/."},{"key":"e_1_3_2_1_14_1","unstructured":"Apache Thrift. [n.d.]. https:\/\/thrift.apache.org\/."},{"key":"e_1_3_2_1_15_1","unstructured":"Avantika Monnappa. 2022. https:\/\/www.simplilearn.com\/how-facebook-isusing-big-data-article."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","unstructured":"Janki Bhimani Ningfang Mi Miriam Leeser and Zhengyu Yang. 2017. FiM: Performance Prediction Model for Parallel Computation in Iterative Data Processing Applications. https:\/\/doi.org\/10.1109\/CLOUD.2017.53","DOI":"10.1109\/CLOUD.2017.53"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3419111.3421305"},{"key":"e_1_3_2_1_18_1","volume-title":"FMI: The FaaS Message Interface. Master's thesis. ETH Zurich.","author":"B\u00f6hringer Roman","year":"2022","unstructured":"Roman B\u00f6hringer. 2022. FMI: The FaaS Message Interface. Master's thesis. ETH Zurich."},{"key":"e_1_3_2_1_19_1","unstructured":"Branka Vuleta. 2021. https:\/\/seedscientific.com\/how-much-data-is-created-every-day\/."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2644865.2541941"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/SCC.2012.47"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2168836.2168847"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCE50343.2020.9290700"},{"key":"e_1_3_2_1_24_1","unstructured":"Google Cloud. [n.d.]. https:\/\/cloud.google.com\/functions."},{"key":"e_1_3_2_1_25_1","unstructured":"Google Cloud. [n.d.]. https:\/\/cloud.google.com\/."},{"key":"e_1_3_2_1_26_1","unstructured":"Google Cloud. [n.d.]. https:\/\/cloud.google.com\/functions\/docs\/concepts\/execution-environment#file_system."},{"key":"e_1_3_2_1_27_1","unstructured":"Google Cloud. [n.d.]. Java Cloud Client Libraries. https:\/\/cloud.google.com\/java\/docs\/reference\/."},{"volume-title":"Spock: Exploiting Serverless Functions for SLO and Cost Aware Resource Procurement in Public Cloud. In 2019 IEEE 12th International Conference on Cloud Computing (CLOUD). 199--208","author":"Gunasekaran J. R.","key":"e_1_3_2_1_28_1","unstructured":"J. R. Gunasekaran, P. Thinakaran, M. T. Kandemir, B. Urgaonkar, G. Kesidis, and C. Das. 2019. Spock: Exploiting Serverless Functions for SLO and Cost Aware Resource Procurement in Public Cloud. In 2019 IEEE 12th International Conference on Cloud Computing (CLOUD). 199--208."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2019.00043"},{"key":"e_1_3_2_1_30_1","unstructured":"Jianwei Hao Ting Jiang Wei Wang and In Kee Kim. 2021. An Empirical Analysis of VM Startup Times in Public IaaS Clouds: An Extended Report."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3423211.3425695"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.beth.2020.05.002"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3492323.3495628"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3127479.3128601"},{"key":"e_1_3_2_1_35_1","volume-title":"Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation","author":"Jyothi Sangeetha Abdu","year":"2016","unstructured":"Sangeetha Abdu Jyothi, Carlo Curino, Ishai Menache, Shravan Matthur Narayanamurthy, Alexey Tumanov, Jonathan Yaniv, Ruslan Mavlyutov, \u00cd\u00f1igo Goiri, Subru Krishnan, Janardhan Kulkarni, and Sriram Rao. 2016. Morpheus: Towards Automated SLOs for Enterprise Clusters. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (Savannah, GA, USA) (OSDI'16). USENIX Association, USA, 117--134."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","unstructured":"Takuya Kanazawa. 2021. One-parameter family of acquisition functions for efficient global optimization. https:\/\/doi.org\/10.48550\/ARXIV.2104.12363","DOI":"10.48550\/ARXIV.2104.12363"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCC.2017.2780159"},{"volume-title":"Serverless Data Analytics with Flint. In 2018 IEEE 11th International Conference on Cloud Computing (CLOUD). 451--455","author":"Kim Y.","key":"e_1_3_2_1_38_1","unstructured":"Y. Kim and J. Lin. 2018. Serverless Data Analytics with Flint. In 2018 IEEE 11th International Conference on Cloud Computing (CLOUD). 451--455."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.15547\/tjs.2015.s.01.067"},{"key":"e_1_3_2_1_40_1","volume-title":"Proceedings of the 2018 USENIX Conference on Usenix Annual Technical Conference (Boston, MA, USA) (USENIX ATC '18). USENIX Association","author":"Klimovic Ana","year":"2018","unstructured":"Ana Klimovic, Heiner Litz, and Christos Kozyrakis. 2018. Selecta: Heterogeneous Cloud Storage Configuration for Data Analytics. In Proceedings of the 2018 USENIX Conference on Usenix Annual Technical Conference (Boston, MA, USA) (USENIX ATC '18). USENIX Association, Berkeley, CA, USA, 759--773. http:\/\/dl.acm.org\/citation.cfm?id=3277355.3277429"},{"key":"e_1_3_2_1_41_1","volume-title":"Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation","author":"Klimovic Ana","year":"2018","unstructured":"Ana Klimovic, Yawen Wang, Patrick Stuedi, Animesh Trivedi, Jonas Pfefferle, and Christos Kozyrakis. 2018. Pocket: Elastic Ephemeral Storage for Serverless Analytics. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (Carlsbad, CA, USA) (OSDI'18). USENIX Association, Berkeley, CA, USA, 427--444. http:\/\/dl.acm.org\/citation.cfm?id=3291168.3291200"},{"key":"e_1_3_2_1_42_1","unstructured":"Alexey Kopytov. 2021. sysbench. https:\/\/github.com\/akopytov\/sysbench."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/IC2E48712.2020.00023"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2018.00062"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1155\/2016\/5635673"},{"key":"e_1_3_2_1_46_1","unstructured":"Lu Zhang and Diuto Malife. [n.d.]. https:\/\/blog.twitter.com\/engineering\/en_us\/\\topics\/infrastructure\/2021\/processing-billions-of-events-in-real-time-at-twitter-."},{"key":"e_1_3_2_1_47_1","volume-title":"OPTIMUSCLOUD: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud. In 2020 USENIX Annual Technical Conference (USENIX ATC 20)","author":"Mahgoub Ashraf","year":"2020","unstructured":"Ashraf Mahgoub, Alexander Michaelson Medoff, Rakesh Kumar, Subrata Mitra, Ana Klimovic, Somali Chaterji, and Saurabh Bagchi. 2020. OPTIMUSCLOUD: Heterogeneous Configuration Optimization for Distributed Databases in the Cloud. In 2020 USENIX Annual Technical Conference (USENIX ATC 20). USENIX Association, 189--203. https:\/\/www.usenix.org\/conference\/atc20\/presentation\/mahgoub"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2012.103"},{"key":"e_1_3_2_1_49_1","unstructured":"MathWorks. 2022. Bayesian Optimization Algorithm. https:\/\/www.mathworks.com\/help\/stats\/bayesian-optimization-algorithm.html."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/SYSCON.2011.5929050"},{"key":"e_1_3_2_1_51_1","unstructured":"Microsoft Azure. [n.d.]. https:\/\/azure.microsoft.com\/en-us\/services\/functions\/."},{"key":"e_1_3_2_1_52_1","unstructured":"Daniel William Moyer. 2021. Punching Holes in the Cloud: Direct Communication between Serverless Functions Using NAT Traversal. Ph.D. Dissertation. Virginia Tech."},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.5121\/ijcsit.2019.11404"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.5555\/1182635.1164217"},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS53621.2022.00109"},{"key":"e_1_3_2_1_56_1","unstructured":"Rodolphe Le Riche Nicolas Durrande. 2017. Introduction to Gaussian Process Surrogate Models."},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/RAEE.2019.8887075"},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMSNETS.2019.8711058"},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/IC2E52221.2021.00025"},{"key":"e_1_3_2_1_60_1","unstructured":"OpenJDK. [n.d.]. JDK 8. https:\/\/openjdk.org\/projects\/jdk8\/."},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3190508.3190517"},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.5555\/1325851.1325979"},{"key":"e_1_3_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3127479.3128603"},{"key":"e_1_3_2_1_64_1","volume-title":"Fast and Slow: Scalable Analytics on Serverless Infrastructure. In 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI 19)","author":"Pu Qifan","year":"2019","unstructured":"Qifan Pu, Shivaram Venkataraman, and Ion Stoica. 2019. Shuffling, Fast and Slow: Scalable Analytics on Serverless Infrastructure. In 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI 19). USENIX Association, Boston, MA, 193--206. https:\/\/www.usenix.org\/conference\/nsdi19\/presentation\/pu"},{"key":"e_1_3_2_1_65_1","unstructured":"PyPI. [n.d.]. sql-metadata. https:\/\/pypi.org\/project\/sql-metadata\/."},{"key":"e_1_3_2_1_66_1","unstructured":"Python. [n.d.]. https:\/\/www.python.org\/."},{"key":"e_1_3_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2987550.2987566"},{"key":"e_1_3_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/IC2E52221.2021.00028"},{"key":"e_1_3_2_1_69_1","unstructured":"Redis. [n.d.]. https:\/\/redis.io\/."},{"key":"e_1_3_2_1_70_1","volume-title":"Workload time series prediction in storage systems: a deep learning based approach. Cluster Computing","author":"Ruan Li","year":"2021","unstructured":"Li Ruan, Yu Bai, Shaoning Li, Shuibing He, and Limin Xiao. 2021. Workload time series prediction in storage systems: a deep learning based approach. Cluster Computing (2021), 1--11."},{"key":"e_1_3_2_1_71_1","unstructured":"Vaishaal Shankar Karl Krauth Qifan Pu Eric Jonas Shivaram Venkataraman Ion Stoica Benjamin Recht and Jonathan Ragan-Kelley. 2018. numpywren: serverless linear algebra. arXiv:1810.09679 [cs.DC]"},{"key":"e_1_3_2_1_72_1","unstructured":"Spark on Lambda. [n.d.]. https:\/\/github.com\/qubole\/spark-on-lambda\/."},{"key":"e_1_3_2_1_73_1","unstructured":"TPC. [n.d.]. TPC-H Vesion 2 and Version 3."},{"key":"e_1_3_2_1_74_1","unstructured":"RIP Tutorial. [n.d.]. Word Count Example in Hive."},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNSM.2016.2569000"},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIT.2017.58"},{"key":"e_1_3_2_1_77_1","volume-title":"Ernest: Efficient Performance Prediction for Large-Scale Advanced Analytics. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16)","author":"Venkataraman Shivaram","year":"2016","unstructured":"Shivaram Venkataraman, Zongheng Yang, Michael Franklin, Benjamin Recht, and Ion Stoica. 2016. Ernest: Efficient Performance Prediction for Large-Scale Advanced Analytics. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16). USENIX Association, Santa Clara, CA, 363--378. https:\/\/www.usenix.org\/conference\/nsdi16\/technical-sessions\/presentation\/venkataraman"},{"key":"e_1_3_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1145\/1998582.1998637"},{"key":"e_1_3_2_1_79_1","doi-asserted-by":"publisher","unstructured":"Michael Wawrzoniak Ingo M\u00fcller Rodrigo Bruno Ana Klimovic and Gustavo Alonso. 2022. Short-lived Datacenter. https:\/\/doi.org\/10.48550\/ARXIV.2202.06646","DOI":"10.48550\/ARXIV.2202.06646"},{"key":"e_1_3_2_1_80_1","doi-asserted-by":"publisher","unstructured":"Mike Wawrzoniak Ingo M\u00fcller Rodrigo Fraga Barcelos Paulus Bruno and Gustavo Alonso. 2021-01. Boxer: Data Analytics on Network-enabled Serverless Platforms. https:\/\/doi.org\/10.3929\/ethz-b-000456492 11th Annual Conference on Innovative Data Systems Research (CIDR 2021); Conference Location: online; Conference Date: January 11-15 2021; The conference lecture was held on January 12 2021. Due to the Coronavirus (COVID-19) the conference was conducted virtually.","DOI":"10.3929\/ethz-b-000456492"},{"key":"e_1_3_2_1_81_1","doi-asserted-by":"publisher","unstructured":"James T. Wilson Frank Hutter and Marc Peter Deisenroth. 2018. Maximizing acquisition functions for Bayesian optimization. https:\/\/doi.org\/10.48550\/ARXIV.1805.10196","DOI":"10.48550\/ARXIV.1805.10196"},{"key":"e_1_3_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1145\/3127479.3131614"},{"key":"e_1_3_2_1_83_1","unstructured":"Zach. 2019. Understanding the Standard Error of the Regression. https:\/\/www.statology.org\/standard-error-regression\/."},{"key":"e_1_3_2_1_84_1","volume-title":"SLO-Aware Machine Learning Inference Serving. In 2019 USENIX Annual Technical Conference (USENIX ATC 19)","author":"Zhang Chengliang","year":"2019","unstructured":"Chengliang Zhang, Minchen Yu, Wei Wang, and Feng Yan. 2019. MArk: Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving. In 2019 USENIX Annual Technical Conference (USENIX ATC 19). USENIX Association, Renton, WA, 1049--1062. https:\/\/www.usenix.org\/conference\/atc19\/presentation\/zhang-chengliang"}],"event":{"name":"Middleware '23: 24th International Middleware Conference","sponsor":["ACM Association for Computing Machinery","IFIP International Federation for Information Processing"],"location":"Bologna Italy","acronym":"Middleware '23"},"container-title":["Proceedings of the 24th International Middleware Conference on ZZZ"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3590140.3592850","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/abs\/10.1145\/3590140.3592850","content-type":"text\/html","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3590140.3592850","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3590140.3592850","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,29]],"date-time":"2025-08-29T16:52:29Z","timestamp":1756486349000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3590140.3592850"}},"subtitle":["Workload Prediction for Serverless-enabled Scalable Data Analytics Systems"],"short-title":[],"issued":{"date-parts":[[2023,11,27]]},"references-count":84,"alternative-id":["10.1145\/3590140.3592850","10.1145\/3590140"],"URL":"https:\/\/doi.org\/10.1145\/3590140.3592850","relation":{},"subject":[],"published":{"date-parts":[[2023,11,27]]},"assertion":[{"value":"2023-11-27","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}