{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T16:03:46Z","timestamp":1776182626284,"version":"3.50.1"},"reference-count":90,"publisher":"Association for Computing Machinery (ACM)","issue":"8","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62325201, 62425203, and 62032003"],"award-info":[{"award-number":["62325201, 62425203, and 62032003"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Beijing Natural Science Foundation","award":["4244076"],"award-info":[{"award-number":["4244076"]}]},{"DOI":"10.13039\/501100005024","name":"Beijing Postdoctoral Research Foundation","doi-asserted-by":"crossref","award":["2024-ZZ-20"],"award-info":[{"award-number":["2024-ZZ-20"]}],"id":[{"id":"10.13039\/501100005024","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Softw. Eng. Methodol."],"published-print":{"date-parts":[[2025,11,30]]},"abstract":"<jats:p>\n            Serverless computing is a popular cloud computing paradigm that has found widespread adoption across various online workloads. It allows software engineers to develop cloud applications as a set of functions (called\n            <jats:italic toggle=\"yes\">serverless functions<\/jats:italic>\n            ). However, accurately measuring the performance (i.e., end-to-end response latency) of serverless functions is challenging due to the highly dynamic nature of the environment in which they run. To tackle this problem, a potential solution is to apply checks of performance testing techniques to determine how many repetitions of a given serverless function across a range of inputs are needed to cater to the performance fluctuation. However, the available literature lacks performance testing approaches designed explicitly for serverless computing. In this article, we propose the first serverless computing-oriented performance testing (SCOPE) approach.\n            <jats:italic toggle=\"yes\">SCOPE<\/jats:italic>\n            takes into account the unique performance characteristics of serverless functions, such as their short execution durations and on-demand triggering. As such,\n            <jats:italic toggle=\"yes\">SCOPE<\/jats:italic>\n            is designed as a fine-grained analysis approach.\n            <jats:italic toggle=\"yes\">SCOPE<\/jats:italic>\n            incorporates the accuracy check and the consistency check to obtain the accurate and reliable performance of serverless functions. The evaluation shows that\n            <jats:italic toggle=\"yes\">SCOPE<\/jats:italic>\n            provides testing results with 97.25% accuracy, 33.83 percentage points higher than the best currently available technique. Moreover, the superiority of\n            <jats:italic toggle=\"yes\">SCOPE<\/jats:italic>\n            over the state-of-the-art holds on all functions that we study.\n          <\/jats:p>","DOI":"10.1145\/3717609","type":"journal-article","created":{"date-parts":[[2025,2,14]],"date-time":"2025-02-14T09:07:46Z","timestamp":1739524066000},"page":"1-30","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["<i>SCOPE<\/i>\n            : Performance Testing for Serverless Computing"],"prefix":"10.1145","volume":"34","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3023-1005","authenticated-orcid":false,"given":"Jinfeng","family":"Wen","sequence":"first","affiliation":[{"name":"Beijing University of Posts and Telecommunications, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4765-1893","authenticated-orcid":false,"given":"Zhenpeng","family":"Chen","sequence":"additional","affiliation":[{"name":"Nanyang Technological University, Singapore, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-8208-7304","authenticated-orcid":false,"given":"Jianshu","family":"Zhao","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9146-442X","authenticated-orcid":false,"given":"Federica","family":"Sarro","sequence":"additional","affiliation":[{"name":"University College London, London, United Kingdom of Great Britain and Northern Ireland"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7947-7826","authenticated-orcid":false,"given":"Haodi","family":"Ping","sequence":"additional","affiliation":[{"name":"Beijing University of Technology, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-6924-2319","authenticated-orcid":false,"given":"Ying","family":"Zhang","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7245-1298","authenticated-orcid":false,"given":"Shangguang","family":"Wang","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunications, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7908-8484","authenticated-orcid":false,"given":"Xuanzhe","family":"Liu","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2025,10,4]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"2024. 2018 serverless community survey: Huge growth in serverless usage. Retrieved from https:\/\/www.serverless.com\/blog\/2018-serverless-community-survey-huge-growth-usage"},{"key":"e_1_3_1_3_2","unstructured":"2024. AWS Lambda. Retrieved from https:\/\/docs.aws.amazon.com\/lambda"},{"key":"e_1_3_1_4_2","unstructured":"2024. AWS serverless application repository. Retrieved from https:\/\/serverlessrepo.aws.amazon.com\/applications"},{"key":"e_1_3_1_5_2","unstructured":"2024. Block bootstrapping. Retrieved from https:\/\/medium.com\/@jcatankard_76170\/block-bootstrapping-with-time-series-and-spatial-data-bd7d7830681e"},{"key":"e_1_3_1_6_2","unstructured":"2024. Bootstrapping (statistics). Retrieved from https:\/\/en.wikipedia.org\/wiki\/Bootstrapping_(statistics)"},{"key":"e_1_3_1_7_2","unstructured":"2024. Comparison of cold starts in serverless functions across AWS Azure and GCP. Retrieved from https:\/\/mikhail.io\/serverless\/coldstarts\/big3\/"},{"key":"e_1_3_1_8_2","unstructured":"2024. Create a serverless workflow with AWS Step Functions and AWS Lambda. Retrieved from https:\/\/aws.amazon.com\/tutorials\/create-a-serverless-workflow-step-functions-lambda"},{"key":"e_1_3_1_9_2","unstructured":"2024. Default memory size of AWS Lambda. Retrieved from https:\/\/docs.aws.amazon.com\/lambda\/latest\/operatorguide\/computing-power.html"},{"key":"e_1_3_1_10_2","unstructured":"2024. Default memory size of Google Cloud Functions. Retrieved from https:\/\/cloud.google.com\/functions\/docs\/configuring\/memory"},{"key":"e_1_3_1_11_2","unstructured":"2024. Default timeout of AWS Lambda. Retrieved from https:\/\/docs.aws.amazon.com\/lambda\/latest\/dg\/configuration-function-common.html"},{"key":"e_1_3_1_12_2","unstructured":"2024. Default timeout of Google Cloud Functions. Retrieved from https:\/\/cloud.google.com\/functions\/docs\/configuring\/timeout"},{"key":"e_1_3_1_13_2","unstructured":"2024. FaaSDom. Retrieved from https:\/\/github.com\/faas-benchmarking\/faasdom"},{"key":"e_1_3_1_14_2","unstructured":"2024. FunctionBench. Retrieved from https:\/\/github.com\/ddps-lab\/serverless-faas-workbench"},{"key":"e_1_3_1_15_2","unstructured":"2024. Google Cloud Functions. Retrieved from https:\/\/cloud.google.com\/functions"},{"key":"e_1_3_1_16_2","unstructured":"2024. ServerlessBench. Retrieved from https:\/\/github.com\/SJTU-IPADS\/ServerlessBench"},{"key":"e_1_3_1_17_2","unstructured":"2024. The state of serverless. Retrieved from https:\/\/www.datadoghq.com\/state-of-serverless\/"},{"key":"e_1_3_1_18_2","unstructured":"2024. Supplemental material. Retrieved from https:\/\/github.com\/WenJinfeng\/SCOPE_PerformanceTesting"},{"key":"e_1_3_1_19_2","unstructured":"2024. Use workflows with Cloud Functions tutorial. Retrieved from https:\/\/cloud.google.com\/workflows\/docs\/tutorials\/run\/cloud-run"},{"key":"e_1_3_1_20_2","unstructured":"2025. A research and markets report. Retrieved from https:\/\/omdia.tech.informa.com\/pr\/2024\/jun\/omdia-serverless-computing-valued-at-19-billion-dollars-is-the-fastest-growing-cloud-service"},{"key":"e_1_3_1_21_2","first-page":"923","volume-title":"Proceedings of the 2018 USENIX Annual Technical Conference","author":"Akkus Istemi Ekin","year":"2018","unstructured":"Istemi Ekin Akkus, Ruichuan Chen, Ivica Rimac, Manuel Stein, Klaus Satzke, Andre Beck, Paarijaat Aditya, and Volker Hilt. 2018. SAND: Towards high-performance serverless computing. In Proceedings of the 2018 USENIX Annual Technical Conference, 923\u2013935."},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSME.2016.46"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/3492321.3524270"},{"issue":"1","key":"e_1_3_1_24_2","first-page":"60","article-title":"Cloud computing: A study of infrastructure as a service (IAAS)","volume":"2","author":"Bhardwaj Sushil","year":"2010","unstructured":"Sushil Bhardwaj, Leena Jain, and Sandeep Jain. 2010. Cloud computing: A study of infrastructure as a service (IAAS). International Journal of Engineering and Information Technology 2, 1 (2010), 60\u201363.","journal-title":"International Journal of Engineering and Information Technology"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3297663.3309670"},{"key":"e_1_3_1_26_2","first-page":"2443","volume-title":"Proceedings of the 31st USENIX Security Symposium","author":"Datta Pubali","year":"2022","unstructured":"Pubali Datta, Isaac Polinsky, Muhammad Adil Inam, Adam Bates, and William Enck. 2022. ALASTOR: Reconstructing the provenance of serverless intrusions. In Proceedings of the 31st USENIX Security Symposium, 2443\u20132460."},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/3358960.3379124"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3464298.3493398"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2022.111294"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2021.3113940"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSR52588.2021.00075"},{"key":"e_1_3_1_32_2","first-page":"363","volume-title":"Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation","author":"Fouladi Sadjad","year":"2017","unstructured":"Sadjad Fouladi, Riad S. Wahby, Brennan Shacklett, Karthikeyan Vasuki Balasubramaniam, William Zeng, Rahul Bhalerao, Anirudh Sivaraman, George Porter, and Keith Winstein. 2017. Encoding, fast and slow: Low-latency video processing using thousands of tiny threads. In Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation, 363\u2013376."},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/3502181.3531459"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1145\/3575693.3575721"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1186\/s13677-021-00253-7"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/ASE51524.2021.9678687"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/3338906.3338912"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS54959.2023.00092"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1145\/3360575"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1145\/3603269.3604816"},{"key":"e_1_3_1_41_2","unstructured":"Eric Jonas Johann Schleier-Smith Vikram Sreekanti Chia-Che Tsai Anurag Khandelwal Qifan Pu Vaishaal Shankar Joao Carreira Karl Krauth Neeraja Yadwadkar et al. 2019. Cloud programming simplified: A Berkeley view on serverless computing. arXiv:1902.03383. Retrieved from https:\/\/arxiv.org\/abs\/1902.03383"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2019.00091"},{"key":"e_1_3_1_43_2","first-page":"805","volume-title":"Proceedings of the 2021 USENIX Annual Technical Conference","author":"Kotni Swaroop","year":"2021","unstructured":"Swaroop Kotni, Ajay Nayak, Vinod Ganapathy, and Arkaprava Basu. 2021. Faastlane: Accelerating function-as-a-service workflows. In Proceedings of the 2021 USENIX Annual Technical Conference, 805\u2013820."},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-019-09681-1"},{"key":"e_1_3_1_45_2","volume-title":"Performance Evaluation of Computer and Communication Systems","author":"Boudec Jean-Yves Le","year":"2010","unstructured":"Jean-Yves Le Boudec. 2010. Performance Evaluation of Computer and Communication Systems. Vol. 2. Epfl Press Lausanne."},{"key":"e_1_3_1_46_2","first-page":"1522","article-title":"Serverless computing: State-of-the-art, challenges and opportunities","author":"Li Yongkang","year":"2022","unstructured":"Yongkang Li, Yanying Lin, Yang Wang, Kejiang Ye, and Cheng-Zhong Xu. 2022. Serverless computing: State-of-the-art, challenges and opportunities. IEEE Transactions on Services Computing 16, 2 (2022), 1522\u20131539.","journal-title":"IEEE Transactions on Services Computing"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/3503222.3507717"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3623278.3624755"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2020.3028841"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3585007"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-024-4227-2"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-021-3528-7"},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/3620665.3640361"},{"key":"e_1_3_1_54_2","first-page":"285","volume-title":"Proceedings of the 2021 USENIX Annual Technical Conference","author":"Mahgoub Ashraf","year":"2021","unstructured":"Ashraf Mahgoub, Li Wang, Karthick Shankar, Yiming Zhang, Huangshi Tian, Subrata Mitra, Yuxing Peng, Hongqi Wang, Ana Klimovic, Haoran Yang, et al. 2021. SONIC: Application-aware data passing for chained serverless applications. In Proceedings of the 2021 USENIX Annual Technical Conference, 285\u2013301."},{"key":"e_1_3_1_55_2","first-page":"303","volume-title":"Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation","author":"Mahgoub Ashraf","year":"2022","unstructured":"Ashraf Mahgoub, Edgardo Barsallo Yi, Karthick Shankar, Sameh Elnikety, Somali Chaterji, and Saurabh Bagchi. 2022. ORION and the three rights: Sizing, bundling, and prewarming for serverless DAGs. In Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 303\u2013320."},{"key":"e_1_3_1_56_2","doi-asserted-by":"publisher","DOI":"10.1145\/3401025.3401738"},{"key":"e_1_3_1_57_2","first-page":"1","article-title":"A holistic view on resource management in serverless computing environments: Taxonomy and future directions","author":"Mampage Anupama","year":"2021","unstructured":"Anupama Mampage, Shanika Karunasekera, and Rajkumar Buyya. 2021. A holistic view on resource management in serverless computing environments: Taxonomy and future directions. ACM Computing Surveys 54, 11s (2021), 1\u201336.","journal-title":"ACM Computing Surveys"},{"key":"e_1_3_1_58_2","first-page":"409","volume-title":"Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation","author":"Maricq Aleksander","year":"2018","unstructured":"Aleksander Maricq, Dmitry Duplyakin, Ivo Jimenez, Carlos Maltzahn, Ryan Stutsman, and Robert Ricci. 2018. Taming performance variability. In Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 409\u2013425."},{"key":"e_1_3_1_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDCSW.2017.36"},{"key":"e_1_3_1_60_2","doi-asserted-by":"publisher","DOI":"10.1145\/3092703.3092725"},{"key":"e_1_3_1_61_2","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389758"},{"key":"e_1_3_1_62_2","doi-asserted-by":"publisher","DOI":"10.1145\/3447786.3456239"},{"key":"e_1_3_1_63_2","first-page":"57","volume-title":"Proceedings of the 2018 USENIX Annual Technical Conference","author":"Oakes Edward","year":"2018","unstructured":"Edward Oakes, Leon Yang, Dennis Zhou, Kevin Houck, Tyler Harter, Andrea Arpaci-Dusseau, and Remzi Arpaci-Dusseau. 2018. SOCK: Rapid task provisioning with serverless-optimized containers. In Proceedings of the 2018 USENIX Annual Technical Conference. USENIX Association, 57\u201370."},{"key":"e_1_3_1_64_2","doi-asserted-by":"publisher","DOI":"10.1145\/3470496.3527407"},{"key":"e_1_3_1_65_2","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3380609"},{"key":"e_1_3_1_66_2","doi-asserted-by":"publisher","DOI":"10.1145\/3600006.3613154"},{"key":"e_1_3_1_67_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2020.110708"},{"key":"e_1_3_1_68_2","first-page":"205","volume-title":"Proceedings of the 2020 USENIX Annual Technical Conference","author":"Shahrad Mohammad","year":"2020","unstructured":"Mohammad Shahrad, Rodrigo Fonseca, \u00cd\u00f1igo Goiri, Gohar Chaudhry, Paul Batum, Jason Cooke, Eduardo Laureano, Colby Tresness, Mark Russinovich, and Ricardo Bianchini. 2020. Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. In Proceedings of the 2020 USENIX Annual Technical Conference, 205\u2013218."},{"key":"e_1_3_1_69_2","doi-asserted-by":"publisher","DOI":"10.1145\/3472883.3486981"},{"key":"e_1_3_1_70_2","doi-asserted-by":"publisher","DOI":"10.1145\/3445814.3446714"},{"key":"e_1_3_1_71_2","first-page":"513","volume-title":"Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation","author":"Uta Alexandru","year":"2020","unstructured":"Alexandru Uta, Alexandru Custura, Dmitry Duplyakin, Ivo Jimenez, Jan Rellermeyer, Carlos Maltzahn, Robert Ricci, and Alexandru Iosup. 2020. Is big data performance reproducible in modern cloud networks?. In Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation, 513\u2013527."},{"key":"e_1_3_1_72_2","doi-asserted-by":"publisher","DOI":"10.1145\/3302424.3303978"},{"key":"e_1_3_1_73_2","doi-asserted-by":"publisher","DOI":"10.1145\/3190645"},{"key":"e_1_3_1_74_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICST.2018.00018"},{"key":"e_1_3_1_75_2","doi-asserted-by":"publisher","DOI":"10.1145\/3579643"},{"key":"e_1_3_1_76_2","first-page":"416","volume-title":"Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering","author":"Wen Jinfeng","year":"2021","unstructured":"Jinfeng Wen, Zhenpeng Chen, Yi Liu, Yiling Lou, Yun Ma, Gang Huang, Xin Jin, and Xuanzhe Liu. 2021. An empirical study on challenges of application development in serverless computing. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 416\u2013428."},{"key":"e_1_3_1_77_2","unstructured":"Jinfeng Wen Zhenpeng Chen Federica Sarro and Xuanzhe Liu. 2023. Revisiting the performance of serverless computing: An analysis of variance. arXiv:2305.04309v1. Retrieved from https:\/\/arxiv.org\/abs\/2305.04309v1"},{"key":"e_1_3_1_78_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-025-10615-3"},{"key":"e_1_3_1_79_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICWS53863.2021.00102"},{"key":"e_1_3_1_80_2","first-page":"e2394:1\u2013e2394:2","article-title":"Characterizing commodity serverless computing platforms","author":"Wen Jinfeng","year":"2021","unstructured":"Jinfeng Wen, Yi Liu, Zhenpeng Chen, Junkai Chen, and Yun Ma. 2021. Characterizing commodity serverless computing platforms. Journal of Software: Evolution and Process 35, 10 \u00a0(2021), e2394:1\u2013e2394:23.","journal-title":"Journal of Software: Evolution and Process"},{"key":"e_1_3_1_81_2","doi-asserted-by":"publisher","DOI":"10.1145\/3694715.3695948"},{"key":"e_1_3_1_82_2","first-page":"911","volume-title":"Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation","author":"Wu Bingyang","year":"2024","unstructured":"Bingyang Wu, Ruidong Zhu, Zili Zhang, Peng Sun, Xuanzhe Liu, and Xin Jin. 2024. dLoRA: Dynamically orchestrating requests and adapters for LoRA LLM serving. In Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 911\u2013927."},{"key":"e_1_3_1_83_2","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS54959.2023.00093"},{"key":"e_1_3_1_84_2","doi-asserted-by":"publisher","DOI":"10.1002\/spe.3016"},{"key":"e_1_3_1_85_2","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517905"},{"key":"e_1_3_1_86_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICPADS47876.2019.00011"},{"key":"e_1_3_1_87_2","doi-asserted-by":"publisher","DOI":"10.1145\/3419111.3421280"},{"key":"e_1_3_1_88_2","doi-asserted-by":"publisher","DOI":"10.1145\/3304112.3325608"},{"key":"e_1_3_1_89_2","doi-asserted-by":"publisher","DOI":"10.1145\/3458817.3476215"},{"key":"e_1_3_1_90_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE-Companion.2019.00132"},{"key":"e_1_3_1_91_2","first-page":"193","volume-title":"Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation","author":"Zhong Yinmin","year":"2024","unstructured":"Yinmin Zhong, Shengyu Liu, Junda Chen, Jianbo Hu, Yibo Zhu, Xuanzhe Liu, Xin Jin, and Hao Zhang. 2024. DistServe: Disaggregating prefill and decoding for goodput-optimized Large Language Model serving. In Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 193\u2013210."}],"container-title":["ACM Transactions on Software Engineering and Methodology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3717609","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,4]],"date-time":"2025-10-04T11:08:57Z","timestamp":1759576137000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3717609"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,4]]},"references-count":90,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2025,11,30]]}},"alternative-id":["10.1145\/3717609"],"URL":"https:\/\/doi.org\/10.1145\/3717609","relation":{},"ISSN":["1049-331X","1557-7392"],"issn-type":[{"value":"1049-331X","type":"print"},{"value":"1557-7392","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,4]]},"assertion":[{"value":"2024-07-10","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-02-06","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-10-04","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}