{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T14:40:45Z","timestamp":1765291245280,"version":"3.46.0"},"publisher-location":"New York, NY, USA","reference-count":38,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,12,15]]},"DOI":"10.1145\/3774899.3775016","type":"proceedings-article","created":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T14:35:38Z","timestamp":1765290938000},"page":"31-37","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Large Language Models for Serverless Function Generation: An Investigation on FaaS Performance"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0008-8935-7263","authenticated-orcid":false,"given":"Xinghan","family":"Chen","sequence":"first","affiliation":[{"name":"University of Washington Tacoma, Tacoma, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9199-4973","authenticated-orcid":false,"given":"Robert","family":"Cordingly","sequence":"additional","affiliation":[{"name":"University of Washington Tacoma, Tacoma, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5209-2248","authenticated-orcid":false,"given":"Ling-Hong","family":"Hung","sequence":"additional","affiliation":[{"name":"University of Washington Tacoma, Tacoma, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2021-8501","authenticated-orcid":false,"given":"Wes","family":"Lloyd","sequence":"additional","affiliation":[{"name":"University of Washington Tacoma, Tacoma, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,12,14]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Alibaba. 2024. Qwen2.5-coder-7b-instruct. https:\/\/huggingface.co\/Qwen\/Qwen2.5-Coder-7B-Instruct. Accessed: 2025-04-16. (2024)."},{"key":"e_1_3_2_1_2_1","unstructured":"Amazon Web Services. 2025. Aws lambda documentation. https:\/\/docs.aws.amazon.com\/lambda\/. Accessed: 2025-05-05. (2025)."},{"key":"e_1_3_2_1_3_1","volume-title":"Cursor: the ai code editor","author":"Anysphere Inc. 2025.","year":"2025","unstructured":"Anysphere Inc. 2025. Cursor: the ai code editor. Accessed: 2025-05-05. (2025). https:\/\/www.cursor.com\/."},{"key":"e_1_3_2_1_4_1","volume-title":"LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World. In 2025 IEEE 22nd Intl. Conf. on Software Architecture (ICSA), 25\u201336","author":"Arun Shrikara","year":"2025","unstructured":"Shrikara Arun, Meghana Tedla, and Karthik Vaidhyanathan. 2025. LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World. In 2025 IEEE 22nd Intl. Conf. on Software Architecture (ICSA), 25\u201336."},{"key":"e_1_3_2_1_5_1","volume-title":"IBM","author":"Bergmann Dave","year":"2025","unstructured":"Dave Bergmann. 2025. What is a reasoning model? https:\/\/www.ibm.com\/think\/topics\/reasoning-model. Accessed: 2025-10-10. IBM, (2025)."},{"key":"e_1_3_2_1_6_1","article-title":"LLMaaS: Serving Large Language Models on Trusted Serverless Computing Platforms","author":"Cai Zinuo","year":"2024","unstructured":"Zinuo Cai and et al. 2024. LLMaaS: Serving Large Language Models on Trusted Serverless Computing Platforms. IEEE Transactions on Artificial Intelligence.","journal-title":"IEEE Transactions on Artificial Intelligence."},{"key":"e_1_3_2_1_7_1","unstructured":"Liguo Chen and et al. 2024. A survey on evaluating large language models in code generation tasks. arXiv preprint arXiv:2408.16498."},{"volume-title":"2024 IEEE\/ACM 17th Intl. Conf. on Utility and Cloud Computing (UCC), 63\u201372","author":"Xinghan","key":"e_1_3_2_1_8_1","unstructured":"Xinghan Chen and et al. 2024. Predicting ARM64 Serverless Functions Runtime: Leveraging function profiling for generalized performance models. In 2024 IEEE\/ACM 17th Intl. Conf. on Utility and Cloud Computing (UCC), 63\u201372."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3661167.3661221"},{"volume-title":"Companion of the ACM\/SPEC Int. Conf. on Perf. Engineering, 161\u2013164","author":"Robert","key":"e_1_3_2_1_10_1","unstructured":"Robert Cordingly and et al. 2021. Enhancing observability of serverless computing with the serverless application analytics framework. In Companion of the ACM\/SPEC Int. Conf. on Perf. Engineering, 161\u2013164."},{"key":"e_1_3_2_1_11_1","volume-title":"2020 IEEE Intl Conf on Cloud and Big Data Computing, 640\u2013649","author":"Cordingly Robert","year":"2020","unstructured":"Robert Cordingly, Wen Shu, and Wes J Lloyd. 2020. Predicting performance and cost of serverless computing functions with SAAF. In 2020 IEEE Intl Conf on Cloud and Big Data Computing, 640\u2013649."},{"key":"e_1_3_2_1_12_1","unstructured":"DeepSeek. 2024. Deepseek reasoner r1 series: 8b 32b 671b. https:\/\/huggingface.co\/deepseek-ai\/DeepSeek-R1. Accessed: 2025-04-16. (2024)."},{"key":"e_1_3_2_1_13_1","volume-title":"Mercury: A code efficiency benchmark for code large language models. arXiv preprint arXiv:2402.07844.","author":"Du Mingzhe","year":"2024","unstructured":"Mingzhe Du and et al. 2024. Mercury: A code efficiency benchmark for code large language models. arXiv preprint arXiv:2402.07844."},{"volume-title":"Proceedings of the IEEE\/ACM 46th Intl. Conf. on Software Engineering, 1\u201313","author":"Xueying","key":"e_1_3_2_1_14_1","unstructured":"Xueying Du and et al. 2024. Evaluating large language models in class-level code generation. In Proceedings of the IEEE\/ACM 46th Intl. Conf. on Software Engineering, 1\u201313."},{"volume-title":"ServerlessLLM: Low-Latency Serverless Inference for Large Language Models. In 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24)","author":"Yao","key":"e_1_3_2_1_15_1","unstructured":"Yao Fu and et al. 2024. ServerlessLLM: Low-Latency Serverless Inference for Large Language Models. In 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24), 135\u2013153."},{"key":"e_1_3_2_1_16_1","unstructured":"Google. 2024. Gemini 2.0 flash and gemini 2.5 series. https:\/\/blog.google\/technology\/google-deepmind\/. Accessed: 2025-04-16. (2024)."},{"key":"e_1_3_2_1_17_1","unstructured":"Michael Hassid and et al. 2024. The larger the better? improved llm code-generation via budget reallocation. arXiv preprint arXiv:2404.00725."},{"key":"e_1_3_2_1_18_1","first-page":"11506","article-title":"Effibench: Benchmarking the efficiency of automatically generated code","volume":"37","author":"Huang Dong","year":"2024","unstructured":"Dong Huang and et al. 2024. Effibench: Benchmarking the efficiency of automatically generated code. Advances in Neural Information Processing Systems, 37, 11506\u201311544.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Thomas Kluyver et al. 2016. Jupyter Notebooks-a publishing format for reproducible computational workflows. In Positioning and power in academic publishing: Players agents and agendas. IOS press 87\u201390.","DOI":"10.3233\/978-1-61499-649-1-87"},{"key":"e_1_3_2_1_20_1","unstructured":"LeetCode. 2023. Count collisions on a road. https:\/\/leetcode.com\/problems\/count-collisions-on-a-road\/. Accessed: 2025-04-16. (2023)."},{"key":"e_1_3_2_1_21_1","unstructured":"LeetCode. 2023. Median of two sorted arrays. https:\/\/leetcode.com\/problems\/median-of-two-sorted-arrays\/. Accessed: 2025-04-16. (2023)."},{"key":"e_1_3_2_1_22_1","unstructured":"LeetCode. 2023. Minimum cost to split an array. https:\/\/leetcode.com\/problems\/minimum-cost-to-split-an-array\/. Accessed: 2025-04-16. (2023)."},{"key":"e_1_3_2_1_23_1","unstructured":"Yujia Li and et al. 2022. Competition-level code generation with alphacode. Science."},{"key":"e_1_3_2_1_24_1","unstructured":"Jiawei Liu and et al. 2024. Evaluating language models for efficient code generation. arXiv preprint arXiv:2408.06450."},{"key":"e_1_3_2_1_25_1","unstructured":"Meta. 2023. Llama 3.3 70b instruct. https:\/\/huggingface.co\/meta-llama\/Llama-3.3-70B-Instruct. Accessed: 2025-04-16. (2023)."},{"key":"e_1_3_2_1_26_1","unstructured":"Ollama Inc. 2024. Ollama: run large language models locally. https:\/\/ollama.com. Accessed: 2025-05-05. (2024)."},{"key":"e_1_3_2_1_27_1","unstructured":"OpenAI. 2023. Gpt-4o and gpt-4o mini. https:\/\/openai.com\/index\/hello-gpt-4o\/. Accessed: 2025-04-16. (2023)."},{"key":"e_1_3_2_1_28_1","unstructured":"OpenAI. 2023. Introducing openai o1 and o1 mini. https:\/\/openai.com\/o1\/. Accessed: 2025-04-16. (2023)."},{"key":"e_1_3_2_1_29_1","unstructured":"OpenAI. 2023. Introducing openai o3 and o4 mini. https:\/\/openai.com\/index\/introducing-o3-and-o4-mini\/. Accessed: 2025-04-16. (2023)."},{"key":"e_1_3_2_1_30_1","unstructured":"OpenAI. 2025. Openai python api. https:\/\/github.com\/openai\/openai-python. Accessed: 2025-05-05. (2025)."},{"key":"e_1_3_2_1_31_1","unstructured":"OpenAI. [n. d.] Tokenizer - openai api. https:\/\/platform.openai.com\/tokenizer. Accessed: 2025-09-04. ()."},{"volume-title":"get up and running with openai gpt-oss, deepseek-r1, gemma 3 and other models. https:\/\/github.com\/ollama\/ollama. Accessed: 2025-10-10","year":"2025","key":"e_1_3_2_1_32_1","unstructured":"ollama organization. 2025. Ollama\/ollama: get up and running with openai gpt-oss, deepseek-r1, gemma 3 and other models. https:\/\/github.com\/ollama\/ollama. Accessed: 2025-10-10; latest release version 0.12.3. (2025)."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"crossref","unstructured":"Yun Peng and et al. 2024. PerfCodeGen: Improving Performance of LLM Generated Code with Execution Feedback. arXiv preprint arXiv:2412.03578.","DOI":"10.1109\/Forge66646.2025.00008"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2020.110708"},{"key":"e_1_3_2_1_35_1","unstructured":"SeBS Project. 2020. Serverless benchmark suite (sebs). https:\/\/github.com\/parasj\/sebs. Accessed: 2025-04-16. (2020)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3702634.3702950"},{"key":"e_1_3_2_1_37_1","unstructured":"Jinfeng Wen and et al. 2025. LLM-Based Misconfiguration Detection for AWS Serverless Computing. ACM Transactions on Software Engineering and Methodology."},{"key":"e_1_3_2_1_38_1","unstructured":"xAI. 2024. Grok-3 and grok-3 mini beta. https:\/\/x.ai\/news\/grok-3. Accessed: 2025-04-16. (2024)."}],"event":{"name":"WoSC11 '25: 11th International Workshop on Serverless Computing","sponsor":["IFIP","Usenix"],"location":"Vanderbilt University Nashville TN USA","acronym":"WoSC11 '25"},"container-title":["Proceedings of the 11th International Workshop on Serverless Computing"],"original-title":[],"deposited":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T14:35:48Z","timestamp":1765290948000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3774899.3775016"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,14]]},"references-count":38,"alternative-id":["10.1145\/3774899.3775016","10.1145\/3774899"],"URL":"https:\/\/doi.org\/10.1145\/3774899.3775016","relation":{},"subject":[],"published":{"date-parts":[[2025,12,14]]},"assertion":[{"value":"2025-12-14","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}