{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T19:25:14Z","timestamp":1774639514773,"version":"3.50.1"},"reference-count":68,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2025,4,28]],"date-time":"2025-04-28T00:00:00Z","timestamp":1745798400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,4,28]],"date-time":"2025-04-28T00:00:00Z","timestamp":1745798400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Mach. Intell. Res."],"published-print":{"date-parts":[[2025,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>In the field of natural language processing, the rapid development of large language model (LLM) has attracted increasing attention. LLMs have shown a high level of creativity in various tasks, but the methods for assessing such creativity are inadequate. Assessment of LLM creativity needs to consider differences from humans, requiring multiple dimensional measurement while balancing accuracy and efficiency. This paper aims to establish an efficient framework for assessing the level of creativity in LLMs. By adapting the modified Torrance tests of creative thinking, the research evaluates the creative performance of various LLMs across 7 tasks, emphasizing 4 criteria including fluency, flexibility, originality, and elaboration. In this context, we develop a comprehensive dataset of 700 questions for testing and an LLM-based evaluation method. In addition, this study presents a novel analysis of LLMs\u2019 responses to diverse prompts and role-play situations. We found that the creativity of LLMs primarily falls short in originality, while excelling in elaboration. In addition, the use of prompts and role-play settings of the model significantly influence creativity. Additionally, the experimental results also indicate that collaboration among multiple LLMs can enhance originality. Notably, our findings reveal a consensus between human evaluations and LLMs regarding the personality traits that influence creativity. The findings underscore the significant impact of LLM design on creativity and bridge artificial intelligence and human creativity, offering insights into LLMs\u2019 creativity and potential applications.<\/jats:p>","DOI":"10.1007\/s11633-025-1546-4","type":"journal-article","created":{"date-parts":[[2025,4,28]],"date-time":"2025-04-28T11:12:29Z","timestamp":1745838749000},"page":"417-436","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["Assessing and Understanding Creativity in Large Language Models"],"prefix":"10.1007","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7747-7040","authenticated-orcid":false,"given":"Yunpu","family":"Zhao","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8691-8549","authenticated-orcid":false,"given":"Rui","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Wenyi","family":"Li","sequence":"additional","affiliation":[]},{"given":"Ling","family":"Li","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,4,28]]},"reference":[{"key":"1546_CR1","unstructured":"S. Bubeck, V. Chandrasekaran, R. Eldan, J. Gehrke, E. Horvitz, E. Kamar, P. Lee, Y. T. Lee, Y. Z. Li, S. Lundberg, H. Nori, H. Palangi, M. T. Ribeiro, Y. Zhang. Sparks of artificial general intelligence: Early experiments with GPT-4, [Online], Available: https:\/\/arxiv.org\/abs\/2303.12712, 2023."},{"key":"1546_CR2","unstructured":"H. Touvron, L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, N. Bashlykov, S. Batra, P. Bhargava, S. Bhosale, D. Bikel, L. Blecher, C. C. Ferrer, M. Y. Chen, G. Cucurull, D. Esiobu, J. Fernandes, J. Fu, W. Y. Fu, B. Fuller, C. Gao, V. Goswami, N. Goyal, A. Hartshorn, S. Hosseini, R. Hou, H. Inan, M. Kardas, V. Kerkez, M. Khabsa, I. Kloumann, A. Korenev, P. S. Koura, M. A. Lachaux, T. Lavril, J. Lee, D. Liskovich, Y. H. Lu, Y. N. Mao, X. Martinet, T. Mihaylov, P. Mishra, I. Molybog, Y. X. Nie, A. Poulton, J. Reizenstein, R. Rungta, K. Saladi, A. Schelten, R. Silva, E. M. Smith, R. Subramanian, X. E. Tan, B. Tang, R. Taylor, A. Williams, J. X. Kuan, P. X. Xu, Z. Yan, I. Zarov, Y. C. Zhang, A. Fan, M. Kambadur, S. Narang, A. Rodriguez, R. Stojnic, S. Edunov, T. Scialom. Llama 2: Open foundation and fine-tuned chat models, [Online], Available: https:\/\/arxiv.org\/abs\/2307.09288, 2023."},{"key":"1546_CR3","volume-title":"Proceedings of the 36th International Conference on Neural Information Processing Systems","author":"Y H Wu","year":"2022","unstructured":"Y. H. Wu, A. Q. Jiang, W. D. Li, M. N. Rabe, C. Staats, M. Jamnik, C. Szegedy. Autoformalization with large language models. In Proceedings of the 36th International Conference on Neural Information Processing Systems, New Orleans, USA, Article number 2344, 2022."},{"key":"1546_CR4","doi-asserted-by":"publisher","first-page":"431","DOI":"10.18653\/v1\/2023.findings-acl.29","volume-title":"Proceedings of Findings of the Association for Computational Linguistics","author":"T R Laskar","year":"2023","unstructured":"T. R. Laskar, M. S. Bari, M. Rahman, A. H. Bhuiyan, S. Joty, J. Huang. A systematic study and comprehensive evaluation of ChatGPT on benchmark datasets. In Proceedings of Findings of the Association for Computational Linguistics, Toronto, Canada, pp. 431\u2013469, 2023. DOI: https:\/\/doi.org\/10.18653\/v1\/2023.findings-acl.29."},{"key":"1546_CR5","doi-asserted-by":"publisher","first-page":"4171","DOI":"10.18653\/v1\/N19-1423","volume-title":"Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"J Devlin","year":"2019","unstructured":"J. Devlin, M. W. Chang, L. Kenton, K. Toutanova. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, USA, pp. 4171\u20134186, 2019. DOI: https:\/\/doi.org\/10.18653\/v1\/N19-1423."},{"key":"1546_CR6","doi-asserted-by":"publisher","first-page":"623","DOI":"10.1145\/3527927.3535197","volume-title":"Proceedings of the 14th Conference on Creativity and Cognition","author":"G Di Fede","year":"2022","unstructured":"G. Di Fede, D. Rocchesso, S. P. Dow, S. Andolina. The idea machine: LLM-based expansion, rewriting, combination, and suggestion of ideas. In Proceedings of the 14th Conference on Creativity and Cognition, Venice, Italy, pp. 623\u2013627, 2022. DOI: https:\/\/doi.org\/10.1145\/3527927.3535197."},{"key":"1546_CR7","first-page":"29","volume-title":"Proceedings of the 14th International Conference on Computational Creativity","author":"M Elzohbi","year":"2023","unstructured":"M. Elzohbi, R. Zhao. Creative data generation: A review focusing on text and poetry. In Proceedings of the 14th International Conference on Computational Creativity, Ontario, Canada, pp. 29\u201338, 2023."},{"key":"1546_CR8","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1145\/3591196.3596612","volume-title":"Proceedings of the 15th Conference on Creativity and Cognition","author":"Z Zhao","year":"2023","unstructured":"Z. Zhao, S. Song, B. Duah, J. Macbeth, S. Carter, M. P. Van, N. S. Bravo, M. Klenk, K. Sick, A. L. S. Filipowicz. More human than human: LLM-generated narratives out-perform human-LLM interleaved narratives. In Proceedings of the 15th Conference on Creativity and Cognition, NewYork, USA, pp. 368\u2013370, 2023. DOI: https:\/\/doi.org\/10.1145\/3591196.3596612."},{"issue":"6624","key":"1546_CR9","doi-asserted-by":"publisher","first-page":"1092","DOI":"10.1126\/science.abq1158","volume":"378","author":"Y J Li","year":"2022","unstructured":"Y. J. Li, D. Choi, J. Chung, N. Kushman, J. Schrittwieser, R. Leblond, T. Eccles, J. Keeling, F. Gimeno, A. Dal Lago, T. Hubert, P. Choy, C. De Masson d\u2019Autume, I. Babuschkin, X. Y. Chen, P. S. Huang, J. Welbl, S. Gowal, A. Cherepanov, J. Molloy, D. J. Mankowitz, E. Sutherland Robson, P. Kohli, N. De Freitas, K. Kavukcuoglu, O. Vinyals. Competition-level code generation with AlphaCode. Science, vol. 378, no. 6624, pp. 1092\u20131097, 2022. DOI: https:\/\/doi.org\/10.1126\/science.abq1158.","journal-title":"Science"},{"key":"1546_CR10","doi-asserted-by":"publisher","unstructured":"E. Kasneci, K. Sessler, S. K\u00fcchemann, M. Bannert, D. Dementieva, F. Fischer, U. Gasser, G. Groh, S. G\u00fcnnemann, E. H\u00fcllermeier, S. Krusche, G. Kutyniok, T. Michaeli, C. Nerdel, J. Pfeffer, O. Poquet, M. Sailer, A. Schmidt, T. Seidel, M. Stadler, J. Weller, J. Kuhn, G. Kasneci. ChatGPT for good? On opportunities and challenges of large language models for education. Learning and Individual Differences, vol. 103, Article number 102274, 2023. DOI: https:\/\/doi.org\/10.1016\/j.lindif.2023.102274.","DOI":"10.1016\/j.lindif.2023.102274"},{"key":"1546_CR11","doi-asserted-by":"publisher","first-page":"11050","DOI":"10.1109\/CVPR52688.2022.01077","volume-title":"Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"S Y Li","year":"2022","unstructured":"S. Y. Li, W. J. Yu, T. P. Gu, C. Z. Lin, Q. Wang, C. Qian, C. C. Loy, Z. W. Liu. Bailando: 3D dance generation by actor-critic GPT with choreographic memory. In Proceedings of IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, USA, pp. 11050\u201311059, 2022. DOI: https:\/\/doi.org\/10.1109\/CVPR52688.2022.01077."},{"key":"1546_CR12","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1007\/978-3-031-03789-4_2","volume-title":"Proceedings of the 11th International Conference on Artificial Intelligence in Music, Sound, Art and Design","author":"B Banar","year":"2022","unstructured":"B. Banar, S. Colton. A systematic evaluation of GPT-2-based music generation. In Proceedings of the 11th International Conference on Artificial Intelligence in Music, Sound, Art and Design, Madrid, Spain, pp. 19\u201335, 2022. DOI: https:\/\/doi.org\/10.1007\/978-3-031-03789-4_2."},{"key":"1546_CR13","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642698","volume-title":"Proceedings of CHI Conference on Human Factors in Computing Systems","author":"Y R Liu","year":"2024","unstructured":"Y. R. Liu, S. Chen, H. C. Cheng, M. X. Yu, X. Ran, A. Mo, Y. L. Tang, Y. Huang. How AI processing delays foster creativity: Exploring research question Co-creation with an LLM-based agent. In Proceedings of CHI Conference on Human Factors in Computing Systems, Honolulu, USA, Article number 17, 2024. DOI: https:\/\/doi.org\/10.1145\/3613904.3642698."},{"key":"1546_CR14","volume-title":"Proceedings of the 1st HEAL Workshop at CHI Conference on Human Factors in Computing Systems","author":"H Shin","year":"2024","unstructured":"H. Shin, S. Choi, J. Y. Cho, S. Admoni, H. Lim, T. Kim, H. Hong, M. Lee, Kim, J. Towards an evaluation of LLM-generated inspiration by developing and validating inspiration scale. In Proceedings of the 1st HEAL Workshop at CHI Conference on Human Factors in Computing Systems, Honolulu, USA, 2024."},{"key":"1546_CR15","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1145\/3635636.3656204","volume-title":"Proceedings of the 16th Conference on Creativity & Cognition","author":"B R Anderson","year":"2024","unstructured":"B. R. Anderson, J. H. Shah, M. Kreminski. Homogenization effects of large language models on human creative ideation. In Proceedings of the 16th Conference on Creativity & Cognition, Chicago, USA, pp.413\u2013425, 2024. DOI: https:\/\/doi.org\/10.1145\/3635636.3656204."},{"issue":"1","key":"1546_CR16","doi-asserted-by":"publisher","first-page":"92","DOI":"10.1080\/10400419.2012.650092","volume":"24","author":"M A Runco","year":"2012","unstructured":"M. A. Runco, G. J. Jaeger. The standard definition of creativity. Creativity Research Journal, vol. 24, no. 1, pp. 92\u201396, 2012. DOI: https:\/\/doi.org\/10.1080\/10400419.2012.650092.","journal-title":"Creativity Research Journal"},{"key":"1546_CR17","doi-asserted-by":"publisher","unstructured":"T. Chakraborty, S. Masud. Judging the creative prowess of AI. Nature Machine Intelligence, vol. 5, no. 6, Article number 558, 2023. DOI: https:\/\/doi.org\/10.1038\/s42256-023-00664-y.","DOI":"10.1038\/s42256-023-00664-y"},{"key":"1546_CR18","volume-title":"Torrance Test of Creative Thinking: Directions Manual and Scoring Guide","author":"E P Torrance","year":"1966","unstructured":"E. P. Torrance. Torrance Test of Creative Thinking: Directions Manual and Scoring Guide, Lexington, USA: Personnel Press, 1966."},{"issue":"2","key":"1546_CR19","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1037\/aca0000251","volume":"13","author":"B Barbot","year":"2019","unstructured":"B. Barbot, R. Reiter-Palmon. Creativity assessment: Pitfalls, solutions, and standards. Psychology of Aesthetics, Creativity, and the Arts, vol. 13, no. 2, pp. 131\u2013132, 2019. DOI: https:\/\/doi.org\/10.1037\/aca0000251.","journal-title":"Psychology of Aesthetics, Creativity, and the Arts"},{"issue":"3\u20134","key":"1546_CR20","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1207\/S15326934CRJ1334_08","volume":"13","author":"R J Sternberg","year":"2001","unstructured":"R. J. Sternberg, E. L. Grigorenko. Guilford\u2019s structure of intellect model and model of creativity: Contributions and limitations. Creativity Research Journal, vol. 13, no. 3\u20134, pp. 309\u2013316, 2001. DOI: https:\/\/doi.org\/10.1207\/S15326934CRJ1334_08.","journal-title":"Creativity Research Journal"},{"issue":"2","key":"1546_CR21","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1037\/aca0000231","volume":"13","author":"S Acar","year":"2019","unstructured":"S. Acar, M. A. Runco. Divergent thinking: New methods, recent research, and extended theory. Psychology of Aesthetics, Creativity, and the Arts, vol. 13, no. 2, pp. 153\u2013158, 2019. DOI: https:\/\/doi.org\/10.1037\/aca0000231.","journal-title":"Psychology of Aesthetics, Creativity, and the Arts"},{"issue":"2","key":"1546_CR22","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1037\/aca0000220","volume":"13","author":"G M Cseh","year":"2019","unstructured":"G. M. Cseh, K. K. Jeffries. A scattered CAT: A critical evaluation of the consensual assessment technique for creativity research. Psychology of Aesthetics, Creativity, and the Arts, vol. 13, no. 2, pp. 159\u2013166, 2019. DOI: https:\/\/doi.org\/10.1037\/aca0000220.","journal-title":"Psychology of Aesthetics, Creativity, and the Arts"},{"issue":"2","key":"1546_CR23","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1037\/aca0000217","volume":"13","author":"J C Kaufman","year":"2019","unstructured":"J. C. Kaufman. Self-assessments of creativity: Not ideal, but better than you think. Psychology of Aesthetics, Creativity, and the Arts, vol. 13, no. 2, pp. 187\u2013192, 2019. DOI: https:\/\/doi.org\/10.1037\/aca0000217.","journal-title":"Psychology of Aesthetics, Creativity, and the Arts"},{"issue":"2","key":"1546_CR24","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1037\/aca0000233","volume":"13","author":"B Barbot","year":"2019","unstructured":"B. Barbot, R. W. Hass, R. Reiter-Palmon. Creativity assessment in psychological research: (Re) setting the standards. Psychology of Aesthetics, Creativity, and the Arts, vol. 13, no. 2, pp. 233\u2013240, 2019. DOI: https:\/\/doi.org\/10.1037\/aca0000233.","journal-title":"Psychology of Aesthetics, Creativity, and the Arts"},{"issue":"4","key":"1546_CR25","doi-asserted-by":"publisher","first-page":"302","DOI":"10.1037\/a0021917","volume":"5","author":"K H Kim","year":"2011","unstructured":"K. H. Kim. The APA 2009 division 10 debate: Are the Torrance tests of creative thinking still relevant in the 21st century? Psychology of Aesthetics, Creativity, and the Arts, vol. 5, no. 4, pp.302\u2013308, 2011. DOI: https:\/\/doi.org\/10.1037\/a0021917.","journal-title":"Psychology of Aesthetics, Creativity, and the Arts"},{"key":"1546_CR26","doi-asserted-by":"publisher","first-page":"103","DOI":"10.4324\/9780203063330","volume-title":"Longitudinal Studies of Creativity","author":"J A Plucker","year":"1999","unstructured":"J. A. Plucker. Is the proof in the pudding? Reanalyses of torrance\u2019s (1958 to present) longitudinal data. Longitudinal Studies of Creativity, M. A. Runco, Ed., New York, USA: Routledge, pp. 103\u2013114, 1999. DOI: https:\/\/doi.org\/10.4324\/9780203063330."},{"issue":"1","key":"1546_CR27","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1207\/s15326934crj1801_2","volume":"18","author":"K H Kim","year":"2006","unstructured":"K. H. Kim. Can we trust creativity tests? A review of the Torrance tests of creative thinking (TTCT). Creativity Research Journal, vol. 18, no. 1, pp. 3\u201314, 2006. DOI: https:\/\/doi.org\/10.1207\/s15326934crj1801_2.","journal-title":"Creativity Research Journal"},{"issue":"3","key":"1546_CR28","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1016\/j.rpto.2015.06.002","volume":"31","author":"S da Costa","year":"2015","unstructured":"S. da Costa, D. P\u00e1ez, F. S\u00e1nchez, M. Garaigordobil, S. Gondim. Personal factors of creativity: A second order meta-analysis. Journal of Work and Organizational Psychology, vol. 31, no. 3, pp. 165\u2013173, 2015. DOI: https:\/\/doi.org\/10.1016\/j.rpto.2015.06.002.","journal-title":"Journal of Work and Organizational Psychology"},{"issue":"1","key":"1546_CR29","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1080\/10400410802633400","volume":"21","author":"H H Ma","year":"2009","unstructured":"H. H. Ma. The effect size of variables associated with creativity: A meta-analysis. Creativity Research Journal, vol. 21, no. 1, pp.30\u201342, 2009. DOI: https:\/\/doi.org\/10.1080\/10400410802633400.","journal-title":"Creativity Research Journal"},{"key":"1546_CR30","doi-asserted-by":"publisher","unstructured":"Y. P. Chang, X. Wang, J. D. Wang, Y. Wu, L. Y. Yang, K. J. Zhu, H. Chen, X. Y. Yi, C. X. Wang, Y. D. Wang, W. Ye, Y. Zhang, Y. Chang, P. S. Yu, Q. Yang, X. Xie. A survey on evaluation of large language models. ACM Transactions on Intelligent Systems and Technology, vol. 15, no. 3, Article number 39, 2024. DOI: https:\/\/doi.org\/10.1145\/3641289.","DOI":"10.1145\/3641289"},{"key":"1546_CR31","doi-asserted-by":"publisher","unstructured":"R. Shiffrin, M. Mitchell. Probing the psychology of AI models. Proceedings of the National Academy of Sciences of the United States of America, vol. 120, no. 10, Article number e2300963120, 2023. DOI: https:\/\/doi.org\/10.1073\/pnas.2300963120.","DOI":"10.1073\/pnas.2300963120"},{"key":"1546_CR32","doi-asserted-by":"crossref","unstructured":"G. Franceschelli, M. Musolesi. On the creativity of large language models, [Online], Available: https:\/\/arxiv.org\/abs\/2304.00008, 2023.","DOI":"10.1007\/s00146-024-02127-3"},{"key":"1546_CR33","volume-title":"Proceedings of AAAI Workshop on Creative AI Across Modalities","author":"D Summers-Stay","year":"2023","unstructured":"D. Summers-Stay, S. Lukin, C. Voss. Brainstorm, then select: A generative language model improves its creativity score. In Proceedings of AAAI Workshop on Creative AI Across Modalities, 2023."},{"key":"1546_CR34","first-page":"164","volume-title":"Proceedings of the 13th International Conference on Computational Creativity","author":"C Stevenson","year":"2022","unstructured":"C. Stevenson, I. Smal, M. Baas, R. P. P. P. Grasman, H. L. J. van der Maas. Putting GPT-3\u2019s creativity to the (alternative uses) test. In Proceedings of the 13th International Conference on Computational Creativity, Bozen-Bolzano, Italy, pp. 164\u2013168, 2022."},{"key":"1546_CR35","volume-title":"Proceedings of the 37th International Conference on Neural Information Processing Systems","author":"S A Naeini","year":"2023","unstructured":"S. A. Naeini, R. Saqur, M. Saeidi, J. Giorgi, B. Taati. Large language models are fixated by red herrings: Exploring creative problem solving and einstellung effect using the only connect wall dataset. In Proceedings of the 37th International Conference on Neural Information Processing Systems, New Orleans, USA, Article number 246, 2023."},{"key":"1546_CR36","doi-asserted-by":"publisher","unstructured":"E. E. Guzik, C. Byrge, C. Gilde. The originality of machines: AI takes the Torrance test. Journal of Creativity, vol. 33, no. 3, Article number 100065, 2023. DOI: https:\/\/doi.org\/10.1016\/j.yjoc.2023.100065.","DOI":"10.1016\/j.yjoc.2023.100065"},{"key":"1546_CR37","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642731","volume-title":"Proceedings of CHI Conference on Human Factors in Computing Systems","author":"T Chakrabarty","year":"2024","unstructured":"T. Chakrabarty, P. Laban, D. Agarwal, S. Muresan, C. S. Wu. Art or artifice? Large language models and the false promise of creativity. In Proceedings of CHI Conference on Human Factors in Computing Systems, Honolulu, USA, Article number 30, 2024. DOI: https:\/\/doi.org\/10.1145\/3613904.3642731."},{"issue":"3\u20134","key":"1546_CR38","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1207\/S15326934CRJ1334_07","volume":"13","author":"T I Lubart","year":"2001","unstructured":"T. I. Lubart. Models of the creative process: Past, present and future. Creativity Research Journal, vol. 13, no. 3\u20134, pp. 295\u2013308, 2001. DOI: https:\/\/doi.org\/10.1207\/S15326934CRJ1334_07.","journal-title":"Creativity Research Journal"},{"issue":"1","key":"1546_CR39","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1080\/10400419.2012.649181","volume":"24","author":"M Batey","year":"2012","unstructured":"M. Batey. The measurement of creativity: From definitional consensus to the introduction of a new heuristic framework. Creativity Research Journal, vol. 24, no. 1, pp.55\u201365, 2012. DOI: https:\/\/doi.org\/10.1080\/10400419.2012.649181.","journal-title":"Creativity Research Journal"},{"issue":"3","key":"1546_CR40","doi-asserted-by":"publisher","first-page":"258","DOI":"10.1016\/j.tsc.2012.04.009","volume":"7","author":"D Piffer","year":"2012","unstructured":"D. Piffer. Can creativity be measured? An attempt to clarify the notion of creativity and general directions for future research. Thinking Skills and Creativity, vol. 7, no. 3, pp. 258\u2013264, 2012. DOI: https:\/\/doi.org\/10.1016\/j.tsc.2012.04.009.","journal-title":"Thinking Skills and Creativity"},{"issue":"2","key":"1546_CR41","doi-asserted-by":"publisher","first-page":"757","DOI":"10.3758\/s13428-020-01453-w","volume":"53","author":"R E Beaty","year":"2021","unstructured":"R. E. Beaty, D. R. Johnson. Automating creativity assessment with SemDis: An open platform for computing semantic distance. Behavior Research Methods, vol. 53, no. 2, pp. 757\u2013780, 2021. DOI: https:\/\/doi.org\/10.3758\/s13428-020-01453-w.","journal-title":"Behavior Research Methods"},{"issue":"1","key":"1546_CR42","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1177\/00169862211061874","volume":"67","author":"S Acar","year":"2023","unstructured":"S. Acar, K. Berthiaume, K. Grajzel, D. Dumas, C. Flemister, P. Organisciak. Applying automated originality scoring to the verbal form of Torrance tests of creative thinking. Gifted Child Quarterly, vol. 67, no. 1, pp.3\u201317, 2023. DOI: https:\/\/doi.org\/10.1177\/00169862211061874.","journal-title":"Gifted Child Quarterly"},{"key":"1546_CR43","volume-title":"Proceedings of the 37th International Conference on Neural Information Processing Systems","author":"Y S Bai","year":"2023","unstructured":"Y. S. Bai, J. H. Ying, Y. X. Cao, X. Lv, Y. Z. He, X. Z. Wang, J. F. Yu, K. S. Zeng, Y. J. Xiao, H. Z. Lyu, J. Y. Zhang, J. Z. Li, L. Hou. Benchmarking foundation models with language-model-as-an-examiner. In Proceedings of the 37th International Conference on Neural Information Processing Systems, New Orleans, USA, 2023."},{"key":"1546_CR44","volume-title":"Proceedings of the 37th International Conference on Neural Information Processing Systems","author":"L M Zheng","year":"2023","unstructured":"L. M. Zheng, W. L. Chiang, Y. Sheng, S. Y. Zhuang, Z. H. Wu, Y. H. Zhuang, Z. Lin, Z. H. Li, D. C. Li, E. P. Xing, H. Zhang, J. E. Gonzalez, I. Stoica. Judging LLM-as-a-judge with MT-bench and chatbot arena. In Proceedings of the 37th International Conference on Neural Information Processing Systems, New Orleans, USA, Article number 2020, 2023."},{"key":"1546_CR45","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.ijcnlp-main.45","volume-title":"Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics","author":"Y J Bang","year":"2023","unstructured":"Y. J. Bang, S. Cahyawijaya, N. Lee, W. L. Dai, D. Su, B. Wilie, H. Lovenia, Z. W. Ji, T. Z. Yu, W. Chung, Q. V. Do, Y. Xu, P. Fung. A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity. In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, Nusa Dua, Indonesia, 2023. DOI: https:\/\/doi.org\/10.18653\/v1\/2023.ijcnlp-main.45."},{"key":"1546_CR46","volume-title":"Proceedings of the 12th International Conference on Learning Representations","author":"Y Wang","year":"2024","unstructured":"Y. Wang, Z. Yu, Z. Zeng, L. Yang, C. Wang, H. Chen, C. Jiang, R. Xie, J. Wang, X. Xie, et al. PandaLM: An automatic evaluation benchmark for LLM instruction tuning optimization. In Proceedings of the 12th International Conference on Learning Representations, Vienna, Austria, 2024."},{"key":"1546_CR47","volume-title":"Proceedings of the 12th International Conference on Learning Representations","author":"C M Chan","year":"2024","unstructured":"C. M. Chan, W. Z. Chen, Y. S. Su, J. X. Yu, W. Xue, S. H. Zhang, J. Fu, Z. Y. Liu. ChatEval: Towards better LLM-based evaluators through multi-agent debate. In Proceedings of the 12th International Conference on Learning Representations, Vienna, Austria, 2024."},{"key":"1546_CR48","doi-asserted-by":"publisher","first-page":"2299","DOI":"10.18653\/v1\/2024.findings-naacl.149","volume-title":"Proceedings of Findings of the Association for Computational Linguistics","author":"W J Zhong","year":"2024","unstructured":"W. J. Zhong, R. X. Cui, Y. D. Guo, Y. B. Liang, S. Lu, Y. L. Wang, A. Saied, W. Z. Chen, N. Duan. AGIEval: A human-centric benchmark for evaluating foundation models. In Proceedings of Findings of the Association for Computational Linguistics, Mexico City, Mexico, pp. 2299\u20132314, 2024. DOI: https:\/\/doi.org\/10.18653\/v1\/2024.findings-naacl.149."},{"key":"1546_CR49","volume-title":"Proceedings of the 37th International Conference on Neural Information Processing Systems","author":"Y Dubois","year":"2024","unstructured":"Y. Dubois, X. C. Li, R. Taori, T. Y. Zhang, I. Gulrajani, J. Ba, C. Guestrin, P. Liang, T. B. Hashimoto. AlpacaFarm: A simulation framework for methods that learn from human feedback. In Proceedings of the 37th International Conference on Neural Information Processing Systems, New Orleans, USA, Article number 1308, 2024."},{"key":"1546_CR50","doi-asserted-by":"publisher","first-page":"2511","DOI":"10.18653\/v1\/2023.emnlp-main.153","volume-title":"Proceedings of Conference on Empirical Methods in Natural Language Processing","author":"Y Liu","year":"2023","unstructured":"Y. Liu, D. Iter, Y. C. Xu, S. H. Wang, R. C. Xu, C. G. Zhu. G-Eval: NLG evaluation using Gpt-4 with better human alignment. In Proceedings of Conference on Empirical Methods in Natural Language Processing, Singapore, pp. 2511\u20132522, 2023. DOI: https:\/\/doi.org\/10.18653\/v1\/2023.emnlp-main.153."},{"issue":"11","key":"1546_CR51","doi-asserted-by":"publisher","first-page":"688","DOI":"10.1038\/S44159-023-00241-5","volume":"2","author":"D Demszky","year":"2023","unstructured":"D. Demszky, D. Y. Yang, D. S. Yeager, C. J. Bryan, M. Clapper, S. Chandhok, J. C. Eichstaedt, C. Hecht, J. Jamieson, M. Johnson, M. Jones, D. Krettek-Cobb, L. Lai, N. Jonesmitchell, D. C. Ong, C. S. Dweck, J. J. Gross, J. W. Pennebaker. Using large language models in psychology. Nature Reviews Psychology, vol. 2, no. 11, pp. 688\u2013701, 2023. DOI: https:\/\/doi.org\/10.1038\/S44159-023-00241-5.","journal-title":"Nature Reviews Psychology"},{"key":"1546_CR52","doi-asserted-by":"publisher","first-page":"38","DOI":"10.18653\/v1\/2020.emnlp-demos.6","volume-title":"Proceedings of Conference on Empirical Methods in Natural Language Processing: System Demonstrations","author":"T Wolf","year":"2020","unstructured":"T. Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf, M. Funtowicz, J. Davison, S. Shleifer, P. von Platen, C. Ma, Y. Jernite, J. Plu, C. W. Xu, T. Le Scao, S. Gugger, M. Drame, Q. Lhoest, A. Rush. Transformers: State-of-the-art natural language processing. In Proceedings of Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 38\u201345, 2020. DOI: https:\/\/doi.org\/10.18653\/v1\/2020.emnlp-demos.6."},{"key":"1546_CR53","unstructured":"J. Z. Bai, S. Bai, Y. F. Chu, Z. Y. Cui, K. Dang, X. D. Deng, Y. Fan, W. B. Ge, Y. Han, F. Huang, B. Y. Hui, L. Ji, M. Li, J. Y. Lin, R. J. Lin, D. H. Liu, G. Liu, C. Q. Lu, K. M. Lu, J. X. Ma, R. Men, X. Z. Ren, X. C. Ren, C. Q. Tan, S. N. Tan, J. H. Tu, P. Wang, S. J. Wang, W. Wang, S. G. Wu, B. F. Xu, J. Xu, A. Yang, H. Yang, J. Yang, S. S. Yang, Y. Yao, B. W. Yu, H. Y. Yuan, Z. Yuan, J. W. Zhang, X. X. Zhang, Y. C. Zhang, Z. R. Zhang, C. Zhou, J. R. Zhou, X. H. Zhou, T. H. Zhu. Qwen technical report, [Online], Available: https:\/\/arxiv.org\/abs\/2309.16609, 2023."},{"key":"1546_CR54","doi-asserted-by":"publisher","first-page":"3214","DOI":"10.18653\/v1\/2022.acl-long.229","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"S Lin","year":"2022","unstructured":"S. Lin, J. Hilton, O. Evans. TruthfulQA: Measuring how models mimic human falsehoods. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Dublin, Ireland, pp.3214\u20133252, 2022. DOI: https:\/\/doi.org\/10.18653\/v1\/2022.acl-long.229."},{"key":"1546_CR55","volume-title":"Proceedings of the 37th Conference on Neural Information Processing Systems","author":"T Xiang","year":"2023","unstructured":"T. Xiang, L. Z. Li, W. Y. Li, M. B. Bai, L. Wei, B. W. Wang, N. Garcia. CARE-MI: Chinese benchmark for misinformation evaluation in maternity and infant care. In Proceedings of the 37th Conference on Neural Information Processing Systems, 2023."},{"key":"1546_CR56","volume-title":"Proceedings of the 37th Conference on Neural Information Processing Systems","author":"Q T Xu","year":"2023","unstructured":"Q. T. Xu, F. L. Hong, B. Li, C. R. Hu, Z. Y. Chen, J. Zhang. On the tool manipulation capability of open-source large language models. In Proceedings of the 37th Conference on Neural Information Processing Systems, 2023."},{"key":"1546_CR57","first-page":"22199","volume-title":"Proceedings of the 36th International Conference on Neural Information Processing Systems","author":"T Kojima","year":"2022","unstructured":"T. Kojima, S. S. Gu, M. Reid, Y. Matsuo, Y. Iwasawa. Large language models are zero-shot reasoners. In Proceedings of the 36th International Conference on Neural Information Processing Systems, New Orleans, USA, pp.22199\u201322213, 2022."},{"key":"1546_CR58","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1016\/B978-0-12-374714-3.00014-8","volume-title":"Handbook of Organizational Creativity","author":"P B Paulus","year":"2012","unstructured":"P. B. Paulus, M. Dzindolet, N. W. Kohn. Collaborative creativity-group creativity and team innovation. Handbook of Organizational Creativity, M. D. Mumford, Ed., Amsterdam, The Netherlands: Elsevier, pp. 327\u2013357, 2012. DOI: https:\/\/doi.org\/10.1016\/B978-0-12-374714-3.00014-8."},{"key":"1546_CR59","doi-asserted-by":"publisher","unstructured":"M. S. Barrett, A. Creech, K. Zhukov. Creative collaboration and collaborative creativity: A systematic literature review. Frontiers in Psychology, vol. 12, Article number 713445, 2021. DOI: https:\/\/doi.org\/10.3389\/fpsyg.2021.713445.","DOI":"10.3389\/fpsyg.2021.713445"},{"issue":"4","key":"1546_CR60","doi-asserted-by":"publisher","first-page":"540","DOI":"10.1037\/a0012746","volume":"8","author":"C MacCann","year":"2008","unstructured":"C. MacCann, R. D. Roberts. New paradigms for assessing emotional intelligence: Theory and data. Emotion, vol. 8, no. 4, pp. 540\u2013551, 2008. DOI: https:\/\/doi.org\/10.1037\/a0012746.","journal-title":"Emotion"},{"issue":"1","key":"1546_CR61","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1080\/00223890802484381","volume":"91","author":"R N Spreng","year":"2009","unstructured":"R. N. Spreng, M. C. McKinnon, R. A. Mar, B. Levine. The Toronto empathy questionnaire: Scale development and initial validation of a factor-analytic solution to multiple empathy measures. Journal of Personality Assessment, vol. 91, no. 1, pp.62\u201371, 2009. DOI: https:\/\/doi.org\/10.1080\/00223890802484381.","journal-title":"Journal of Personality Assessment"},{"key":"1546_CR62","first-page":"35","volume-title":"Measures in Health Psychology: A User\u2019s Portfolio","author":"R Schwarzer","year":"1995","unstructured":"R. Schwarzer, M. Jerusalem. Generalized self-efficacy scale. Measures in Health Psychology: A User\u2019s Portfolio, J. Weinman, S. Wright, M. Johnston, Eds., Windsor, UK: NFER-NELSON, pp. 35\u201337, 1995."},{"key":"1546_CR63","volume-title":"Handbook of Personality: Theory and Research","author":"O P John","year":"1999","unstructured":"O. P. John, S. Srivastava. The big five trait taxonomy: History, measurement, and theoretical perspectives. Handbook of Personality: Theory and Research, 2nd ed., L. A. Pervin, O. P. John, Eds., New York, USA: Guilford Press, 1999.","edition":"2nd ed."},{"key":"1546_CR64","doi-asserted-by":"publisher","unstructured":"M. H. Qian, J. A. Plucker, X. D. Yang. Is creativity domain specific or domain general? Evidence from multilevel explanatory item response theory models. Thinking Skills and Creativity, vol. 33, Article number 100571, 2019. DOI: https:\/\/doi.org\/10.1016\/j.tsc.2019.100571.","DOI":"10.1016\/j.tsc.2019.100571"},{"key":"1546_CR65","unstructured":"D. Driess, F. Xia, M. S. Sajjadi, C. Lynch, A. Chowdhery, B. Ichter, A. Wahid, J. Tompson, Q. Vuong, T. H. Yu, W. L. Huang, Y. Chebotar, P. Sermanet, D. Duckworth, S. Levine, V. Vanhoucke, K. Hausman, M. Toussaint, K. Greff, A. Zeng, I. Mordatch, P. Florence. PaLM-E: An embodied multimodal language model, [Online], Available: https:\/\/arxiv.org\/abs\/2303.03378, 2023."},{"issue":"9","key":"1546_CR66","doi-asserted-by":"publisher","first-page":"10850","DOI":"10.1109\/TPAMI.2023.3261988","volume":"45","author":"F A Croitoru","year":"2023","unstructured":"F. A. Croitoru, V. Hondru, R. T. Ionescu, M. Shah. Diffusion models in vision: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 9, pp. 10850\u201310869, 2023. DOI: https:\/\/doi.org\/10.1109\/TPAMI.2023.3261988.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"1546_CR67","doi-asserted-by":"publisher","unstructured":"F. Carnovalini, A. Rod\u00e0. Computational creativity and music generation systems: An introduction to the state of the art. Frontiers in Artificial Intelligence, vol. 3, Article number 14, 2020. DOI: https:\/\/doi.org\/10.3389\/frai.2020.00014.","DOI":"10.3389\/frai.2020.00014"},{"key":"1546_CR68","doi-asserted-by":"publisher","unstructured":"L. Wang, C. Ma, X. Y. Feng, Z. Y. Zhang, H. Yang, J. S. Zhang, Z. Y. Chen, J. K. Tang, X. Chen, Y. K. Lin, W. X. Zhao, Z. W. Wei, J. R. Wen. A survey on large language model based autonomous agents. Frontiers of Computer Science, vol. 18, no. 6, Article number 186345, 2024. DOI: https:\/\/doi.org\/10.1007\/s11704-024-40231-1.","DOI":"10.1007\/s11704-024-40231-1"}],"container-title":["Machine Intelligence Research"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11633-025-1546-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11633-025-1546-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11633-025-1546-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,27]],"date-time":"2025-05-27T07:02:50Z","timestamp":1748329370000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11633-025-1546-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4,28]]},"references-count":68,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025,6]]}},"alternative-id":["1546"],"URL":"https:\/\/doi.org\/10.1007\/s11633-025-1546-4","relation":{},"ISSN":["2731-538X","2731-5398"],"issn-type":[{"value":"2731-538X","type":"print"},{"value":"2731-5398","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,4,28]]},"assertion":[{"value":"24 August 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 January 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 April 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declared that they have no conflicts of interest to this work.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations of conflict of interest"}}]}}