{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T09:09:29Z","timestamp":1779354569707,"version":"3.51.4"},"reference-count":35,"publisher":"Wiley","issue":"3","license":[{"start":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T00:00:00Z","timestamp":1779321600000},"content-version":"vor","delay-in-days":20,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/doi.wiley.com\/10.1002\/tdm_license_1.1"}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Computer Animation &amp;amp; Virtual"],"published-print":{"date-parts":[[2026,5]]},"abstract":"<jats:title>ABSTRACT<\/jats:title>\n                  <jats:p>City\u2010scale 3D urban generation requires planning\u2010level semantic grounding from user intent and scalable geometric synthesis with structural validity and editability. Procedural content generation (PCG) offers controllability and scalability, but is hard to author due to high\u2010dimensional parameters and nonintuitive workflows. Meanwhile, directly generating city geometry or scripts from text with LLMs can suffer from weak large\u2010scale consistency and limited geometric validity, hindering downstream editing and engine deployment. We present Text\u2010to\u20103D City, a plan\u2010then\u2010execute framework that couples an LLM\u2010based City Planner with a PCG\u2010based Implementer. Given a natural language description, the Planner grounds textual intent into a structured city plan by composing PCG parameters via a schema and in\u2010context exemplars. The Implementer deterministically executes road generation, block extraction, lot subdivision, and asset placement with validity checks and reproducible seeding to synthesize an engine\u2010ready 3D city. Experiments on multi\u2010view renderings evaluate text\u2010scene alignment, diversity, realism, and runtime, demonstrating rapid generation and scalability to large urban scenes.<\/jats:p>","DOI":"10.1002\/cav.70124","type":"journal-article","created":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T08:31:28Z","timestamp":1779352288000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Text\u2010to\u20103D City: Plan\u2010then\u2010Execute Urban Generation With\u00a0LLM Planners and Procedural Synthesis"],"prefix":"10.1002","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7969-3134","authenticated-orcid":false,"given":"Xiaohang","family":"Dong","sequence":"first","affiliation":[{"name":"College of Computer Science Nankai University  Tianjin China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-2373-6833","authenticated-orcid":false,"given":"Hualong","family":"Yu","sequence":"additional","affiliation":[{"name":"College of Computer Science Nankai University  Tianjin China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-2676-0576","authenticated-orcid":false,"given":"Xu","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Computer Science Nankai University  Tianjin China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-0905-1197","authenticated-orcid":false,"given":"Jianye","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Computer Science Nankai University  Tianjin China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1592-2783","authenticated-orcid":false,"given":"Qicheng","family":"Li","sequence":"additional","affiliation":[{"name":"College of Computer Science Nankai University  Tianjin China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2026,5,21]]},"reference":[{"issue":"1","key":"e_1_2_10_2_1","doi-asserted-by":"crossref","DOI":"10.1111\/cgf.14989","article-title":"A Survey of Procedural Modelling Methods for Layout Generation of Virtual Scenes","volume":"43","author":"Cogo E.","year":"2024","journal-title":"Computer Graphics Forum"},{"issue":"2","key":"e_1_2_10_3_1","doi-asserted-by":"crossref","DOI":"10.3390\/computers8020038","article-title":"Procedural Modeling of Buildings Composed of Arbitrarily\u2010Shaped Floor\u2010Plans: Background, Progress, Contributions and Challenges of a Methodology Oriented to Cultural Heritage","volume":"8","author":"Ad\u00e3o T.","year":"2019","journal-title":"Computers"},{"key":"e_1_2_10_4_1","first-page":"43447","volume-title":"Advances in Neural Information Processing Systems","author":"Lu P.","year":"2023"},{"issue":"18","key":"e_1_2_10_5_1","doi-asserted-by":"crossref","first-page":"20256","DOI":"10.1609\/aaai.v38i18.30006","article-title":"Generalized Planning in PDDL Domains With Pretrained Large Language Models","volume":"38","author":"Silver T.","year":"2024","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"e_1_2_10_6_1","volume-title":"Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology","author":"Park J. S.","year":"2023"},{"key":"e_1_2_10_7_1","first-page":"82","volume-title":"Advances in Neural Information Processing Systems","author":"Wu J.","year":"2016"},{"key":"e_1_2_10_8_1","doi-asserted-by":"crossref","first-page":"10674","DOI":"10.1109\/CVPR52688.2022.01042","volume-title":"Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Rombach R.","year":"2022"},{"key":"e_1_2_10_9_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Jain A.","year":"2022"},{"key":"e_1_2_10_10_1","volume-title":"International Conference on Learning Representations (ICLR)","author":"Poole B.","year":"2023"},{"key":"e_1_2_10_11_1","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Lin C.\u2010H.","year":"2023"},{"key":"e_1_2_10_12_1","volume-title":"Advances in Neural Information Processing Systems","author":"Yang X.","year":"2024"},{"key":"e_1_2_10_13_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Yu H.\u2010X.","year":"2024"},{"key":"e_1_2_10_14_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Xie H.","year":"2024"},{"key":"e_1_2_10_15_1","first-page":"22751","volume-title":"Proceedings of the2023 IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"Lin C. H.","year":"2023"},{"issue":"12","key":"e_1_2_10_16_1","doi-asserted-by":"crossref","first-page":"15562","DOI":"10.1109\/TPAMI.2023.3321857","article-title":"SceneDreamer: Unbounded 3D Scene Generation From 2D Image Collections","volume":"45","author":"Chen Z.","year":"2023","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_2_10_17_1","doi-asserted-by":"crossref","first-page":"20863","DOI":"10.1109\/CVPR52729.2023.01999","volume-title":"Proceedings of the 2023 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Chai L.","year":"2023"},{"key":"e_1_2_10_18_1","first-page":"473","volume-title":"Proceedings of the 2023 IEEE 17th International Symposium on Applied Computational Intelligence and Informatics (SACI)","author":"Erdei B.","year":"2023"},{"key":"e_1_2_10_19_1","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1145\/1185657.1185716","volume-title":"ACM SIGGRAPH 2006 Courses","author":"Muller P.","year":"2006"},{"key":"e_1_2_10_20_1","first-page":"4","volume-title":"ACM Transactions on Graphics","author":"Merrell P."},{"key":"e_1_2_10_21_1","first-page":"12630","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Raistrick A.","year":"2023"},{"issue":"5","key":"e_1_2_10_22_1","doi-asserted-by":"crossref","first-page":"911","DOI":"10.1007\/s00371-019-01701-x","article-title":"CityCraft: 3D Virtual City Creation From a Single Image","volume":"36","author":"Kim S.","year":"2020","journal-title":"Visual Computer"},{"key":"e_1_2_10_23_1","unstructured":"J.Deng W.Chai J.Guo et al. \u201cCityGen: Infinite and Controllable 3D City Layout Generation \u201d(2023). arXiv preprint arXiv:2312.01508."},{"key":"e_1_2_10_24_1","unstructured":"C.Sun J.Han W.Deng X.Wang Z.Qin andS.Gould \u201c3D\u2010GPT: Procedural 3D Modeling With Large Language Models \u201d(2023). arXiv Preprint arXiv:2310.12945."},{"key":"e_1_2_10_25_1","unstructured":"Q.Dong L.Li D.Dai et al. \u201cA Survey for in\u2010Context Learning \u201d(2023). arXiv Preprint arXiv:2301.00234."},{"key":"e_1_2_10_26_1","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1111\/j.1467-8659.2012.03047.x","article-title":"Procedural Generation of Parcels in Urban Modeling","volume":"31","author":"Vanegas C.","year":"2012","journal-title":"Computer Graphics Forum"},{"key":"e_1_2_10_27_1","first-page":"8748","volume-title":"Proceedings of the 38th International Conference on Machine Learning","author":"Radford A.","year":"2021"},{"key":"e_1_2_10_28_1","first-page":"2818","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Cherti M.","year":"2023"},{"key":"e_1_2_10_29_1","first-page":"770","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"He K.","year":"2016"},{"key":"e_1_2_10_30_1","doi-asserted-by":"crossref","unstructured":"Y.Hirakawa T.Wada K.Morishita et al. \u201cAn Empirical Analysis of GPT\u20104V's Performance on Fashion Aesthetic Evaluation \u201d(2024). arXiv Preprint arXiv:2410.23730.","DOI":"10.1145\/3681758.3698022"},{"issue":"3","key":"e_1_2_10_31_1","doi-asserted-by":"crossref","first-page":"862","DOI":"10.1109\/TPAMI.2020.3019967","article-title":"Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero\u2010Shot Cross\u2010Dataset Transfer","volume":"44","author":"Ranftl R.","year":"2022","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_2_10_32_1","first-page":"6629","volume-title":"Proceedings of the 31st International Conference on Neural Information Processing Systems","author":"Heusel M.","year":"2017"},{"key":"e_1_2_10_33_1","unstructured":"M.Bi\u0144kowski D. J.Sutherland M.Arbel andA.Gretton \u201cDemystifying MMD Gans \u201d(2018). arXiv Preprint arXiv:1801.01401."},{"key":"e_1_2_10_34_1","first-page":"10806","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence, Vol 39, 10806\u201310814","author":"Zhou M.","year":"2025"},{"key":"e_1_2_10_35_1","volume-title":"Proceedings of the2019 International Conference on Robotics and Automation (ICRA), 7249\u20137255","author":"Prakash A.","year":"2019"},{"key":"e_1_2_10_36_1","volume-title":"Neural Information Processing Systems","author":"Shen Y.\u2010C.","year":"2022"}],"container-title":["Computer Animation and Virtual Worlds"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/cav.70124","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/full-xml\/10.1002\/cav.70124","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/cav.70124","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T08:31:37Z","timestamp":1779352297000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/cav.70124"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,5]]},"references-count":35,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2026,5]]}},"alternative-id":["10.1002\/cav.70124"],"URL":"https:\/\/doi.org\/10.1002\/cav.70124","archive":["Portico"],"relation":{},"ISSN":["1546-4261","1546-427X"],"issn-type":[{"value":"1546-4261","type":"print"},{"value":"1546-427X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,5]]},"assertion":[{"value":"2026-04-19","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-05-12","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-05-21","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"e70124"}}