{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,29]],"date-time":"2025-08-29T00:05:14Z","timestamp":1756425914004,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":53,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,8,31]]},"DOI":"10.1145\/3743049.3748579","type":"proceedings-article","created":{"date-parts":[[2025,8,28]],"date-time":"2025-08-28T14:03:01Z","timestamp":1756389781000},"page":"610-616","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["From Text to Immersion: A Modular Software Pipeline to Generate Audiovisual Environments from Text Prompts"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-2506-8891","authenticated-orcid":false,"given":"Jimmy","family":"Orawetz","sequence":"first","affiliation":[{"name":"Faculty of Informatics \/ Mathematics, University of Applied Sciences Dresden, Dresden, Saxony, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3822-6043","authenticated-orcid":false,"given":"Dietrich","family":"Kammer","sequence":"additional","affiliation":[{"name":"Faculty of Informatics \/ Mathematics, University of Applied Sciences Dresden, Dresden, Saxony, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3620-3331","authenticated-orcid":false,"given":"Georg","family":"Freitag","sequence":"additional","affiliation":[{"name":"Faculty of Informatics \/ Mathematics, University of Applied Sciences Dresden, Dresden, Saxony, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,8,30]]},"reference":[{"key":"e_1_3_3_2_2_2","doi-asserted-by":"publisher","unstructured":"Tadas Baltru\u0161aitis Chaitanya Ahuja and Louis-Philippe Morency. 2017. Multimodal Machine Learning: A Survey and Taxonomy. 10.48550\/arXiv.1705.09406 arxiv:https:\/\/arXiv.org\/abs\/1705.09406","DOI":"10.48550\/arXiv.1705.09406"},{"key":"e_1_3_3_2_3_2","unstructured":"James Betker Gabriel Goh Li Jing \u2020 TimBrooks Jianfeng Wang Linjie Li \u2020 LongOuyang \u2020 JuntangZhuang \u2020 JoyceLee \u2020 YufeiGuo \u2020 WesamManassra \u2020 PrafullaDhariwal \u2020 CaseyChu \u2020 YunxinJiao and Aditya Ramesh. 2024. Improving Image Generation with Better Captions. https:\/\/www.semanticscholar.org\/paper\/Improving-Image-Generation-with-Better-Captions-Betker-Goh\/cfee1826dd4743eab44c6e27a0cc5970effa4d80"},{"key":"e_1_3_3_2_4_2","doi-asserted-by":"publisher","unstructured":"J. Bieniek M. Rahouti and D.\u00a0C. Verma. 2024. Generative AI in Multimodal User Interfaces: Trends Challenges and Cross-Platform Adaptability. 10.48550\/arXiv.2411.10234","DOI":"10.48550\/arXiv.2411.10234"},{"key":"e_1_3_3_2_5_2","doi-asserted-by":"publisher","unstructured":"Rishi Bommasani Drew\u00a0A. Hudson Ehsan Adeli Russ Altman Simran Arora Sydney\u00a0von Arx Michael\u00a0S. Bernstein Jeannette Bohg Antoine Bosselut Emma Brunskill Erik Brynjolfsson Shyamal Buch Dallas Card Rodrigo Castellon Niladri Chatterji Annie Chen Kathleen Creel Jared\u00a0Quincy Davis Dora Demszky Chris Donahue Moussa Doumbouya Esin Durmus Stefano Ermon John Etchemendy Kawin Ethayarajh Li Fei-Fei Chelsea Finn Trevor Gale Lauren Gillespie Karan Goel Noah Goodman Shelby Grossman Neel Guha Tatsunori Hashimoto Peter Henderson John Hewitt Daniel\u00a0E. Ho Jenny Hong Kyle Hsu Jing Huang Thomas Icard Saahil Jain Dan Jurafsky Pratyusha Kalluri Siddharth Karamcheti Geoff Keeling Fereshte Khani Omar Khattab Pang\u00a0Wei Koh Mark Krass Ranjay Krishna Rohith Kuditipudi Ananya Kumar Faisal Ladhak Mina Lee Tony Lee Jure Leskovec Isabelle Levent Xiang\u00a0Lisa Li Xuechen Li Tengyu Ma Ali Malik Christopher\u00a0D. Manning Suvir Mirchandani Eric Mitchell Zanele Munyikwa Suraj Nair Avanika Narayan Deepak Narayanan Ben Newman Allen Nie Juan\u00a0Carlos Niebles Hamed Nilforoshan Julian Nyarko Giray Ogut Laurel Orr Isabel Papadimitriou Joon\u00a0Sung Park Chris Piech Eva Portelance Christopher Potts Aditi Raghunathan Rob Reich Hongyu Ren Frieda Rong Yusuf Roohani Camilo Ruiz Jack Ryan Christopher R\u00e9 Dorsa Sadigh Shiori Sagawa Keshav Santhanam Andy Shih Krishnan Srinivasan Alex Tamkin Rohan Taori Armin\u00a0W. Thomas Florian Tram\u00e8r Rose\u00a0E. Wang William Wang Bohan Wu Jiajun Wu Yuhuai Wu Sang\u00a0Michael Xie Michihiro Yasunaga Jiaxuan You Matei Zaharia Michael Zhang Tianyi Zhang Xikun Zhang Yuhui Zhang Lucia Zheng Kaitlyn Zhou and Percy Liang. 2022. On the Opportunities and Risks of Foundation Models. arXiv:2108.07258 (2022). 10.48550\/arXiv.2108.07258 arxiv:https:\/\/arXiv.org\/abs\/2108.07258","DOI":"10.48550\/arXiv.2108.07258"},{"key":"e_1_3_3_2_6_2","volume-title":"The Shallows: What the Internet is Doing to Our Brains","author":"Carr Nicholas","year":"2010","unstructured":"Nicholas Carr. 2010. The Shallows: What the Internet is Doing to Our Brains. W. W. Norton & Company, New York. 174\u2013177 pages."},{"key":"e_1_3_3_2_7_2","unstructured":"Muxi Chen Yi Liu Jian Yi Changran Xu Qiuxia Lai Hongliang Wang Tsung-Yi Ho and Qiang Xu. 2024. Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis. arxiv:https:\/\/arXiv.org\/abs\/2403.05125\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2403.05125"},{"key":"e_1_3_3_2_8_2","unstructured":"comfyanonymous. 2023. ComfyUI \u2013 A Node-Based Stable Diffusion GUI. https:\/\/github.com\/comfyanonymous\/ComfyUI\/releases. https:\/\/github.com\/comfyanonymous\/ComfyUI\/releases Accessed: 2025-06-10. Windows Portable Version v0.3.40."},{"key":"e_1_3_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/166117.166134"},{"key":"e_1_3_3_2_10_2","doi-asserted-by":"publisher","unstructured":"Carolina Cruz-Neira Daniel\u00a0J. Sandin Thomas\u00a0A. DeFanti Robert\u00a0V. Kenyon and John\u00a0C. Hart. 1992. The CAVE: Audio Visual Experience Automatic Virtual Environment. Commun. ACM 35 6 (1992) 64\u201372. 10.1145\/129888.129892","DOI":"10.1145\/129888.129892"},{"key":"e_1_3_3_2_11_2","volume-title":"Using LoRA for Efficient Stable Diffusion Fine-Tuning","author":"Cuenca Pedro","year":"2023","unstructured":"Pedro Cuenca and Paul Sayak. 2023. Using LoRA for Efficient Stable Diffusion Fine-Tuning. https:\/\/huggingface.co\/blog\/lora"},{"key":"e_1_3_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP40776.2020.9052990"},{"key":"e_1_3_3_2_13_2","doi-asserted-by":"publisher","unstructured":"Zach Evans Julian\u00a0D. Parker C.\u00a0J. Carr Zack Zukowski Josiah Taylor and Jordi Pons. 2024. Stable Audio Open. 10.48550\/arXiv.2407.14358 arxiv:https:\/\/arXiv.org\/abs\/2407.14358","DOI":"10.48550\/arXiv.2407.14358"},{"key":"e_1_3_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/IVMSPW.2016.7528221"},{"key":"e_1_3_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1117\/12.2004790"},{"key":"e_1_3_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952261"},{"key":"e_1_3_3_2_17_2","doi-asserted-by":"publisher","unstructured":"Ian\u00a0J. Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative Adversarial Networks. 10.48550\/arXiv.1406.2661 arxiv:https:\/\/arXiv.org\/abs\/1406.2661","DOI":"10.48550\/arXiv.1406.2661"},{"key":"e_1_3_3_2_18_2","unstructured":"Google DeepMind. 2025. Veo3: Text\u2011to\u2011Video Generation with Synchronized Audio. Blog post and model page on VertexAI. https:\/\/deepmind.google\/models\/veo\/ Announced on VertexAI and DeepMind websites; no formal paper available."},{"key":"e_1_3_3_2_19_2","doi-asserted-by":"publisher","unstructured":"Yuwei Guo Ceyuan Yang Anyi Rao Zhengyang Liang Yaohui Wang Yu Qiao Maneesh Agrawala Dahua Lin and Bo Dai. 2024. AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning. 10.48550\/arXiv.2307.04725 arxiv:https:\/\/arXiv.org\/abs\/2307.04725","DOI":"10.48550\/arXiv.2307.04725"},{"key":"e_1_3_3_2_20_2","doi-asserted-by":"publisher","unstructured":"Jonathan Ho Ajay Jain and Pieter Abbeel. 2020. Denoising Diffusion Probabilistic Models. 10.48550\/arXiv.2006.11239 arxiv:https:\/\/arXiv.org\/abs\/2006.11239","DOI":"10.48550\/arXiv.2006.11239"},{"key":"e_1_3_3_2_21_2","doi-asserted-by":"publisher","unstructured":"Edward\u00a0J. Hu Yelong Shen Phillip Wallis Zeyuan Allen-Zhu Yuanzhi Li Shean Wang Lu Wang and Weizhu Chen. 2021. LoRA: Low-Rank Adaptation of Large Language Models. 10.48550\/arXiv.2106.09685 arxiv:https:\/\/arXiv.org\/abs\/2106.09685","DOI":"10.48550\/arXiv.2106.09685"},{"key":"e_1_3_3_2_22_2","doi-asserted-by":"publisher","unstructured":"Rongjie Huang Jiawei Huang Dongchao Yang Yi Ren Luping Liu Mingze Li Zhenhui Ye Jinglin Liu Xiang Yin and Zhou Zhao. 2023. Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. 10.48550\/arXiv.2301.12661","DOI":"10.48550\/arXiv.2301.12661"},{"key":"e_1_3_3_2_23_2","doi-asserted-by":"publisher","unstructured":"Muhammet\u00a0Furkan Ilaslan Ali Koksal Kevin\u00a0Qinhong Lin Burak Satar Mike\u00a0Zheng Shou and Qianli Xu. 2024. VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting. 10.48550\/arXiv.2412.11621 arxiv:https:\/\/arXiv.org\/abs\/2412.11621","DOI":"10.48550\/arXiv.2412.11621"},{"key":"e_1_3_3_2_24_2","unstructured":"Dietrich\u00a0Kammer Jimmy\u00a0Orawetz and Georg Freitag. 2025. Github Repository - From Text to Immersion. https:\/\/github.com\/OraJim\/Text2Atmosphere"},{"key":"e_1_3_3_2_25_2","doi-asserted-by":"publisher","unstructured":"Nikolai Kalischek Michael Oechsle Fabian Manhardt Philipp Henzler Konrad Schindler and Federico Tombari. 2025. CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation. 10.48550\/arXiv.2501.17162 arxiv:https:\/\/arXiv.org\/abs\/2501.17162Read_Status: New Read_Status_Date: 2025-04-07T09:33:12.999Z.","DOI":"10.48550\/arXiv.2501.17162"},{"key":"e_1_3_3_2_26_2","doi-asserted-by":"publisher","unstructured":"Chunyuan Li Zhe Gan Zhengyuan Yang Jianwei Yang Linjie Li Lijuan Wang and Jianfeng Gao. 2023. Multimodal Foundation Models: From Specialists to General-Purpose Assistants. arXiv:2309.10020 (2023). 10.48550\/arXiv.2309.10020 arxiv:https:\/\/arXiv.org\/abs\/2309.10020","DOI":"10.48550\/arXiv.2309.10020"},{"key":"e_1_3_3_2_27_2","doi-asserted-by":"publisher","unstructured":"Renjie Li Panwang Pan Bangbang Yang Dejia Xu Shijie Zhou Xuanyang Zhang Zeming Li Achuta Kadambi Zhangyang Wang Zhengzhong Tu and Zhiwen Fan. 2024. 4K4DGen: Panoramic 4D Generation at 4K Resolution. 10.48550\/arXiv.2406.13527","DOI":"10.48550\/arXiv.2406.13527"},{"key":"e_1_3_3_2_28_2","doi-asserted-by":"publisher","unstructured":"Wei Li Xue Xu Jiachen Liu and Xinyan Xiao. 2024. UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion. 10.48550\/arXiv.2401.13388 arxiv:https:\/\/arXiv.org\/abs\/2401.13388","DOI":"10.48550\/arXiv.2401.13388"},{"key":"e_1_3_3_2_29_2","doi-asserted-by":"publisher","unstructured":"Haotian Liu Chunyuan Li Qingyang Wu and Yong\u00a0Jae Lee. 2023. Visual Instruction Tuning. 10.48550\/arXiv.2304.08485 arxiv:https:\/\/arXiv.org\/abs\/2304.08485","DOI":"10.48550\/arXiv.2304.08485"},{"key":"e_1_3_3_2_30_2","doi-asserted-by":"publisher","unstructured":"Nan Liu Shuang Li Yilun Du Antonio Torralba and Joshua\u00a0B. Tenenbaum. 2023. Compositional Visual Generation with Composable Diffusion Models. 10.48550\/arXiv.2206.01714 arxiv:https:\/\/arXiv.org\/abs\/2206.01714","DOI":"10.48550\/arXiv.2206.01714"},{"key":"e_1_3_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3501825"},{"key":"e_1_3_3_2_32_2","doi-asserted-by":"crossref","unstructured":"Matthew Lombard and Theresa Ditton. 1997. At the heart of it all: The concept of presence. Journal of Computer-Mediated Communication 3 2 (1997).","DOI":"10.1111\/j.1083-6101.1997.tb00072.x"},{"key":"e_1_3_3_2_33_2","doi-asserted-by":"publisher","unstructured":"Yujie Lu Pan Lu Zhiyu Chen Wanrong Zhu Xin\u00a0Eric Wang and William\u00a0Yang Wang. 2023. Multimodal Procedural Planning via Dual Text-Image Prompting. 10.48550\/arXiv.2305.01795arXiv:https:\/\/arXiv.org\/abs\/2305.01795.","DOI":"10.48550\/arXiv.2305.01795"},{"key":"e_1_3_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.5040\/9781472544988"},{"key":"e_1_3_3_2_35_2","doi-asserted-by":"crossref","unstructured":"Jon McCormack Alan Dorin and Troy Innocent. 2005. Generative design: a paradigm for design research. https:\/\/api.semanticscholar.org\/CorpusID:261880350","DOI":"10.21606\/drs.2004.101"},{"key":"e_1_3_3_2_36_2","unstructured":"Midjourney Team. 2023. Midjourney: Independent Research Lab Exploring New Mediums of Thought. https:\/\/www.midjourney.com. Accessed June 2025."},{"key":"e_1_3_3_2_37_2","doi-asserted-by":"publisher","unstructured":"An\u00a0Nuur\u00a0Khairune Nisa. 2023. Las Vegas Sphere: Technological Innovations that Bring Unforgettable Travel Experiences in the City of Neon Lights. Journal of Tourism Sciences 1 3 (Dec. 2023) 133\u2013142. 10.62885\/toursci.v1i3.155","DOI":"10.62885\/toursci.v1i3.155"},{"key":"e_1_3_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/3656650.3656752"},{"key":"e_1_3_3_2_39_2","doi-asserted-by":"publisher","unstructured":"Dustin Podell Zion English Kyle Lacey Andreas Blattmann Tim Dockhorn Jonas M\u00fcller Joe Penna and Robin Rombach. 2023. SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis. 10.48550\/arXiv.2307.01952 arxiv:https:\/\/arXiv.org\/abs\/2307.01952","DOI":"10.48550\/arXiv.2307.01952"},{"key":"e_1_3_3_2_40_2","volume-title":"Interactive sound simulation: Rendering immersive soundscapes in games and virtual reality","author":"Raghuvanshi Nikunj","year":"2020","unstructured":"Nikunj Raghuvanshi. 2020. Interactive sound simulation: Rendering immersive soundscapes in games and virtual reality. https:\/\/www.microsoft.com\/en-us\/research\/video\/interactive-sound-simulation-rendering-immersive-soundscapes-in-games-and-virtual-reality\/"},{"key":"e_1_3_3_2_41_2","doi-asserted-by":"publisher","unstructured":"Aditya Ramesh Prafulla Dhariwal Alex Nichol Casey Chu and Mark Chen. 2022. Hierarchical Text-Conditional Image Generation with CLIP Latents. 10.48550\/arXiv.2204.06125 arxiv:https:\/\/arXiv.org\/abs\/2204.06125","DOI":"10.48550\/arXiv.2204.06125"},{"key":"e_1_3_3_2_42_2","unstructured":"Aditya Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen and Ilya Sutskever. 2021. Zero-Shot Text-to-Image Generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2102.12092 (Feb. 2021)."},{"key":"e_1_3_3_2_43_2","doi-asserted-by":"publisher","unstructured":"Robin Rombach Andreas Blattmann Dominik Lorenz Patrick Esser and Bj\u00f6rn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. 10.48550\/arXiv.2112.10752 arxiv:https:\/\/arXiv.org\/abs\/2112.10752","DOI":"10.48550\/arXiv.2112.10752"},{"key":"e_1_3_3_2_44_2","doi-asserted-by":"publisher","unstructured":"Mark Sabini and Gili Rusak. 2018. Painting Outside the Box: Image Outpainting with GANs. 10.48550\/arXiv.1808.08483 arxiv:https:\/\/arXiv.org\/abs\/1808.08483","DOI":"10.48550\/arXiv.1808.08483"},{"key":"e_1_3_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01162"},{"key":"e_1_3_3_2_46_2","doi-asserted-by":"publisher","unstructured":"Mel Slater. 2018. Immersion and the illusion of presence in virtual reality. 109 3 (2018) 431\u2013433. 10.1111\/bjop.12305","DOI":"10.1111\/bjop.12305"},{"key":"e_1_3_3_2_47_2","doi-asserted-by":"publisher","unstructured":"Maria Tsimpoukelli Jacob Menick Serkan Cabi S.\u00a0M.\u00a0Ali Eslami Oriol Vinyals and Felix Hill. 2021. Multimodal Few-Shot Learning with Frozen Language Models. 10.48550\/arXiv.2106.13884 arxiv:https:\/\/arXiv.org\/abs\/2106.13884","DOI":"10.48550\/arXiv.2106.13884"},{"key":"e_1_3_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1706.03762"},{"key":"e_1_3_3_2_49_2","unstructured":"Team Wan Ang Wang Baole Ai Bin Wen Chaojie Mao Chen-Wei Xie Di Chen Feiwu Yu Haiming Zhao Jianxiao Yang Jianyuan Zeng Jiayu Wang Jingfeng Zhang Jingren Zhou Jinkai Wang Jixuan Chen Kai Zhu Kang Zhao Keyu Yan Lianghua Huang Mengyang Feng Ningyi Zhang Pandeng Li Pingyu Wu Ruihang Chu Ruili Feng Shiwei Zhang Siyang Sun Tao Fang Tianxing Wang Tianyi Gui Tingyu Weng Tong Shen Wei Lin Wei Wang Wei Wang Wenmeng Zhou Wente Wang Wenting Shen Wenyuan Yu Xianzhong Shi Xiaoming Huang Xin Xu Yan Kou Yangyu Lv Yifei Li Yijing Liu Yiming Wang Yingya Zhang Yitong Huang Yong Li You Wu Yu Liu Yulin Pan Yun Zheng Yuntao Hong Yupeng Shi Yutong Feng Zeyinzi Jiang Zhen Han Zhi-Fan Wu and Ziyu Liu. 2025. Wan: Open and Advanced Large-Scale Video Generative Models. arxiv:https:\/\/arXiv.org\/abs\/2503.20314\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2503.20314"},{"key":"e_1_3_3_2_50_2","doi-asserted-by":"publisher","unstructured":"Hai Wang Xiaoyu Xiang Yuchen Fan and Jing-Hao Xue. 2023. Customizing 360\u2011Degree Panoramas through Text\u2011to\u2011Image Diffusion Models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2310.18840 (Oct. 2023). 10.48550\/arXiv.2310.18840","DOI":"10.48550\/arXiv.2310.18840"},{"key":"e_1_3_3_2_51_2","doi-asserted-by":"publisher","unstructured":"Xintao Wang Liangbin Xie Chao Dong and Ying Shan. 2021. Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data. 10.48550\/arXiv.2107.10833 arxiv:https:\/\/arXiv.org\/abs\/2107.10833","DOI":"10.48550\/arXiv.2107.10833"},{"key":"e_1_3_3_2_52_2","doi-asserted-by":"publisher","unstructured":"Denis Zavadski Johann-Friedrich Feiden and Carsten Rother. 2023. ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models. 10.48550\/arXiv.2312.06573 arxiv:https:\/\/arXiv.org\/abs\/2312.06573version: 1.","DOI":"10.48550\/arXiv.2312.06573"},{"key":"e_1_3_3_2_53_2","doi-asserted-by":"publisher","unstructured":"Haiyang Zhou Xinhua Cheng Wangbo Yu Yonghong Tian and Li Yuan. 2024. HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions. 10.48550\/arXiv.2407.15187","DOI":"10.48550\/arXiv.2407.15187"},{"key":"e_1_3_3_2_54_2","doi-asserted-by":"publisher","unstructured":"Shijie Zhou Zhiwen Fan Dejia Xu Haoran Chang Pradyumna Chari Tejas Bharadwaj Suya You Zhangyang Wang and Achuta Kadambi. 2024. DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting. 10.48550\/arXiv.2404.06903","DOI":"10.48550\/arXiv.2404.06903"}],"event":{"name":"MuC '25: Mensch und Computer 2025","location":"Chemnitz Germany","acronym":"MuC '25"},"container-title":["Proceedings of the Mensch und Computer 2025"],"original-title":[],"deposited":{"date-parts":[[2025,8,28]],"date-time":"2025-08-28T14:55:23Z","timestamp":1756392923000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3743049.3748579"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,30]]},"references-count":53,"alternative-id":["10.1145\/3743049.3748579","10.1145\/3743049"],"URL":"https:\/\/doi.org\/10.1145\/3743049.3748579","relation":{},"subject":[],"published":{"date-parts":[[2025,8,30]]},"assertion":[{"value":"2025-08-30","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}