{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T02:11:13Z","timestamp":1760667073433,"version":"build-2065373602"},"reference-count":113,"publisher":"Association for Computing Machinery (ACM)","issue":"7","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Hum.-Comput. Interact."],"published-print":{"date-parts":[[2025,10,18]]},"abstract":"<jats:p>\n            Social Virtual Reality (VR) offers immersive, interactive, and engaging mechanisms for collaborative activities within virtual environments. However, interpersonal communication in social VR remains constrained by existing mediums and channels. To address this limitation, we introduce an\n            <jats:italic toggle=\"yes\">impact-caption-inspired<\/jats:italic>\n            approach to facilitate real-time conversations in social VR. Impact captions are a type of typographic visual effects commonly employed in videos to convey verbal messages and non-verbal cues simultaneously for enhancing viewer engagement. Starting with an exploration of the design space of impact captions, we subsequently developed a proof-of-concept system, SpeechCap, that enables users to communicate through speech-driven impact captions in VR. Using the system, we conducted a user study (N=14) to assess the effectiveness of the visual and interaction design of our approach, revealing the strengths in enhancing interactivity and integrating of verbal and non-verbal information. We conclude by discussing key findings related to visual rhetoric, interactivity of communication mediums, and ambiguity, and offer design implications aimed at improving interpersonal communication in social VR.\n          <\/jats:p>","DOI":"10.1145\/3757427","type":"journal-article","created":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T17:06:01Z","timestamp":1760634361000},"page":"1-33","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["SpeechCap: Leveraging Playful Impact Captions to Facilitate Interpersonal Communication in Social Virtual Reality"],"prefix":"10.1145","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8574-111X","authenticated-orcid":false,"given":"Yu","family":"Zhang","sequence":"first","affiliation":[{"name":"City University of Hong Kong, Hong Kong, Hong Kong"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-4190-5938","authenticated-orcid":false,"given":"Yi","family":"Wen","sequence":"additional","affiliation":[{"name":"Texas A&amp;M University, College Station, TX, USA and City University of Hong Kong,, Hong Kong SAR, Hong Kong"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3824-2801","authenticated-orcid":false,"given":"Siying","family":"HU","sequence":"additional","affiliation":[{"name":"City University of Hong Kong, Hong Kong SAR, Hong Kong"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7761-6351","authenticated-orcid":false,"given":"Zhicong","family":"Lu","sequence":"additional","affiliation":[{"name":"George Mason University, Fairfax, VA, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,10,16]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3479597"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2022.102819"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2993171"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3584931.3606968"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPCC.2012.6408605"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642101"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/AERO.2016.7500674"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1054972.1054998"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3501920"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2021.3067783"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359251"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445752"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3472301.3484325"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1044\/2024_AJA-24-00056"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3126594.3126660"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642725"},{"volume-title":"Typographic design: Form and communication","author":"Carter Rob","key":"e_1_2_1_17_1","unstructured":"Rob Carter, Philip B Meggs, and Ben Day. 2011. Typographic design: Form and communication. John Wiley & Sons."},{"key":"e_1_2_1_18_1","volume-title":"Proceedings of the ACM on Human-Computer Interaction 9, CSCW2 (November 2025","author":"Chang Victoria","year":"2025","unstructured":"Victoria Chang, Caro Williams - Pierce, Huaishu Peng, and Ge Gao. 2025. Verisimilitude as Boon and Bane: How People Initiate Opportunistic Interactions at Professional Events in Social VR. Proceedings of the ACM on Human-Computer Interaction 9, CSCW2 (November 2025). To appear in PACMHCI, CSCW2, November 2025."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2702123.2702423"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642405"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411763.3451698"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3517745.3561417"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-018-6753-3"},{"key":"e_1_2_1_24_1","volume-title":"Impact caption translation on a streaming media platform: the case of a Chinese reality show. Perspectives","author":"Chow Yean Fun","year":"2023","unstructured":"Yean Fun Chow. 2023. Impact caption translation on a streaming media platform: the case of a Chinese reality show. Perspectives (2023), 1-20."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1515\/sem-2013-0079"},{"volume-title":"Basics of qualitative research: Techniques and procedures for developing grounded theory","author":"Corbin Juliet","key":"e_1_2_1_26_1","unstructured":"Juliet Corbin and Anselm Strauss. 2014. Basics of qualitative research: Techniques and procedures for developing grounded theory. Sage publications."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10111-007-0105-9"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v16i1.19393"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642258"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581511"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1057237.1057241"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3710924"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10055-021-00564-9"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3391614.3399396"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3452918.3458805"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3432938"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581464"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/642611.642653"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1002\/col.22171"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICVEE59738.2023.10348186"},{"key":"e_1_2_1_41_1","volume-title":"Formality of language: definition, measurement and behavioral determinants. Interner Bericht","author":"Heylighen Francis","year":"1999","unstructured":"Francis Heylighen and Jean-Marc Dewaele. 1999. Formality of language: definition, measurement and behavioral determinants. Interner Bericht, Center ''Leo Apostel'', Vrije Universiteit Br\u00fcssel 4, 1 (1999)."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/1124772.1124838"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376228"},{"key":"e_1_2_1_44_1","volume-title":"Retrieved","author":"Epic Games Inc.","year":"2024","unstructured":"Epic Games Inc. 2024. 3D Text Actor in Unreal Engine. Retrieved May 05, 2024 from https:\/\/dev.epicgames.com\/documentation\/en-us\/unreal-engine\/3d-text-actor-in-unreal-engine?application_version=5.1"},{"key":"e_1_2_1_45_1","volume-title":"Retrieved","author":"Epic Games Inc.","year":"2024","unstructured":"Epic Games Inc. 2024. Networking Overview. Retrieved May 05, 2024 from https:\/\/dev.epicgames.com\/documentation\/en-us\/unreal-engine\/networking-overview-for-unreal-engine?application_version=5.1"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3462244.3479946"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10639-017-9676-0"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3517542"},{"key":"e_1_2_1_49_1","volume-title":"Likert scale: Explored and explained. British journal of applied science & technology 7, 4","author":"Joshi Ankur","year":"2015","unstructured":"Ankur Joshi, Saket Kale, Satish Chandel, and D Kumar Pal. 2015. Likert scale: Explored and explained. British journal of applied science & technology 7, 4 (2015), 396-403."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581130"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2442106.2442109"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3322276.3322352"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.3389\/frvir.2021.786665"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2380116.2380122"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/3139131.3139156"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3579483"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3517451"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3526114.3558699"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3334480.3382836"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300897"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411763.3441346"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/3517428.3544817"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3526113.3545702"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818048.2819945"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/3643834.3661549"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581566"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1089\/cpb.2007.0132"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3174040"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642255"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPCC.2004.1375315"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/3410404.3414266"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/3415246"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigData59044.2023.10386476"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411763.3450377"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1080\/07370024.2021.1994860"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300794"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445503"},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1109\/SIVE.2018.8577094"},{"volume-title":"Sonic Interactions in Virtual Environments","author":"Men Liang","key":"e_1_2_1_79_1","unstructured":"Liang Men and Nick Bryan-Kinns. 2022. Supporting sonic interaction in creative, shared virtual environments. In Sonic Interactions in Virtual Environments. Springer International Publishing Cham, 237-267."},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v11i1.14901"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10055-023-00842-8"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3173784"},{"key":"e_1_2_1_83_1","volume-title":"Japanese TV entertainment: Framing humour with open caption telop. Translation, humor and the media 2","author":"O'Hagan Minako","year":"2010","unstructured":"Minako O'Hagan. 2010. Japanese TV entertainment: Framing humour with open caption telop. Translation, humor and the media 2 (2010), 70-88."},{"key":"e_1_2_1_84_1","volume-title":"Interpersonal communication and virtual reality: Mediating interpersonal relationships. Communication in the age of virtual reality","author":"Palmer Mark T","year":"1995","unstructured":"Mark T Palmer. 1995. Interpersonal communication and virtual reality: Mediating interpersonal relationships. Communication in the age of virtual reality (1995), 277-299."},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1145\/3490355.3490367"},{"volume-title":"Theories of emotion","author":"Plutchik Robert","key":"e_1_2_1_86_1","unstructured":"Robert Plutchik and Henry Kellerman. 2013. Theories of emotion. Vol. 1. Academic press."},{"key":"e_1_2_1_87_1","volume-title":"Proceedings of the 40th International Conference on Machine Learning (Proceedings of Machine Learning Research","volume":"28518","author":"Radford Alec","year":"2023","unstructured":"Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine Mcleavey, and Ilya Sutskever. 2023. Robust Speech Recognition via Large-Scale Weak Supervision. In Proceedings of the 40th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 202), Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (Eds.). PMLR, 28492-28518. https:\/\/proceedings.mlr.press\/v202\/radford23a.html"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445606"},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.dcm.2014.03.003"},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1075\/tcb.00055.sas"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1145\/3656156.3658381"},{"key":"e_1_2_1_92_1","volume-title":"Subtitle system visualizing non-verbal expressions in voice for hearing impaired-Ambient Font. Proceeding of the 10th Asia-Pacific Industrial Engineering and Management Systems","author":"Seto Shuichi","year":"2010","unstructured":"Shuichi Seto, Hiroshi Arai, Kimikazu Sugimori, Yuko Shimomura, and Hiroyuki Kawabe. 2010. Subtitle system visualizing non-verbal expressions in voice for hearing impaired-Ambient Font. Proceeding of the 10th Asia-Pacific Industrial Engineering and Management Systems (2010)."},{"key":"e_1_2_1_93_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3641923"},{"key":"e_1_2_1_94_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3173863"},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1023924110279"},{"key":"e_1_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581230"},{"key":"e_1_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3502008"},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376606"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.1145\/3512983"},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-60881-0_24"},{"key":"e_1_2_1_101_1","volume-title":"Social VR for Socially Isolated Adolescents with Significant Illnesses. In Companion Proceedings of the 2022 Conference on Interactive Surfaces and Spaces. 50-53","author":"Hansi Shashiprabha Udapola Udapola Balage","year":"2022","unstructured":"Udapola Balage Hansi Shashiprabha Udapola. 2022. Social VR for Socially Isolated Adolescents with Significant Illnesses. In Companion Proceedings of the 2022 Conference on Interactive Surfaces and Spaces. 50-53."},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1145\/3505284.3532976"},{"key":"e_1_2_1_103_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2613641"},{"key":"e_1_2_1_104_1","doi-asserted-by":"publisher","DOI":"10.1145\/3581783.3612438"},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581405"},{"key":"e_1_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.1145\/3565698.3565767"},{"volume-title":"International Conference on Computers Helping People with Special Needs","author":"Wieland Markus","key":"e_1_2_1_107_1","unstructured":"Markus Wieland, Lauren Thevin, Albrecht Schmidt, and Tonja Machulla. 2022. Non-verbal Communication and Joint Attention Between People with and Without Visual Impairments: Deriving Guidelines for Inclusive Conversations in Virtual Realities. In International Conference on Computers Helping People with Special Needs. Springer, 295-304."},{"key":"e_1_2_1_108_1","volume-title":"Color and emotion: effects of hue, saturation, and brightness. Psychological research 82, 5","author":"Wilms Lisa","year":"2018","unstructured":"Lisa Wilms and Daniel Oberfeld. 2018. Color and emotion: effects of hue, saturation, and brightness. Psychological research 82, 5 (2018), 896-914."},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1145\/3626473"},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1145\/3586183.3606773"},{"key":"e_1_2_1_111_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2023.3247085"},{"key":"e_1_2_1_112_1","doi-asserted-by":"publisher","DOI":"10.1145\/3419249.3420112"},{"key":"e_1_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.1145\/3517428.3544829"}],"container-title":["Proceedings of the ACM on Human-Computer Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3757427","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T01:56:23Z","timestamp":1760666183000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3757427"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,16]]},"references-count":113,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2025,10,18]]}},"alternative-id":["10.1145\/3757427"],"URL":"https:\/\/doi.org\/10.1145\/3757427","relation":{},"ISSN":["2573-0142"],"issn-type":[{"type":"electronic","value":"2573-0142"}],"subject":[],"published":{"date-parts":[[2025,10,16]]},"assertion":[{"value":"2025-10-16","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}