{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T10:10:36Z","timestamp":1773137436300,"version":"3.50.1"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"9","license":[{"start":{"date-parts":[[2024,8,19]],"date-time":"2024-08-19T00:00:00Z","timestamp":1724025600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001602","name":"Science Foundation Ireland","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001602","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"crossref","award":["13\/RC\/2106_P2"],"award-info":[{"award-number":["13\/RC\/2106_P2"]}],"id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Horizon Europe Framework Program","award":["101070109"],"award-info":[{"award-number":["101070109"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,9,30]]},"abstract":"<jats:p>This article presents the results of an empirical study that aimed to investigate the influence of various types of audio (spatial and non-spatial) on the user quality of experience (QoE) of and visual attention in 360\u00b0 videos. The study compared the head pose, eye gaze, pupil dilations, heart rate, and subjective responses of 73 users who watched ten 360\u00b0 videos with different sound configurations. The configurations evaluated were no sound; non-spatial (stereo) audio; and two spatial sound conditions (first- and third-order ambisonics). The videos covered various categories and presented both indoor and outdoor scenarios. The subjective responses were analyzed using an ANOVA (Analysis of Variance) to assess mean differences between sound conditions. Data visualization was also employed to enhance the interpretability of the results. The findings reveal diverse viewing patterns, physiological responses, and subjective experiences among users watching 360\u00b0 videos with different sound conditions. Spatial audio, in particular third-order ambisonics, garnered heightened attention. This is evident in increased pupil dilation and heart rate. Furthermore, the presence of spatial audio led to more diverse head poses when sound sources were distributed across the scene. These findings have important implications for the development of effective techniques for optimizing processing, encoding, distributing, and rendering content in virtual reality (VR) and 360\u00b0 videos with spatialized audio. These insights are also relevant in the creative realms of content design and enhancement. They provide valuable guidance on how spatial audio influences user attention, physiological responses, and overall subjective experiences. Understanding these dynamics can assist content creators and designers in crafting immersive experiences that leverage spatialized audio to captivate users, enhance engagement, and optimize the overall quality of VR and 360\u00b0 video content. The dataset, scripts used for data collection, ffmpeg commands used for processing the videos, and the subjective questionnaire and its statistical analysis are publicly available.<\/jats:p>","DOI":"10.1145\/3650208","type":"journal-article","created":{"date-parts":[[2024,3,6]],"date-time":"2024-03-06T12:07:03Z","timestamp":1709726823000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["A Quality of Experience and Visual Attention Evaluation for 360\u00b0 Videos with Non-spatial and Spatial Audio"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8609-3890","authenticated-orcid":false,"given":"Amit","family":"Hirway","sequence":"first","affiliation":[{"name":"Department of Computer and Software Engineering, Technological University of the Shannon - Midlands Midwest, Athlone, Ireland"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1543-1589","authenticated-orcid":false,"given":"Yuansong","family":"Qiao","sequence":"additional","affiliation":[{"name":"Software Research Institute, Technological University of the Shannon - Midlands Midwest, Athlone, Ireland"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5919-0596","authenticated-orcid":false,"given":"Niall","family":"Murray","sequence":"additional","affiliation":[{"name":"Department of Computer and Software Engineering, Technological University of the Shannon - Midlands Midwest, Athlone, Ireland"}]}],"member":"320","published-online":{"date-parts":[[2024,8,19]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"crossref","unstructured":"R. Shafi W. Shuai and M. U. Younus. 2020. 360-degree video streaming: A survey of the state of the art. Symmetry 12 1491.","DOI":"10.3390\/sym12091491"},{"key":"e_1_3_1_3_2","unstructured":"J. E. Hoffman. 2016. Visual attention and eye movements. Attention 119--153."},{"key":"e_1_3_1_4_2","unstructured":"Laurent Itti. 2000. Models of bottom-up and top-down visual attention."},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2013.2265801"},{"key":"e_1_3_1_6_2","first-page":"129","volume-title":"Proceedings of the IEEE Virtual Reality","author":"Poeschl-Guenther S.","year":"2013","unstructured":"S. Poeschl-Guenther, K. Wall, and N. D\u00f6ring. 2013. Integration of spatial sound in immersive virtual environments: An experimental study on effects of spatial sound on presence. In Proceedings of the IEEE Virtual Reality, 129\u2013130."},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP46576.2022.9897737"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","unstructured":"Pierre Marighetto Antoine Coutrot Nicolas Riche Nathalie Guyader Matei Mancas Bernard Gosselin and Robert Laganiere. 2017. Audio-visual attention: Eye-tracking dataset and analysis toolbox. 10.1109\/ICIP.2017.8296592","DOI":"10.1109\/ICIP.2017.8296592"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/2578153.2578180"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.3233\/978-1-61499-595-1-44"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","unstructured":"Paulo Bala Raul Masu Valentina Nisi and Nuno Nunes. 2019. When the elephant trumps: A comparative study on spatial audio for orientation in 360\u00ba videos. 1--13. DOI:10.1145\/3290605.3300925","DOI":"10.1145\/3290605.3300925"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1386\/ts_00017_1"},{"key":"e_1_3_1_13_2","unstructured":"C. Yue and T. D. Planque. 2017. 3-D ambisonics experience for virtual reality."},{"key":"e_1_3_1_14_2","volume-title":"Journal of the Audio Engineering Society","unstructured":"M. A. Gerzon. 1985. Ambisonics in multichannel broadcasting and video. Journal of the Audio Engineering Society. 33, 859--871."},{"key":"e_1_3_1_15_2","first-page":"51","volume-title":"Immersive Sound","author":"Politis A.","year":"2018","unstructured":"A. Politis, S. Siltanen, and V. Pulkki. 2018. Higher-order ambisonics. Immersive Sound. Springer, Berlin, Germany, 51\u2013102."},{"key":"e_1_3_1_16_2","first-page":"1","volume-title":"Understanding and Improving Quality of Experience in Multimedia Communications","author":"Raake A.","year":"2014","unstructured":"A. Raake. 2014. Quality of experience: What it is and why it matters. Understanding and Improving Quality of Experience in Multimedia Communications. Springer, 1\u201315."},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","unstructured":"Raimund Schatz Tobias Hossfeld Lucjan Janowski and Sebastian Egger-Lampl. 2013. From packets to people: Quality of experience as a new measurement challenge. DOI:10.1007\/978-3-642-36784-7_10","DOI":"10.1007\/978-3-642-36784-7_10"},{"key":"e_1_3_1_18_2","unstructured":"Juliet Moso and Wilson Muange. 2018. Quality of experience (QoE) measurement and its challenges in mobile networks for multimedia. 7."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","unstructured":"Ulrich Reiter Kjell Brunnstr\u00f6m Katrien De Moor Chaker Larabi Manuela Pereira Antonio Pinheiro Junyong You and Andrej Zgank. 2014. Factors influencing quality of experience. DOI:10.1007\/978-3-319-02681-7_4","DOI":"10.1007\/978-3-319-02681-7_4"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12193-022-00388-0"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2020.2982046"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","unstructured":"Tobias Hossfeld Luigi Atzori Poul Heegaard Lea Skorin-Kapov and Mart\u00edn Varela. 2019. The interplay between QoE user behavior and system blocking in QoE management. 112--117. DOI:10.1109\/ICIN.2019.8685902","DOI":"10.1109\/ICIN.2019.8685902"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.3390\/app9163384"},{"key":"e_1_3_1_24_2","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1145\/3524273.3528179","volume-title":"Proceedings of the 13th ACM Multimedia Systems Conference (MMSys\u201922)","author":"Hirway Amit","year":"2022","unstructured":"Amit Hirway, Yuansong Qiao, and Niall Murray. 2022. Spatial audio in 360\u00b0 videos: Does it influence visual attention? In Proceedings of the 13th ACM Multimedia Systems Conference (MMSys\u201922), Association for Computing Machinery, New York, NY, USA, 39\u201351."},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3083187.3083219"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/3204949.3208139"},{"key":"e_1_3_1_27_2","first-page":"153","volume-title":"Proceedings of the 6th International Workshop on Quality of Multimedia Experience (QoMEX\u201914)","author":"Min Xiongkuo","year":"2014","unstructured":"Xiongkuo Min, Guangtao Zhai, Zhongpai Gao, Chunjia Hu, and Xiaokang Yang. 2014. Sound influences visual attention discriminately in videos. In Proceedings of the 6th International Workshop on Quality of Multimedia Experience (QoMEX\u201914). 153\u2013158."},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2017.8296592"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3083187.3083210"},{"key":"e_1_3_1_30_2","unstructured":"M. Almquist and V. Almquist. 2018. Analysis of 360\u00b0 video viewing behaviours (dissertation). Retrieved from https:\/\/urn.kb.se\/resolve?urn=urn:nbn:se:liu:diva-144907"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2021.3124080"},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41233-022-00052-1"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2016.2609843"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10055-019-00400-1"},{"key":"e_1_3_1_35_2","unstructured":"ISO. 2007. ISO 8589:2007 Sensory analysis.General guidance for the design of test rooms. International Standards Organization. Retrieved November 30 2023 from https:\/\/www.iso.org\/obp\/ui\/#iso:std:iso:8589:ed-2:v1:en"},{"key":"e_1_3_1_36_2","unstructured":"International Telecommunication Union. 2023. ITU-T P.910: Subjective Video Quality Assessment Methods for Multimedia Applications. Retrieved March 27 2023 from https:\/\/www.itu.int\/rec\/T-REC-P.910-202310-I\/en"},{"key":"e_1_3_1_37_2","unstructured":"TobiiPro. 2018. Tobii Pro VR Integration\u2014based on HTC Vive Development Kit Description. Retrieved March 27 2023 from https:\/\/www.tobiipro.com\/siteassets\/tobii-pro\/product-descriptions\/tobii-pro-vr-integration-product-description.pdf\/?v=1.7"},{"key":"e_1_3_1_38_2","unstructured":"Beyerdynamic. 2020. Beyerdynamic DT990 Pro. Retrieved March 27 2023 from https:\/\/europe.beyerdynamic.com\/dt-990-pro.html"},{"key":"e_1_3_1_39_2","unstructured":"GoPro. 2020. GoPro VR Player for Desktop FAQ. Retrieved March 27 2023 from https:\/\/gopro.com\/help\/articles\/block\/gopro-vr-player-for-desktop-faq"},{"key":"e_1_3_1_40_2","unstructured":"Angelo Farina. 2020. Index of \/Public. Retrieved March 7 2023 from http:\/\/www.angelofarina.it\/Public\/"},{"key":"e_1_3_1_41_2","unstructured":"Empatica. E4 Wristband Support Page. Retrieved March 27 2023 from https:\/\/support.empatica.com\/hc\/en-us\/categories\/200023126-E4-wristband"},{"key":"e_1_3_1_42_2","unstructured":"Ffmpeg.org. 2021. FFmpeg. Retrieved March 27 2023 from https:\/\/ffmpeg.org\/"},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/QoMEX.2017.7965656"},{"key":"e_1_3_1_44_2","first-page":"1","volume-title":"Proceedings of the 2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP\u201919)","author":"Hynes E.","year":"2019","unstructured":"E. Hynes, R. Flynn, B. Lee, and N. Murray. 2019. A quality of experience evaluation comparing augmented reality and paper based instruction for complex task assistance. In Proceedings of the 2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP\u201919). 1\u20136."},{"key":"e_1_3_1_45_2","first-page":"1","volume-title":"Proceedings of the 2016 8th International Conference on Quality of Multimedia Experience (QoMEX)","author":"Egan D.","year":"2016","unstructured":"D. Egan, S. Brennan, J. Barrett, Y. Qiao, C. Timmerer, and N. Murray. 2016. An evaluation of Heart Rate and ElectroDermal activity as an objective QoE evaluation method for immersive virtual reality environments. In Proceedings of the 2016 8th International Conference on Quality of Multimedia Experience (QoMEX). 1\u20136."},{"key":"e_1_3_1_46_2","unstructured":"International Telecommunications Union. 2016. P.913: Methods for the Subjective Assessment of Video Quality Audio Quality and Audiovisual Quality of Internet Video and Distribution Quality Television in Any Environment. Retrieved March 27 2023 from https:\/\/www.itu.int\/rec\/T-REC-P.913\/en"},{"key":"e_1_3_1_47_2","unstructured":"Provisu.ch. 2021. Snellen Eye Chart. Retrieved March 27 2023 from https:\/\/www.provisu.ch\/images\/PDF\/Snellenchart_en.pdf"},{"key":"e_1_3_1_48_2","unstructured":"Colblindor. 2021. Ishihara's Test for Colour Deficiency: 38 Plates Edition. Retrieved March 27 2023 from https:\/\/www.color-blindness.com\/ishiharas-test-for-colour-eficiency-38-plates-edition\/"},{"key":"e_1_3_1_49_2","unstructured":"D. Pigeon Online Hearing Test and Audiogram Printout. Retrieved March 27 2023 from https:\/\/hearingtest.online\/"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3317697.3323361"},{"key":"e_1_3_1_51_2","doi-asserted-by":"crossref","unstructured":"B. G. Witmer and M. J. Singer. 1998. Measuring presence in virtual environments: A presence questionnaire. Presence: Teleoperators and Virtual Environments 7 3 (1998) 225--240.","DOI":"10.1162\/105474698565686"},{"key":"e_1_3_1_52_2","first-page":"76","article-title":"Effects of eye dominance on binocular rivalry with continuous flash suppression","volume":"130","author":"Schreiber K. M.","year":"2017","unstructured":"K. M. Schreiber and J. M. Hillis. 2017. Effects of eye dominance on binocular rivalry with continuous flash suppression. Vision Research 130 (2017), 76\u201386.","journal-title":"Vision Research"},{"key":"e_1_3_1_53_2","first-page":"102778","article-title":"Gaze behavior and cognitive processing in 360-degree virtual reality videos: An eye-tracking study","volume":"73","author":"Chen X.","year":"2020","unstructured":"X. Chen, X. Zhou, P. Xu, K. Xu, and Y. Liu. 2020. Gaze behavior and cognitive processing in 360-degree virtual reality videos: An eye-tracking study. Journal of Visual Communication and Image Representation 73 (2020), 102778.","journal-title":"Journal of Visual Communication and Image Representation"},{"key":"e_1_3_1_54_2","first-page":"43","volume-title":"Proceedings of the 19th ACM Symposium on Virtual Reality Software and Technology","author":"Bruder G.","year":"2013","unstructured":"G. Bruder, F. Steinicke, H. Ritter, and K. Hinrichs. 2013. Pupil dilation indicates cognitive processing load during 360\u00b0 video playback in head-mounted displays. In Proceedings of the 19th ACM Symposium on Virtual Reality Software and Technology. 43\u201350."},{"key":"e_1_3_1_55_2","article-title":"QoE_Visual-Attention_Spatial-Audio_360-videos","author":"Hirway A.","unstructured":"A. Hirway. Year. QoE_Visual-Attention_Spatial-Audio_360-videos. GitHub repository. Retrieved February 6, 2024 from https:\/\/github.com\/hirwaam\/QoE_Visual-Attention_Spatial-Audio_360-videos","journal-title":"GitHub repository"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3650208","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3650208","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:03:32Z","timestamp":1750291412000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3650208"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,19]]},"references-count":54,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2024,9,30]]}},"alternative-id":["10.1145\/3650208"],"URL":"https:\/\/doi.org\/10.1145\/3650208","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,8,19]]},"assertion":[{"value":"2023-04-29","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-02-09","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-08-19","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}