{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T17:28:12Z","timestamp":1770917292264,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":56,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T00:00:00Z","timestamp":1602460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,10,12]]},"DOI":"10.1145\/3394171.3413894","type":"proceedings-article","created":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T12:27:38Z","timestamp":1602505658000},"page":"1162-1170","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Scene-Aware Background Music Synthesis"],"prefix":"10.1145","author":[{"given":"Yujia","family":"Wang","sequence":"first","affiliation":[{"name":"Beijing Institute of Technology, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Liang","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wanwan","family":"Li","sequence":"additional","affiliation":[{"name":"George Mason University, Fairfax, VA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dingzeyu","family":"Li","sequence":"additional","affiliation":[{"name":"Adobe Research, Seattle, WA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lap-Fai","family":"Yu","sequence":"additional","affiliation":[{"name":"George Mason University, Fairfax, VA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,10,12]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"2018. Inside the booming business of background music. https:\/\/www.theguardian.com\/news\/2018\/nov\/06\/inside-the-booming-business-of-background-music.  2018. Inside the booming business of background music. https:\/\/www.theguardian.com\/news\/2018\/nov\/06\/inside-the-booming-business-of-background-music."},{"key":"e_1_3_2_2_2_1","volume-title":"\u201cvisual","author":"Abboud Sami","year":"2014","unstructured":"Sami Abboud , Shlomi Hanassy , Shelly Levy-Tzedek , Shachar Maidenbaum , and Amir Amedi . 2014. EyeMusic: Introducing a \u201cvisual \u201d colorful experience for the blind using auditory sensory substitution. Restorative neurology and neuroscience, Vol. 32 , 2 ( 2014 ), 247--257. Sami Abboud, Shlomi Hanassy, Shelly Levy-Tzedek, Shachar Maidenbaum, and Amir Amedi. 2014. EyeMusic: Introducing a \u201cvisual\u201d colorful experience for the blind using auditory sensory substitution. Restorative neurology and neuroscience, Vol. 32, 2 (2014), 247--257."},{"key":"e_1_3_2_2_3_1","volume-title":"Jibin Rajan Varghese, and Zhangyang Wang","author":"Baig Mohammed Habibullah","year":"2018","unstructured":"Mohammed Habibullah Baig , Jibin Rajan Varghese, and Zhangyang Wang . 2018 . MusicMapp: A Deep Learning Based Solution for Music Exploration and Visual Interaction. In ACM Multimedia . 1253--1255. Mohammed Habibullah Baig, Jibin Rajan Varghese, and Zhangyang Wang. 2018. MusicMapp: A Deep Learning Based Solution for Music Exploration and Visual Interaction. In ACM Multimedia. 1253--1255."},{"key":"e_1_3_2_2_4_1","volume-title":"A cross-cultural investigation of the perception of emotion in music: Psychophysical and cultural cues. Music perception: an interdisciplinary journal","author":"Balkwill Laura-Lee","year":"1999","unstructured":"Laura-Lee Balkwill and William Forde Thompson . 1999. A cross-cultural investigation of the perception of emotion in music: Psychophysical and cultural cues. Music perception: an interdisciplinary journal , Vol. 17 , 1 ( 1999 ), 43--64. Laura-Lee Balkwill and William Forde Thompson. 1999. A cross-cultural investigation of the perception of emotion in music: Psychophysical and cultural cues. Music perception: an interdisciplinary journal, Vol. 17, 1 (1999), 43--64."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"crossref","unstructured":"Jared S Bauer Alex Jansen and Jesse Cirimele. 2011. MoodMusic: a method for cooperative generative music playlist creation. In UIST. ACM 85--86.  Jared S Bauer Alex Jansen and Jesse Cirimele. 2011. MoodMusic: a method for cooperative generative music playlist creation. In UIST. ACM 85--86.","DOI":"10.1145\/2046396.2046435"},{"key":"e_1_3_2_2_6_1","volume-title":"International journal of psychophysiology","author":"Baumgartner Thomas","year":"2006","unstructured":"Thomas Baumgartner , Michaela Esslen , and Lutz Jancke . 2006. From emotion perception to emotion experience: Emotions evoked by pictures and classical music . International journal of psychophysiology , Vol. 60 , 1 ( 2006 ), 34--43. Thomas Baumgartner, Michaela Esslen, and Lutz Jancke. 2006. From emotion perception to emotion experience: Emotions evoked by pictures and classical music. International journal of psychophysiology, Vol. 60, 1 (2006), 34--43."},{"key":"e_1_3_2_2_7_1","first-page":"53","article-title":"Composition and arrangement techniques for music in interactive immersive environments","volume":"2006","author":"Berndt Axel","year":"2006","unstructured":"Axel Berndt , Knut Hartmann , Niklas R\u00f6ber , and Maic Masuch . 2006 . Composition and arrangement techniques for music in interactive immersive environments . Audio Mostly , Vol. 2006 (2006), 53 -- 59 . Axel Berndt, Knut Hartmann, Niklas R\u00f6ber, and Maic Masuch. 2006. Composition and arrangement techniques for music in interactive immersive environments. Audio Mostly, Vol. 2006 (2006), 53--59.","journal-title":"Audio Mostly"},{"key":"e_1_3_2_2_8_1","first-page":"633","article-title":"An Expert Ground Truth Set for Audio Chord Recognition and Music Analysis","volume":"11","author":"Burgoyne John Ashley","year":"2011","unstructured":"John Ashley Burgoyne , Jonathan Wild , and Ichiro Fujinaga . 2011 . An Expert Ground Truth Set for Audio Chord Recognition and Music Analysis .. In ISMIR , Vol. 11. 633 -- 638 . John Ashley Burgoyne, Jonathan Wild, and Ichiro Fujinaga. 2011. An Expert Ground Truth Set for Audio Chord Recognition and Music Analysis.. In ISMIR, Vol. 11. 633--638.","journal-title":"ISMIR"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2017.01.011"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.protcy.2013.12.117"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"crossref","unstructured":"Fu-Yin Cherng Yi-Chen Lee Jung-Tai King and Wen-Chieh Lin. 2019. Measuring the Influences of Musical Parameters on Cognitive and Behavioral Responses to Audio Notifications Using EEG and Large-scale Online Studies. In ACM SIGCHI. ACM 409.  Fu-Yin Cherng Yi-Chen Lee Jung-Tai King and Wen-Chieh Lin. 2019. Measuring the Influences of Musical Parameters on Cognitive and Behavioral Responses to Audio Notifications Using EEG and Large-scale Online Studies. In ACM SIGCHI. ACM 409.","DOI":"10.1145\/3290605.3300639"},{"key":"e_1_3_2_2_12_1","volume-title":"Musical scales and the generalized circle of fifths. The american mathematical monthly","author":"Clough John","year":"1986","unstructured":"John Clough and Gerald Myerson . 1986. Musical scales and the generalized circle of fifths. The american mathematical monthly , Vol. 93 , 9 ( 1986 ), 695--701. John Clough and Gerald Myerson. 1986. Musical scales and the generalized circle of fifths. The american mathematical monthly, Vol. 93, 9 (1986), 695--701."},{"key":"e_1_3_2_2_13_1","volume-title":"ICASSP","volume":"4","author":"Ellis Daniel PW","year":"2007","unstructured":"Daniel PW Ellis and Graham E Poliner . 2007 . Identifyingcover songs' with chroma features and dynamic programming beat tracking . In ICASSP , Vol. 4 . IEEE, IV--1429. Daniel PW Ellis and Graham E Poliner. 2007. Identifyingcover songs' with chroma features and dynamic programming beat tracking. In ICASSP, Vol. 4. IEEE, IV--1429."},{"key":"e_1_3_2_2_14_1","unstructured":"Shaojing Fan Zhiqi Shen Ming Jiang Bryan L Koenig Juan Xu Mohan S Kankanhalli and Qi Zhao. 2018. Emotional attention: A study of image sentiment and visual attention. In CVPR. 7521--7531.  Shaojing Fan Zhiqi Shen Ming Jiang Bryan L Koenig Juan Xu Mohan S Kankanhalli and Qi Zhao. 2018. Emotional attention: A study of image sentiment and visual attention. In CVPR. 7521--7531."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.compedu.2011.09.002"},{"key":"e_1_3_2_2_16_1","volume-title":"Background music and industrial efficiency - a review. Applied ergonomics","author":"Fox JG","year":"1971","unstructured":"JG Fox . 1971. Background music and industrial efficiency - a review. Applied ergonomics , Vol. 2 , 2 ( 1971 ), 70--73. JG Fox. 1971. Background music and industrial efficiency - a review. Applied ergonomics, Vol. 2, 2 (1971), 70--73."},{"key":"e_1_3_2_2_17_1","unstructured":"Heitor Guimaraes. 2018. Music Genre classification using Convolutional Neural Networks. Github.  Heitor Guimaraes. 2018. Music Genre classification using Convolutional Neural Networks. Github."},{"key":"e_1_3_2_2_18_1","volume-title":"Der General-Ba\u00df in der Composition [1728]","author":"Heinichen Johann David","year":"1969","unstructured":"Johann David Heinichen . 1969. Der General-Ba\u00df in der Composition [1728] . Hildesheim : Olms ( 1969 ). Johann David Heinichen. 1969. Der General-Ba\u00df in der Composition [1728]. Hildesheim: Olms (1969)."},{"key":"e_1_3_2_2_19_1","volume-title":"Long short-term memory. Neural computation","author":"Hochreiter Sepp","year":"1997","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber . 1997. Long short-term memory. Neural computation , Vol. 9 , 8 ( 1997 ), 1735--1780. Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780."},{"key":"e_1_3_2_2_20_1","volume-title":"Jean Garcia-Gathright, and Jennifer Thom.","author":"Hosey Christine","year":"2019","unstructured":"Christine Hosey , Lara Vujovi\u0107 , Brian St Thomas , Jean Garcia-Gathright, and Jennifer Thom. 2019 . Just Give Me What I Want: How People Use and Evaluate Music Search. In ACM SIGCHI. ACM , 299. Christine Hosey, Lara Vujovi\u0107, Brian St Thomas, Jean Garcia-Gathright, and Jennifer Thom. 2019. Just Give Me What I Want: How People Use and Evaluate Music Search. In ACM SIGCHI. ACM, 299."},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"crossref","unstructured":"Qibin Hou Ming-Ming Cheng Xiaowei Hu Ali Borji Zhuowen Tu and Philip HS Torr. 2017. Deeply supervised salient object detection with short connections. In CVPR. 3203--3212.  Qibin Hou Ming-Ming Cheng Xiaowei Hu Ali Borji Zhuowen Tu and Philip HS Torr. 2017. Deeply supervised salient object detection with short connections. In CVPR. 3203--3212.","DOI":"10.1109\/CVPR.2017.563"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300851"},{"key":"e_1_3_2_2_23_1","unstructured":"Danyal Imran. 2016. Music Emotion Recognition. Github. https:\/\/github.com\/danz1ka19\/Music-Emotion-Recognition.  Danyal Imran. 2016. Music Emotion Recognition. Github. https:\/\/github.com\/danz1ka19\/Music-Emotion-Recognition."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"crossref","unstructured":"Junki Kikuchi Hidekatsu Yanagi and Yoshiaki Mima. 2016. Music composition with recommendation. In UIST. ACM 137--138.  Junki Kikuchi Hidekatsu Yanagi and Yoshiaki Mima. 2016. Music composition with recommendation. In UIST. ACM 137--138.","DOI":"10.1145\/2984751.2985733"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1469-8986.1979.tb01511.x"},{"key":"e_1_3_2_2_26_1","volume-title":"motivation, and anxiety: Brain mechanisms and psychophysiology. Biological psychiatry","author":"Lang Peter J","year":"1998","unstructured":"Peter J Lang , Margaret M Bradley , and Bruce N Cuthbert . 1998. Emotion , motivation, and anxiety: Brain mechanisms and psychophysiology. Biological psychiatry , Vol. 44 , 12 ( 1998 ), 1248--1263. Peter J Lang, Margaret M Bradley, and Bruce N Cuthbert. 1998. Emotion, motivation, and anxiety: Brain mechanisms and psychophysiology. Biological psychiatry, Vol. 44, 12 (1998), 1248--1263."},{"key":"e_1_3_2_2_27_1","unstructured":"Jen-Chun Lin Wen-Li Wei James Yang Hsin-Min Wang and Hong-Yuan Mark Liao. 2017. Automatic music video generation based on simultaneous soundtrack recommendation and video editing. In ACM Multimedia. 519--527.  Jen-Chun Lin Wen-Li Wei James Yang Hsin-Min Wang and Hong-Yuan Mark Liao. 2017. Automatic music video generation based on simultaneous soundtrack recommendation and video editing. In ACM Multimedia. 519--527."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1109\/TPAMI.2010.70","article-title":"Learning to detect a salient object","volume":"33","author":"Liu Tie","year":"2010","unstructured":"Tie Liu , Zejian Yuan , Jian Sun , Jingdong Wang , Nanning Zheng , Xiaoou Tang , and Heung-Yeung Shum . 2010 . Learning to detect a salient object . TPAMI , Vol. 33 , 2 (2010), 353 -- 367 . Tie Liu, Zejian Yuan, Jian Sun, Jingdong Wang, Nanning Zheng, Xiaoou Tang, and Heung-Yeung Shum. 2010. Learning to detect a salient object. TPAMI, Vol. 33, 2 (2010), 353--367.","journal-title":"TPAMI"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290607.3312784"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1177\/1356766708098171"},{"key":"e_1_3_2_2_31_1","volume-title":"Matthew Yee-King, and Mark d'Inverno.","author":"McCormack Jon","year":"2019","unstructured":"Jon McCormack , Toby Gifford , Patrick Hutchings , Maria Teresa Llano Rodriguez , Matthew Yee-King, and Mark d'Inverno. 2019 . In a Silent Way : Communication Between AI and Improvising Musicians Beyond Sound. In ACM SIGCHI. ACM , 38. Jon McCormack, Toby Gifford, Patrick Hutchings, Maria Teresa Llano Rodriguez, Matthew Yee-King, and Mark d'Inverno. 2019. In a Silent Way: Communication Between AI and Improvising Musicians Beyond Sound. In ACM SIGCHI. ACM, 38."},{"key":"e_1_3_2_2_32_1","volume-title":"Emotion and meaning in music","author":"Meyer Leonard B","unstructured":"Leonard B Meyer . 2008. Emotion and meaning in music . University of Chicago Press . Leonard B Meyer. 2008. Emotion and meaning in music .University of Chicago Press."},{"key":"e_1_3_2_2_33_1","volume-title":"Dagstuhl Follow-Ups","volume":"3","author":"M\u00fcller Meinard","year":"2012","unstructured":"Meinard M\u00fcller and Jonathan Driedger . 2012 . Data-driven sound track generation . In Dagstuhl Follow-Ups , Vol. 3 . Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik. Meinard M\u00fcller and Jonathan Driedger. 2012. Data-driven sound track generation. In Dagstuhl Follow-Ups, Vol. 3. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik."},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1525\/mp.2004.22.1.41"},{"key":"e_1_3_2_2_35_1","volume-title":"Ambient sound provides supervision for visual learning","author":"Owens Andrew","unstructured":"Andrew Owens , Jiajun Wu , Josh H McDermott , William T Freeman , and Antonio Torralba . 2016. Ambient sound provides supervision for visual learning . In ECCV. Springer , 801--816. Andrew Owens, Jiajun Wu, Josh H McDermott, William T Freeman, and Antonio Torralba. 2016. Ambient sound provides supervision for visual learning. In ECCV. Springer, 801--816."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.142"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.3402\/rlt.v16i3.10901"},{"key":"e_1_3_2_2_38_1","volume-title":"Proceedings of ECAI-98 Workshop on AI\/Alife and Entertainment. Citeseer.","author":"Robertson Judy","year":"1998","unstructured":"Judy Robertson , Andrew de Quincey , Tom Stapleford , and Geraint Wiggins . 1998 . Real-time music generation for a virtual environment . In Proceedings of ECAI-98 Workshop on AI\/Alife and Entertainment. Citeseer. Judy Robertson, Andrew de Quincey, Tom Stapleford, and Geraint Wiggins. 1998. Real-time music generation for a virtual environment. In Proceedings of ECAI-98 Workshop on AI\/Alife and Entertainment. Citeseer."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"crossref","unstructured":"Steve Rubin and Maneesh Agrawala. 2014. Generating emotionally relevant musical scores for audio stories. In UIST. ACM 439--448.  Steve Rubin and Maneesh Agrawala. 2014. Generating emotionally relevant musical scores for audio stories. In UIST. ACM 439--448.","DOI":"10.1145\/2642918.2647406"},{"key":"e_1_3_2_2_40_1","unstructured":"Zhengshan Shi and Gautham J Mysore. 2018. LoopMaker: Automatic Creation of Music Loops from Pre-recorded Music. In ACM SIGCHI. ACM 454.  Zhengshan Shi and Gautham J Mysore. 2018. LoopMaker: Automatic Creation of Music Loops from Pre-recorded Music. In ACM SIGCHI. ACM 454."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988229"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"crossref","unstructured":"Zhenyu Tang Nicolas Morales and Dinesh Manocha. 2018. Dynamic Sound Field Synthesis for Speech and Music Optimization. In ACM Multimedia. 1901--1909.  Zhenyu Tang Nicolas Morales and Dinesh Manocha. 2018. Dynamic Sound Field Synthesis for Speech and Music Optimization. In ACM Multimedia. 1901--1909.","DOI":"10.1145\/3240508.3240644"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"crossref","unstructured":"Quoc-Tuan Truong and Hady W Lauw. 2017. Visual sentiment analysis for review images with item-oriented and user-oriented CNN. In ACM Multimedia. 1274--1282.  Quoc-Tuan Truong and Hady W Lauw. 2017. Visual sentiment analysis for review images with item-oriented and user-oriented CNN. In ACM Multimedia. 1274--1282.","DOI":"10.1145\/3123266.3123374"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1037\/0096-3445.123.4.394"},{"key":"e_1_3_2_2_45_1","volume-title":"How brains beware: neural mechanisms of emotional attention. Trends in cognitive sciences","author":"Vuilleumier Patrik","year":"2005","unstructured":"Patrik Vuilleumier . 2005. How brains beware: neural mechanisms of emotional attention. Trends in cognitive sciences , Vol. 9 , 12 ( 2005 ), 585--594. Patrik Vuilleumier. 2005. How brains beware: neural mechanisms of emotional attention. Trends in cognitive sciences, Vol. 9, 12 (2005), 585--594."},{"key":"e_1_3_2_2_46_1","volume-title":"Playing with tagging: A real-time tagging music player","author":"Wang Ju-Chiang","unstructured":"Ju-Chiang Wang , Hsin-Min Wang , and Shyh-Kang Jeng . 2012. Playing with tagging: A real-time tagging music player . In ICASSP. IEEE , 77--80. Ju-Chiang Wang, Hsin-Min Wang, and Shyh-Kang Jeng. 2012. Playing with tagging: A real-time tagging music player. In ICASSP. IEEE, 77--80."},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2019.05.026"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3355089.3356487","article-title":"b. Comic-guided speech synthesis","volume":"38","author":"Wang Yujia","year":"2019","unstructured":"Yujia Wang , Wenguan Wang , Wei Liang , and Lap-Fai Yu . 2019 b. Comic-guided speech synthesis . TOG , Vol. 38 , 6 (2019), 1 -- 14 . Yujia Wang, Wenguan Wang, Wei Liang, and Lap-Fai Yu. 2019 b. Comic-guided speech synthesis. TOG, Vol. 38, 6 (2019), 1--14.","journal-title":"TOG"},{"key":"e_1_3_2_2_49_1","volume-title":"Gamasutra","volume":"29","author":"Whitmore Guy","year":"2003","unstructured":"Guy Whitmore . 2003 . Design with music in mind: A guide to adaptive audio for game designers . Gamasutra , May , Vol. 29 (2003). Guy Whitmore. 2003. Design with music in mind: A guide to adaptive audio for game designers. Gamasutra, May, Vol. 29 (2003)."},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2018.2879399"},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1108\/EUM0000000002577"},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2010.2064164"},{"key":"e_1_3_2_2_53_1","unstructured":"Quanzeng You Jiebo Luo Hailin Jin and Jianchao Yang. 2015. Robust image sentiment analysis using progressively trained and domain transferred deep networks. In AAAI. 381--388.  Quanzeng You Jiebo Luo Hailin Jin and Jianchao Yang. 2015. Robust image sentiment analysis using progressively trained and domain transferred deep networks. In AAAI. 381--388."},{"key":"e_1_3_2_2_54_1","unstructured":"Quanzeng You Jiebo Luo Hailin Jin and Jianchao Yang. 2016. Building a large scale dataset for image emotion recognition: The fine print and the benchmark. In AAAI. 308--314.  Quanzeng You Jiebo Luo Hailin Jin and Jianchao Yang. 2016. Building a large scale dataset for image emotion recognition: The fine print and the benchmark. In AAAI. 308--314."},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"crossref","unstructured":"Sicheng Zhao Yue Gao Xiaolei Jiang Hongxun Yao Tat-Seng Chua and Xiaoshuai Sun. 2014. Exploring principles-of-art features for image emotion recognition. In ACM Multimedia. 47--56.  Sicheng Zhao Yue Gao Xiaolei Jiang Hongxun Yao Tat-Seng Chua and Xiaoshuai Sun. 2014. Exploring principles-of-art features for image emotion recognition. In ACM Multimedia. 47--56.","DOI":"10.1145\/2647868.2654930"},{"key":"e_1_3_2_2_56_1","doi-asserted-by":"crossref","unstructured":"Yipin Zhou Zhaowen Wang Chen Fang Trung Bui and Tamara L Berg. 2018. Visual to sound: Generating natural sound for videos in the wild. In CVPR. 3550--3558.  Yipin Zhou Zhaowen Wang Chen Fang Trung Bui and Tamara L Berg. 2018. Visual to sound: Generating natural sound for videos in the wild. In CVPR. 3550--3558.","DOI":"10.1109\/CVPR.2018.00374"}],"event":{"name":"MM '20: The 28th ACM International Conference on Multimedia","location":"Seattle WA USA","acronym":"MM '20","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 28th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394171.3413894","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3394171.3413894","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:06Z","timestamp":1750195926000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394171.3413894"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,12]]},"references-count":56,"alternative-id":["10.1145\/3394171.3413894","10.1145\/3394171"],"URL":"https:\/\/doi.org\/10.1145\/3394171.3413894","relation":{},"subject":[],"published":{"date-parts":[[2020,10,12]]},"assertion":[{"value":"2020-10-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}