{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:19:19Z","timestamp":1750220359365,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,23]],"date-time":"2021-04-23T00:00:00Z","timestamp":1619136000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Development of AI for Analysis and Synthesis of Korean Pansori","award":["NRF-2021R1A2C2006895"],"award-info":[{"award-number":["NRF-2021R1A2C2006895"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,23]]},"DOI":"10.1145\/3468891.3468911","type":"proceedings-article","created":{"date-parts":[[2021,9,6]],"date-time":"2021-09-06T17:42:54Z","timestamp":1630950174000},"page":"132-137","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Music Emotion Classification with Deep Neural Nets"],"prefix":"10.1145","author":[{"given":"Yagya","family":"Raj Pandeya","sequence":"first","affiliation":[{"name":"Jeonbuk National University, South korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bhuwan","family":"Bhattarai","sequence":"additional","affiliation":[{"name":"Jeonbuk National University, South korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joonwhoan","family":"Lee","sequence":"additional","affiliation":[{"name":"Jeonbuk National University, South korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,9,6]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"1317","volume-title":"USA","author":"Qian Kun","year":"2015","unstructured":"Kun Qian , Zixing Zhang , Fabien Ringeval and Bjorn Schuller . 2015 . Bird sounds classification by large scale acoustic features and extreme learning machine.2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP). Orlando, FL , USA , pp. 1317 - 1321 , doi: 10.1109\/GlobalSIP.2015.7418412. Kun Qian, Zixing Zhang, Fabien Ringeval and Bjorn Schuller. 2015. Bird sounds classification by large scale acoustic features and extreme learning machine.2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP). Orlando, FL, USA, pp. 1317-1321, doi: 10.1109\/GlobalSIP.2015.7418412."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Yagya Raj Pandeya and Joonwhoan Lee Domestic Cat Sound Classification Using Transfer Learning Int. J. Fuzzy Log. Intell. Syst. 2018;18(2):154-160  Yagya Raj Pandeya and Joonwhoan Lee Domestic Cat Sound Classification Using Transfer Learning Int. J. Fuzzy Log. Intell. Syst. 2018;18(2):154-160","DOI":"10.5391\/IJFIS.2018.18.2.154"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.3390\/app8101949"},{"key":"e_1_3_2_1_4_1","first-page":"273","volume-title":"Korea (South)","author":"Pandeya Yagya R.","year":"2020","unstructured":"Yagya R. Pandeya , Bhuwan Bhattarai and Joonwhoan Lee . 2020 . Sound Event Detection in Cowshed using Synthetic Data and Convolutional Neural Network.2020 International Conference on Information and Communication Technology Convergence (ICTC), Jeju , Korea (South) , pp. 273 - 276 , doi: 10.1109\/ICTC49870.2020.9289545. Yagya R. Pandeya, Bhuwan Bhattarai and Joonwhoan Lee. 2020. Sound Event Detection in Cowshed using Synthetic Data and Convolutional Neural Network.2020 International Conference on Information and Communication Technology Convergence (ICTC), Jeju, Korea (South), pp. 273-276, doi: 10.1109\/ICTC49870.2020.9289545."},{"key":"e_1_3_2_1_5_1","first-page":"162625","volume-title":"Bhuwan Bhattarai and Joonwhoan Lee","author":"Pandeya Yagya R.","year":"2020","unstructured":"Yagya R. Pandeya , Bhuwan Bhattarai and Joonwhoan Lee . 2020 . Visual Object Detector for Cow Sound Event Detection. in\u00a0 IEEE Access , vol. 8 , pp. 162625 - 162633 , 2020, doi: 10.1109\/ACCESS.2020.3022058. Yagya R. Pandeya, Bhuwan Bhattarai and Joonwhoan Lee. 2020. Visual Object Detector for Cow Sound Event Detection. in\u00a0 IEEE Access, vol. 8, pp. 162625-162633, 2020, doi: 10.1109\/ACCESS.2020.3022058."},{"key":"e_1_3_2_1_6_1","volume-title":"\u201cCross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems","author":"Kong Qiuqiang","year":"2019","unstructured":"Qiuqiang Kong , Yin Cao , Turab Iqbal , Yong Xu , Wenwu Wang , and Mark D. Plumbley , \u201cCross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems ,\u201d arXiv, 2019 . [Online]. https:\/\/arxiv.org\/abs\/1904.03476 Qiuqiang Kong, Yin Cao, Turab Iqbal, Yong Xu, Wenwu Wang, and Mark D. Plumbley, \u201cCross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems,\u201d arXiv,2019. [Online]. https:\/\/arxiv.org\/abs\/1904.03476"},{"key":"e_1_3_2_1_7_1","volume-title":"Pandeya and Joonwhoan Lee","author":"Yagya","year":"2019","unstructured":"Yagya R. Pandeya and Joonwhoan Lee . 2019 . Music-Video Emotion Analysis Using Late Fusion of Multimodal. DEStech Transactions on Computer Science and Engineering . Yagya R. Pandeya and Joonwhoan Lee. 2019. Music-Video Emotion Analysis Using Late Fusion of Multimodal. DEStech Transactions on Computer Science and Engineering."},{"key":"e_1_3_2_1_8_1","volume-title":"Hernane Borges de Barros Pereira.","author":"de Freitas Piedade Melo Dirceu","year":"2020","unstructured":"Dirceu de Freitas Piedade Melo , Inacio de Sousa Fadigas , Hernane Borges de Barros Pereira. 2020 . Graph-based feature extraction: A new proposal to study the classification of music signals outside the time-frequency domain. Plos One . https:\/\/doi.org\/10.1371\/journal.pone.0240915 Dirceu de Freitas Piedade Melo, Inacio de Sousa Fadigas, Hernane Borges de Barros Pereira. 2020. Graph-based feature extraction: A new proposal to study the classification of music signals outside the time-frequency domain. Plos One. https:\/\/doi.org\/10.1371\/journal.pone.0240915"},{"key":"e_1_3_2_1_9_1","first-page":"1","volume-title":"Berkay \u00d6zt\u00fcrk and Nizamettin Aydin.","author":"Elbir Ahmet","year":"2018","unstructured":"Ahmet Elbir , Hilmi Bilal \u00c7am , Mehmet Emre Iyican , Berkay \u00d6zt\u00fcrk and Nizamettin Aydin. 2018 . Music Genre Classification and Recommendation by Using Machine Learning Techniques.2018 Innovations in Intelligent Systems and Applications Conference (ASYU), Adana, Turkey , pp. 1 - 5 , doi: 10.1109\/ASYU.2018.8554016. Ahmet Elbir, Hilmi Bilal \u00c7am, Mehmet Emre Iyican, Berkay \u00d6zt\u00fcrk and Nizamettin Aydin. 2018. Music Genre Classification and Recommendation by Using Machine Learning Techniques.2018 Innovations in Intelligent Systems and Applications Conference (ASYU), Adana, Turkey, pp. 1-5, doi: 10.1109\/ASYU.2018.8554016."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/INISTA.2017.8001169"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1049\/iet-spr.2019.0381"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Pablo Gimeno Ignacio Vi\u00f1als Alfonso Ortega Antonio Miguel Eduardo Lleida. 2020.\u00a0Multiclass audio segmentation based on recurrent neural networks for broadcast domain data.\u00a0 J AUDIO SPEECH MUSIC PROC.\u00a02020 \u00a05. https:\/\/doi.org\/10.1186\/s13636-020-00172-6  Pablo Gimeno Ignacio Vi\u00f1als Alfonso Ortega Antonio Miguel Eduardo Lleida. 2020.\u00a0Multiclass audio segmentation based on recurrent neural networks for broadcast domain data.\u00a0 J AUDIO SPEECH MUSIC PROC.\u00a02020 \u00a05. https:\/\/doi.org\/10.1186\/s13636-020-00172-6","DOI":"10.1186\/s13636-020-00172-6"},{"key":"e_1_3_2_1_13_1","first-page":"110","volume-title":"MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation.In Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC)","author":"Takahashi Naoya","year":"2018","unstructured":"Naoya Takahashi , Nabarun Goswami , and Yuki Mitsufuji . 2018. MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation.In Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC) . Tokyo, Japan , 17\u201320 September 2018 ; pp. 106\u2013 110 . Naoya Takahashi, Nabarun Goswami, and Yuki Mitsufuji. 2018. MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation.In Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC). Tokyo, Japan, 17\u201320 September 2018; pp. 106\u2013110."},{"key":"e_1_3_2_1_14_1","first-page":"206016","volume-title":"Parallel Stacked Hourglass Network for Music Source Separation,\" in\u00a0 IEEE Access","author":"Bhattarai Bhuwan","year":"2020","unstructured":"Bhuwan Bhattarai , Yagya R. Pandeya , and Joonwhoan Lee . 2020. Parallel Stacked Hourglass Network for Music Source Separation,\" in\u00a0 IEEE Access , vol. 8 , pp. 206016 - 206027 , 2020 , doi: 10.1109\/ACCESS.2020.3037773. Bhuwan Bhattarai, Yagya R. Pandeya, and Joonwhoan Lee. 2020. Parallel Stacked Hourglass Network for Music Source Separation,\" in\u00a0 IEEE Access, vol. 8, pp. 206016-206027, 2020, doi: 10.1109\/ACCESS.2020.3037773."},{"key":"e_1_3_2_1_15_1","volume-title":"Gunjan V., Garcia Diaz V., Cardona M., Solanki V., Sunitha K. (eds) ICICCT 2019 \u2013 System Reliability, Quality Control, Safety, Maintenance and Management. ICICCT","author":"Vandana Tula","year":"2019","unstructured":"Tula Vandana , Nara Kalyani , and K. Santhi Sree . 2020. Music Mood Categorization: A Survey . In: Gunjan V., Garcia Diaz V., Cardona M., Solanki V., Sunitha K. (eds) ICICCT 2019 \u2013 System Reliability, Quality Control, Safety, Maintenance and Management. ICICCT 2019 . Springer , Singapore . https:\/\/doi.org\/10.1007\/978-981-13-8461-5_14 Tula Vandana, Nara Kalyani, and K. Santhi Sree. 2020. Music Mood Categorization: A Survey. In: Gunjan V., Garcia Diaz V., Cardona M., Solanki V., Sunitha K. (eds) ICICCT 2019 \u2013 System Reliability, Quality Control, Safety, Maintenance and Management. ICICCT 2019. Springer, Singapore. https:\/\/doi.org\/10.1007\/978-981-13-8461-5_14"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Yeong-Seok Seo and Jun-Ho Huh. 2019. Automatic Emotion-Based Music Classification for Supporting Intelligent IoT Applications. Electronics.  Yeong-Seok Seo and Jun-Ho Huh. 2019. Automatic Emotion-Based Music Classification for Supporting Intelligent IoT Applications. Electronics.","DOI":"10.3390\/electronics8020164"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMLA.2008.96"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Changfeng Chen and Qiang Li. 2020. A Multimodal Music Emotion Classification Method Based on Multifeature Combined Network Classifier. Mathematical Problems in Engineering.\u00a011.\u00a0https:\/\/doi.org\/10.1155\/2020\/4606027  Changfeng Chen and Qiang Li. 2020. A Multimodal Music Emotion Classification Method Based on Multifeature Combined Network Classifier. Mathematical Problems in Engineering.\u00a011.\u00a0https:\/\/doi.org\/10.1155\/2020\/4606027","DOI":"10.1155\/2020\/4606027"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5391\/IJFIS.2019.19.2.88"},{"key":"e_1_3_2_1_20_1","volume-title":"Yi-Hsuan Yang and Mohammad Soleymani","author":"Aljanaki Anna","year":"2017","unstructured":"Anna Aljanaki , Yi-Hsuan Yang and Mohammad Soleymani . 2017 . Developing a Benchmark for Emotional Analysis of Music. PLoS ONE 12(3): e0173392, doi:10.1371\/journal.pone.0173392 Anna Aljanaki, Yi-Hsuan Yang and Mohammad Soleymani. 2017. Developing a Benchmark for Emotional Analysis of Music. PLoS ONE 12(3): e0173392, doi:10.1371\/journal.pone.0173392"},{"key":"e_1_3_2_1_21_1","volume-title":"Dasa Ticha and Roman Jarina","author":"Malik Miroslav","year":"2017","unstructured":"Miroslav Malik , Sharath Adavanne , Konstantinos Drossos , Tuomas Virtanen , Dasa Ticha and Roman Jarina . 2017 . Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition . arXiv:1706.02292v1. Miroslav Malik, Sharath Adavanne, Konstantinos Drossos, Tuomas Virtanen, Dasa Ticha and Roman Jarina. 2017. Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition. arXiv:1706.02292v1."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2014.6890290"},{"key":"e_1_3_2_1_23_1","volume-title":"ISMIR2020","author":"Choi Woosung","year":"2020","unstructured":"Woosung Choi ,\u00a0 Minseok Kim ,\u00a0 Jaehwa Chung ,\u00a0 Daewon Lee ,\u00a0 Soonyoung Jung . 2020 . Investigating U-nets with Various Intermediate blocks for Spectrogram-Based Singing Voice Separation . ISMIR2020 . Woosung Choi,\u00a0Minseok Kim,\u00a0Jaehwa Chung,\u00a0Daewon Lee,\u00a0Soonyoung Jung. 2020. Investigating U-nets with Various Intermediate blocks for Spectrogram-Based Singing Voice Separation. ISMIR2020."},{"key":"e_1_3_2_1_24_1","volume-title":"Phasen: A phase-and-harmonics-aware speech enhancement network. arXiv preprint arXiv:1911.04697.","author":"Yin Dacheng","year":"2019","unstructured":"Dacheng Yin ,\u00a0 Chong Luo ,\u00a0 Zhiwei Xiong ,\u00a0 Wenjun Zeng . 2019 . Phasen: A phase-and-harmonics-aware speech enhancement network. arXiv preprint arXiv:1911.04697. Dacheng Yin,\u00a0Chong Luo,\u00a0Zhiwei Xiong,\u00a0Wenjun Zeng. 2019. Phasen: A phase-and-harmonics-aware speech enhancement network. arXiv preprint arXiv:1911.04697."},{"key":"e_1_3_2_1_25_1","volume-title":"An introduction to the psychology of hearing","author":"Moore Brian","year":"2012","unstructured":"Brian CJ- Moore . An introduction to the psychology of hearing . Brill , 2012 . Brian CJ-Moore. An introduction to the psychology of hearing. Brill, 2012."},{"key":"e_1_3_2_1_26_1","volume-title":"Pandeya and Joonwhoan Lee","author":"Yagya","year":"2020","unstructured":"Yagya R. Pandeya and Joonwhoan Lee . 2020 . Deep Learning-based Late Fusion of Multimodal Information for Emotion Classification of Music Video. Multimed Tools Appl . 537. Yagya R. Pandeya and Joonwhoan Lee. 2020. Deep Learning-based Late Fusion of Multimodal Information for Emotion Classification of Music Video. Multimed Tools Appl. 537."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1177\/0305735610362821"},{"key":"e_1_3_2_1_28_1","first-page":"165","article-title":"The Greek Audio Dataset","volume":"2014","author":"Ioannis\u00a0Karydis Lida\u00a0Kermanidis","year":"2014","unstructured":"Dimos\u00a0Makris, Katia\u00a0 Lida\u00a0Kermanidis and Ioannis\u00a0Karydis . 2014 . The Greek Audio Dataset . Artificial Intelligence Applications and Innovations - AIAI 2014 , pp. 165 \u2013 173 . Rhodos, Greece: Springer, Berlin, Heidelberg, http:\/\/doi.org\/10.1007\/978-3-662-44722-2_18 Dimos\u00a0Makris, Katia\u00a0Lida\u00a0Kermanidis and Ioannis\u00a0Karydis. 2014. The Greek Audio Dataset. Artificial Intelligence Applications and Innovations - AIAI 2014, pp. 165\u2013173. Rhodos, Greece: Springer, Berlin, Heidelberg, http:\/\/doi.org\/10.1007\/978-3-662-44722-2_18","journal-title":"Artificial Intelligence Applications and Innovations - AIAI"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2015.03.004"},{"key":"e_1_3_2_1_30_1","first-page":"60","volume-title":"Basic Emotions in Handbook of Cognition and Emotion","author":"Ekman Paul","unstructured":"Paul Ekman . 1999. Basic Emotions in Handbook of Cognition and Emotion . Wiley Hoboken , pp. 45\u2013 60 . Paul Ekman. 1999. Basic Emotions in Handbook of Cognition and Emotion. Wiley Hoboken, pp. 45\u201360."},{"volume-title":"A Regression Approach to Music Emotion Recognition. in\u00a0 IEEE Transactions on Audio, Speech, and Language Processing","author":"Yang Yi-Hsuan","key":"e_1_3_2_1_31_1","unstructured":"Yi-Hsuan Yang ,\u00a0 Yu-Ching Lin ,\u00a0 Ya-Fan Su ,\u00a0 Homer H. Chen . 2008. A Regression Approach to Music Emotion Recognition. in\u00a0 IEEE Transactions on Audio, Speech, and Language Processing , vol. 16 , no. 2, pp. 448-457, doi: 10.1109\/TASL.2007.911513. Yi-Hsuan Yang,\u00a0Yu-Ching Lin,\u00a0Ya-Fan Su,\u00a0Homer H. Chen. 2008. A Regression Approach to Music Emotion Recognition. in\u00a0 IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 2, pp. 448-457, doi: 10.1109\/TASL.2007.911513."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1037\/h0077714"},{"key":"e_1_3_2_1_33_1","volume-title":"Chen","author":"Yi-Hsuan","year":"2011","unstructured":"Yi-Hsuan and Homer H . Chen . 2011 . Music Emotion Recognition. CRC Press . Yi-Hsuan and Homer H. Chen. 2011. Music Emotion Recognition. CRC Press."},{"volume-title":"Novel Audio Features for Music Emotion Recognition.\u00a0 IEEE Transactions on Affective Computing","author":"Panda Renato","key":"e_1_3_2_1_34_1","unstructured":"Renato Panda ,\u00a0 Ricardo Malheiro ,\u00a0 Rui Pedro Paiva . 2020. Novel Audio Features for Music Emotion Recognition.\u00a0 IEEE Transactions on Affective Computing . vol. 11 , no. 4, pp. 614-626, 1 Oct.-Dec. 2020, doi: 10.1109\/TAFFC.2018.2820691. Renato Panda,\u00a0Ricardo Malheiro,\u00a0Rui Pedro Paiva. 2020. Novel Audio Features for Music Emotion Recognition.\u00a0 IEEE Transactions on Affective Computing. vol. 11, no. 4, pp. 614-626, 1 Oct.-Dec. 2020, doi: 10.1109\/TAFFC.2018.2820691."},{"key":"e_1_3_2_1_35_1","volume-title":"CNN Based Music Emotion Classification","author":"Liu Xin","year":"2017","unstructured":"Xin Liu ,\u00a0 Qingcai Chen ,\u00a0 Xiangping Wu ,\u00a0 Yan Liu ,\u00a0 Yang Liu . 2017. CNN Based Music Emotion Classification . 2017 , arXiv:1704.05665. Xin Liu,\u00a0Qingcai Chen,\u00a0Xiangping Wu,\u00a0Yan Liu,\u00a0Yang Liu. 2017. CNN Based Music Emotion Classification. 2017, arXiv:1704.05665."}],"event":{"name":"ICMLT 2021: 2021 6th International Conference on Machine Learning Technologies","acronym":"ICMLT 2021","location":"Jeju Island Republic of Korea"},"container-title":["2021 6th International Conference on Machine Learning Technologies"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3468891.3468911","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3468891.3468911","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:22Z","timestamp":1750191442000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3468891.3468911"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,23]]},"references-count":35,"alternative-id":["10.1145\/3468891.3468911","10.1145\/3468891"],"URL":"https:\/\/doi.org\/10.1145\/3468891.3468911","relation":{},"subject":[],"published":{"date-parts":[[2021,4,23]]},"assertion":[{"value":"2021-09-06","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}