{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T16:33:39Z","timestamp":1775579619592,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":55,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,24]],"date-time":"2021-08-24T00:00:00Z","timestamp":1629763200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,24]]},"DOI":"10.1145\/3460426.3463619","type":"proceedings-article","created":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T22:50:29Z","timestamp":1630536629000},"page":"29-36","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["MS-SincResNet: Joint Learning of 1D and 2D Kernels Using Multi-scale SincNet and ResNet for Music Genre Classification"],"prefix":"10.1145","author":[{"given":"Pei-Chun","family":"Chang","sequence":"first","affiliation":[{"name":"National Yang Ming Chiao Tung University, Hsinchu, Taiwan Roc"}]},{"given":"Yong-Sheng","family":"Chen","sequence":"additional","affiliation":[{"name":"National Yang Ming Chiao Tung University, Hsinchu, Taiwan Roc"}]},{"given":"Chang-Hsing","family":"Lee","sequence":"additional","affiliation":[{"name":"Chung Hua University, Hsinchu, Taiwan Roc"}]}],"member":"320","published-online":{"date-parts":[[2021,9]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2014.2326991"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.572"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3043142"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1076\/jnmr.32.1.83.16801"},{"key":"e_1_3_2_1_5_1","volume-title":"Jamie Ryan Kiros, and Geoffrey E Hinton","author":"Ba Jimmy Lei","year":"2016","unstructured":"Jimmy Lei Ba , Jamie Ryan Kiros, and Geoffrey E Hinton . 2016 . Layer normalization. arXiv preprint arXiv:1607.06450 (2016). Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E Hinton. 2016. Layer normalization. arXiv preprint arXiv:1607.06450 (2016)."},{"key":"e_1_3_2_1_6_1","volume-title":"Audio-Based Music Classification with DenseNet and Data Augmentation. In Pacific Rim International Conference on Artificial Intelligence. Springer, 56--65","author":"Bian Wenhao","year":"2019","unstructured":"Wenhao Bian , Jie Wang , Bojin Zhuang , Jiankui Yang , Shaojun Wang , and Jing Xiao . 2019 . Audio-Based Music Classification with DenseNet and Data Augmentation. In Pacific Rim International Conference on Artificial Intelligence. Springer, 56--65 . Wenhao Bian, Jie Wang, Bojin Zhuang, Jiankui Yang, Shaojun Wang, and Jing Xiao. 2019. Audio-Based Music Classification with DenseNet and Data Augmentation. In Pacific Rim International Conference on Artificial Intelligence. Springer, 56--65."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1111\/exsy.12429"},{"key":"e_1_3_2_1_8_1","volume-title":"Music Genre Recognition Using Residual Neural Networks. In TENCON 2019--2019 IEEE Region 10 Conference (TENCON). IEEE","author":"Bisharad Dipjyoti","year":"2019","unstructured":"Dipjyoti Bisharad and Rabul Hussain Laskar . 2019 . Music Genre Recognition Using Residual Neural Networks. In TENCON 2019--2019 IEEE Region 10 Conference (TENCON). IEEE , 2063--2068. Dipjyoti Bisharad and Rabul Hussain Laskar. 2019. Music Genre Recognition Using Residual Neural Networks. In TENCON 2019--2019 IEEE Region 10 Conference (TENCON). IEEE, 2063--2068."},{"key":"e_1_3_2_1_9_1","volume-title":"ISMIR 2004 audio description contest. Music Technology Group of the Universitat Pompeu Fabra, Tech. Rep","author":"Cano Pedro","year":"2006","unstructured":"Pedro Cano , Emilia G\u00f3mez , Fabien Gouyon , Perfecto Herrera , Markus Koppenberger , Beesuan Ong , Xavier Serra , Sebastian Streich , and Nicolas Wack . 2006 . ISMIR 2004 audio description contest. Music Technology Group of the Universitat Pompeu Fabra, Tech. Rep (2006). Pedro Cano, Emilia G\u00f3mez, Fabien Gouyon, Perfecto Herrera, Markus Koppenberger, Beesuan Ong, Xavier Serra, Sebastian Streich, and Nicolas Wack. 2006. ISMIR 2004 audio description contest. Music Technology Group of the Universitat Pompeu Fabra, Tech. Rep (2006)."},{"key":"e_1_3_2_1_10_1","volume-title":"Transfer learning for music classification and regression tasks. arXiv preprint arXiv:1703.09179","author":"Choi Keunwoo","year":"2017","unstructured":"Keunwoo Choi , Gy\u00f6rgy Fazekas , Mark Sandler , and Kyunghyun Cho . 2017. Transfer learning for music classification and regression tasks. arXiv preprint arXiv:1703.09179 ( 2017 ). Keunwoo Choi, Gy\u00f6rgy Fazekas, Mark Sandler, and Kyunghyun Cho. 2017. Transfer learning for music classification and regression tasks. arXiv preprint arXiv:1703.09179 (2017)."},{"key":"e_1_3_2_1_11_1","volume-title":"An evaluation of convolutional neural networks for music classification using spectrograms. Applied soft computing 52","author":"Costa Yandre MG","year":"2017","unstructured":"Yandre MG Costa , Luiz S Oliveira , and Carlos N Silla Jr . 2017. An evaluation of convolutional neural networks for music classification using spectrograms. Applied soft computing 52 ( 2017 ), 28--38. Yandre MG Costa, Luiz S Oliveira, and Carlos N Silla Jr. 2017. An evaluation of convolutional neural networks for music classification using spectrograms. Applied soft computing 52 (2017), 28--38."},{"key":"e_1_3_2_1_12_1","unstructured":"Jonathan Driedger Meinard M\u00fcller and Sascha Disch. 2014. Extending Harmonic- Percussive Separation of Audio Signals.. In ISMIR. 611--616.  Jonathan Driedger Meinard M\u00fcller and Sascha Disch. 2014. Extending Harmonic- Percussive Separation of Audio Signals.. In ISMIR. 611--616."},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the International Conference on Digital Audio Effects (DAFx)","volume":"13","author":"Fitzgerald Derry","year":"2010","unstructured":"Derry Fitzgerald . 2010 . Harmonic\/percussive separation using median filtering . In Proceedings of the International Conference on Digital Audio Effects (DAFx) , Vol. 13 . Derry Fitzgerald. 2010. Harmonic\/percussive separation using median filtering. In Proceedings of the International Conference on Digital Audio Effects (DAFx), Vol. 13."},{"key":"e_1_3_2_1_14_1","volume-title":"Proceedings of the COST G-6 conference on Digital Audio Effects (DAFX-00)","volume":"5","author":"Gouyon Fabien","year":"2000","unstructured":"Fabien Gouyon , Fran\u00e7ois Pachet , Olivier Delerue , 2000 . On the use of zero-crossing rate for an application of classification of percussive sounds . In Proceedings of the COST G-6 conference on Digital Audio Effects (DAFX-00) , Verona, Italy , Vol. 5 . Fabien Gouyon, Fran\u00e7ois Pachet, Olivier Delerue, et al. 2000. On the use of zero-crossing rate for an application of classification of percussive sounds. In Proceedings of the COST G-6 conference on Digital Audio Effects (DAFX-00), Verona, Italy, Vol. 5."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2389824"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2012.2234114"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-35236-2_1"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2002.1035731"},{"key":"e_1_3_2_1_20_1","unstructured":"Xin Jin and Rongfang Bie. 2006. Random Forest and PCA for Self-Organizing Maps based Automatic Music Genre Discrimination.. In DMIN. 414--417.  Xin Jin and Rongfang Bie. 2006. Random Forest and PCA for Self-Organizing Maps based Automatic Music Genre Discrimination.. In DMIN. 414--417."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICICI.2017.8365395"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2004.826766"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8462046"},{"key":"e_1_3_2_1_24_1","article-title":"Music genre classification using MFCC","volume":"112","author":"Kour Gursimran","year":"2015","unstructured":"Gursimran Kour and Neha Mehan . 2015 . Music genre classification using MFCC , SVM and BPNN. International Journal of Computer Applications 112 , 6 (2015). Gursimran Kour and Neha Mehan. 2015. Music genre classification using MFCC, SVM and BPNN. International Journal of Computer Applications 112, 6 (2015).","journal-title":"SVM and BPNN. International Journal of Computer Applications"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2009.2017635"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2007.4284622"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860487"},{"key":"e_1_3_2_1_28_1","unstructured":"Thomas Lidy and Andreas Rauber. 2005. Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In ISMIR. 34--41.  Thomas Lidy and Andreas Rauber. 2005. Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In ISMIR. 34--41."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.851880"},{"key":"e_1_3_2_1_30_1","volume-title":"Bottom-up broadcast neural network for music genre classification. Multimedia Tools and Applications","author":"Liu Caifeng","year":"2020","unstructured":"Caifeng Liu , Lin Feng , Guochao Liu , Huibing Wang , and Shenglan Liu . 2020. Bottom-up broadcast neural network for music genre classification. Multimedia Tools and Applications ( 2020 ), 1--19. Caifeng Liu, Lin Feng, Guochao Liu, Huibing Wang, and Shenglan Liu. 2020. Bottom-up broadcast neural network for music genre classification. Multimedia Tools and Applications (2020), 1--19."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2015.2409693"},{"key":"e_1_3_2_1_32_1","volume-title":"Content-based audio classification and segmentation by using support vector machines. Multimedia systems 8, 6","author":"Lu Lie","year":"2003","unstructured":"Lie Lu , Hong-Jiang Zhang , and Stan Z Li. 2003. Content-based audio classification and segmentation by using support vector machines. Multimedia systems 8, 6 ( 2003 ), 482--492. Lie Lu, Hong-Jiang Zhang, and Stan Z Li. 2003. Content-based audio classification and segmentation by using support vector machines. Multimedia systems 8, 6 (2003), 482--492."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2007.899293"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-015-2819-7"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.860352"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8462165"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2017.01.013"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2015.09.018"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3017661"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2020.2975422"},{"key":"e_1_3_2_1_41_1","volume-title":"Fundamentals of speech recognition. Fundamentals of speech recognition","author":"Rabiner Lawrence","year":"1993","unstructured":"Lawrence Rabiner . 1993. Fundamentals of speech recognition. Fundamentals of speech recognition ( 1993 ). Lawrence Rabiner. 1993. Fundamentals of speech recognition. Fundamentals of speech recognition (1993)."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8461807"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/SLT.2018.8639585"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2014.2311016"},{"key":"e_1_3_2_1_45_1","volume-title":"Voting-based music genre classification using melspectogram and convolutional neural network. In 2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","author":"Sugianto Sugianto","unstructured":"Sugianto Sugianto and Suyanto Suyanto . 2019. Voting-based music genre classification using melspectogram and convolutional neural network. In 2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI) . IEEE , 330--333. Sugianto Sugianto and Suyanto Suyanto. 2019. Voting-based music genre classification using melspectogram and convolutional neural network. In 2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI). IEEE, 330--333."},{"key":"e_1_3_2_1_46_1","volume-title":"Third International Workshop on Pattern Recognition","volume":"10828","author":"Tang Chun Pui","year":"2018","unstructured":"Chun Pui Tang , Ka Long Chui , Ying Kin Yu , Zhiliang Zeng , and Kin Hong Wong . 2018 . Music genre classification using a hierarchical long short term memory (LSTM) model . In Third International Workshop on Pattern Recognition , Vol. 10828 . International Society for Optics and Photonics, 108281B. Chun Pui Tang, Ka Long Chui, Ying Kin Yu, Zhiliang Zeng, and Kin Hong Wong. 2018. Music genre classification using a hierarchical long short term memory (LSTM) model. In Third International Workshop on Pattern Recognition, Vol. 10828. International Society for Optics and Photonics, 108281B."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.15625\/1813-9663\/36\/4\/14424"},{"key":"e_1_3_2_1_48_1","volume-title":"International Research Journal of Engineering and Technology (IRJET)","author":"Thiruvengatanadhan R","year":"2018","unstructured":"R Thiruvengatanadhan . 2018. Music Genre Classification using MFCC and AANN . International Research Journal of Engineering and Technology (IRJET) ( 2018 ). R Thiruvengatanadhan. 2018. Music Genre Classification using MFCC and AANN. International Research Journal of Engineering and Technology (IRJET) (2018)."},{"key":"e_1_3_2_1_49_1","unstructured":"Adam R Tindale Ajay Kapur George Tzanetakis and Ichiro Fujinaga. 2004. Retrieval of percussion gestures using timbre classification techniques.. In ISMIR.  Adam R Tindale Ajay Kapur George Tzanetakis and Ichiro Fujinaga. 2004. Retrieval of percussion gestures using timbre classification techniques.. In ISMIR."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472669"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2002.800560"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2002.800560"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2337842"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCCI.2018.8441340"},{"key":"e_1_3_2_1_55_1","volume-title":"Music genre recognition. Benediktsvogler.com","author":"Vogler Benedikt S","year":"2016","unstructured":"Benedikt S Vogler and Amir Othman . 2016. Music genre recognition. Benediktsvogler.com ( 2016 ). Benedikt S Vogler and Amir Othman. 2016. Music genre recognition. Benediktsvogler.com (2016)."}],"event":{"name":"ICMR '21: International Conference on Multimedia Retrieval","location":"Taipei Taiwan","acronym":"ICMR '21","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 2021 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460426.3463619","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3460426.3463619","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:03Z","timestamp":1750191423000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460426.3463619"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,24]]},"references-count":55,"alternative-id":["10.1145\/3460426.3463619","10.1145\/3460426"],"URL":"https:\/\/doi.org\/10.1145\/3460426.3463619","relation":{},"subject":[],"published":{"date-parts":[[2021,8,24]]},"assertion":[{"value":"2021-09-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}