{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T03:32:38Z","timestamp":1775878358114,"version":"3.50.1"},"reference-count":67,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2022,4,29]],"date-time":"2022-04-29T00:00:00Z","timestamp":1651190400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Guru Nanak Dev Engineering College, Ludhiana, Punjab"},{"name":"IKG Punjab Technical University, Kapurthala, Punjab"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2022,9,30]]},"abstract":"<jats:p>As a challenge to refine the spontaneity and productivity of a machine and human coherence, speech emotion recognition has been an overriding area of research. The trustability and fulfillment of emotion recognition are largely involved with the feature extraction and selection processes. An important role is played in exploring and distinguishing audio content during the feature extraction phase. Also, the features that have been extracted should be resilient to a number of disturbances and reliable enough for an adequate classification system. This article focuses on three main components of a Speech Emotion Recognition (SER) process. The first one is the optimal feature extraction method for a Punjabi SER system. The second one is the use of an appropriate feature selection method that selects effectual features from the ones extracted in the first step and removes the redundant features to improve the conduct of emotion recognition. The third one is the classification model that has been used further for emotion recognition. So the scope of this article is to explain the three main steps of the Punjabi SER system: feature extraction, feature selection, and emotion recognition with classifier. The results have been calculated and compared for number of feature set combinations, with and without a feature selection process. A total of 10 experiments are carried out, and various performance metrics such as precision, recall, F1-score, accuracy, and so on, are used to demonstrate the results.<\/jats:p>","DOI":"10.1145\/3511888","type":"journal-article","created":{"date-parts":[[2022,3,9]],"date-time":"2022-03-09T20:23:51Z","timestamp":1646857431000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Impact of Feature Extraction and Feature Selection Algorithms on Punjabi Speech Emotion Recognition Using Convolutional Neural Network"],"prefix":"10.1145","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3542-1214","authenticated-orcid":false,"given":"Kamaldeep","family":"Kaur","sequence":"first","affiliation":[{"name":"Research Scholar, IKG Punjab Technical University, Punjab, India and Department of Computer Science &amp; Engineering, Guru Nanak Dev Engineering College, Ludhiana, Punjab, India"}]},{"given":"Parminder","family":"Singh","sequence":"additional","affiliation":[{"name":"Department of Computer Science &amp; Engineering, Guru Nanak Dev Engineering College, Ludhiana, Punjab, India"}]}],"member":"320","published-online":{"date-parts":[[2022,4,29]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12652-021-03479-0"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-020-05248-1"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1007\/s005210070006"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2010.09.020"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.apacoust.2018.11.028"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2008.09.003"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2017.07.050"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1080\/03637754109374888"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-020-10329-2"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2019.09.002"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-019-09775-8"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/CIS.2014.148"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2013.08.004"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2015.2503757"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/IKT.2015.7288756"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1049\/el.2014.3339"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-016-9364-2"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/SPA.2016.7763627"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/BIOSMART.2016.7835600"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISCAIE.2016.7575033"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2014.01.003"},{"key":"e_1_3_2_23_2","article-title":"Speech emotion recognition using Fourier parameters","author":"Wang K.","year":"2015","unstructured":"K. Wang, N. An, B. N. Li, Y. Zhang, and L. Li. 2015. Speech emotion recognition using Fourier parameters. IEEE Trans. Affect. Comput. (2015).","journal-title":"IEEE Trans. Affect. Comput"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-016-9333-9"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/TENCON.2008.4766487"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/TENCON.2016.7848296"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICICICT.2014.6781245"},{"key":"e_1_3_2_28_2","first-page":"2812","article-title":"Bengali speech emotion recognition","author":"Mohanta A.","year":"2016","unstructured":"A. Mohanta and U. Sharma. 2016. Bengali speech emotion recognition. In Proceedings of the 3rd International Conference on Computing for Sustainable Global Development (INDIACom\u201916). 2812\u20132814.","journal-title":"Proceedings of the 3rd International Conference on Computing for Sustainable Global Development (INDIACom\u201916)"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.protcy.2016.05.242"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-012-9139-3"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-012-9175-z"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDECOM.2011.5738540"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSDA.2011.6085972"},{"key":"e_1_3_2_34_2","first-page":"1865","volume-title":"Proceedings of the 2nd International Conference on Computing for Sustainable Global Development (INDIACom\u201915)","author":"Bansal S.","year":"2015","unstructured":"S. Bansal and A. Dev. 2015. Emotional Hindi speech: Feature extraction and classification. In Proceedings of the 2nd International Conference on Computing for Sustainable Global Development (INDIACom\u201915). 1865\u20131868."},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.18201\/ijisae.2021473641"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-10-8240-5_33"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1063\/1.4992990"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2016.06.032"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.5120\/ijca2018916290"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/79.911197"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.858051"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-016-9358-0"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-014-9239-3"},{"key":"e_1_3_2_44_2","article-title":"Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques","volume":"2","author":"Muda L.","year":"2010","unstructured":"L. Muda, M. Begam, and I. Elamvazuthi. 2010. Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. J. Comput. 2 (2010).","journal-title":"J. Comput."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3043201"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCS45141.2019.9065620"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-017-9426-0"},{"key":"e_1_3_2_48_2","first-page":"222","volume-title":"Proceedings of the International Conference on Spoken Language Processing (ICSLP\u201900 \/INTERSPEECH\u201900)","author":"Petrushin V.","year":"2000","unstructured":"V. Petrushin. 2000. Emotion recognition in speech signal: Experimental study, development, and application. In Proceedings of the International Conference on Spoken Language Processing (ICSLP\u201900 \/INTERSPEECH\u201900). 222\u2013225."},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-018-9495-8"},{"key":"e_1_3_2_50_2","volume-title":"Proceedings of the International Conference on Power, Energy, Environment and Computer Science (PEECS\u201905)","author":"Toh A.","year":"2005","unstructured":"A. Toh, R. Togneri, and S. Nordholm. 2005. Spectral entropy as speech features for speech recognition. In Proceedings of the International Conference on Power, Energy, Environment and Computer Science (PEECS\u201905)."},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/CONFLUENCE.2016.7508171"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCCT2.2019.8824988"},{"key":"e_1_3_2_53_2","unstructured":"L. Bankert. 1994. Feature selection for case-based classification of cloud types: An empirical comparison."},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-46742-8_19"},{"key":"e_1_3_2_55_2","first-page":"61","article-title":"The chi square test: An introduction","volume":"4","author":"Ugoni A.","year":"1995","unstructured":"A. Ugoni and B. Walker. 1995. The chi square test: An introduction. COMSIG Review\/COMSIG, Chiropractors and Osteopaths Musculo-Skeletal Interest Group 4 (1995), 61\u201364.","journal-title":"COMSIG Review\/COMSIG, Chiropractors and Osteopaths Musculo-Skeletal Interest Group"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.apnum.2020.09.013"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCB.2002.804363"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICMLA.2007.35"},{"key":"e_1_3_2_59_2","first-page":"1","volume-title":"VU Amsterdam","author":"Fonti V.","year":"2017","unstructured":"V. Fonti. 2017. Feature selection using LASSO. VU Amsterdam 1\u201326."},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1109\/WCCCT.2016.25"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-019-09727-2"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009715923555"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11767-012-0871-2"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1109\/5.18626"},{"key":"e_1_3_2_65_2","first-page":"16","article-title":"Dimensionality reduction and classification of color features data using svm and knn","volume":"1","author":"Manjunath R.","year":"2013","unstructured":"R. Manjunath. 2013. Dimensionality reduction and classification of color features data using svm and knn. Int. J. Image Process. Vis. Commun. 1 (2013), 16\u201321.","journal-title":"Int. J. Image Process. Vis. Commun."},{"key":"e_1_3_2_66_2","first-page":"1","article-title":"Introduction to convolutional neural networks","author":"Wu J.","year":"2017","unstructured":"J. Wu. 2017. Introduction to convolutional neural networks. In Introduction to Convolutional Neural Networks. 1\u201331.","journal-title":"Introduction to Convolutional Neural Networks."},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6638947"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511888","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3511888","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:48:50Z","timestamp":1750182530000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511888"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,29]]},"references-count":67,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,9,30]]}},"alternative-id":["10.1145\/3511888"],"URL":"https:\/\/doi.org\/10.1145\/3511888","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,29]]},"assertion":[{"value":"2021-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-04-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}