{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:41:00Z","timestamp":1753882860495,"version":"3.41.2"},"reference-count":69,"publisher":"World Scientific Pub Co Pte Ltd","issue":"14","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2022,11]]},"abstract":"<jats:p> Speech-driven user interfaces are becoming more common in our lives. To interact with such systems naturally and effectively, machines need to recognize the emotional states of users and respond to them accordingly. At the heart of the emotion recognition research done to this end lies the emotion representation that enables machines to learn and predict emotions. Speech emotion recognition studies use a wide range of low-to-high-level acoustic features for representation purposes such as LLDs, their functionals, and BoAW. In this paper, we present a new method for extracting a novel set of high-level features for classifying emotions. For this purpose, we (1) reduce the dimension of discrete-time speech signals, (2) perform a quantization operation on the new signals and assign a distinct symbol to each quantization level, (3) use the symbol sequences representing the signals to extract discriminative patterns that are capable of distinguishing different emotions from each other, and (4) generate a separate set of features for each emotion from the extracted patterns. Experimental results show that pattern features outperform Energy, Voicing, MFCC, Spectral, and RASTA feature sets. We also demonstrate that combining the pattern-based features and the acoustic features further improves the classification performance. <\/jats:p>","DOI":"10.1142\/s0218001422500458","type":"journal-article","created":{"date-parts":[[2022,10,7]],"date-time":"2022-10-07T17:32:25Z","timestamp":1665163945000},"source":"Crossref","is-referenced-by-count":1,"title":["A Pattern Mining Approach for Improving Speech Emotion Recognition"],"prefix":"10.1142","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7433-8704","authenticated-orcid":false,"given":"Umut","family":"Avci","sequence":"first","affiliation":[{"name":"Department of Software Engineering, Yasar University, Izmir, Turkey"}]}],"member":"219","published-online":{"date-parts":[[2022,11,24]]},"reference":[{"key":"S0218001422500458BIB001","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2807385"},{"key":"S0218001422500458BIB002","first-page":"31","volume-title":"Proc. 2018 IEEE Recent Advances in Intelligent Computational Systems","author":"Alex S. B.","year":"2018"},{"key":"S0218001422500458BIB003","first-page":"43","volume-title":"SGAI 2015: Research and Development in Intelligent Systems","author":"Alshdaifat E.","year":"2015"},{"key":"S0218001422500458BIB004","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1007\/978-3-030-26061-3_6","volume-title":"SPECOM 2019: Speech and Computer","volume":"11658","author":"Avci U.","year":"2019"},{"key":"S0218001422500458BIB005","first-page":"109","volume-title":"Proc. 2017 Int. Conf. Inventive Communication and Computational Technologies","author":"Basu S.","year":"2017"},{"key":"S0218001422500458BIB006","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1145\/1027933.1027968","volume-title":"Proc. 6th Int. Conf. Multimodal Interfaces","author":"Busso C.","year":"2004"},{"issue":"1","key":"S0218001422500458BIB007","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1109\/TAFFC.2016.2515617","volume":"8","author":"Busso C.","year":"2017","journal-title":"IEEE Trans. Affect. Comput."},{"key":"S0218001422500458BIB008","first-page":"4738","volume-title":"Proc. 2004 IEEE Int. Conf. Systems, Man and Cybernetics","author":"Chang F.","year":"2004"},{"key":"S0218001422500458BIB009","volume-title":"Proc. CVonline: On-Line Compendium of Computer Vision","volume":"9","author":"Chibelushi C. C.","year":"2003"},{"key":"S0218001422500458BIB011","first-page":"801","volume-title":"Proc. INTERSPEECH 2006: Ninth Int. Conf. Spoken Language Processing","author":"Devillers L.","year":"2006"},{"key":"S0218001422500458BIB012","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1146\/annurev.ps.30.020179.002523","volume":"30","author":"Ekman P.","year":"1979","journal-title":"Annu. Rev. Psychol."},{"issue":"3","key":"S0218001422500458BIB013","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1016\/j.patcog.2010.09.020","volume":"44","author":"El Ayadi M.","year":"2011","journal-title":"Pattern Recognit."},{"key":"S0218001422500458BIB014","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001420500275"},{"key":"S0218001422500458BIB015","first-page":"1459","volume-title":"Proc. 18th ACM Int. Conf. Multimedia","author":"Eyben F.","year":"2010"},{"key":"S0218001422500458BIB017","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-006-0059-1"},{"key":"S0218001422500458BIB018","doi-asserted-by":"crossref","first-page":"223","DOI":"10.21437\/Interspeech.2014-57","volume-title":"Proc. INTERSPEECH 2014: 15th Annu. Conf. International Speech Communication Association","author":"Han K.","year":"2014"},{"key":"S0218001422500458BIB019","first-page":"6822","volume-title":"Proc. 2018 IEEE Int. Conf. Acoustics, Speech and Signal Processing","author":"Han J.","year":"2018"},{"key":"S0218001422500458BIB020","first-page":"53","volume-title":"Proc. BMVA Symp. Facial Analysis and Animation","author":"Haq S.-U.","year":"2009"},{"issue":"15","key":"S0218001422500458BIB021","doi-asserted-by":"crossref","first-page":"288","DOI":"10.3182\/20130811-5-US-2037.00049","volume":"46","author":"Hartmann K.","year":"2013","journal-title":"IFAC Proc. Vol."},{"issue":"7777","key":"S0218001422500458BIB022","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1038\/d41586-019-03013-5","volume":"574","author":"Heaven D.","year":"2019","journal-title":"Nature"},{"issue":"12","key":"S0218001422500458BIB023","doi-asserted-by":"crossref","first-page":"272","DOI":"10.1007\/s10916-016-0627-x","volume":"40","author":"Hossain M. S.","year":"2016","journal-title":"J. Med. Syst."},{"key":"S0218001422500458BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/72.991427"},{"key":"S0218001422500458BIB025","first-page":"5866","volume-title":"Proc. 2019 IEEE Int. Conf. Acoustics, Speech and Signal Processing","author":"Huang K.-Y.","year":"2019"},{"key":"S0218001422500458BIB026","first-page":"886","volume-title":"Proc. 2016 IEEE Int. Conf. Communication and Signal Processing","author":"Jacob A.","year":"2016"},{"key":"S0218001422500458BIB027","first-page":"8","volume-title":"Proc. Fifth Int. Conf. Data Mining","author":"Ji X.","year":"2005"},{"key":"S0218001422500458BIB028","doi-asserted-by":"publisher","DOI":"10.3390\/s19122730"},{"key":"S0218001422500458BIB029","first-page":"1017","volume-title":"Proc. 2017 Int. Conf. Wireless Communications, Signal Processing and Networking (WiSPNET)","author":"Khan A.","year":"2017"},{"key":"S0218001422500458BIB030","first-page":"5166","volume-title":"Proc. 2010 IEEE Int. Conf. Acoustics, Speech and Signal Processing","author":"Kim W.","year":"2010"},{"key":"S0218001422500458BIB032","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1017\/9781316756782","volume-title":"Communicative Functions and Linguistic Forms in Speech Interaction","author":"Kohler K. J.","year":"2017"},{"issue":"2","key":"S0218001422500458BIB034","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1007\/s10772-011-9125-1","volume":"15","author":"Koolagudi S. G.","year":"2012","journal-title":"Int. J. Speech Technol."},{"key":"S0218001422500458BIB036","doi-asserted-by":"publisher","DOI":"10.1038\/nature14539"},{"key":"S0218001422500458BIB037","doi-asserted-by":"crossref","first-page":"849","DOI":"10.1109\/IJCNN.2005.1555963","volume-title":"Proc. 2005 IEEE Int. Joint Conf. Neural Networks","volume":"2","author":"Liu Y.","year":"2005"},{"key":"S0218001422500458BIB038","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0196391"},{"key":"S0218001422500458BIB039","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1016\/j.asoc.2017.05.048","volume":"59","author":"Lucas T.","year":"2017","journal-title":"Appl. Soft Comput."},{"key":"S0218001422500458BIB040","first-page":"225","volume":"33","author":"Madzarov G.","year":"2009","journal-title":"Informatica"},{"key":"S0218001422500458BIB041","first-page":"6715","volume-title":"Proc. 2019 IEEE Int. Conf. Acoustics, Speech and Signal Processing","author":"Mao S.","year":"2019"},{"volume-title":"Proc. Tenth Int. Workshop Frontiers in Handwriting Recognition","year":"2006","author":"Milgram J.","key":"S0218001422500458BIB042"},{"issue":"1","key":"S0218001422500458BIB043","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1109\/TBME.2007.900562","volume":"55","author":"Moore E.","year":"2008","journal-title":"IEEE Trans. Biomed. Eng."},{"key":"S0218001422500458BIB044","first-page":"169","volume-title":"Proc. 13th Int. Conf. Multimodal Interfaces","author":"Morency L.-P.","year":"2011"},{"issue":"4","key":"S0218001422500458BIB045","doi-asserted-by":"crossref","first-page":"3073","DOI":"10.1121\/1.5137665","volume":"146","author":"Morgan M. M.","year":"2019","journal-title":"J. Acoust. Soc. Am."},{"issue":"7","key":"S0218001422500458BIB046","doi-asserted-by":"crossref","first-page":"1159","DOI":"10.1142\/S0218001410008329","volume":"24","author":"Mporas I.","year":"2010","journal-title":"Int. J. Pattern Recognit. Artif. Intell."},{"key":"S0218001422500458BIB047","first-page":"809","volume-title":"Proc. INTERSPEECH 2006: 9th Int. Conf. Spoken Language Processing","author":"Neiberg D.","year":"2006"},{"issue":"4","key":"S0218001422500458BIB048","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1007\/s005210070006","volume":"9","author":"Nicholson J.","year":"2000","journal-title":"Neural Comput. Appl."},{"issue":"4","key":"S0218001422500458BIB049","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1016\/S0167-6393(03)00099-2","volume":"41","author":"Nwe T. L.","year":"2003","journal-title":"Speech Commun."},{"issue":"1","key":"S0218001422500458BIB050","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1177\/0165551516677911","volume":"44","author":"Onan A.","year":"2018","journal-title":"J. Inf. Sci."},{"key":"S0218001422500458BIB051","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2945911"},{"key":"S0218001422500458BIB052","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3049734"},{"key":"S0218001422500458BIB053","doi-asserted-by":"crossref","first-page":"1103","DOI":"10.21437\/Interspeech.2017-1494","volume-title":"Proc. INTERSPEECH 2017","author":"Parthasarathy S.","year":"2017"},{"issue":"8","key":"S0218001422500458BIB054","first-page":"84","volume":"7","author":"Pervaiz M.","year":"2016","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"S0218001422500458BIB055","first-page":"547","volume-title":"Proc. 12th Int. Conf. Neural Information Processing Systems","author":"Platt J. C.","year":"2000"},{"key":"S0218001422500458BIB056","series-title":"Cambridge Handbooks in Language and Linguistics","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1017\/9781316779194.011","volume-title":"The Cambridge Handbook of Spanish Linguistics","author":"Prieto P.","year":"2018"},{"key":"S0218001422500458BIB057","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1109\/ASRU.1997.659129","volume-title":"Proc. 1997 IEEE Workshop Automatic Speech Recognition and Understanding Proceedings","author":"Rabiner L. R.","year":"1997"},{"issue":"1","key":"S0218001422500458BIB058","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/0167-6393(95)00009-D","volume":"17","author":"Reynolds D. A.","year":"1995","journal-title":"Speech Commun."},{"key":"S0218001422500458BIB059","first-page":"589","volume-title":"Proc. 9th Int. Symp. Chinese Spoken Language Processing","author":"Rieger S. A.","year":"2014"},{"issue":"3","key":"S0218001422500458BIB060","first-page":"21","volume":"30","author":"Schmandt C.","year":"1984","journal-title":"IEEE Trans. Consum. Electron."},{"key":"S0218001422500458BIB061","doi-asserted-by":"crossref","first-page":"495","DOI":"10.21437\/Interspeech.2016-1124","volume-title":"Proc. INTERSPEECH 2016","author":"Schmitt M.","year":"2016"},{"key":"S0218001422500458BIB062","first-page":"4585","volume-title":"Proc. 2009 IEEE Int. Conf. Acoustics, Speech and Signal Processing","author":"Schuller B.","year":"2009"},{"key":"S0218001422500458BIB063","first-page":"577","volume-title":"Proc. 2004 IEEE Int. Conf. Acoustics, Speech and Signal Processing","volume":"1","author":"Schuller B.","year":"2004"},{"key":"S0218001422500458BIB064","doi-asserted-by":"crossref","first-page":"148","DOI":"10.21437\/Interspeech.2013-56","volume-title":"Proc. INTERSPEECH 2013: 14th Annu. Conf. International Speech Communication Association","author":"Schuller B.","year":"2013"},{"key":"S0218001422500458BIB065","first-page":"86","volume-title":"Proc. 2013 5th Int. Conf. Knowledge and Smart Technology","author":"Seehapoch T.","year":"2013"},{"key":"S0218001422500458BIB067","first-page":"370","volume-title":"Proc. IEEE Int. Conf. Image Processing 2005","author":"Shan C.","year":"2005"},{"key":"S0218001422500458BIB068","doi-asserted-by":"crossref","first-page":"3076324","DOI":"10.1155\/2019\/3076324","volume":"2019","author":"Tiwari A.","year":"2019","journal-title":"Comput. Intell. Neurosci."},{"key":"S0218001422500458BIB069","doi-asserted-by":"crossref","first-page":"1691","DOI":"10.21437\/Interspeech.2019-1811","volume-title":"Proc. INTERSPEECH 2019","author":"Triantafyllopoulos A.","year":"2019"},{"key":"S0218001422500458BIB070","first-page":"S3G","volume-title":"Proc. 35th Annu. Conf. Frontiers in Education","author":"Wald M.","year":"2005"},{"issue":"4","key":"S0218001422500458BIB071","doi-asserted-by":"crossref","first-page":"1191","DOI":"10.1109\/TASE.2015.2467311","volume":"12","author":"Wang J.","year":"2015","journal-title":"IEEE Trans. Autom. Sci. Eng."},{"key":"S0218001422500458BIB072","doi-asserted-by":"crossref","first-page":"292","DOI":"10.3389\/fpsyg.2013.00292","volume":"4","author":"Weninger F.","year":"2013","journal-title":"Front. Psychol."},{"issue":"11","key":"S0218001422500458BIB073","first-page":"2994","volume":"26","author":"Yang H.","year":"2015","journal-title":"J. Softw."},{"key":"S0218001422500458BIB074","series-title":"Frontiers in Artificial Intelligence and Applications","first-page":"216","volume-title":"Information Technology and Intelligent Transportation Systems","volume":"314","author":"Yang N.","year":"2019"},{"issue":"8","key":"S0218001422500458BIB075","doi-asserted-by":"crossref","first-page":"1685","DOI":"10.1142\/S0218001409007764","volume":"23","author":"You M.","year":"2009","journal-title":"Int. J. Pattern Recognit. Artif. Intell."}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001422500458","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,20]],"date-time":"2022-12-20T02:51:59Z","timestamp":1671504719000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001422500458"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11]]},"references-count":69,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2022,11]]}},"alternative-id":["10.1142\/S0218001422500458"],"URL":"https:\/\/doi.org\/10.1142\/s0218001422500458","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2022,11]]},"article-number":"2250045"}}