{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T13:09:41Z","timestamp":1779196181552,"version":"3.51.4"},"reference-count":32,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2021,4,1]],"date-time":"2021-04-01T00:00:00Z","timestamp":1617235200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2021,11,17]]},"abstract":"<jats:p>Automatic recognition of human affective state using speech has been the focus of the research world for more than two decades. In the present day, with multi-lingual countries like India and Europe, population are communicating in various languages. However, majority of the existing works have put forth different strategies to recognize affect from various databases, with each comprising single language recordings. There exists a great demand for affective systems to serve the context of mixed-language scenario. Hence, this work focusses on an effective methodology to recognize human affective state using speech samples from a mixed language framework. A unique cepstral and bi-spectral speech features derived from the speech samples classified using random forest (RF) are applied for the task. This work is first of its kind with the proposed approach validated and found to be effective on a self-recorded database with speech samples comprising from eleven various diverse Indian languages. Six different affective states of angry, fear, sad, neutral, surprise and happy are considered. Three affective models have been investigated in the work. The experimental results demonstrate the proposed feature combination in addition to data augmentation show enhanced affect recognition.<\/jats:p>","DOI":"10.3233\/jifs-189868","type":"journal-article","created":{"date-parts":[[2021,4,2]],"date-time":"2021-04-02T14:37:18Z","timestamp":1617374238000},"page":"5467-5476","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":5,"title":["Investigation of automatic mixed-lingual affective state recognition system for diverse Indian languages"],"prefix":"10.1177","volume":"41","author":[{"given":"S.","family":"Lalitha","sequence":"first","affiliation":[{"name":"Department of Electronics and Communication Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Deepa","family":"Gupta","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2021,4]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"http:\/\/canwetalk.ca\/about-mental-illness\/factors-affectingmental-health\/"},{"key":"e_1_3_1_3_2","doi-asserted-by":"crossref","unstructured":"SlotenJ.V. VerdonckP. NyssenM. and HaueisenJ. Influence of mental stress on heart rate and heart rate variability International Federation for Medical and Biological Engineering Proceedings 2008 pp. 1366\u20131369.","DOI":"10.1007\/978-3-540-89208-3_324"},{"key":"e_1_3_1_4_2","doi-asserted-by":"crossref","unstructured":"BakkerJ. PechenizkiyM. and SidorovaN. What\u2019s your current stress level? Detection of stress patterns from GSR sensor data In Proceedings of ICDM 2011 pp. 573\u2013580.a","DOI":"10.1109\/ICDMW.2011.178"},{"key":"e_1_3_1_5_2","doi-asserted-by":"crossref","unstructured":"GowdaR.K. NimbalkerV. LavanyaR. LalithaS. and TripathiS. Affective computing using speech processing for call centre applications. In: International conference on advances in computing communications and informatics (ICACCI) Udupi; 2017. p. 766\u201371..a","DOI":"10.1109\/ICACCI.2017.8125934"},{"issue":"3","key":"e_1_3_1_6_2","first-page":"1","article-title":"Enhanced speech emotion detection using deep neural networks","volume":"22","author":"Lalitha S.","year":"2018","unstructured":"LalithaS. and TripathiG.D., Enhanced speech emotion detection using deep neural networks, Int J Speech Technol22(3) (2018), 1\u201314.","journal-title":"Int J Speech Technol"},{"key":"e_1_3_1_7_2","doi-asserted-by":"crossref","unstructured":"BurkhardtF. PaeschkeA. RolfesM. SendlmeierW.F. and WeissB. A database of German emotional speech Interspeech ISCA (2005) p. 1517\u20131520.","DOI":"10.21437\/Interspeech.2005-446"},{"key":"e_1_3_1_8_2","doi-asserted-by":"crossref","unstructured":"MartinO. KotsiaI. MacqB. and PitasI. The eNTERFACE 05 audio-visual emotion database In: 22nd international conference on data engineering workshops (ICDEW\u201906) Atlanta GA USA 2006. p. 8\u20138.","DOI":"10.1109\/ICDEW.2006.145"},{"key":"e_1_3_1_9_2","unstructured":"JacksonP. and HaqS. Surrey audio-visual expressed emotion (Savee) database Guildford UK: University of Surrey 2014."},{"key":"e_1_3_1_10_2","doi-asserted-by":"crossref","unstructured":"ZhalehpourS. OnderO. AkhtarZ. and ErdemC.E. BAUM-1: a spontaneous audio visual face database of affective and mental states IEEE Transactions on Affective Computing3. vol. 8. p. 300\u2013313.","DOI":"10.1109\/TAFFC.2016.2553038"},{"key":"e_1_3_1_11_2","doi-asserted-by":"crossref","unstructured":"PoriaS. HazarikaD. MajumderN. NaikG. CambriaE. and MihalceaR. MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations (2019) 527\u2013536","DOI":"10.18653\/v1\/P19-1050"},{"key":"e_1_3_1_12_2","doi-asserted-by":"crossref","unstructured":"SefaraT.J. The Effects of Normalisation Methods on Speech Emotion Recognition 2019 International Multidisciplinary Information Technology and Engineering Conference (IMITEC) Vanderbijlpark South Africa 2019 pp. 1\u20138","DOI":"10.1109\/IMITEC45504.2019.9015895"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-012-9368-5"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1166\/jctn.2020.8666"},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","unstructured":"DaneshfarF. JahanshahS. and NeekabadiA. Speech emotion recognition using hybrid spectral-prosodic features of speech signal\/glottal waveform metaheuristic-based dimensionality reduction and Gaussian elliptical basis function network classifier Applied Acoustics (2020).","DOI":"10.1016\/j.apacoust.2020.107360"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.3390\/a13030070"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2901352"},{"key":"e_1_3_1_18_2","doi-asserted-by":"crossref","unstructured":"LalithaS. and TripathiS. Emotion detection using perceptual based speech features 2016 IEEE Annual India Conference (INDICON) Bangalore 2016 pp. 1\u20135.","DOI":"10.1109\/INDICON.2016.7839028"},{"key":"e_1_3_1_19_2","doi-asserted-by":"crossref","unstructured":"LeeC.M. YildirimS. BulutM. KazemzadehA. BussoC. DengZ. LeeS.S. and NarayananS. Emotion recognition based on phoneme classes Int Conf Spoken Lang Process (2004) 205\u2013211.","DOI":"10.21437\/Interspeech.2004-322"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2010.09.020"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.3390\/s18113744"},{"key":"e_1_3_1_22_2","doi-asserted-by":"crossref","unstructured":"LeilaK. YoussefS. KosaiR. MohamedM. MohamedM. and CatherineC. Automatic speech emotion recognition using an optimal combination of features based on EMD-TKEO Speech Communication 2019 114.","DOI":"10.1016\/j.specom.2019.09.002"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1049\/iet-spr.2016.0336"},{"key":"e_1_3_1_24_2","doi-asserted-by":"crossref","unstructured":"ZhangY. LiuY. WeningerF. and SchullerB. Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations In: IEEE International conference on acoustics speech andsignal processing (ICASSP) New Orleans LA 2017.","DOI":"10.1109\/ICASSP.2017.7953106"},{"key":"e_1_3_1_25_2","doi-asserted-by":"crossref","unstructured":"LeeS. The generalization effect for multilingual speech emotion recognition across heterogeneous languages In: ICASSP 2019 \u20132019 IEEE international conference on acoustics speech and signal processing (ICASSP). p. 5881\u20135885.","DOI":"10.1109\/ICASSP.2019.8683046"},{"key":"e_1_3_1_26_2","doi-asserted-by":"crossref","unstructured":"ZhangY. LiuY. WeningerF. and SchullerB. Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations In: IEEE International conference on acoustics speech and signal processing (ICASSP) New Orleans LA 2017. p. 4990\u20134994.","DOI":"10.1109\/ICASSP.2017.7953106"},{"key":"e_1_3_1_27_2","first-page":"688","article-title":"Blind detection ofphotomontage using higher order statistics","volume":"685","author":"Ng T.-T.","year":"2004","unstructured":"NgT.-T., ChangS.-F. and SunQ., Blind detection ofphotomontage using higher order statistics, International Symposium on Circuits and Systems IEEE685 (2004), 688\u2013691.","journal-title":"International Symposium on Circuits and Systems IEEE"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.apacoust.2020.107519"},{"key":"e_1_3_1_29_2","unstructured":"BachuR. KopparthiS. AdapaB. and BarkanaB. Voiced\/Unvoiced Decision for Speech Signals Based on Zero-CrossingRate and Energy ElleithyK. (eds) Advanced Techniques in Computing Sciences."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10916-010-9633-6"},{"key":"e_1_3_1_31_2","doi-asserted-by":"crossref","unstructured":"ChawlaN.V. BowyerW.K. HallO.L. and KegelmeyerW.P. SMOTE: synthetic minority over-sampling technique J Artif Intell Res (2002) 321\u2013357.","DOI":"10.1613\/jair.953"},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-31865-1_25"},{"key":"e_1_3_1_33_2","unstructured":"https:\/\/www.python.org"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-189868","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-189868","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-189868","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:43:45Z","timestamp":1777455825000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-189868"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4]]},"references-count":32,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,11,17]]}},"alternative-id":["10.3233\/JIFS-189868"],"URL":"https:\/\/doi.org\/10.3233\/jifs-189868","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,4]]}}}