{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T00:31:23Z","timestamp":1773793883872,"version":"3.50.1"},"reference-count":45,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T00:00:00Z","timestamp":1751587200000},"content-version":"vor","delay-in-days":184,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/doi.wiley.com\/10.1002\/tdm_license_1.1"}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Journal of Electrical and Computer Engineering"],"published-print":{"date-parts":[[2025,1]]},"abstract":"<jats:p>This study uses deep learning to explore the influence of phonetic similarities across languages on multilingual SER systems in diverse linguistic contexts. A deep convolutional neural network (DCNN) model was employed to evaluate the performance of speech emotion detection in a multilingual context. Experimented datasets are the SUST Bangla Emotional Speech Corpus (SUBESCO), Indian Institute of Technology Kharagpur Simulated Emotion Hindi Speech Corpus (IITKGP\u2010SEHSC), SIT Bhubaneswar\u2010Odia Speech Emotion Database (SITB\u2010OSED), Ryerson Audio\u2010Visual Database of Emotional Speech and Song (RAVDESS), and EmoDB datasets of Bangla, Hindi, Odia, English, and German languages, respectively. Here, Bangla, Hindi, and Odia are of the Indo\u2010Aryan language family and English and German are of the Germanic. A baseline monolingual experiment was performed first to evaluate the models, and then cross\u2010lingual and multilingual experiments were carried out. The experimental results reveal that the models can recognize emotions of multiple language speech of the same linguistic family better than language speech from different families. The DCNN model achieved the highest multilingual emotion recognition accuracy of 83% for Indo\u2010Aryan languages, 79% for Germanic languages, and 73% when both language families were combined. These results suggest that phonetic similarities within the same language family improve recognition accuracy.<\/jats:p>","DOI":"10.1155\/jece\/4748790","type":"journal-article","created":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T07:49:36Z","timestamp":1751615376000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["A Deep Learning Approach Toward Analyzing the Cross\u2010Lingual Acoustic\u2010Phonetic Similarities in Multilingual Speech Emotion Recognition"],"prefix":"10.1155","volume":"2025","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6221-179X","authenticated-orcid":false,"given":"Syeda Tamanna Alam","family":"Monisha","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9582-8868","authenticated-orcid":false,"given":"Sadia","family":"Sultana","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2025,7,4]]},"reference":[{"key":"e_1_2_10_1_2","doi-asserted-by":"publisher","DOI":"10.1145\/3129340"},{"key":"e_1_2_10_2_2","doi-asserted-by":"publisher","DOI":"10.1037\/\/0033-295x.99.3.550"},{"key":"e_1_2_10_3_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11235-011-9624-z"},{"key":"e_1_2_10_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2008.03.012"},{"key":"e_1_2_10_5_2","unstructured":"WilliamsonJ. D. Speech Analyzer for Analyzing Pitch or Frequency Perturbations in Individual Speech Pattern to Determine the Emotional State of the Person 1978."},{"key":"e_1_2_10_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/tbme.2010.2091640"},{"key":"e_1_2_10_7_2","doi-asserted-by":"crossref","unstructured":"Ee Brian OoiK. Alex LowL.-S. LechM. andAllenN. Early Prediction of Major Depression in Adolescents Using Glottal Wave Characteristics and Teager Energy Parameters 2012 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) 2012 IEEE 4613\u20134616.","DOI":"10.1109\/ICASSP.2012.6288946"},{"key":"e_1_2_10_8_2","doi-asserted-by":"publisher","DOI":"10.3390\/app132312587"},{"key":"e_1_2_10_9_2","article-title":"A Review of the Advancement in Speech Emotion Recognition for Indo-Aryan and Dravidian Languages","volume":"2022","author":"Monisha S. T. A.","year":"2022","journal-title":"Advances in Human-Computer Interaction"},{"key":"e_1_2_10_10_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0196391"},{"key":"e_1_2_10_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-008-9076-6"},{"key":"e_1_2_10_12_2","article-title":"Surrey Audio-Visual Expressed Emotion (Savee) Database","author":"Jackson P.","year":"2014","journal-title":"University of Surrey: Guildford, UK"},{"key":"e_1_2_10_13_2","doi-asserted-by":"crossref","unstructured":"RingevalF. SondereggerA. SauerJ. andLalanneD. Introducing the Recola Multimodal Corpus of Remote Collaborative and Affective Interactions 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) 2013 IEEE 1\u20138.","DOI":"10.1109\/FG.2013.6553805"},{"key":"e_1_2_10_14_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2005-446"},{"key":"e_1_2_10_15_2","unstructured":"BatlinerA. SteidlS. andN\u00f6thE. Releasing a Thoroughly Annotated and Processed Spontaneous Emotional Database: The Fau Aibo Emotion Corpus 2008."},{"key":"e_1_2_10_16_2","doi-asserted-by":"publisher","DOI":"10.21437\/speechprosody.2010-123"},{"key":"e_1_2_10_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/t-affc.2010.8"},{"key":"e_1_2_10_18_2","doi-asserted-by":"crossref","unstructured":"WangY. DingJ. MengQ. andJiangX. Multilingual Emotion Analysis and Recognition Based on Prosodic and Semantic Features 2011 International Conference in Electrics Communication and Automatic Control Proceedings 2012 1483\u20131490.","DOI":"10.1007\/978-1-4419-8849-2_191"},{"key":"e_1_2_10_19_2","doi-asserted-by":"crossref","unstructured":"BresterC. SemenkinE. KovalevI. ZelenkovP. andSidorovM. Evolutionary Feature Selection for Emotion Recognition in Multilingual Speech Analysis 2015 IEEE Congress on Evolutionary Computation (CEC) 2015 IEEE 2406\u20132411.","DOI":"10.1109\/CEC.2015.7257183"},{"key":"e_1_2_10_20_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-015-9275-7"},{"key":"e_1_2_10_21_2","doi-asserted-by":"crossref","unstructured":"FarhadM. IsmailH. HarousS. Mehedy MasudM. andBegA. Analysis of Emotion Recognition From Cross-Lingual Speech: Arabic English and Urdu 2021 2nd International Conference on Computation Automation and Knowledge Management (ICCAKM) 2021 IEEE 42\u201347.","DOI":"10.1109\/ICCAKM50778.2021.9357726"},{"key":"e_1_2_10_22_2","doi-asserted-by":"crossref","unstructured":"NeumannM. Cross-Lingual and Multilingual Speech Emotion Recognition on English and French 2018 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) 2018 IEEE 5769\u20135773.","DOI":"10.1109\/ICASSP.2018.8462162"},{"key":"e_1_2_10_23_2","doi-asserted-by":"crossref","unstructured":"LiY. ZhaoT. andKawaharaT. Improved End-Toend Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning Interspeech 2019 2803\u20132807.","DOI":"10.21437\/Interspeech.2019-2594"},{"key":"e_1_2_10_24_2","doi-asserted-by":"crossref","unstructured":"YadavA.andVishwakarmaD. K. A Multilingual Framework of Cnn and Bi-lstm for Emotion Classification 2020 International Conference on Computing Communication and Networking Technologies (ICCCNT) 2020 IEEE.","DOI":"10.1109\/ICCCNT49239.2020.9225614"},{"key":"e_1_2_10_25_2","doi-asserted-by":"crossref","unstructured":"ScottiV. GalatiF. SbattellaL. andTedescoR. Combining Deep and Unsupervised Features for Multilingual Speech Emotion Recognition International Conference on Pattern Recognition 2021 Springer 114\u2013128.","DOI":"10.1007\/978-3-030-68790-8_10"},{"key":"e_1_2_10_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2021.3136251"},{"key":"e_1_2_10_27_2","doi-asserted-by":"publisher","DOI":"10.1250\/ast.44.157"},{"key":"e_1_2_10_28_2","first-page":"681","volume-title":"Recent Trends in Mechatronics towards Industry 4.0: Selected Articles From iM3F 2020, Malaysia","author":"Qadri S. A. A.","year":"2022"},{"key":"e_1_2_10_29_2","doi-asserted-by":"publisher","DOI":"10.1007\/s40747-022-00713-w"},{"key":"e_1_2_10_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-022-12411-3"},{"key":"e_1_2_10_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-99-5088-1_15"},{"key":"e_1_2_10_32_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0250173"},{"key":"e_1_2_10_33_2","doi-asserted-by":"crossref","unstructured":"KoolagudiS. G. ReddyR. YadavJ. andSreenivasa RaoK. Iitkgp-sehsc: Hindi Speech Corpus for Emotion Analysis 2011 International Conference on Devices and Communications (ICDeCom) 2011 IEEE 1\u20135.","DOI":"10.1109\/ICDECOM.2011.5738540"},{"key":"e_1_2_10_34_2","doi-asserted-by":"publisher","DOI":"10.3390\/electronics11091328"},{"key":"e_1_2_10_35_2","doi-asserted-by":"publisher","DOI":"10.25080\/Majora-7b98e3ed-003"},{"key":"e_1_2_10_36_2","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/1453\/1\/012085"},{"key":"e_1_2_10_37_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2010.09.020"},{"key":"e_1_2_10_38_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2006.04.003"},{"key":"e_1_2_10_39_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-018-09572-8"},{"key":"e_1_2_10_40_2","doi-asserted-by":"crossref","unstructured":"PandeyS. K. Hanumant Singh Shekhawat and SR Mahadeva Prasanna. Deep Learning Techniques for Speech Emotion Recognition: A Review In 2019 29th International Conference Radioelektronika (RADIOELEKTRONIKA) 2019 IEEE 1\u20136.","DOI":"10.1109\/RADIOELEK.2019.8733432"},{"key":"e_1_2_10_41_2","volume-title":"Recognition of Speech Emotion in Call Centre Conversations in a Multilingual Environment","author":"Zvarevashe K.","year":"2021"},{"key":"e_1_2_10_42_2","doi-asserted-by":"crossref","unstructured":"LikithaM. S. GuptaR. HasithaK. andUpendra RajuA. Speech Based Human Emotion Recognition Using Mfcc 2017 International Conference on Wireless Communications Signal Processing and Networking (WiSPNET) 2017 IEEE 2257\u20132260.","DOI":"10.1109\/WiSPNET.2017.8300161"},{"key":"e_1_2_10_43_2","unstructured":"NaazM. F. Krishan Kumar Goyal and Komal Alwani Ex-Plore the Integration of Multimodal Inputs With Facial Expressions for More Comprehensive Emotion Recognition."},{"key":"e_1_2_10_44_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-35314-7_2"},{"key":"e_1_2_10_45_2","doi-asserted-by":"publisher","DOI":"10.3390\/app14167426"}],"container-title":["Journal of Electrical and Computer Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/jece\/4748790","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/full-xml\/10.1155\/jece\/4748790","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/jece\/4748790","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T20:36:49Z","timestamp":1773779809000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/jece\/4748790"}},"subtitle":[],"editor":[{"given":"Raid","family":"Al-Nima","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,1]]},"references-count":45,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1]]}},"alternative-id":["10.1155\/jece\/4748790"],"URL":"https:\/\/doi.org\/10.1155\/jece\/4748790","archive":["Portico"],"relation":{},"ISSN":["2090-0147","2090-0155"],"issn-type":[{"value":"2090-0147","type":"print"},{"value":"2090-0155","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1]]},"assertion":[{"value":"2024-10-16","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-06-06","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-07-04","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"4748790"}}