{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,27]],"date-time":"2026-05-27T13:41:01Z","timestamp":1779889261241,"version":"3.53.1"},"reference-count":72,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T00:00:00Z","timestamp":1647993600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Machine Learning (ML) algorithms within a human\u2013computer framework are the leading force in speech emotion recognition (SER). However, few studies explore cross-corpora aspects of SER; this work aims to explore the feasibility and characteristics of a cross-linguistic, cross-gender SER. Three ML classifiers (SVM, Na\u00efve Bayes and MLP) are applied to acoustic features, obtained through a procedure based on Kononenko\u2019s discretization and correlation-based feature selection. The system encompasses five emotions (disgust, fear, happiness, anger and sadness), using the Emofilm database, comprised of short clips of English movies and the respective Italian and Spanish dubbed versions, for a total of 1115 annotated utterances. The results see MLP as the most effective classifier, with accuracies higher than 90% for single-language approaches, while the cross-language classifier still yields accuracies higher than 80%. The results show cross-gender tasks to be more difficult than those involving two languages, suggesting greater differences between emotions expressed by male versus female subjects than between different languages. Four feature domains, namely, RASTA, F0, MFCC and spectral energy, are algorithmically assessed as the most effective, refining existing literature and approaches based on standard sets. To our knowledge, this is one of the first studies encompassing cross-gender and cross-linguistic assessments on SER.<\/jats:p>","DOI":"10.3390\/s22072461","type":"journal-article","created":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T22:08:06Z","timestamp":1648073286000},"page":"2461","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":31,"title":["The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8675-5532","authenticated-orcid":false,"given":"Giovanni","family":"Costantini","sequence":"first","affiliation":[{"name":"Department of Electronic Engineering, University of Rome Tor Vergata, 00133 Rome, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1843-3632","authenticated-orcid":false,"given":"Emilia","family":"Parada-Cabaleiro","sequence":"additional","affiliation":[{"name":"Institute of Computational Perception, Johannes Kepler University, 4040 Linz, Austria"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Daniele","family":"Casali","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, University of Rome Tor Vergata, 00133 Rome, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Valerio","family":"Cesarini","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, University of Rome Tor Vergata, 00133 Rome, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,3,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"507","DOI":"10.3758\/BF03199574","article-title":"Irrelevant thoughts, emotional mood states, and cognitive task performance","volume":"19","author":"Seibert","year":"1991","journal-title":"Mem. Cognit."},{"key":"ref_2","unstructured":"Frijda, N.H. (1993). Moods, emotion episodes, and emotions. Handbook of Emotions, The Guilford Press."},{"key":"ref_3","first-page":"349","article-title":"Emotion and memory: Effect of mood states on immediate and unexpected delayed recall","volume":"10","author":"Ellis","year":"1995","journal-title":"Psychol. J. Soc. Behav. Personal."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Kwon, O.-W., Chan, K., Hao, J., and Lee, T.-W. (2003, January 1\u20134). Emotion recognition by speech signals. Proceedings of the 8th European Conference on Speech Communication and Technology, Eurospeech 2003\u2014Interspeech 2003, Geneva, Switzerland.","DOI":"10.21437\/Eurospeech.2003-80"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1016\/j.patcog.2010.09.020","article-title":"Survey on speech emotion recognition: Features, classification schemes, and databases","volume":"44","author":"Kamel","year":"2011","journal-title":"Pattern Recognit."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1016\/S0167-6393(03)00099-2","article-title":"Speech emotion recognition using hidden Markov models","volume":"41","author":"Nwe","year":"2003","journal-title":"Speech Commun."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Nicholson, J., Takahashi, K., and Nakatsu, R. (2000). Emotion Recognition in Speech Using Neural Networks. Neural Comput. Appl.","DOI":"10.1007\/s005210070006"},{"key":"ref_8","unstructured":"Cullen, C., Vaughan, B., Kousidis, S., Wang, Y., McDonnell, C., and Campbell, D. (2006, January 25\u201328). Generation of High Quality Audio Natural Emotional Speech Corpus using Task Based Mood Induction. Proceedings of the International Conference on Multidisciplinary Information Sciences and Technologies Extremadura (InSciT), Merida, Spain."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1007\/BF00992107","article-title":"The velten mood induction procedure: A methodological review","volume":"10","author":"Kenealy","year":"1986","journal-title":"Motiv. Emot."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"121","DOI":"10.3758\/BF03335211","article-title":"A convenient self-referencing mood induction procedure","volume":"29","author":"Seibert","year":"1991","journal-title":"Bull. Psychon. Soc."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1177\/0146167291173013","article-title":"Meta-Analysis of Experimental Manipulations: Some Factors Affecting the Velten Mood Induction Procedure","volume":"17","author":"Larsen","year":"1991","journal-title":"Pers. Soc. Psychol. Bull."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1002\/per.466","article-title":"Trait Emotional Intelligence: Behavioural Validation in Two Studies of Emotion Recognition and Reactivity to Mood Induction","volume":"17","author":"Petrides","year":"2003","journal-title":"Eur. J. Personal."},{"key":"ref_13","first-page":"341","article-title":"DEMoS: An Italian emotional speech corpus: Elicitation methods, machine learning, and perception","volume":"54","author":"Costantini","year":"2019","journal-title":"Lang. Resour. Eval."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1037\/h0077714","article-title":"A Circumplex Model of Affect","volume":"39","author":"Russell","year":"1980","journal-title":"J. Pers. Soc. Psychol."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"102","DOI":"10.55612\/s-5002-015-008","article-title":"An exploration on possible correlations among perception and physical characteristics of EMOVO emotional portrayals","volume":"15","author":"Giovannella","year":"2012","journal-title":"IxD&A"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"2637","DOI":"10.22214\/ijraset.2021.37375","article-title":"Speech Emotion Recognition","volume":"9","author":"Swethashrree","year":"2021","journal-title":"Int. J. Res. Appl. Sci. Eng. Technol."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Xiao, Z., Wu, D., Zhang, X., and Tao, Z. (2016, January 23\u201325). Speech emotion recognition cross language families: Mandarin vs. western languages. Proceedings of the 2016 International Conference on Progress in Informatics and Computing (PIC), Shanghai, China.","DOI":"10.1109\/PIC.2016.7949505"},{"key":"ref_18","first-page":"1259","article-title":"Speech emotion recognition based on SVM and KNN classifications fusion","volume":"11","author":"Jawad","year":"2021","journal-title":"Int. J. Electr. Comput. Eng."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Costantini, G., Cesarini, V., and Casali, D. (2022, January 9\u201311). A Subset of Acoustic Features for Machine Learning-Based and Statistical Approaches in Speech Emotion Recognition. Proceedings of the BIOSIGNALS 2022: 15th International Conference on Bio-Inspired Systems and Signal Processing, Online Streaming.","DOI":"10.5220\/0010912500003123"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"9554","DOI":"10.1016\/j.eswa.2015.07.062","article-title":"New approach in quantification of emotional intensity from the speech signal: Emotional temperature","volume":"42","author":"Alonso","year":"2015","journal-title":"Expert Syst. Appl."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1945630","DOI":"10.1155\/2017\/1945630","article-title":"Random Deep Belief Networks for Recognizing Emotions from Speech Signals","volume":"2017","author":"Wen","year":"2017","journal-title":"Comput. Intell. Neurosci."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1186\/s13636-018-0145-5","article-title":"Decision tree SVM model with Fisher feature selection for speech emotion recognition","volume":"2019","author":"Sun","year":"2019","journal-title":"EURASIP J. Audio Speech Music Process."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Kaur, J., and Kumar, A. (2021). Speech Emotion Recognition Using CNN, k-NN, MLP and Random Forest. Computer Networks and Inventive Communication Technologies, Springer.","DOI":"10.1007\/978-981-15-9647-6_39"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"14","DOI":"10.3389\/fcomp.2020.00014","article-title":"Real-Time Speech Emotion Recognition Using a Pre-trained Image Classification Network: Effects of Bandwidth Reduction and Companding","volume":"2","author":"Lech","year":"2020","journal-title":"Front. Comput. Sci."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Aftab, A., Morsali, A., Ghaemmaghami, S., and Champagne, B. (2021). Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition. arXiv.","DOI":"10.1109\/ICASSP43922.2022.9746679"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Gat, I., Aronowitz, H., Zhu, W., Morais, E., and Hoory, R. (2022). Speaker Normalization for Self-supervised Speech Emotion Recognition. arXiv.","DOI":"10.1109\/ICASSP43922.2022.9747460"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"4486","DOI":"10.1007\/s00034-016-0284-9","article-title":"A Subspace Projection Approach for Analysis of Speech Under Stressed Condition","volume":"35","author":"Shukla","year":"2016","journal-title":"Circuits Syst. Signal Process."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1401","DOI":"10.1002\/mds.28508","article-title":"Voice Analysis with Machine Learning: One Step Closer to an Objective Diagnosis of Essential Tremor","volume":"36","author":"Suppa","year":"2021","journal-title":"Mov. Disord."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., and Weiss, B. (2005, January 4\u20138). A database of German emotional speech. Proceedings of the Interspeech 2005, Lisbon, Portugal.","DOI":"10.21437\/Interspeech.2005-446"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1007\/s10579-008-9076-6","article-title":"IEMOCAP: Interactive emotional dyadic motion capture database","volume":"42","author":"Busso","year":"2008","journal-title":"Lang. Resour. Eval."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Wang, W. (2010). Machine Audition: Principles, Algorithms and Systems, IGI Global.","DOI":"10.4018\/978-1-61520-919-4"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1238","DOI":"10.1121\/1.1913238","article-title":"Emotions and speech: Some acoustical correlates","volume":"52","author":"Williams","year":"1972","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_34","unstructured":"Costantini, G., Iaderola, I., Paoloni, A., and Todisco, M. (2014, January 26\u201331). EMOVO Corpus: An Italian Emotional Speech Database. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914), Reykjavik, Iceland."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1093\/mind\/os-IX.34.188","article-title":"II.\u2014What Is an Emotion?","volume":"os-IX","author":"James","year":"1884","journal-title":"Mind"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"614","DOI":"10.1037\/0022-3514.70.3.614","article-title":"Acoustic Profiles in Vocal Emotion Expression","volume":"70","author":"Banse","year":"1996","journal-title":"J. Pers. Soc. Psychol."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Rajoo, R., and Aun, C. (2016, January 30\u201331). Influences of languages in speech emotion recognition: A comparative study using Malay, English and Mandarin languages. Proceedings of the IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE), Penang, Malaysia.","DOI":"10.1109\/ISCAIE.2016.7575033"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Fu, C., Dissanayake, T., Hosoda, K., Maekawa, T., and Ishiguro, H. (2020, January 3\u20135). Similarity of Speech Emotion in Different Languages Revealed by A Neural Network with Attention. Proceedings of the IEEE 14th International Conference on Semantic Computing (ICSC), San Diego, CA, USA.","DOI":"10.1109\/ICSC.2020.00076"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.specom.2019.04.004","article-title":"Improving multilingual speech emotion recognition by combining acoustic features in a three-layer model","volume":"110","author":"Li","year":"2019","journal-title":"Speech Commun."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Tamulevi\u010dius, G., Korvel, G., Yayak, A.B., Treigys, P., Bernatavi\u010dien\u0117, J., and Kostek, B. (2020). A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces. Electronics, 9.","DOI":"10.3390\/electronics9101725"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"47795","DOI":"10.1109\/ACCESS.2021.3068045","article-title":"A Comprehensive Review of Speech Emotion Recognition Systems","volume":"9","author":"Wani","year":"2021","journal-title":"IEEE Access"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.parkreldis.2020.03.012","article-title":"Voice analysis in adductor spasmodic dysphonia: Objective diagnosis and response to botulinum toxin","volume":"Volume 73","author":"Suppa","year":"2020","journal-title":"Parkinsonism & Related Disorders"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Parada-Cabaleiro, E., Costantini, G., Batliner, A., Baird, A., and Schuller, B. (2018, January 2\u20136). Categorical vs. Dimensional Perception of Italian Emotional Speech. Proceedings of the Interspeech 2018, Hyderabad, India.","DOI":"10.21437\/Interspeech.2018-47"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Hansen, J.H.L., and Bou-Ghazale, S.E. (1997, January 22\u201325). Getting started with SUSAS: A speech under simulated and actual stress database. Proceedings of the 5th European Conference on Speech Communication and Technology (EUROSPEECH 1997), Rhodes, Greece.","DOI":"10.21437\/Eurospeech.1997-494"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Kerkeni, L., Serrestou, Y., Mbarki, M., Raoof, K., Mahjoub, M.A., and Cleder, C. (2019). Automatic Speech Emotion Recognition Using Machine Learning. Social Media and Machine Learning, IntechOpen.","DOI":"10.5772\/intechopen.84856"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1845","DOI":"10.1007\/s40747-020-00250-4","article-title":"Cross corpus multi-lingual speech emotion recognition using ensemble learning","volume":"7","author":"Zehra","year":"2021","journal-title":"Complex Intell. Syst."},{"key":"ref_47","unstructured":"Shih, J. (2020). The Rise of the Italian Dubbing Industry, JBI Localization. Available online: https:\/\/jbilocalization.com\/italian-dubbing-growing-industry\/."},{"key":"ref_48","unstructured":"Benavides, L. (2022, February 19). Dubbing Movies Into Spanish Is Big Business for Spain\u2019s Voice Actors, npr.org. Available online: https:\/\/www.npr.org\/2018\/11\/27\/671090473\/dubbing-movies-into-spanish-is-big-business-for-spains-voice-actors."},{"key":"ref_49","unstructured":"Kononenko, I. (1995, January 20\u201325). On biases in estimating multi-valued attributes. Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, QC, Canada."},{"key":"ref_50","unstructured":"Eibe, F., Hall, M.A., and Witten, I.H. (2016). The WEKA Workbench. Online Appendix for \u201cData Mining: Practical Machine Learning Tools and Techniques\u201d, Morgan Kauffman. [4th ed.]."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Kacur, J., Puterka, B., Pavlovicova, J., and Oravec, M. (2021). On the Speech Properties and Feature Extraction Methods in Speech Emotion Recognition. Sensors, 21.","DOI":"10.3390\/s21051888"},{"key":"ref_52","unstructured":"Bimbot, F., Cerisara, C., Cecile, F., Gravier, G., Lamel, L., Pellegrino, F., and Perrier, P. (2013, January 25\u201329). In Proceedings of the Interspeech 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1145\/2729095.2729097","article-title":"openSMILE:): The Munich open-source large-scale multimedia feature extractor","volume":"6","author":"Eyben","year":"2015","journal-title":"ACM SIGMultimedia Rec."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Gr\u00fcnwald, P.D. (2007). The Minimum Description Length Principle. Adaptive Computation and Machine Learning Series, MIT Press.","DOI":"10.7551\/mitpress\/4643.001.0001"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"1930001","DOI":"10.1142\/S2661335219300018","article-title":"Minimum Description Length Revisited","volume":"11","author":"Roos","year":"2019","journal-title":"Int. J. Math. Ind."},{"key":"ref_56","unstructured":"Kira, K., and Rendell, L.A. (1992, January 12\u201316). The Feature Selection Problem: Traditional Methods and a New Algorithm. Proceedings of the 10th National Conference on Artificial Intelligence, San Jose, CA, USA."},{"key":"ref_57","unstructured":"Cestnik, B. (1989, January 26\u201328). Informativity-Based Splitting of Numerical Attributes into Intervals. Proceedings of the IASTED International Conference on Expert Systems, Theory and Applications, Zurich, Switzerland."},{"key":"ref_58","unstructured":"Hall, M.A. (1999). Correlation-Based Feature Selection for Machine Learning, University of Waikato."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Sammut, C., and Webb, G.I. (2010). Na\u00efve Bayes. Encyclopedia of Machine Learning, Springer.","DOI":"10.1007\/978-0-387-30164-8"},{"key":"ref_60","first-page":"83","article-title":"The Elements of Statistical Learning: Data Mining, Inference, and Prediction","volume":"27","author":"Hastie","year":"2004","journal-title":"Math. Intell."},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"80","DOI":"10.2307\/3001968","article-title":"Individual Comparisons by Ranking Methods","volume":"1","author":"Wilcoxon","year":"1945","journal-title":"Biom. Bull."},{"key":"ref_62","unstructured":"McDonald, J.H. (2022, March 12). Wilcoxon Signed-Rank Test\u2014Handbook of Biological Statistics. Available online: http:\/\/www.biostathandbook.com\/wilcoxonsignedrank.html."},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Student (1908). The probable error of a mean. Biometrika, 4, 1\u201325.","DOI":"10.2307\/2331554"},{"key":"ref_64","unstructured":"Dair, Z., Donovan, R., and O\u2019Reilly, R. (2021). Linguistic and Gender Variation in Speech Emotion Recognition using Spectral Features. arXiv."},{"key":"ref_65","unstructured":"Bogert, B.P. (1963, January 11\u201314). The quefrency alanysis of time series for echoes; Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking. Proceedings of the Symposium on Time Series Analysis, New York, NY, USA."},{"key":"ref_66","unstructured":"Saggio, G., and Costantini, G. (2020). Worldwide Healthy Adult Voice Baseline Parameters: A Comprehensive Review. J. Voice."},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1109\/89.326616","article-title":"RASTA processing of speech","volume":"2","author":"Hermansky","year":"1994","journal-title":"IEEE Trans. Speech Audio Process."},{"key":"ref_68","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1121\/1.396427","article-title":"Measurement of pitch by subharmonic summation","volume":"83","author":"Hermes","year":"1988","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_69","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1186\/s13636-017-0100-x","article-title":"Efficiency of chosen speech descriptors in relation to emotion recognition","volume":"2017","author":"Anbarjafari","year":"2017","journal-title":"EURASIP J. Audio Speech Music Process."},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Cesarini, V., Casiddu, N., Porfirione, C., Massazza, G., Saggio, G., and Costantini, G. (2021, January 7\u20139). A Machine Learning-Based Voice Analysis for the Detection of Dysphagia Biomarkers. Proceedings of the 2021 IEEE International Workshop on Metrology for Industry 4.0 IoT (MetroInd4.0 IoT), Roma, Italy.","DOI":"10.1109\/MetroInd4.0IoT51437.2021.9488503"},{"key":"ref_71","unstructured":"Robotti, C., Costantini, G., Saggio, G., Cesarini, V., Calastri, A., Maiorano, E., Piloni, D., Perrone, T., Sabatini, U., and Ferretti, V.V. (2021). Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients. J. Voice."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Gupta, K., and Gupta, D. (2016, January 14\u201315). An analysis on LPC, RASTA and MFCC techniques in Automatic Speech recognition system. Proceedings of the 6th International Conference\u2014Cloud System and Big Data Engineering (Confluence), Noida, India.","DOI":"10.1109\/CONFLUENCE.2016.7508170"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/7\/2461\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:41:27Z","timestamp":1760136087000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/7\/2461"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,23]]},"references-count":72,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2022,4]]}},"alternative-id":["s22072461"],"URL":"https:\/\/doi.org\/10.3390\/s22072461","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,23]]}}}