{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T02:06:44Z","timestamp":1774922804952,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":54,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,5,6]],"date-time":"2021-05-06T00:00:00Z","timestamp":1620259200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Higher Education Commission of Pakistan","award":["CIPL-NCBC"],"award-info":[{"award-number":["CIPL-NCBC"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,5,6]]},"DOI":"10.1145\/3411764.3445171","type":"proceedings-article","created":{"date-parts":[[2021,5,8]],"date-time":"2021-05-08T07:01:49Z","timestamp":1620457309000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["SEMOUR: A Scripted Emotional Speech Repository for Urdu"],"prefix":"10.1145","author":[{"given":"Nimra","family":"Zaheer","sequence":"first","affiliation":[{"name":"Computer Science Information Technology University, Pakistan"}]},{"given":"Obaid Ullah","family":"Ahmad","sequence":"additional","affiliation":[{"name":"Computer Science Information Technology University, Pakistan"}]},{"given":"Ammar","family":"Ahmed","sequence":"additional","affiliation":[{"name":"Computer Science Information Technology University, Pakistan"}]},{"given":"Muhammad Shehryar","family":"Khan","sequence":"additional","affiliation":[{"name":"Computer Science Information Technology University, Pakistan"}]},{"given":"Mudassir","family":"Shabbir","sequence":"additional","affiliation":[{"name":"Computer Science Information Technology University, Pakistan"}]}],"member":"320","published-online":{"date-parts":[[2021,5,7]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"2012 international conference on electronics computer technology (ICECT","author":"Ali Hazrat","year":"2012","unstructured":"Hazrat Ali , Nasir Ahmad , Khawaja\u00a0 M Yahya , and Omar Farooq . 2012 . A medium vocabulary Urdu isolated words balanced corpus for automatic speech recognition . In 2012 international conference on electronics computer technology (ICECT 2012). IEEE, Kanyakumari, India, 473\u2013476. Hazrat Ali, Nasir Ahmad, Khawaja\u00a0M Yahya, and Omar Farooq. 2012. A medium vocabulary Urdu isolated words balanced corpus for automatic speech recognition. In 2012 international conference on electronics computer technology (ICECT 2012). IEEE, Kanyakumari, India, 473\u2013476."},{"key":"e_1_3_2_1_2_1","volume-title":"3rd International Conference on Learning Representations, ICLR","author":"Bahdanau Dzmitry","year":"2015","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2015 . Neural machine translation by jointly learning to align and translate . In 3rd International Conference on Learning Representations, ICLR 2015. Computational and Biological Learning Society, San Diego, CA, USA, 15\u00a0pages. Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In 3rd International Conference on Learning Representations, ICLR 2015. Computational and Biological Learning Society, San Diego, CA, USA, 15\u00a0pages."},{"key":"e_1_3_2_1_3_1","volume-title":"ISCA tutorial and research workshop (ITRW) on speech and emotion","author":"Batliner Anton","unstructured":"Anton Batliner , Kerstin Fischer , Richard Huber , J\u00f6rg Spilker , and Elmar N\u00f6th . 2000. Desperately seeking emotions or: Actors, wizards, and human beings . In ISCA tutorial and research workshop (ITRW) on speech and emotion . International Speech Communication Association , Newcastle, Northern Ireland, UK, 6\u00a0pages. Anton Batliner, Kerstin Fischer, Richard Huber, J\u00f6rg Spilker, and Elmar N\u00f6th. 2000. Desperately seeking emotions or: Actors, wizards, and human beings. In ISCA tutorial and research workshop (ITRW) on speech and emotion. International Speech Communication Association, Newcastle, Northern Ireland, UK, 6\u00a0pages."},{"key":"e_1_3_2_1_4_1","volume-title":"Proc. of a Satellite Workshop of LREC. European Language Resources Association, Marrakesh, Morocco, 28","author":"Batliner Anton","year":"2008","unstructured":"Anton Batliner , Stefan Steidl , and Elmar N\u00f6th . 2008 . Releasing a thoroughly annotated and processed spontaneous emotional database: the FAU Aibo Emotion Corpus . In Proc. of a Satellite Workshop of LREC. European Language Resources Association, Marrakesh, Morocco, 28 . Anton Batliner, Stefan Steidl, and Elmar N\u00f6th. 2008. Releasing a thoroughly annotated and processed spontaneous emotional database: the FAU Aibo Emotion Corpus. In Proc. of a Satellite Workshop of LREC. European Language Resources Association, Marrakesh, Morocco, 28."},{"key":"e_1_3_2_1_5_1","volume-title":"Ninth European Conference on Speech Communication and Technology. International Speech Communication Association","author":"Burkhardt Felix","year":"2005","unstructured":"Felix Burkhardt , Astrid Paeschke , Miriam Rolfes , Walter\u00a0 F Sendlmeier , and Benjamin Weiss . 2005 . A database of German emotional speech . In Ninth European Conference on Speech Communication and Technology. International Speech Communication Association , Lisbon, Portugal, 1517\u20131520. Felix Burkhardt, Astrid Paeschke, Miriam Rolfes, Walter\u00a0F Sendlmeier, and Benjamin Weiss. 2005. A database of German emotional speech. In Ninth European Conference on Speech Communication and Technology. International Speech Communication Association, Lisbon, Portugal, 1517\u20131520."},{"key":"e_1_3_2_1_6_1","volume-title":"IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation 42, 4","author":"Busso Carlos","year":"2008","unstructured":"Carlos Busso , Murtaza Bulut , Chi-Chun Lee , Abe Kazemzadeh , Emily Mower , Samuel Kim , Jeannette\u00a0 N Chang , Sungbok Lee , and Shrikanth\u00a0 S Narayanan . 2008 . IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation 42, 4 (2008), 335. Carlos Busso, Murtaza Bulut, Chi-Chun Lee, Abe Kazemzadeh, Emily Mower, Samuel Kim, Jeannette\u00a0N Chang, Sungbok Lee, and Shrikanth\u00a0S Narayanan. 2008. IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation 42, 4 (2008), 335."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2016.2515617"},{"key":"e_1_3_2_1_8_1","volume-title":"A Framework for Recognizing and Regulating Emotions in the Elderly","author":"Castillo Jos\u00e9\u00a0Carlos","unstructured":"Jos\u00e9\u00a0Carlos Castillo , Antonio Fern\u00e1ndez-Caballero , \u00c1lvaro Castro-Gonz\u00e1lez , Miguel\u00a0 A. Salichs , and Mar\u00eda\u00a0 T. L\u00f3pez . 2014. A Framework for Recognizing and Regulating Emotions in the Elderly . In Ambient Assisted Living and Daily Activities, Leandro Pecchia, Liming\u00a0Luke Chen, Chris Nugent, and Jos\u00e9 Bravo (Eds.). Springer International Publishing , Cham , 320\u2013327. Jos\u00e9\u00a0Carlos Castillo, Antonio Fern\u00e1ndez-Caballero, \u00c1lvaro Castro-Gonz\u00e1lez, Miguel\u00a0A. Salichs, and Mar\u00eda\u00a0T. L\u00f3pez. 2014. A Framework for Recognizing and Regulating Emotions in the Elderly. In Ambient Assisted Living and Daily Activities, Leandro Pecchia, Liming\u00a0Luke Chen, Chris Nugent, and Jos\u00e9 Bravo (Eds.). Springer International Publishing, Cham, 320\u2013327."},{"key":"e_1_3_2_1_9_1","volume-title":"ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion. International Speech Communication Association","author":"Cauldwell T","year":"2000","unstructured":"Richard\u00a0 T Cauldwell . 2000 . Where did the anger go? The role of context in interpreting emotion in speech . In ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion. International Speech Communication Association , Newcastle, Northern Ireland, UK, 5\u00a0pages. Richard\u00a0T Cauldwell. 2000. Where did the anger go? The role of context in interpreting emotion in speech. In ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion. International Speech Communication Association, Newcastle, Northern Ireland, UK, 5\u00a0pages."},{"key":"e_1_3_2_1_10_1","volume-title":"Data Augmentation Using GANs for Speech Emotion Recognition","author":"Chatziagapi Aggelina","unstructured":"Aggelina Chatziagapi , Georgios Paraskevopoulos , Dimitris Sgouropoulos , Georgios Pantazopoulos , Malvina Nikandrou , Theodoros Giannakopoulos , Athanasios Katsamanis , Alexandros Potamianos , and Shrikanth Narayanan . 2019. Data Augmentation Using GANs for Speech Emotion Recognition .. In INTERSPEECH. International Speech Communication Association , Graz, Austria , 171\u2013175. Aggelina Chatziagapi, Georgios Paraskevopoulos, Dimitris Sgouropoulos, Georgios Pantazopoulos, Malvina Nikandrou, Theodoros Giannakopoulos, Athanasios Katsamanis, Alexandros Potamianos, and Shrikanth Narayanan. 2019. Data Augmentation Using GANs for Speech Emotion Recognition.. In INTERSPEECH. International Speech Communication Association, Graz, Austria, 171\u2013175."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2018.2860246"},{"key":"e_1_3_2_1_12_1","volume-title":"International Conference on Language Resources and Evaluation (LREC","author":"Costantini Giovanni","year":"2014","unstructured":"Giovanni Costantini , Iacopo Iaderola , Andrea Paoloni , and Massimiliano Todisco . 2014 . EMOVO corpus: an Italian emotional speech database . In International Conference on Language Resources and Evaluation (LREC 2014). European Language Resources Association (ELRA), European Language Resources Association, Reykjavik, Iceland, 3501\u20133504. Giovanni Costantini, Iacopo Iaderola, Andrea Paoloni, and Massimiliano Todisco. 2014. EMOVO corpus: an Italian emotional speech database. In International Conference on Language Resources and Evaluation (LREC 2014). European Language Resources Association (ELRA), European Language Resources Association, Reykjavik, Iceland, 3501\u20133504."},{"key":"e_1_3_2_1_13_1","volume-title":"Emotional speech: Towards a new generation of databases. Speech communication 40, 1-2","author":"Douglas-Cowie Ellen","year":"2003","unstructured":"Ellen Douglas-Cowie , Nick Campbell , Roddy Cowie , and Peter Roach . 2003. Emotional speech: Towards a new generation of databases. Speech communication 40, 1-2 ( 2003 ), 33\u201360. Ellen Douglas-Cowie, Nick Campbell, Roddy Cowie, and Peter Roach. 2003. Emotional speech: Towards a new generation of databases. Speech communication 40, 1-2 (2003), 33\u201360."},{"key":"e_1_3_2_1_14_1","volume-title":"Ninth European conference on speech communication and technology. International Speech Communication Association","author":"Douglas-Cowie Ellen","year":"2005","unstructured":"Ellen Douglas-Cowie , Laurence Devillers , Jean-Claude Martin , Roddy Cowie , Suzie Savvidou , Sarkis Abrilian , and Cate Cox . 2005 . Multimodal databases of everyday emotion: Facing up to complexity . In Ninth European conference on speech communication and technology. International Speech Communication Association , Lisbon, Portugal, 4. Ellen Douglas-Cowie, Laurence Devillers, Jean-Claude Martin, Roddy Cowie, Suzie Savvidou, Sarkis Abrilian, and Cate Cox. 2005. Multimodal databases of everyday emotion: Facing up to complexity. In Ninth European conference on speech communication and technology. International Speech Communication Association, Lisbon, Portugal, 4."},{"key":"e_1_3_2_1_15_1","volume-title":"Ethnologue: Languages of the world","author":"Eberhard M","year":"2020","unstructured":"David\u00a0 M Eberhard , Gary\u00a0 F Simons , and Charles\u00a0 D Fennig . 2020 . Ethnologue: Languages of the world . 23 rd edn. Dallas . David\u00a0M Eberhard, Gary\u00a0F Simons, and Charles\u00a0D Fennig. 2020. Ethnologue: Languages of the world. 23rd edn. Dallas.","edition":"23"},{"key":"e_1_3_2_1_16_1","volume-title":"The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing","author":"Eyben Florian","year":"2015","unstructured":"Florian Eyben , Klaus\u00a0 R Scherer , Bj\u00f6rn\u00a0 W Schuller , Johan Sundberg , Elisabeth Andr\u00e9 , Carlos Busso , Laurence\u00a0 Y Devillers , Julien Epps , Petri Laukka , Shrikanth\u00a0 S Narayanan , 2015. The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing . IEEE transactions on affective computing 7, 2 ( 2015 ), 190\u2013202. Florian Eyben, Klaus\u00a0R Scherer, Bj\u00f6rn\u00a0W Schuller, Johan Sundberg, Elisabeth Andr\u00e9, Carlos Busso, Laurence\u00a0Y Devillers, Julien Epps, Petri Laukka, Shrikanth\u00a0S Narayanan, 2015. The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing. IEEE transactions on affective computing 7, 2 (2015), 190\u2013202."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2017.02.013"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICOMET.2018.8346370"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2008.4607572"},{"key":"e_1_3_2_1_20_1","volume-title":"Ordinal Learning for Emotion Recognition in Customer Service Calls. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE","author":"Han Wenjing","year":"2020","unstructured":"Wenjing Han , Tao Jiang , Yan Li , Bj\u00f6rn Schuller , and Huabin Ruan . 2020 . Ordinal Learning for Emotion Recognition in Customer Service Calls. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE , Barcelona, Spain, 6494\u20136498. Wenjing Han, Tao Jiang, Yan Li, Bj\u00f6rn Schuller, and Huabin Ruan. 2020. Ordinal Learning for Emotion Recognition in Customer Service Calls. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Barcelona, Spain, 6494\u20136498."},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)","author":"Ali\u00a0Raza Haris Agha","year":"2018","unstructured":"Agha Ali\u00a0Raza Haris Bin\u00a0Zia and Awais Athar . 2018 . PronouncUR: An Urdu Pronunciation Lexicon Generator . In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (Miyazaki, Japan, 7-12). European Language Resources Association (ELRA), Paris, France, 5\u00a0pages. Agha Ali\u00a0Raza Haris Bin\u00a0Zia and Awais Athar. 2018. PronouncUR: An Urdu Pronunciation Lexicon Generator. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)(Miyazaki, Japan, 7-12). European Language Resources Association (ELRA), Paris, France, 5\u00a0pages."},{"key":"e_1_3_2_1_22_1","volume-title":"the Proceedings of Conference on Language Technology (CLT07)","author":"Ijaz Madiha","year":"2007","unstructured":"Madiha Ijaz and Sarmad Hussain . 2007 . Corpus based Urdu lexicon development . In the Proceedings of Conference on Language Technology (CLT07) , University of Peshawar, Pakistan, Vol.\u00a073. Academia, Pakistan, 12. Madiha Ijaz and Sarmad Hussain. 2007. Corpus based Urdu lexicon development. In the Proceedings of Conference on Language Technology (CLT07), University of Peshawar, Pakistan, Vol.\u00a073. Academia, Pakistan, 12."},{"key":"e_1_3_2_1_23_1","unstructured":"P Jackson and S Haq. 2014. Surrey audio-visual expressed emotion (savee) database.  P Jackson and S Haq. 2014. Surrey audio-visual expressed emotion (savee) database."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10919-015-0209-5"},{"key":"e_1_3_2_1_25_1","unstructured":"Hasan Kabir and Abdul\u00a0Mannan Saleem. 2002. Speech assessment methods phonetic alphabet (SAMPA): Analysis of Urdu. 6\u00a0pages.  Hasan Kabir and Abdul\u00a0Mannan Saleem. 2002. Speech assessment methods phonetic alphabet (SAMPA): Analysis of Urdu. 6\u00a0pages."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-69052-8_32"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/FIT.2018.00023"},{"key":"e_1_3_2_1_28_1","volume-title":"Sixth International Conference on Spoken Language Processing. International Speech Communication Association","author":"Li Aijun","year":"2000","unstructured":"Aijun Li , Fang Zheng , William Byrne , Pascale Fung , Terri Kamm , Yi Liu , Zhanjiang Song , Umar Ruhi , Veera Venkataramani , and Xiaoxia Chen . 2000 . CASS: A phonetically transcribed corpus of Mandarin spontaneous speech . In Sixth International Conference on Spoken Language Processing. International Speech Communication Association , Beijing, China, 485\u2013488. Aijun Li, Fang Zheng, William Byrne, Pascale Fung, Terri Kamm, Yi Liu, Zhanjiang Song, Umar Ruhi, Veera Venkataramani, and Xiaoxia Chen. 2000. CASS: A phonetically transcribed corpus of Mandarin spontaneous speech. In Sixth International Conference on Spoken Language Processing. International Speech Communication Association, Beijing, China, 485\u2013488."},{"key":"e_1_3_2_1_29_1","volume-title":"Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile","author":"Li Jeng-Lin","unstructured":"Jeng-Lin Li and Chi-Chun Lee . 2019. Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile .. In INTERSPEECH. International Speech Communication Association, Graz , Austria , 211\u2013215. Jeng-Lin Li and Chi-Chun Lee. 2019. Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile.. In INTERSPEECH. International Speech Communication Association, Graz, Austria, 211\u2013215."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12652-016-0406-z"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0196391"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.25080\/Majora-7b98e3ed-003"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASAR.2017.8067775"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2006.262725"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-018-9427-x"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1186\/1687-4722-2013-26"},{"key":"e_1_3_2_1_37_1","volume-title":"Analysis of Deep Learning Architectures for Cross-Corpus Speech Emotion Recognition","author":"Parry Jack","unstructured":"Jack Parry , Dimitri Palaz , Georgia Clarke , Pauline Lecomte , Rebecca Mead , Michael Berger , and Gregor Hofer . 2019. Analysis of Deep Learning Architectures for Cross-Corpus Speech Emotion Recognition .. In INTERSPEECH. International Speech Communication Association, Graz , Austria , 1656\u20131660. Jack Parry, Dimitri Palaz, Georgia Clarke, Pauline Lecomte, Rebecca Mead, Michael Berger, and Gregor Hofer. 2019. Analysis of Deep Learning Architectures for Cross-Corpus Speech Emotion Recognition.. In INTERSPEECH. International Speech Communication Association, Graz, Austria, 1656\u20131660."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSDA.2016.7918979"},{"key":"e_1_3_2_1_39_1","volume-title":"Design and development of phonetically rich Urdu speech corpus. In 2009 oriental COCOSDA international conference on speech database and assessments","author":"Raza Agha\u00a0Ali","unstructured":"Agha\u00a0Ali Raza , Sarmad Hussain , Huda Sarfraz , Inam Ullah , and Zahid Sarfraz . 2009. Design and development of phonetically rich Urdu speech corpus. In 2009 oriental COCOSDA international conference on speech database and assessments . IEEE , Urumqi, China , 38\u201343. Agha\u00a0Ali Raza, Sarmad Hussain, Huda Sarfraz, Inam Ullah, and Zahid Sarfraz. 2009. Design and development of phonetically rich Urdu speech corpus. In 2009 oriental COCOSDA international conference on speech database and assessments. IEEE, Urumqi, China, 38\u201343."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/FG.2013.6553805"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1016\/0092-6566(77)90037-X"},{"key":"e_1_3_2_1_42_1","volume-title":"VESUS: A Crowd-Annotated Database to Study Emotion Production and Perception in Spoken English.","author":"Sager Jacob","year":"2019","unstructured":"Jacob Sager , Ravi Shankar , Jacob Reinhold , and Archana Venkataraman . 2019 . VESUS: A Crowd-Annotated Database to Study Emotion Production and Perception in Spoken English. . In INTERSPEECH. International Speech Communication Association, Graz , Austria , 316\u2013320. Jacob Sager, Ravi Shankar, Jacob Reinhold, and Archana Venkataraman. 2019. VESUS: A Crowd-Annotated Database to Study Emotion Production and Perception in Spoken English.. In INTERSPEECH. International Speech Communication Association, Graz, Austria, 316\u2013320."},{"key":"e_1_3_2_1_43_1","volume-title":"Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. Speech communication 54, 4","author":"Sahidullah Md","year":"2012","unstructured":"Md Sahidullah and Goutam Saha . 2012. Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. Speech communication 54, 4 ( 2012 ), 543\u2013565. Md Sahidullah and Goutam Saha. 2012. Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. Speech communication 54, 4 (2012), 543\u2013565."},{"key":"e_1_3_2_1_44_1","volume-title":"Vocal communication of emotion: A review of research paradigms. Speech communication 40, 1-2","author":"Scherer R","year":"2003","unstructured":"Klaus\u00a0 R Scherer . 2003. Vocal communication of emotion: A review of research paradigms. Speech communication 40, 1-2 ( 2003 ), 227\u2013256. Klaus\u00a0R Scherer. 2003. Vocal communication of emotion: A review of research paradigms. Speech communication 40, 1-2 (2003), 227\u2013256."},{"key":"e_1_3_2_1_45_1","volume-title":"The interspeech 2016 computational paralinguistics challenge: Deception, sincerity & native language. In 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH","author":"Schuller Bj\u00f6rn","year":"2016","unstructured":"Bj\u00f6rn Schuller , Stefan Steidl , Anton Batliner , Julia Hirschberg , Judee\u00a0 K Burgoon , Alice Baird , Aaron Elkins , Yue Zhang , Eduardo Coutinho , Keelan Evanini , 2016. The interspeech 2016 computational paralinguistics challenge: Deception, sincerity & native language. In 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016 ), VOLS 1-5. International Speech Communication Association , San Francisco, CA, USA, 2001\u20132005. Bj\u00f6rn Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee\u00a0K Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini, 2016. The interspeech 2016 computational paralinguistics challenge: Deception, sincerity & native language. In 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5. International Speech Communication Association, San Francisco, CA, USA, 2001\u20132005."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376338"},{"key":"e_1_3_2_1_47_1","volume-title":"Fusion Techniques for Utterance-Level Emotion Recognition Combining Speech and Transcripts","author":"Sebastian Jilt","unstructured":"Jilt Sebastian and Piero Pierucci . 2019. Fusion Techniques for Utterance-Level Emotion Recognition Combining Speech and Transcripts .. In INTERSPEECH. International Speech Communication Association , Graz, Austria , 51\u201355. Jilt Sebastian and Piero Pierucci. 2019. Fusion Techniques for Utterance-Level Emotion Recognition Combining Speech and Transcripts.. In INTERSPEECH. International Speech Communication Association, Graz, Austria, 51\u201355."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.1915893"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-018-9491-z"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300302"},{"key":"e_1_3_2_1_51_1","volume-title":"Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition","author":"Xu Xinzhou","unstructured":"Xinzhou Xu , Jun Deng , Nicholas Cummins , Zixing Zhang , Li Zhao , and Bj\u00f6rn\u00a0 W Schuller . 2019. Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition .. In INTERSPEECH. International Speech Communication Association, Graz , Austria , 949\u2013953. Xinzhou Xu, Jun Deng, Nicholas Cummins, Zixing Zhang, Li Zhao, and Bj\u00f6rn\u00a0W Schuller. 2019. Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition.. In INTERSPEECH. International Speech Communication Association, Graz, Austria, 949\u2013953."},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2016.2553038"},{"key":"e_1_3_2_1_53_1","volume-title":"The Blizzard Challenge 2008 workshop. International Speech Communication Association","author":"Fangzhou Liu\u00a0Meng Zhang Jianhua Tao","year":"2008","unstructured":"Jianhua Tao Fangzhou Liu\u00a0Meng Zhang and Huibin Jia . 2008 . Design of speech corpus for mandarin text to speech . In The Blizzard Challenge 2008 workshop. International Speech Communication Association , Brisbane, Australia, 4. Jianhua Tao Fangzhou Liu\u00a0Meng Zhang and Huibin Jia. 2008. Design of speech corpus for mandarin text to speech. In The Blizzard Challenge 2008 workshop. International Speech Communication Association, Brisbane, Australia, 4."},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.bspc.2018.08.035"}],"event":{"name":"CHI '21: CHI Conference on Human Factors in Computing Systems","location":"Yokohama Japan","acronym":"CHI '21","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3411764.3445171","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3411764.3445171","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:28:42Z","timestamp":1750195722000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3411764.3445171"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,6]]},"references-count":54,"alternative-id":["10.1145\/3411764.3445171","10.1145\/3411764"],"URL":"https:\/\/doi.org\/10.1145\/3411764.3445171","relation":{},"subject":[],"published":{"date-parts":[[2021,5,6]]},"assertion":[{"value":"2021-05-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}