{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:16:27Z","timestamp":1750220187916,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":60,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,10,5]],"date-time":"2021-10-05T00:00:00Z","timestamp":1633392000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,5]]},"DOI":"10.1145\/3489088.3489121","type":"proceedings-article","created":{"date-parts":[[2022,2,14]],"date-time":"2022-02-14T05:09:46Z","timestamp":1644815386000},"page":"136-140","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Front-end Based Robust Speech Recognition Methods: A Review"],"prefix":"10.1145","author":[{"given":"Vicky","family":"Zilvan","sequence":"first","affiliation":[{"name":"Research Center for Informatics, Indonesian Institute of Sciences, Indonesia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ana","family":"Heryana","sequence":"additional","affiliation":[{"name":"Research Center for Informatics, Indonesian Institute of Sciences, Indonesia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Asri Rizki","family":"Yuliani","sequence":"additional","affiliation":[{"name":"Research Center for Informatics, Indonesian Institute of Sciences, Indonesia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dikdik","family":"Krisnandi","sequence":"additional","affiliation":[{"name":"Research Center for Informatics, Indonesian Institute of Sciences, Indonesia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"R. Sandra","family":"Yuwana","sequence":"additional","affiliation":[{"name":"Research Center for Informatics, Indonesian Institute of Sciences, Indonesia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hilman F","family":"Pardede","sequence":"additional","affiliation":[{"name":"Research Center for Informatics, Indonesian Institute of Sciences, Indonesia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,2,13]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Nagaraj Adiga Yannis Pantazis Vassilis Tsiaras and Yannis Stylianou. 2019. Speech Enhancement for Noise-Robust Speech Synthesis Using Wasserstein GAN.. In INTERSPEECH. 1821\u20131825. Nagaraj Adiga Yannis Pantazis Vassilis Tsiaras and Yannis Stylianou. 2019. Speech Enhancement for Noise-Robust Speech Synthesis Using Wasserstein GAN.. In INTERSPEECH. 1821\u20131825.","DOI":"10.21437\/Interspeech.2019-2648"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2019.2913965"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7177943"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2017.02.006"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.sigpro.2015.09.002"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/97.988717"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1980.1163420"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2003.820201"},{"key":"#cr-split#-e_1_3_2_1_9_1.1","doi-asserted-by":"crossref","unstructured":"Cong-Thanh Do Rama Doddipatla and Thomas Hain. 2021. Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). 6978-6982. https:\/\/doi.org\/10.1109\/ICASSP39728.2021.9414414 10.1109\/ICASSP39728.2021.9414414","DOI":"10.1109\/ICASSP39728.2021.9414414"},{"key":"#cr-split#-e_1_3_2_1_9_1.2","doi-asserted-by":"crossref","unstructured":"Cong-Thanh Do Rama Doddipatla and Thomas Hain. 2021. Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). 6978-6982. https:\/\/doi.org\/10.1109\/ICASSP39728.2021.9414414","DOI":"10.1109\/ICASSP39728.2021.9414414"},{"key":"e_1_3_2_1_10_1","volume-title":"Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 5024\u20135028","author":"Donahue Chris","year":"2018","unstructured":"Chris Donahue , Bo Li , and Rohit Prabhavalkar . 2018 . Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 5024\u20135028 . https:\/\/doi.org\/10.1109\/ICASSP.2018.8462581 10.1109\/ICASSP.2018.8462581 Chris Donahue, Bo Li, and Rohit Prabhavalkar. 2018. Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 5024\u20135028. https:\/\/doi.org\/10.1109\/ICASSP.2018.8462581"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2014-148"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1981.1163530"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Meng Ge Longbiao Wang Nan Li Hao Shi Jianwu Dang and Xiangang Li. 2019. Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement.. In INTERSPEECH. 3153\u20133157. Meng Ge Longbiao Wang Nan Li Hao Shi Jianwu Dang and Xiangang Li. 2019. Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement.. In INTERSPEECH. 3153\u20133157.","DOI":"10.21437\/Interspeech.2019-1477"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Kun Han Yanzhang He Deblin Bagchi Eric Fosler-Lussier and DeLiang Wang. 2015. Deep neural network based spectral feature mapping for robust speech recognition. In Sixteenth annual conference of the international speech communication association. Kun Han Yanzhang He Deblin Bagchi Eric Fosler-Lussier and DeLiang Wang. 2015. Deep neural network based spectral feature mapping for robust speech recognition. In Sixteenth annual conference of the international speech communication association.","DOI":"10.21437\/Interspeech.2015-536"},{"key":"e_1_3_2_1_15_1","first-page":"1","article-title":"A review of signal subspace speech enhancement and its application to noise robust speech recognition","volume":"2007","author":"Hermus Kris","year":"2006","unstructured":"Kris Hermus , Patrick Wambacq , and Hugo Van\u00a0Hamme . 2006 . A review of signal subspace speech enhancement and its application to noise robust speech recognition . EURASIP Journal on Advances in Signal Processing 2007 (2006), 1 \u2013 15 . Kris Hermus, Patrick Wambacq, and Hugo Van\u00a0Hamme. 2006. A review of signal subspace speech enhancement and its application to noise robust speech recognition. EURASIP Journal on Advances in Signal Processing 2007 (2006), 1\u201315.","journal-title":"EURASIP Journal on Advances in Signal Processing"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2015.7404841"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.2307\/1268779"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2017.03.001"},{"key":"e_1_3_2_1_19_1","volume-title":"A spectral masking approach to noise-robust speech recognition using deep neural networks","author":"Li Bo","year":"2014","unstructured":"Bo Li and Khe\u00a0Chai Sim . 2014. A spectral masking approach to noise-robust speech recognition using deep neural networks . IEEE\/ACM transactions on audio, speech, and language processing 22, 8( 2014 ), 1296\u20131305. Bo Li and Khe\u00a0Chai Sim. 2014. A spectral masking approach to noise-robust speech recognition using deep neural networks. IEEE\/ACM transactions on audio, speech, and language processing 22, 8(2014), 1296\u20131305."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2012.6288962"},{"key":"e_1_3_2_1_21_1","volume-title":"2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.\u00a01. IEEE, I\u201361","author":"Li Jin-Yu","year":"2004","unstructured":"Jin-Yu Li , Bo Liu , Ren-Hua Wang , and Li-Rong Dai . 2004 . A complexity reduction of ETSI advanced front-end for DSR . In 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.\u00a01. IEEE, I\u201361 . Jin-Yu Li, Bo Liu, Ren-Hua Wang, and Li-Rong Dai. 2004. A complexity reduction of ETSI advanced front-end for DSR. In 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.\u00a01. IEEE, I\u201361."},{"key":"e_1_3_2_1_22_1","volume-title":"Head-Synchronous Decoding for Transformer-Based Streaming ASR. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 5909\u20135913","author":"Li Mohan","year":"2021","unstructured":"Mohan Li , C\u0103t\u0103lin Zoril\u0103 , and Rama Doddipatla . 2021 . Head-Synchronous Decoding for Transformer-Based Streaming ASR. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 5909\u20135913 . https:\/\/doi.org\/10.1109\/ICASSP39728.2021.9414103 10.1109\/ICASSP39728.2021.9414103 Mohan Li, C\u0103t\u0103lin Zoril\u0103, and Rama Doddipatla. 2021. Head-Synchronous Decoding for Transformer-Based Streaming ASR. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 5909\u20135913. https:\/\/doi.org\/10.1109\/ICASSP39728.2021.9414103"},{"key":"e_1_3_2_1_23_1","volume-title":"Transformer-Based Online Speech Recognition with Decoder-end Adaptive Computation Steps. In 2021 IEEE Spoken Language Technology Workshop (SLT). 1\u20137. https:\/\/doi.org\/10","author":"Li Mohan","year":"2021","unstructured":"Mohan Li , C\u0103t\u0103lin Zoril\u0103 , and Rama Doddipatla . 2021 . Transformer-Based Online Speech Recognition with Decoder-end Adaptive Computation Steps. In 2021 IEEE Spoken Language Technology Workshop (SLT). 1\u20137. https:\/\/doi.org\/10 .1109\/SLT48900.2021.9383613 10.1109\/SLT48900.2021.9383613 Mohan Li, C\u0103t\u0103lin Zoril\u0103, and Rama Doddipatla. 2021. Transformer-Based Online Speech Recognition with Decoder-end Adaptive Computation Steps. In 2021 IEEE Spoken Language Technology Workshop (SLT). 1\u20137. https:\/\/doi.org\/10.1109\/SLT48900.2021.9383613"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2014.11.004"},{"key":"e_1_3_2_1_25_1","volume-title":"IEEE International Conference on, Vol.\u00a01. IEEE Computer Society, 265\u2013268","author":"Lockwood Philip","year":"1992","unstructured":"Philip Lockwood , J\u00e9r\u00f4me Boudy , and Marc Blanchet . 1992 . Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise environments. In Acoustics, Speech, and Signal Processing , IEEE International Conference on, Vol.\u00a01. IEEE Computer Society, 265\u2013268 . Philip Lockwood, J\u00e9r\u00f4me Boudy, and Marc Blanchet. 1992. Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise environments. In Acoustics, Speech, and Signal Processing, IEEE International Conference on, Vol.\u00a01. IEEE Computer Society, 265\u2013268."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2001.940811"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2002.1005724"},{"key":"e_1_3_2_1_28_1","volume-title":"The AMI meeting corpus. In Proceedings of the 5th international conference on methods and techniques in behavioral research, Vol.\u00a088","author":"McCowan Iain","year":"2005","unstructured":"Iain McCowan , Jean Carletta , Wessel Kraaij , Simone Ashby , S Bourban , M Flynn , M Guillemot , Thomas Hain , J Kadlec , Vasilis Karaiskos , 2005 . The AMI meeting corpus. In Proceedings of the 5th international conference on methods and techniques in behavioral research, Vol.\u00a088 . Citeseer, 100. Iain McCowan, Jean Carletta, Wessel Kraaij, Simone Ashby, S Bourban, M Flynn, M Guillemot, Thomas Hain, J Kadlec, Vasilis Karaiskos, 2005. The AMI meeting corpus. In Proceedings of the 5th international conference on methods and techniques in behavioral research, Vol.\u00a088. Citeseer, 100."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2012.6288824"},{"key":"e_1_3_2_1_30_1","volume-title":"Nips workshop on deep learning for speech recognition and related applications, Vol.\u00a01","author":"Dahl George","year":"2009","unstructured":"Abdel-rahman Mohamed, George Dahl , Geoffrey Hinton , 2009 . Deep belief networks for phone recognition . In Nips workshop on deep learning for speech recognition and related applications, Vol.\u00a01 . Vancouver, Canada, 39. Abdel-rahman Mohamed, George Dahl, Geoffrey Hinton, 2009. Deep belief networks for phone recognition. In Nips workshop on deep learning for speech recognition and related applications, Vol.\u00a01. Vancouver, Canada, 39."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2010.09.053"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639038"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2305833"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2896880"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2013.02.004"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2012-385"},{"key":"e_1_3_2_1_37_1","first-page":"87","article-title":"On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise","volume":"35","author":"Pardede F","year":"2019","unstructured":"Hilman\u00a0 F Pardede , Asri\u00a0 R Yuliani , and Agus Subekti . 2019 . On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise . J. Inf. Sci. Eng. 35 , 1 (2019), 87 \u2013 104 . Hilman\u00a0F Pardede, Asri\u00a0R Yuliani, and Agus Subekti. 2019. On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise.J. Inf. Sci. Eng. 35, 1 (2019), 87\u2013104.","journal-title":"J. Inf. Sci. Eng."},{"key":"e_1_3_2_1_38_1","volume-title":"2004 12th European Signal Processing Conference. IEEE, 553\u2013556","author":"Parihar Naveen","year":"2004","unstructured":"Naveen Parihar , Joseph Picone , David Pearce , and Hans-G\u00fcnter Hirsch . 2004 . Performance analysis of the Aurora large vocabulary baseline system . In 2004 12th European Signal Processing Conference. IEEE, 553\u2013556 . Naveen Parihar, Joseph Picone, David Pearce, and Hans-G\u00fcnter Hirsch. 2004. Performance analysis of the Aurora large vocabulary baseline system. In 2004 12th European Signal Processing Conference. IEEE, 553\u2013556."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2013.6707722"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2016.2602884"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472808"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7953204"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639100"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2008.03.004"},{"volume-title":"Noise reduction in speech applications","author":"Singh Rita","key":"e_1_3_2_1_45_1","unstructured":"Rita Singh , Richard\u00a0 M Stern , and Bhiksha Raj . 2018. Signal and feature compensation methods for robust speech recognition . In Noise reduction in speech applications . CRC Press , 219\u2013244. Rita Singh, Richard\u00a0M Stern, and Bhiksha Raj. 2018. Signal and feature compensation methods for robust speech recognition. In Noise reduction in speech applications. CRC Press, 219\u2013244."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2006.09.003"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2005.1415143"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1109\/TASL.2010.2061226","article-title":"Extended VTS for noise-robust speech recognition","volume":"19","author":"van Dalen C","year":"2010","unstructured":"Rogier\u00a0 C van Dalen and Mark\u00a0 JF Gales . 2010 . Extended VTS for noise-robust speech recognition . IEEE Transactions on Audio, Speech, and Language Processing 19 , 4(2010), 733 \u2013 743 . Rogier\u00a0C van Dalen and Mark\u00a0JF Gales. 2010. Extended VTS for noise-robust speech recognition. IEEE Transactions on Audio, Speech, and Language Processing 19, 4(2010), 733\u2013743.","journal-title":"IEEE Transactions on Audio, Speech, and Language Processing"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1998.675369"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-6393(98)00033-8"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6637622"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2012.2198059"},{"key":"e_1_3_2_1_53_1","volume-title":"CAVBench: A Benchmark Suite for Connected and Autonomous Vehicles. In 2018 IEEE\/ACM Symposium on Edge Computing (SEC). 30\u201342","author":"Wang Yifan","year":"2018","unstructured":"Yifan Wang , Shaoshan Liu , Xiaopei Wu , and Weisong Shi . 2018 . CAVBench: A Benchmark Suite for Connected and Autonomous Vehicles. In 2018 IEEE\/ACM Symposium on Edge Computing (SEC). 30\u201342 . https:\/\/doi.org\/10.1109\/SEC.2018.00010 10.1109\/SEC.2018.00010 Yifan Wang, Shaoshan Liu, Xiaopei Wu, and Weisong Shi. 2018. CAVBench: A Benchmark Suite for Connected and Autonomous Vehicles. In 2018 IEEE\/ACM Symposium on Edge Computing (SEC). 30\u201342. https:\/\/doi.org\/10.1109\/SEC.2018.00010"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6854661"},{"key":"e_1_3_2_1_55_1","volume-title":"International conference on latent variable analysis and signal separation. Springer, 91\u201399","author":"Weninger Felix","year":"2015","unstructured":"Felix Weninger , Hakan Erdogan , Shinji Watanabe , Emmanuel Vincent , Jonathan Le\u00a0Roux , John\u00a0 R Hershey , and Bj\u00f6rn Schuller . 2015 . Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR . In International conference on latent variable analysis and signal separation. Springer, 91\u201399 . Felix Weninger, Hakan Erdogan, Shinji Watanabe, Emmanuel Vincent, Jonathan Le\u00a0Roux, John\u00a0R Hershey, and Bj\u00f6rn Schuller. 2015. Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR. In International conference on latent variable analysis and signal separation. Springer, 91\u201399."},{"key":"e_1_3_2_1_56_1","volume-title":"2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). 759\u2013763","author":"Yan Bi-Cheng","year":"2020","unstructured":"Bi-Cheng Yan , Meng-Che Wu , and Berlin Chen . 2020 . Exploring Feature Enhancement in The Modulation Spectrum Domain via Ideal Ratio Mask for Robust Speech Recognition . In 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). 759\u2013763 . Bi-Cheng Yan, Meng-Che Wu, and Berlin Chen. 2020. Exploring Feature Enhancement in The Modulation Spectrum Domain via Ideal Ratio Mask for Robust Speech Recognition. In 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). 759\u2013763."},{"key":"e_1_3_2_1_57_1","volume-title":"2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 4041\u20134044","author":"Yu Dong","year":"2008","unstructured":"Dong Yu , Li Deng , Jasha Droppo , Jian Wu , Yifan Gong , and Alex Acero . 2008 . A minimum-mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition . In 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 4041\u20134044 . Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, and Alex Acero. 2008. A minimum-mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition. In 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 4041\u20134044."},{"key":"e_1_3_2_1_58_1","article-title":"Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments","volume":"9","author":"Zhang Zixing","year":"2018","unstructured":"Zixing Zhang , J\u00fcrgen Geiger , Jouni Pohjalainen , Amr El-Desoky Mousa , Wenyu Jin , and Bj\u00f6rn Schuller . 2018 . Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments . ACM Trans. Intell. Syst. Technol. 9 , 5, Article 49 (April 2018), 28\u00a0pages. https:\/\/doi.org\/10.1145\/3178115 10.1145\/3178115 Zixing Zhang, J\u00fcrgen Geiger, Jouni Pohjalainen, Amr El-Desoky Mousa, Wenyu Jin, and Bj\u00f6rn Schuller. 2018. Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments. ACM Trans. Intell. Syst. Technol. 9, 5, Article 49 (April 2018), 28\u00a0pages. https:\/\/doi.org\/10.1145\/3178115","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2010.5495669"}],"event":{"name":"IC3INA 2021: The 2021 International Conference on Computer, Control, Informatics and Its Applications","acronym":"IC3INA 2021","location":"Virtual\/online conference Indonesia"},"container-title":["Proceedings of the 2021 International Conference on Computer, Control, Informatics and Its Applications"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3489088.3489121","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3489088.3489121","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:26Z","timestamp":1750186946000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3489088.3489121"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,5]]},"references-count":60,"alternative-id":["10.1145\/3489088.3489121","10.1145\/3489088"],"URL":"https:\/\/doi.org\/10.1145\/3489088.3489121","relation":{},"subject":[],"published":{"date-parts":[[2021,10,5]]},"assertion":[{"value":"2022-02-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}