{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T06:29:08Z","timestamp":1778048948678,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,14]],"date-time":"2021-08-14T00:00:00Z","timestamp":1628899200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,14]]},"DOI":"10.1145\/3447548.3467156","type":"proceedings-article","created":{"date-parts":[[2021,8,12]],"date-time":"2021-08-12T06:13:10Z","timestamp":1628748790000},"page":"3569-3575","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries"],"prefix":"10.1145","author":[{"given":"Sukhdeep S.","family":"Sodhi","sequence":"first","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Ellie Ka-In","family":"Chio","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Ambarish","family":"Jash","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Santiago","family":"Onta\u00f1\u00f3n","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Ajit","family":"Apte","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Ankit","family":"Kumar","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Ayooluwakunmi","family":"Jeje","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Dima","family":"Kuzmin","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Harry","family":"Fung","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Heng-Tze","family":"Cheng","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Jon","family":"Effrat","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Tarush","family":"Bali","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Nitin","family":"Jindal","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Pei","family":"Cao","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Sarvjeet","family":"Singh","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Senqiang","family":"Zhou","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Tameen","family":"Khan","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Amol","family":"Wankhede","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Moustafa","family":"Alzantot","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Allen","family":"Wu","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]},{"given":"Tushar","family":"Chandra","sequence":"additional","affiliation":[{"name":"Google Research, Mountain View, CA, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,8,14]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Olivier Deroo, Stephane Dupont, Teodora Erbes, Denis Jouvet, Luciano Fissore, Pietro Laface, Alfred Mertins, Christophe Ris, et al.","author":"Benzeghiba Mohamed","year":"2007","unstructured":"Mohamed Benzeghiba , Renato De Mori , Olivier Deroo, Stephane Dupont, Teodora Erbes, Denis Jouvet, Luciano Fissore, Pietro Laface, Alfred Mertins, Christophe Ris, et al. 2007 . Automatic speech recognition and speech variability: A review. Speech communication, Vol. 49 , 10--11 (2007), 763--786. Mohamed Benzeghiba, Renato De Mori, Olivier Deroo, Stephane Dupont, Teodora Erbes, Denis Jouvet, Luciano Fissore, Pietro Laface, Alfred Mertins, Christophe Ris, et al. 2007. Automatic speech recognition and speech variability: A review. Speech communication, Vol. 49, 10--11 (2007), 763--786."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639104"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-39593-2_7"},{"key":"e_1_3_2_1_4_1","volume-title":"BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2018.03.005"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2009.10.001"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683745"},{"key":"e_1_3_2_1_8_1","volume-title":"ContextNet: Improving convolutional neural networks for automatic speech recognition with global context. arXiv preprint arXiv:2005.03191","author":"Han Wei","year":"2020","unstructured":"Wei Han , Zhengdong Zhang , Yu Zhang , Jiahui Yu , Chung-Cheng Chiu , James Qin , Anmol Gulati , Ruoming Pang , and Yonghui Wu. 2020. ContextNet: Improving convolutional neural networks for automatic speech recognition with global context. arXiv preprint arXiv:2005.03191 ( 2020 ). Wei Han, Zhengdong Zhang, Yu Zhang, Jiahui Yu, Chung-Cheng Chiu, James Qin, Anmol Gulati, Ruoming Pang, and Yonghui Wu. 2020. ContextNet: Improving convolutional neural networks for automatic speech recognition with global context. arXiv preprint arXiv:2005.03191 (2020)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2004.01.006"},{"key":"e_1_3_2_1_10_1","volume-title":"ICASSP 2020--2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Hrinchuk Oleksii","unstructured":"Oleksii Hrinchuk , Mariya Popova , and Boris Ginsburg . 2020. Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model . In ICASSP 2020--2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . IEEE , 7074--7078. Oleksii Hrinchuk, Mariya Popova, and Boris Ginsburg. 2020. Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model. In ICASSP 2020--2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 7074--7078."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2015-711"},{"key":"e_1_3_2_1_12_1","volume-title":"Error-tolerant Language Understanding for Spoken Dialogue Systems. In Sixth International Conference on Spoken Language Processing .","author":"Lin Yi-Chung","year":"2000","unstructured":"Yi-Chung Lin and Huei-Ming Wang . 2000 . Error-tolerant Language Understanding for Spoken Dialogue Systems. In Sixth International Conference on Spoken Language Processing . Yi-Chung Lin and Huei-Ming Wang. 2000. Error-tolerant Language Understanding for Spoken Dialogue Systems. In Sixth International Conference on Spoken Language Processing ."},{"key":"e_1_3_2_1_13_1","volume-title":"Characterizing and predicting corrections in spoken dialogue systems. Computational linguistics","author":"Litman Diane","year":"2006","unstructured":"Diane Litman , Julia Hirschberg , and Marc Swerts . 2006. Characterizing and predicting corrections in spoken dialogue systems. Computational linguistics , Vol. 32 , 3 ( 2006 ), 417--438. Diane Litman, Julia Hirschberg, and Marc Swerts. 2006. Characterizing and predicting corrections in spoken dialogue systems. Computational linguistics, Vol. 32, 3 (2006), 417--438."},{"key":"e_1_3_2_1_14_1","volume-title":"ASR Error Correction and Domain Adaptation Using Machine Translation. In ICASSP 2020--2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6344--6348","author":"Mani Anirudh","year":"2020","unstructured":"Anirudh Mani , Shruti Palaskar , Nimshi Venkat Meripo , Sandeep Konam , and Florian Metze . 2020 . ASR Error Correction and Domain Adaptation Using Machine Translation. In ICASSP 2020--2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6344--6348 . Anirudh Mani, Shruti Palaskar, Nimshi Venkat Meripo, Sandeep Konam, and Florian Metze. 2020. ASR Error Correction and Domain Adaptation Using Machine Translation. In ICASSP 2020--2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6344--6348."},{"key":"e_1_3_2_1_15_1","volume-title":"Transformers with convolutional context for ASR. arXiv preprint arXiv:1904.11660","author":"Mohamed Abdelrahman","year":"2019","unstructured":"Abdelrahman Mohamed , Dmytro Okhonko , and Luke Zettlemoyer . 2019. Transformers with convolutional context for ASR. arXiv preprint arXiv:1904.11660 ( 2019 ). Abdelrahman Mohamed, Dmytro Okhonko, and Luke Zettlemoyer. 2019. Transformers with convolutional context for ASR. arXiv preprint arXiv:1904.11660 (2019)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3173580"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178964"},{"key":"e_1_3_2_1_18_1","volume-title":"Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 311--318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni , Salim Roukos , Todd Ward , and Wei-Jing Zhu . 2002 . BLEU: a method for automatic evaluation of machine translation . In Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 311--318 . Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 311--318."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i08.7022"},{"key":"e_1_3_2_1_20_1","volume-title":"Percent Change Estimation in Large Scale Online Experiments. arXiv preprint arXiv:1711.00562","author":"Soriano Jacopo","year":"2017","unstructured":"Jacopo Soriano . 2017. Percent Change Estimation in Large Scale Online Experiments. arXiv preprint arXiv:1711.00562 ( 2017 ). Jacopo Soriano. 2017. Percent Change Estimation in Large Scale Online Experiments. arXiv preprint arXiv:1711.00562 (2017)."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6854012"},{"key":"e_1_3_2_1_22_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3178115","article-title":"Deep learning for environmentally robust speech recognition: An overview of recent developments","volume":"9","author":"Zhang Zixing","year":"2018","unstructured":"Zixing Zhang , J\u00fcrgen Geiger , Jouni Pohjalainen , Amr El-Desoky Mousa , Wenyu Jin , and Bj\u00f6rn Schuller . 2018 . Deep learning for environmentally robust speech recognition: An overview of recent developments . ACM Transactions on Intelligent Systems and Technology (TIST) , Vol. 9 , 5 (2018), 1 -- 28 . Zixing Zhang, J\u00fcrgen Geiger, Jouni Pohjalainen, Amr El-Desoky Mousa, Wenyu Jin, and Bj\u00f6rn Schuller. 2018. Deep learning for environmentally robust speech recognition: An overview of recent developments. ACM Transactions on Intelligent Systems and Technology (TIST), Vol. 9, 5 (2018), 1--28.","journal-title":"ACM Transactions on Intelligent Systems and Technology (TIST)"}],"event":{"name":"KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Virtual Event Singapore","acronym":"KDD '21","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery &amp; Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447548.3467156","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3447548.3467156","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:18:27Z","timestamp":1750191507000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447548.3467156"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,14]]},"references-count":23,"alternative-id":["10.1145\/3447548.3467156","10.1145\/3447548"],"URL":"https:\/\/doi.org\/10.1145\/3447548.3467156","relation":{},"subject":[],"published":{"date-parts":[[2021,8,14]]},"assertion":[{"value":"2021-08-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}