{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,29]],"date-time":"2026-05-29T11:18:45Z","timestamp":1780053525619,"version":"3.54.0"},"publisher-location":"New York, NY, USA","reference-count":79,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T00:00:00Z","timestamp":1636934400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,11,15]]},"DOI":"10.1145\/3485730.3485945","type":"proceedings-article","created":{"date-parts":[[2021,11,11]],"date-time":"2021-11-11T11:41:43Z","timestamp":1636630903000},"page":"97-110","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":82,"title":["Wavoice"],"prefix":"10.1145","author":[{"given":"Tiantian","family":"Liu","sequence":"first","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ming","family":"Gao","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Feng","family":"Lin","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China and Key Laboratory of Blockchain and Cyberspace Governance of Zhejiang Province, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chao","family":"Wang","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhongjie","family":"Ba","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jinsong","family":"Han","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wenyao","family":"Xu","sequence":"additional","affiliation":[{"name":"University at Buffalo, the State University of New York, Buffalo, New York, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kui","family":"Ren","sequence":"additional","affiliation":[{"name":"Zhejiang University, Hangzhou, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2021,11,15]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Joon Son Chung, and Andrew Zisserman","author":"Afouras Triantafyllos","year":"2018","unstructured":"Triantafyllos Afouras , Joon Son Chung, and Andrew Zisserman . 2018 . The conversation: Deep audio-visual speech enhancement. arXiv preprint arXiv:1804.04121 (2018). Triantafyllos Afouras, Joon Son Chung, and Andrew Zisserman. 2018. The conversation: Deep audio-visual speech enhancement. arXiv preprint arXiv:1804.04121 (2018)."},{"key":"e_1_3_2_1_2_1","volume-title":"Improved Prosthetic Hand Control with Synchronous Use of Voice Recognition and Inertial Measurements. IOP Conference Series: Materials Science and Engineering 745 (2020","author":"Alkhafaf Omer Saad","year":"2088","unstructured":"Omer Saad Alkhafaf , Mousa K. Wali , and Ali H . Al-Timemy. 2020 . Improved Prosthetic Hand Control with Synchronous Use of Voice Recognition and Inertial Measurements. IOP Conference Series: Materials Science and Engineering 745 (2020 ), 01 2088 . Omer Saad Alkhafaf, Mousa K. Wali, and Ali H. Al-Timemy. 2020. Improved Prosthetic Hand Control with Synchronous Use of Voice Recognition and Inertial Measurements. IOP Conference Series: Materials Science and Engineering 745 (2020), 012088."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSEN.2020.3023243"},{"key":"e_1_3_2_1_4_1","unstructured":"Amazon.com. 2021. https:\/\/www.amazon.com\/echo\/ title = Amazon echo.  Amazon.com. 2021. https:\/\/www.amazon.com\/echo\/ title = Amazon echo."},{"key":"e_1_3_2_1_5_1","volume-title":"International conference on machine learning. PMLR, 173--182","author":"Amodei Dario","year":"2016","unstructured":"Dario Amodei , Sundaram Ananthanarayanan , Rishita Anubhai , Jingliang Bai , Eric Battenberg , Carl Case , Jared Casper , Bryan Catanzaro , Qiang Cheng , Guoliang Chen , 2016 . Deep speech 2: End-to-end speech recognition in english and mandarin . In International conference on machine learning. PMLR, 173--182 . Dario Amodei, Sundaram Ananthanarayanan, Rishita Anubhai, Jingliang Bai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Qiang Cheng, Guoliang Chen, et al. 2016. Deep speech 2: End-to-end speech recognition in english and mandarin. In International conference on machine learning. PMLR, 173--182."},{"key":"e_1_3_2_1_6_1","volume-title":"The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines. arXiv preprint arXiv:1803.10609","author":"Barker Jon","year":"2018","unstructured":"Jon Barker , Shinji Watanabe , Emmanuel Vincent , and Jan Trmal . 2018. The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines. arXiv preprint arXiv:1803.10609 ( 2018 ). Jon Barker, Shinji Watanabe, Emmanuel Vincent, and Jan Trmal. 2018. The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines. arXiv preprint arXiv:1803.10609 (2018)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472621"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.3390\/s20040956"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.3390\/s16010050"},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of International Conference on Learning Representations.","author":"Choi Hyeong-Seok","year":"2019","unstructured":"Hyeong-Seok Choi , Jang-Hyun Kim , Jaesung Huh , Adrian Kim , Jung-Woo Ha , and Kyogu Lee . 2019 . Phase-Aware Speech Enhancement with Deep Complex U-Net . In Proceedings of International Conference on Learning Representations. Hyeong-Seok Choi, Jang-Hyun Kim, Jaesung Huh, Adrian Kim, Jung-Woo Ha, and Kyogu Lee. 2019. Phase-Aware Speech Enhancement with Deep Complex U-Net. In Proceedings of International Conference on Learning Representations."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1155\/2012\/928591"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1117\/12.2558272"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8462581"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3372224.3419210"},{"key":"e_1_3_2_1_15_1","unstructured":"Gmtd. 2021. GM-A906 microphone. [online].  Gmtd. 2021. GM-A906 microphone. [online]."},{"key":"e_1_3_2_1_16_1","unstructured":"Google. 2019. https:\/\/www.androidcentral.com\/how-does-googles-soli-chipwork title = Here's how the Pixel 4's Soli radar works and why Motion Sense has so much potential .  Google. 2019. https:\/\/www.androidcentral.com\/how-does-googles-soli-chipwork title = Here's how the Pixel 4's Soli radar works and why Motion Sense has so much potential ."},{"key":"e_1_3_2_1_17_1","unstructured":"Google. 2021. https:\/\/store.google.com\/product\/google_home\/ title = Google home.  Google. 2021. https:\/\/store.google.com\/product\/google_home\/ title = Google home."},{"key":"e_1_3_2_1_18_1","unstructured":"Google. 2021. ok-google.io. https:\/\/ok-google.io\/  Google. 2021. ok-google.io. https:\/\/ok-google.io\/"},{"key":"e_1_3_2_1_19_1","volume-title":"Journal of Physics: Conference Series","volume":"1817","author":"Nishu","year":"2016","unstructured":"Nishu Gupta et al. 2021. A Novel Voice Controlled Robotic Vehicle For Smart City Applications . In Journal of Physics: Conference Series , Vol. 1817 . IOP Publishing, 01 2016 . Nishu Gupta et al. 2021. A Novel Voice Controlled Robotic Vehicle For Smart City Applications. In Journal of Physics: Conference Series, Vol. 1817. IOP Publishing, 012016."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2015.7404843"},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing.","author":"Hirsch H. G.","unstructured":"H. G. Hirsch and C. Ehrlicher . 1995. Noise estimation techniques for robust speech recognition . In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. H. G. Hirsch and C. Ehrlicher. 1995. Noise estimation techniques for robust speech recognition. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.3390\/s16081181"},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of Advances in Neural Information Processing Systems.","author":"Hou Ruibing","year":"2019","unstructured":"Ruibing Hou , Hong Chang , Bingpeng Ma , Shiguang Shan , and Xilin Chen . 2019 . Cross Attention Network for Few-shot Classification . In Proceedings of Advances in Neural Information Processing Systems. Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, and Xilin Chen. 2019. Cross Attention Network for Few-shot Classification. In Proceedings of Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_1_24_1","volume-title":"Squeeze-and-Excitation Networks. In Proceedings of Conference on Computer Vision and Pattern Recognition.","author":"Hu Jie","year":"2018","unstructured":"Jie Hu , Li Shen , and Gang Sun . 2018 . Squeeze-and-Excitation Networks. In Proceedings of Conference on Computer Vision and Pattern Recognition. Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-Excitation Networks. In Proceedings of Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_2_1_25_1","unstructured":"Apple Inc. 2021. https:\/\/www.apple.com\/au\/siri\/ title = Siri - Apple.  Apple Inc. 2021. https:\/\/www.apple.com\/au\/siri\/ title = Siri - Apple."},{"key":"e_1_3_2_1_26_1","volume-title":"Amy Gimma, Kiesha Prem, Petra Klepac, G James Rubin, and W John Edmunds.","author":"Jarvis Christopher I","year":"2020","unstructured":"Christopher I Jarvis , Kevin Van Zandvoort , Amy Gimma, Kiesha Prem, Petra Klepac, G James Rubin, and W John Edmunds. 2020 . Quantifying the impact of physical distance measures on the transmission of COVID-19 in the UK. BMC medicine 18 (2020), 1--10. Christopher I Jarvis, Kevin Van Zandvoort, Amy Gimma, Kiesha Prem, Petra Klepac, G James Rubin, and W John Edmunds. 2020. Quantifying the impact of physical distance measures on the transmission of COVID-19 in the UK. BMC medicine 18 (2020), 1--10."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2007.896450"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1186\/s13634-016-0306-6"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/LWC.2019.2899571"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2019.03.008"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3384419.3430779"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1113"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.2528\/PIER12052207"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.2528\/PIERB08063001"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP40000.2020.00004"},{"key":"e_1_3_2_1_36_1","volume-title":"Speech enhancement: theory and practice","author":"Loizou Philipos C","unstructured":"Philipos C Loizou . 2013. Speech enhancement: theory and practice . CRC press . Philipos C Loizou. 2013. Speech enhancement: theory and practice. CRC press."},{"key":"e_1_3_2_1_37_1","first-page":"1080006","article-title":"Fusion of millimeter wave radar and RGB-depth sensors for assisted navigation of the visually impaired","volume":"10800","author":"Long Ningbo","year":"2018","unstructured":"Ningbo Long , Kaiwei Wang , Ruiqi Cheng , Kailun Yang , and Jian Bai . 2018 . Fusion of millimeter wave radar and RGB-depth sensors for assisted navigation of the visually impaired . In Proceedings of the Millimetre Wave and Terahertz Sensors and Technology XI , Vol. 10800. 1080006 . Ningbo Long, Kaiwei Wang, Ruiqi Cheng, Kailun Yang, and Jian Bai. 2018. Fusion of millimeter wave radar and RGB-depth sensors for assisted navigation of the visually impaired. In Proceedings of the Millimetre Wave and Terahertz Sensors and Technology XI, Vol. 10800. 1080006.","journal-title":"Proceedings of the Millimetre Wave and Terahertz Sensors and Technology XI"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3386901.3388945"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3384419.3430776"},{"key":"e_1_3_2_1_40_1","volume-title":"Proceedings of Advances in Neural Information Processing Systems.","author":"Lu Jiasen","year":"2016","unstructured":"Jiasen Lu , Jianwei Yang , Dhruv Batra , and Devi Parikh . 2016 . Hierarchical Question-Image Co-Attention for Visual Question Answering . In Proceedings of Advances in Neural Information Processing Systems. Jiasen Lu, Jianwei Yang, Dhruv Batra, and Devi Parikh. 2016. Hierarchical Question-Image Co-Attention for Visual Question Answering. In Proceedings of Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3025453.3025786"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvoice.2021.01.013"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8682061"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/1622176.1622181"},{"key":"e_1_3_2_1_45_1","volume-title":"Mitchell McLaren, Colleen Richey, Aaron Lawson, and Maria Alejandra Barrios.","author":"Nandwana Mahesh Kumar","year":"2019","unstructured":"Mahesh Kumar Nandwana , Julien Van Hout , Mitchell McLaren, Colleen Richey, Aaron Lawson, and Maria Alejandra Barrios. 2019 . The voices from a distance challenge 2019 evaluation plan. arXiv preprint arXiv:1902.10828 (2019). Mahesh Kumar Nandwana, Julien Van Hout, Mitchell McLaren, Colleen Richey, Aaron Lawson, and Maria Alejandra Barrios. 2019. The voices from a distance challenge 2019 evaluation plan. arXiv preprint arXiv:1902.10828 (2019)."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2019.2913512"},{"key":"e_1_3_2_1_47_1","volume-title":"SEGAN: Speech enhancement generative adversarial network. arXiv preprint arXiv:1703.09452","author":"Pascual Santiago","year":"2017","unstructured":"Santiago Pascual , Antonio Bonafonte , and Joan Serra . 2017 . SEGAN: Speech enhancement generative adversarial network. arXiv preprint arXiv:1703.09452 (2017). Santiago Pascual, Antonio Bonafonte, and Joan Serra. 2017. SEGAN: Speech enhancement generative adversarial network. arXiv preprint arXiv:1703.09452 (2017)."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.2307\/2981392"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-233"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683535"},{"key":"e_1_3_2_1_51_1","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.","author":"Peihua Li Wangmeng Zuo Pengfei Zhu","year":"2020","unstructured":"Pengfei Zhu Peihua Li Wangmeng Zuo Qilong Wang , Banggu Wu and Qinghua Hu . 2020 . ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks . In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Pengfei Zhu Peihua Li Wangmeng Zuo Qilong Wang, Banggu Wu and Qinghua Hu. 2020. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICR.1996.573784"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3191789.3191799"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1587\/transele.E100.C.790"},{"key":"e_1_3_2_1_55_1","volume-title":"https:\/\/www.chinadaily.com.cn\/a\/202003\/13\/WS5e6b3fcca31012821727ef88.html, urldate =","author":"AI.","year":"2020","unstructured":"Sound AI. 2020. https:\/\/www.chinadaily.com.cn\/a\/202003\/13\/WS5e6b3fcca31012821727ef88.html, urldate = March 13, 2020 , title = Voice-controlled Elevator System Put into Use in Beijing . SoundAI. 2020. https:\/\/www.chinadaily.com.cn\/a\/202003\/13\/WS5e6b3fcca31012821727ef88.html, urldate = March 13, 2020, title = Voice-controlled Elevator System Put into Use in Beijing."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3447993.3448626"},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-2822"},{"key":"e_1_3_2_1_58_1","unstructured":"Telsa. 2021. https:\/\/www.tesla.com\/ title = Model S\/3\/X\/Y.  Telsa. 2021. https:\/\/www.tesla.com\/ title = Model S\/3\/X\/Y."},{"key":"e_1_3_2_1_59_1","unstructured":"TI. 2021. DCA1000EVM. https:\/\/www.ti.com\/tool\/DCA1000EVM.  TI. 2021. DCA1000EVM. https:\/\/www.ti.com\/tool\/DCA1000EVM."},{"key":"e_1_3_2_1_60_1","unstructured":"TI. 2021. IWR1642. https:\/\/www.ti.com\/tool\/IWR1642BOOST.  TI. 2021. IWR1642. https:\/\/www.ti.com\/tool\/IWR1642BOOST."},{"key":"e_1_3_2_1_61_1","unstructured":"TI. 2021. mmWave Studio. https:\/\/www.ti.com\/tool\/MMWAVE-STUDIO.  TI. 2021. mmWave Studio. https:\/\/www.ti.com\/tool\/MMWAVE-STUDIO."},{"key":"e_1_3_2_1_62_1","volume-title":"Proceedings of the International Conference on Learning Representations.","author":"Trabelsi Chiheb","unstructured":"Chiheb Trabelsi , Olexa Bilaniuk , Ying Zhang , Dmitriy Serdyuk , Sandeep Subramanian , Jo\u00e3o Felipe Santos , Soroush Mehri , Negar Rostamzadeh , Yoshua Bengio , and Christopher J. Pal . 2018. Deep Complex Networks . In Proceedings of the International Conference on Learning Representations. Chiheb Trabelsi, Olexa Bilaniuk, Ying Zhang, Dmitriy Serdyuk, Sandeep Subramanian, Jo\u00e3o Felipe Santos, Soroush Mehri, Negar Rostamzadeh, Yoshua Bengio, and Christopher J. Pal. 2018. Deep Complex Networks. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_2_1_63_1","volume-title":"Proceedings of Advances in Neural Information Processing Systems.","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N. Gomez , Lukasz Kaiser , and Illia Polosukhin . 2017 . Attention is All you Need . In Proceedings of Advances in Neural Information Processing Systems. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In Proceedings of Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.3390\/sym11081018"},{"key":"e_1_3_2_1_65_1","volume-title":"We can hear you with Wi-Fi! IEEE Transactions on Mobile Computing 15, 11","author":"Wang Guanhua","year":"2016","unstructured":"Guanhua Wang , Yongpan Zou , Zimu Zhou , Kaishun Wu , and Lionel M Ni. 2016. We can hear you with Wi-Fi! IEEE Transactions on Mobile Computing 15, 11 ( 2016 ), 2907--2920. Guanhua Wang, Yongpan Zou, Zimu Zhou, Kaishun Wu, and Lionel M Ni. 2016. We can hear you with Wi-Fi! IEEE Transactions on Mobile Computing 15, 11 (2016), 2907--2920."},{"key":"e_1_3_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/MWC.001.1900409"},{"key":"e_1_3_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.3390\/su13063575"},{"key":"e_1_3_2_1_68_1","volume-title":"Speech commands: A dataset for limited-vocabulary speech recognition. arXiv preprint arXiv:1804.03209","author":"Warden Pete","year":"2018","unstructured":"Pete Warden . 2018. Speech commands: A dataset for limited-vocabulary speech recognition. arXiv preprint arXiv:1804.03209 ( 2018 ). Pete Warden. 2018. Speech commands: A dataset for limited-vocabulary speech recognition. arXiv preprint arXiv:1804.03209 (2018)."},{"key":"e_1_3_2_1_69_1","volume-title":"Chime-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings. arXiv preprint arXiv:2004.09249","author":"Watanabe Shinji","year":"2020","unstructured":"Shinji Watanabe , Michael Mandel , Jon Barker , and Emmanuel Vincent . 2020. Chime-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings. arXiv preprint arXiv:2004.09249 ( 2020 ). Shinji Watanabe, Michael Mandel, Jon Barker, and Emmanuel Vincent. 2020. Chime-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings. arXiv preprint arXiv:2004.09249 (2020)."},{"key":"e_1_3_2_1_70_1","volume-title":"https:\/\/www.canalys.com\/newsroom\/canalys-global-smart-speaker-market-2021-forecast, urldate =","author":"Canalys","year":"2020","unstructured":"Canalys website. 2020. https:\/\/www.canalys.com\/newsroom\/canalys-global-smart-speaker-market-2021-forecast, urldate = October 22, 2020 , title = Global Smart Speaker Market 2021 Forecast . Canalys website. 2020. https:\/\/www.canalys.com\/newsroom\/canalys-global-smart-speaker-market-2021-forecast, urldate = October 22, 2020, title = Global Smart Speaker Market 2021 Forecast."},{"key":"e_1_3_2_1_71_1","volume-title":"https:\/\/www.designnews.com\/design-hardware-software\/covid-19-giving-touchless-interfaces-chance-make-impression-0, urldate =","author":"Wiltz Chris","year":"2020","unstructured":"Chris Wiltz . 2020. https:\/\/www.designnews.com\/design-hardware-software\/covid-19-giving-touchless-interfaces-chance-make-impression-0, urldate = June 03, 2020 , title = COVID-19 Giving Touchless Interfaces a Chance to Make an Impression . Chris Wiltz. 2020. https:\/\/www.designnews.com\/design-hardware-software\/covid-19-giving-touchless-interfaces-chance-make-impression-0, urldate = June 03, 2020, title = COVID-19 Giving Touchless Interfaces a Chance to Make an Impression."},{"key":"e_1_3_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.858066"},{"key":"e_1_3_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/MWC.2017.1600374"},{"key":"e_1_3_2_1_74_1","unstructured":"Xiaomi. [n.d.]. 'Not science fiction': Xiaomi's revolutionary new wireless charging tech can charge your devices remotely. https:\/\/www.financialexpress.com\/industry\/technology\/not-science-fiction-xiaomis-revolutionary-new-wireless-charging-tech-can-charge-your-devices-remotely\/2181661\/.  Xiaomi. [n.d.]. 'Not science fiction': Xiaomi's revolutionary new wireless charging tech can charge your devices remotely. https:\/\/www.financialexpress.com\/industry\/technology\/not-science-fiction-xiaomis-revolutionary-new-wireless-charging-tech-can-charge-your-devices-remotely\/2181661\/."},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307334.3326073"},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6489"},{"key":"e_1_3_2_1_77_1","volume-title":"AUTOMATIC SPEECH RECOGNITION","author":"Yu Dong","unstructured":"Dong Yu and Li Deng . 2016. AUTOMATIC SPEECH RECOGNITION . Springer . Dong Yu and Li Deng. 2016. AUTOMATIC SPEECH RECOGNITION. Springer."},{"key":"e_1_3_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.1078"},{"key":"e_1_3_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2942382"}],"event":{"name":"SenSys '21: The 19th ACM Conference on Embedded Networked Sensor Systems","location":"Coimbra Portugal","acronym":"SenSys '21","sponsor":["SIGMETRICS ACM Special Interest Group on Measurement and Evaluation","SIGCOMM ACM Special Interest Group on Data Communication","SIGMOBILE ACM Special Interest Group on Mobility of Systems, Users, Data and Computing","SIGOPS ACM Special Interest Group on Operating Systems","SIGBED ACM Special Interest Group on Embedded Systems","SIGARCH ACM Special Interest Group on Computer Architecture"]},"container-title":["Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3485730.3485945","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3485730.3485945","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:12:10Z","timestamp":1750191130000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3485730.3485945"}},"subtitle":["A Noise-resistant Multi-modal Speech Recognition System Fusing mmWave and Audio Signals"],"short-title":[],"issued":{"date-parts":[[2021,11,15]]},"references-count":79,"alternative-id":["10.1145\/3485730.3485945","10.1145\/3485730"],"URL":"https:\/\/doi.org\/10.1145\/3485730.3485945","relation":{},"subject":[],"published":{"date-parts":[[2021,11,15]]},"assertion":[{"value":"2021-11-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}