{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T18:11:17Z","timestamp":1772907077810,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":64,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,5,30]],"date-time":"2022-05-30T00:00:00Z","timestamp":1653868800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CNS-1950171; CCF-2007159"],"award-info":[{"award-number":["CNS-1950171; CCF-2007159"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000055","name":"National Institute on Deafness and Other Communication Disorders","doi-asserted-by":"publisher","award":["R01DC012315"],"award-info":[{"award-number":["R01DC012315"]}],"id":[{"id":"10.13039\/100000055","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,5,30]]},"DOI":"10.1145\/3488932.3517420","type":"proceedings-article","created":{"date-parts":[[2022,5,24]],"date-time":"2022-05-24T04:23:26Z","timestamp":1653366206000},"page":"1019-1033","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["SUPERVOICE"],"prefix":"10.1145","author":[{"given":"Hanqing","family":"Guo","sequence":"first","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qiben","family":"Yan","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nikolay","family":"Ivanov","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ying","family":"Zhu","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Li","family":"Xiao","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eric J.","family":"Hunter","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,5,30]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"1993. TIMIT Acoustic-Phonetic Continuous Speech Corpus. https:\/\/catalog.ldc.upenn.edu\/LDC93S1. Accessed: 2020-05-04.  1993. TIMIT Acoustic-Phonetic Continuous Speech Corpus. https:\/\/catalog.ldc.upenn.edu\/LDC93S1. Accessed: 2020-05-04."},{"key":"e_1_3_2_2_2_1","unstructured":"2020. FFmpeg. https:\/\/www.ffmpeg.org\/. Accessed: 2021-11-02.  2020. FFmpeg. https:\/\/www.ffmpeg.org\/. Accessed: 2021-11-02."},{"key":"e_1_3_2_2_3_1","volume-title":"Iljoo Kim, Taekkyung Oh, and Hyoungshick Kim.","author":"Ahmed Muhammad Ejaz","year":"2020","unstructured":"Muhammad Ejaz Ahmed , Il-Youp Kwak , Jun Ho Huh , Iljoo Kim, Taekkyung Oh, and Hyoungshick Kim. 2020 . Void : A fast and light voice liveness detection system. In USENIX Security . Muhammad Ejaz Ahmed, Il-Youp Kwak, Jun Ho Huh, Iljoo Kim, Taekkyung Oh, and Hyoungshick Kim. 2020. Void: A fast and light voice liveness detection system. In USENIX Security."},{"key":"e_1_3_2_2_4_1","volume-title":"2012 Proceedings of the 20th european signal processing conference (EUSIPCO). IEEE, 36--40","author":"Alegre Federico","year":"2012","unstructured":"Federico Alegre , Ravichander Vipperla , Nicholas Evans , and Beno\u00eft Fauve . 2012 . On the vulnerability of automatic speaker recognition to spoofing attacks with artificial signals . In 2012 Proceedings of the 20th european signal processing conference (EUSIPCO). IEEE, 36--40 . Federico Alegre, Ravichander Vipperla, Nicholas Evans, and Beno\u00eft Fauve. 2012. On the vulnerability of automatic speaker recognition to spoofing attacks with artificial signals. In 2012 Proceedings of the 20th european signal processing conference (EUSIPCO). IEEE, 36--40."},{"key":"e_1_3_2_2_5_1","unstructured":"Avisoft. [n.d.] a. http:\/\/www.avisoft.com\/ultrasound-microphones\/cm16-cmpa\/.  Avisoft. [n.d.] a. http:\/\/www.avisoft.com\/ultrasound-microphones\/cm16-cmpa\/."},{"key":"e_1_3_2_2_6_1","unstructured":"Avisoft. [n.d.] b. http:\/\/www.avisoft.com\/playback\/vifa\/.  Avisoft. [n.d.] b. http:\/\/www.avisoft.com\/playback\/vifa\/."},{"key":"e_1_3_2_2_7_1","unstructured":"Bose. [n.d.]. https:\/\/www.bose.com\/en_us\/support\/products\/bose_speakers_support\/bose_smarthome_speakers_support\/soundtouch-10-wireless-system.html.  Bose. [n.d.]. https:\/\/www.bose.com\/en_us\/support\/products\/bose_speakers_support\/bose_smarthome_speakers_support\/soundtouch-10-wireless-system.html."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2006.870086"},{"key":"e_1_3_2_2_9_1","volume-title":"Sixteenth Annual Conference of the International Speech Communication Association.","author":"Lopez-Moreno Ignacio","year":"2015","unstructured":"Yu-hsin Chen, Ignacio Lopez-Moreno , Tara N Sainath , Mirk\u00f3 Visontai , Raziel Alvarez , and Carolina Parada . 2015 . Locally-connected and convolutional neural networks for small footprint speaker recognition . In Sixteenth Annual Conference of the International Speech Communication Association. Yu-hsin Chen, Ignacio Lopez-Moreno, Tara N Sainath, Mirk\u00f3 Visontai, Raziel Alvarez, and Carolina Parada. 2015. Locally-connected and convolutional neural networks for small footprint speaker recognition. In Sixteenth Annual Conference of the International Speech Communication Association."},{"key":"e_1_3_2_2_10_1","unstructured":"Noam Chomsky and Morris Halle. 1968. The sound pattern of English. (1968).  Noam Chomsky and Morris Halle. 1968. The sound pattern of English. (1968)."},{"key":"e_1_3_2_2_11_1","volume-title":"Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals","author":"Chowdhury Anurag","year":"2019","unstructured":"Anurag Chowdhury and Arun Ross . 2019. Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals . IEEE Transactions on Information Forensics and Security ( 2019 ). Anurag Chowdhury and Arun Ross. 2019. Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals. IEEE Transactions on Information Forensics and Security (2019)."},{"key":"e_1_3_2_2_12_1","volume-title":"Voxceleb2: Deep speaker recognition. arXiv preprint arXiv:1806.05622","author":"Chung Joon Son","year":"2018","unstructured":"Joon Son Chung , Arsha Nagrani , and Andrew Zisserman . 2018. Voxceleb2: Deep speaker recognition. arXiv preprint arXiv:1806.05622 ( 2018 ). Joon Son Chung, Arsha Nagrani, and Andrew Zisserman. 2018. Voxceleb2: Deep speaker recognition. arXiv preprint arXiv:1806.05622 (2018)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2010.2064307"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3117811.3117823"},{"key":"e_1_3_2_2_15_1","first-page":"191","article-title":"Comparative evaluation of various MFCC implementations on the speaker verification task","volume":"1","author":"Ganchev Todor","year":"2005","unstructured":"Todor Ganchev , Nikos Fakotakis , and George Kokkinakis . 2005 . Comparative evaluation of various MFCC implementations on the speaker verification task . In Proceedings of the SPECOM , Vol. 1. 191 -- 194 . Todor Ganchev, Nikos Fakotakis, and George Kokkinakis. 2005. Comparative evaluation of various MFCC implementations on the speaker verification task. In Proceedings of the SPECOM, Vol. 1. 191--194.","journal-title":"Proceedings of the SPECOM"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.6028\/NIST.IR.4930"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1994.389336"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1995.479538"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472652"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178847"},{"key":"e_1_3_2_2_22_1","unstructured":"HOTENDA. 2013. https:\/\/www.hotenda.com\/datasheet-html\/2493\/1\/SPU0410LR5H-QB.html.  HOTENDA. 2013. https:\/\/www.hotenda.com\/datasheet-html\/2493\/1\/SPU0410LR5H-QB.html."},{"key":"e_1_3_2_2_23_1","unstructured":"Roman Jakobson C Gunnar Fant and Morris Halle. 1951. Preliminaries to speech analysis: The distinctive features and their correlates. (1951).  Roman Jakobson C Gunnar Fant and Morris Halle. 1951. Preliminaries to speech analysis: The distinctive features and their correlates. (1951)."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1002\/sec.1499"},{"key":"e_1_3_2_2_25_1","volume-title":"Digital Coding of Waveforms, Principles and Applications to Speech and Video","author":"Jayant N. S.","year":"1913","unstructured":"N. S. Jayant and Peter Noll . 1984. Digital Coding of Waveforms, Principles and Applications to Speech and Video . Prentice-Hall , Englewood Cliffs NJ, USA , 688. N. S. Jayant: Bell Laboratories; ISBN 0-13-21 1913 -7. N. S. Jayant and Peter Noll. 1984. Digital Coding of Waveforms, Principles and Applications to Speech and Video. Prentice-Hall, Englewood Cliffs NJ, USA, 688. N. S. Jayant: Bell Laboratories; ISBN 0-13-211913-7."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.1288413"},{"key":"e_1_3_2_2_27_1","volume-title":"Montreal,(Report) CRIM-06\/08-13","author":"Kenny Patrick","year":"2005","unstructured":"Patrick Kenny . 2005. Joint factor analysis of speaker and session variability: Theory and algorithms. CRIM , Montreal,(Report) CRIM-06\/08-13 , Vol. 14 ( 2005 ), 28--29. Patrick Kenny. 2005. Joint factor analysis of speaker and session variability: Theory and algorithms. CRIM, Montreal,(Report) CRIM-06\/08-13, Vol. 14 (2005), 28--29."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639151"},{"key":"e_1_3_2_2_29_1","volume-title":"An overview of text-independent speaker recognition: From features to supervectors. Speech communication","author":"Kinnunen Tomi","year":"2010","unstructured":"Tomi Kinnunen and Haizhou Li. 2010. An overview of text-independent speaker recognition: From features to supervectors. Speech communication , Vol. 52 , 1 ( 2010 ), 12--40. Tomi Kinnunen and Haizhou Li. 2010. An overview of text-independent speaker recognition: From features to supervectors. Speech communication, Vol. 52, 1 (2010), 12--40."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"crossref","unstructured":"Tomi Kinnunen Md Sahidullah H\u00e9ctor Delgado Massimiliano Todisco Nicholas Evans Junichi Yamagishi and Kong Aik Lee. 2017. The ASVspoof 2017 challenge: Assessing the limits of replay spoofing attack detection. (2017).  Tomi Kinnunen Md Sahidullah H\u00e9ctor Delgado Massimiliano Todisco Nicholas Evans Junichi Yamagishi and Kong Aik Lee. 2017. The ASVspoof 2017 challenge: Assessing the limits of replay spoofing attack detection. (2017).","DOI":"10.21437\/Interspeech.2017-1111"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"crossref","unstructured":"Galina Lavrentyeva Sergey Novoselov Egor Malykh Alexander Kozlov Oleg Kudashev and Vadim Shchemelinin. 2017. Audio Replay Attack Detection with Deep Learning Frameworks.. In Interspeech. 82--86.  Galina Lavrentyeva Sergey Novoselov Egor Malykh Alexander Kozlov Oleg Kudashev and Vadim Shchemelinin. 2017. Audio Replay Attack Detection with Deep Learning Frameworks.. In Interspeech. 82--86.","DOI":"10.21437\/Interspeech.2017-360"},{"key":"e_1_3_2_2_32_1","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"van der Maaten Laurens","year":"2008","unstructured":"Laurens van der Maaten and Geoffrey Hinton . 2008 . Visualizing data using t-SNE . Journal of machine learning research , Vol. 9 , Nov (2008), 2579 -- 2605 . Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research, Vol. 9, Nov (2008), 2579--2605.","journal-title":"Journal of machine learning research"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3209582.3209591"},{"key":"e_1_3_2_2_34_1","volume-title":"The perceptual significance of high-frequency energy in the human voice. Frontiers in psychology","author":"Monson Brian B","year":"2014","unstructured":"Brian B Monson , Eric J Hunter , Andrew J Lotto , and Brad H Story . 2014. The perceptual significance of high-frequency energy in the human voice. Frontiers in psychology , Vol. 5 ( 2014 ), 587. Brian B Monson, Eric J Hunter, Andrew J Lotto, and Brad H Story. 2014. The perceptual significance of high-frequency energy in the human voice. Frontiers in psychology, Vol. 5 (2014), 587."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2019.101027"},{"key":"e_1_3_2_2_36_1","volume-title":"Joon Son Chung, and Andrew Zisserman","author":"Nagrani Arsha","year":"2017","unstructured":"Arsha Nagrani , Joon Son Chung, and Andrew Zisserman . 2017 . Voxceleb : a large-scale speaker identification dataset. arXiv preprint arXiv:1706.08612 (2017). Arsha Nagrani, Joon Son Chung, and Andrew Zisserman. 2017. Voxceleb: a large-scale speaker identification dataset. arXiv preprint arXiv:1706.08612 (2017)."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"crossref","unstructured":"Mahesh Kumar Nandwana Julien van Hout Mitchell McLaren Allen R Stauffer Colleen Richey Aaron Lawson and Martin Graciarena. 2018. Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings.. In Interspeech. 1106--1110.  Mahesh Kumar Nandwana Julien van Hout Mitchell McLaren Allen R Stauffer Colleen Richey Aaron Lawson and Martin Graciarena. 2018. Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings.. In Interspeech. 1106--1110.","DOI":"10.21437\/Interspeech.2018-2221"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/SLT.2018.8639585"},{"key":"e_1_3_2_2_39_1","volume-title":"Speaker verification using adapted Gaussian mixture models. Digital signal processing","author":"Reynolds Douglas A","year":"2000","unstructured":"Douglas A Reynolds , Thomas F Quatieri , and Robert B Dunn . 2000. Speaker verification using adapted Gaussian mixture models. Digital signal processing , Vol. 10 , 1--3 ( 2000 ), 19--41. Douglas A Reynolds, Thomas F Quatieri, and Robert B Dunn. 2000. Speaker verification using adapted Gaussian mixture models. Digital signal processing, Vol. 10, 1--3 (2000), 19--41."},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3081333.3081366"},{"key":"e_1_3_2_2_41_1","unstructured":"Sada. [n.d.]. https:\/\/www.aliexpress.com\/item\/4001241222763.html.  Sada. [n.d.]. https:\/\/www.aliexpress.com\/item\/4001241222763.html."},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2015-1"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.1910954"},{"key":"e_1_3_2_2_44_1","unstructured":"Stephen Shum Najim Dehak Reda Dehak and James R Glass. 2010. Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification.. In Odyssey. 16.  Stephen Shum Najim Dehak Reda Dehak and James R Glass. 2010. Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification.. In Odyssey. 16."},{"key":"e_1_3_2_2_45_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_2_46_1","unstructured":"SiSonic. [n.d.]. https:\/\/www.digikey.com\/product-detail\/en\/knowles\/SPU0410LR5H-QB-7\/423-1139-1-ND\/2420983.  SiSonic. [n.d.]. https:\/\/www.digikey.com\/product-detail\/en\/knowles\/SPU0410LR5H-QB-7\/423-1139-1-ND\/2420983."},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"crossref","unstructured":"David Snyder Daniel Garcia-Romero Daniel Povey and Sanjeev Khudanpur. 2017. Deep Neural Network Embeddings for Text-Independent Speaker Verification.. In Interspeech. 999--1003.  David Snyder Daniel Garcia-Romero Daniel Povey and Sanjeev Khudanpur. 2017. Deep Neural Network Embeddings for Text-Independent Speaker Verification.. In Interspeech. 999--1003.","DOI":"10.21437\/Interspeech.2017-620"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1177\/00238309010440010301"},{"key":"e_1_3_2_2_50_1","article-title":"Hey siri: An on-device dnn-powered voice trigger for apple's personal assistant","volume":"1","author":"Team Siri","year":"2017","unstructured":"Siri Team . 2017 . Hey siri: An on-device dnn-powered voice trigger for apple's personal assistant . Apple Machine Learning Journal , Vol. 1 , 6 (2017). Siri Team. 2017. Hey siri: An on-device dnn-powered voice trigger for apple's personal assistant. Apple Machine Learning Journal, Vol. 1, 6 (2017).","journal-title":"Apple Machine Learning Journal"},{"key":"e_1_3_2_2_51_1","unstructured":"Francis Tom Mohit Jain and Prasenjit Dey. 2018. End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention.. In Interspeech. 681--685.  Francis Tom Mohit Jain and Prasenjit Dey. 2018. End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention.. In Interspeech. 681--685."},{"key":"e_1_3_2_2_52_1","unstructured":"usbank. 2020. How voice-activated devices work with banks. https:\/\/www.usbank.com\/financialiq\/manage-your-household\/personal-finance\/how-voice-activated-devices-work-with-banks.html.  usbank. 2020. How voice-activated devices work with banks. https:\/\/www.usbank.com\/financialiq\/manage-your-household\/personal-finance\/how-voice-activated-devices-work-with-banks.html."},{"key":"e_1_3_2_2_53_1","volume-title":"Ignacio Lopez Moreno, and Javier Gonzalez-Dominguez","author":"Variani Ehsan","year":"2014","unstructured":"Ehsan Variani , Xin Lei , Erik McDermott , Ignacio Lopez Moreno, and Javier Gonzalez-Dominguez . 2014 . Deep neural networks for small footprint text-dependent speaker verification. In 2014 IEEE ICASSP. IEEE , 4052--4056. Ehsan Variani, Xin Lei, Erik McDermott, Ignacio Lopez Moreno, and Javier Gonzalez-Dominguez. 2014. Deep neural networks for small footprint text-dependent speaker verification. In 2014 IEEE ICASSP. IEEE, 4052--4056."},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCST.2011.6095943"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8462665"},{"key":"e_1_3_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2004.841042"},{"key":"e_1_3_2_2_57_1","unstructured":"WILDLIFE. [n.d.]. https:\/\/www.wildlifeacoustics.com\/products\/echo-meter-touch-2-pro-ios.  WILDLIFE. [n.d.]. https:\/\/www.wildlifeacoustics.com\/products\/echo-meter-touch-2-pro-ios."},{"key":"e_1_3_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2014.10.005"},{"key":"e_1_3_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3319535.3354248"},{"key":"e_1_3_2_2_60_1","volume-title":"SurfingAttack: Interactive Hidden Attack on Voice Assistants Using Ultrasonic Guided Wave. In Network and Distributed Systems Security (NDSS) Symposium.","author":"Yan Qiben","year":"2020","unstructured":"Qiben Yan , Kehai Liu , Qin Zhou , Hanqing Guo , and Ning Zhang . 2020 . SurfingAttack: Interactive Hidden Attack on Voice Assistants Using Ultrasonic Guided Wave. In Network and Distributed Systems Security (NDSS) Symposium. Qiben Yan, Kehai Liu, Qin Zhou, Hanqing Guo, and Ning Zhang. 2020. SurfingAttack: Interactive Hidden Attack on Voice Assistants Using Ultrasonic Guided Wave. In Network and Distributed Systems Security (NDSS) Symposium."},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"e_1_3_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133956.3134052"},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133956.3133962"},{"key":"e_1_3_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/2976749.2978296"}],"event":{"name":"ASIA CCS '22: ACM Asia Conference on Computer and Communications Security","location":"Nagasaki Japan","acronym":"ASIA CCS '22","sponsor":["SIGSAC ACM Special Interest Group on Security, Audit, and Control"]},"container-title":["Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3488932.3517420","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/abs\/10.1145\/3488932.3517420","content-type":"text\/html","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3488932.3517420","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3488932.3517420","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:31:27Z","timestamp":1750188687000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3488932.3517420"}},"subtitle":["Text-Independent Speaker Verification Using Ultrasound Energy in Human Speech"],"short-title":[],"issued":{"date-parts":[[2022,5,30]]},"references-count":64,"alternative-id":["10.1145\/3488932.3517420","10.1145\/3488932"],"URL":"https:\/\/doi.org\/10.1145\/3488932.3517420","relation":{},"subject":[],"published":{"date-parts":[[2022,5,30]]},"assertion":[{"value":"2022-05-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}