{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T04:51:48Z","timestamp":1776315108102,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":32,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,10,18]],"date-time":"2021-10-18T00:00:00Z","timestamp":1634515200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,18]]},"DOI":"10.1145\/3461615.3491109","type":"proceedings-article","created":{"date-parts":[[2021,12,18]],"date-time":"2021-12-18T04:57:40Z","timestamp":1639803460000},"page":"91-96","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Efficient Gradient-Based Neural Architecture Search For End-to-End ASR"],"prefix":"10.1145","author":[{"given":"Xian","family":"Shi","sequence":"first","affiliation":[{"name":"School of Computer Scienc, Northwestern Polytechnical University, China"}]},{"given":"Pan","family":"Zhou","sequence":"additional","affiliation":[{"name":"AI Interaction Division, Sogou Inc., China"}]},{"given":"Wei","family":"Chen","sequence":"additional","affiliation":[{"name":"AI Interaction Division, Sogou Inc., China"}]},{"given":"Lei","family":"Xie","sequence":"additional","affiliation":[{"name":"School of Computer Scienc, Northwestern Polytechnical University, China"}]}],"member":"320","published-online":{"date-parts":[[2021,12,17]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Bowen Baker Otkrist Gupta Nikhil Naik and Ramesh Raskar. 2016. Designing neural network architectures using reinforcement learning. arXiv preprint arXiv:1611.02167(2016).  Bowen Baker Otkrist Gupta Nikhil Naik and Ramesh Raskar. 2016. Designing neural network architectures using reinforcement learning. arXiv preprint arXiv:1611.02167(2016)."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSDA.2017.8384449"},{"key":"e_1_3_2_1_3_1","volume-title":"Proxylessnas: Direct neural architecture search on target task and hardware. arXiv preprint arXiv:1812.00332(2018).","author":"Cai Han","year":"2018","unstructured":"Han Cai , Ligeng Zhu , and Song Han . 2018 . Proxylessnas: Direct neural architecture search on target task and hardware. arXiv preprint arXiv:1812.00332(2018). Han Cai, Ligeng Zhu, and Song Han. 2018. Proxylessnas: Direct neural architecture search on target task and hardware. arXiv preprint arXiv:1812.00332(2018)."},{"key":"e_1_3_2_1_4_1","volume-title":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Chan W.","unstructured":"W. Chan , N. Jaitly , Q. Le , and O. Vinyals . 2016. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition . In 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). W. Chan, N. Jaitly, Q. Le, and O. Vinyals. 2016. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition. In 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)."},{"key":"e_1_3_2_1_5_1","first-page":"1","article-title":"Neural architecture search: A survey.J","volume":"20","author":"Elsken Thomas","year":"2019","unstructured":"Thomas Elsken , Jan\u00a0Hendrik Metzen , Frank Hutter , 2019 . Neural architecture search: A survey.J . Mach. Learn. Res. 20 , 55 (2019), 1 \u2013 21 . Thomas Elsken, Jan\u00a0Hendrik Metzen, Frank Hutter, 2019. Neural architecture search: A survey.J. Mach. Learn. Res. 20, 55 (2019), 1\u201321.","journal-title":"Mach. Learn. Res."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143891"},{"key":"e_1_3_2_1_7_1","volume-title":"Conformer: Convolution-augmented transformer for speech recognition. arXiv preprint arXiv:2005.08100(2020).","author":"Gulati Anmol","year":"2020","unstructured":"Anmol Gulati , James Qin , Chung-Cheng Chiu , Niki Parmar , Yu Zhang , Jiahui Yu , Wei Han , Shibo Wang , Zhengdong Zhang , Yonghui Wu , 2020 . Conformer: Convolution-augmented transformer for speech recognition. arXiv preprint arXiv:2005.08100(2020). Anmol Gulati, James Qin, Chung-Cheng Chiu, Niki Parmar, Yu Zhang, Jiahui Yu, Wei Han, Shibo Wang, Zhengdong Zhang, Yonghui Wu, 2020. Conformer: Convolution-augmented transformer for speech recognition. arXiv preprint arXiv:2005.08100(2020)."},{"key":"e_1_3_2_1_8_1","unstructured":"Liqiang He Dan Su and Dong Yu. 2020. Learned Transferable Architectures Can Surpass Hand-Designed Architectures for Large Scale Speech Recognition. ArXiv abs\/2008.11589(2020).  Liqiang He Dan Su and Dong Yu. 2020. Learned Transferable Architectures Can Surpass Hand-Designed Architectures for Large Scale Speech Recognition. ArXiv abs\/2008.11589(2020)."},{"key":"e_1_3_2_1_9_1","unstructured":"Shoukang Hu Xurong Xie Shansong Liu Mengzhe Geng Xunying Liu and Helen Meng. 2020. Neural architecture search for speech recognition. arXiv preprint arXiv:2007.08818(2020).  Shoukang Hu Xurong Xie Shansong Liu Mengzhe Geng Xunying Liu and Helen Meng. 2020. Neural architecture search for speech recognition. arXiv preprint arXiv:2007.08818(2020)."},{"key":"e_1_3_2_1_10_1","first-page":"1788","article-title":"Evolved Speech-Transformer: Applying Neural Architecture Search to End-to-End Automatic Speech Recognition","volume":"2020","author":"Kim Jihwan","year":"2020","unstructured":"Jihwan Kim , Jisung Wang , Sangki Kim , and Yeha Lee . 2020 . Evolved Speech-Transformer: Applying Neural Architecture Search to End-to-End Automatic Speech Recognition . Proc. Interspeech 2020 (2020), 1788 \u2013 1792 . Jihwan Kim, Jisung Wang, Sangki Kim, and Yeha Lee. 2020. Evolved Speech-Transformer: Applying Neural Architecture Search to End-to-End Automatic Speech Recognition. Proc. Interspeech 2020(2020), 1788\u20131792.","journal-title":"Proc. Interspeech"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7953075"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU46091.2019.9003906"},{"key":"e_1_3_2_1_13_1","unstructured":"Hanwen Liang Shifeng Zhang Jiacheng Sun Xingqiu He Weiran Huang Kechen Zhuang and Zhenguo Li. 2019. Darts+: Improved differentiable architecture search with early stopping. arXiv preprint arXiv:1909.06035(2019).  Hanwen Liang Shifeng Zhang Jiacheng Sun Xingqiu He Weiran Huang Kechen Zhuang and Zhenguo Li. 2019. Darts+: Improved differentiable architecture search with early stopping. arXiv preprint arXiv:1909.06035(2019)."},{"key":"e_1_3_2_1_14_1","volume-title":"Darts: Differentiable architecture search. arXiv preprint arXiv:1806.09055(2018).","author":"Liu Hanxiao","year":"2018","unstructured":"Hanxiao Liu , Karen Simonyan , and Yiming Yang . 2018 . Darts: Differentiable architecture search. arXiv preprint arXiv:1806.09055(2018). Hanxiao Liu, Karen Simonyan, and Yiming Yang. 2018. Darts: Differentiable architecture search. arXiv preprint arXiv:1806.09055(2018)."},{"key":"e_1_3_2_1_15_1","unstructured":"Yiping Lu Zhuohan Li Di He Zhiqing Sun Bin Dong Tao Qin Liwei Wang and Tie-Yan Liu. 2019. Understanding and improving transformer from a multi-particle dynamic system point of view. arXiv preprint arXiv:1906.02762(2019).  Yiping Lu Zhuohan Li Di He Zhiqing Sun Bin Dong Tao Qin Liwei Wang and Tie-Yan Liu. 2019. Understanding and improving transformer from a multi-particle dynamic system point of view. arXiv preprint arXiv:1906.02762(2019)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Hanna Mazzawi Xavi Gonzalvo Aleks Kracun Prashant Sridhar Niranjan Subrahmanya Ignacio Lopez-Moreno Hyun-Jin Park and Patrick Violette. 2019. Improving Keyword Spotting and Language Identification via Neural Architecture Search at Scale.. In INTERSPEECH. 1278\u20131282.  Hanna Mazzawi Xavi Gonzalvo Aleks Kracun Prashant Sridhar Niranjan Subrahmanya Ignacio Lopez-Moreno Hyun-Jin Park and Patrick Violette. 2019. Improving Keyword Spotting and Language Identification via Neural Architecture Search at Scale.. In INTERSPEECH. 1278\u20131282.","DOI":"10.21437\/Interspeech.2019-1916"},{"key":"e_1_3_2_1_17_1","volume-title":"NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition. In International Conference on Learning Representations (ICLR).","author":"Mehrotra Abhinav","year":"2021","unstructured":"Abhinav Mehrotra , Alberto\u00a0Gil Ramos , Sourav Bhattacharya , \u0141ukasz Dudziak , Ravichander Vipperla , Thomas Chau , Mohamed\u00a0 S Abdelfattah , Samin Ishtiaq , and Nicholas\u00a0 D Lane . 2021 . NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition. In International Conference on Learning Representations (ICLR). Abhinav Mehrotra, Alberto\u00a0Gil Ramos, Sourav Bhattacharya, \u0141ukasz Dudziak, Ravichander Vipperla, Thomas Chau, Mohamed\u00a0S Abdelfattah, Samin Ishtiaq, and Nicholas\u00a0D Lane. 2021. NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_18_1","unstructured":"Microsoft. [n. d.]. Neural Network Intelligence (NNI). http:\/\/nni.readthedocs.io.  Microsoft. [n. d.]. Neural Network Intelligence (NNI). http:\/\/nni.readthedocs.io."},{"key":"e_1_3_2_1_19_1","unstructured":"Abdelrahman Mohamed Dmytro Okhonko and Luke Zettlemoyer. 2019. Transformers with convolutional context for ASR. arXiv preprint arXiv:1904.11660(2019).  Abdelrahman Mohamed Dmytro Okhonko and Luke Zettlemoyer. 2019. Transformers with convolutional context for ASR. arXiv preprint arXiv:1904.11660(2019)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2017.8268935"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33014780"},{"key":"e_1_3_2_1_22_1","volume-title":"International Conference on Machine Learning. PMLR, 2902\u20132911","author":"Real Esteban","year":"2017","unstructured":"Esteban Real , Sherry Moore , Andrew Selle , Saurabh Saxena , Yutaka\u00a0Leon Suematsu , Jie Tan , Quoc\u00a0 V Le , and Alexey Kurakin . 2017 . Large-scale evolution of image classifiers . In International Conference on Machine Learning. PMLR, 2902\u20132911 . Esteban Real, Sherry Moore, Andrew Selle, Saurabh Saxena, Yutaka\u00a0Leon Suematsu, Jie Tan, Quoc\u00a0V Le, and Alexey Kurakin. 2017. Large-scale evolution of image classifiers. In International Conference on Machine Learning. PMLR, 2902\u20132911."},{"key":"e_1_3_2_1_23_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762(2017).  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762(2017)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683305"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2018-1456"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8682256"},{"key":"e_1_3_2_1_27_1","unstructured":"Sirui Xie Hehui Zheng Chunxiao Liu and Liang Lin. 2018. SNAS: stochastic neural architecture search. arXiv preprint arXiv:1812.09926(2018).  Sirui Xie Hehui Zheng Chunxiao Liu and Liang Lin. 2018. SNAS: stochastic neural architecture search. arXiv preprint arXiv:1812.09926(2018)."},{"key":"e_1_3_2_1_28_1","unstructured":"Bo Zhang WenFeng Li Qingyuan Li Weiji Zhuang Xiangxiang Chu and Yujun Wang. 2020. AutoKWS: Keyword Spotting with Differentiable Architecture Search. arXiv preprint arXiv:2009.03658(2020).  Bo Zhang WenFeng Li Qingyuan Li Weiji Zhuang Xiangxiang Chu and Yujun Wang. 2020. AutoKWS: Keyword Spotting with Differentiable Architecture Search. arXiv preprint arXiv:2009.03658(2020)."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Shiliang Zhang Zhifu Gao Haoneng Luo Ming Lei Jie Gao Zhijie Yan and Lei Xie. 2020. Streaming chunk-aware multihead attention for online end-to-end speech recognition. arXiv preprint arXiv:2006.01712(2020).  Shiliang Zhang Zhifu Gao Haoneng Luo Ming Lei Jie Gao Zhijie Yan and Lei Xie. 2020. Streaming chunk-aware multihead attention for online end-to-end speech recognition. arXiv preprint arXiv:2006.01712(2020).","DOI":"10.21437\/Interspeech.2020-1972"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Huahuan Zheng Keyu An and Zhijian Ou. 2020. Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients. arXiv preprint arXiv:2011.05649(2020).  Huahuan Zheng Keyu An and Zhijian Ou. 2020. Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients. arXiv preprint arXiv:2011.05649(2020).","DOI":"10.1109\/SLT48900.2021.9383527"},{"key":"e_1_3_2_1_31_1","unstructured":"Barret Zoph and Quoc\u00a0V Le. 2016. Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578(2016).  Barret Zoph and Quoc\u00a0V Le. 2016. Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578(2016)."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00907"}],"event":{"name":"ICMI '21: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","location":"Montreal QC Canada","acronym":"ICMI '21","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Companion Publication of the 2021 International Conference on Multimodal Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3461615.3491109","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3461615.3491109","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:49:04Z","timestamp":1750193344000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3461615.3491109"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,18]]},"references-count":32,"alternative-id":["10.1145\/3461615.3491109","10.1145\/3461615"],"URL":"https:\/\/doi.org\/10.1145\/3461615.3491109","relation":{},"subject":[],"published":{"date-parts":[[2021,10,18]]},"assertion":[{"value":"2021-12-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}