{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,25]],"date-time":"2026-04-25T14:56:43Z","timestamp":1777129003658,"version":"3.51.4"},"reference-count":45,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2023,5,9]],"date-time":"2023-05-09T00:00:00Z","timestamp":1683590400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62172438, 61702160"],"award-info":[{"award-number":["62172438, 61702160"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2021YFB3900601"],"award-info":[{"award-number":["2021YFB3900601"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Key Project of Shenzhen City Special Fund for Fundamental Research","award":["202208183000751"],"award-info":[{"award-number":["202208183000751"]}]},{"name":"Key Laboratory of AI and Information Processing"},{"DOI":"10.13039\/501100011823","name":"Education Department of Guangxi Zhuang Autonomous Region","doi-asserted-by":"crossref","award":["2022GXZDSY014"],"award-info":[{"award-number":["2022GXZDSY014"]}],"id":[{"id":"10.13039\/501100011823","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"crossref","award":["B220202074"],"award-info":[{"award-number":["B220202074"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Fundamental Research Funds for the Central Universities, JLU"},{"name":"Joint Foundation of the Ministry of Education","award":["8091B022123"],"award-info":[{"award-number":["8091B022123"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2023,5,31]]},"abstract":"<jats:p>How to accurately understand low-resource languages is the core of the task-oriented human-computer dialogue system. Language understanding consists of two sub-tasks, i.e., intent detection and slot filling. Intent detection still faces challenges due to semantic ambiguity and implicit intentions with users\u2019 input. Moreover, separately modeling intent detection and slot filling significantly decrease the correctness and relevance between questions and answers. To address these issues, we propose a joint intent detection method using asynchronous training strategy. The proposed method firstly encodes local text information extracted by CNN and relationship information among words emphasized by attention structure. Later, a joint intent detection model with asynchronous training strategy is proposed by either fusing hidden states of intent detection and slot filling layers, or adopting the key information to fine-tune the whole network, greatly increasing the relevance of intent detection and slot filling subtasks. The accuracy achieved by the proposed method tested on an open-source airline travel dataset and a self-collected electricity service dataset, i.e., ATIS and ECSF, are 97.49% and 89.68%, respectively, which proves the effectiveness of joint learning and asynchronous training.<\/jats:p>","DOI":"10.1145\/3558096","type":"journal-article","created":{"date-parts":[[2022,8,22]],"date-time":"2022-08-22T11:41:58Z","timestamp":1661168518000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Joint Intent Detection Model for Task-oriented Human-Computer Dialogue System using Asynchronous Training"],"prefix":"10.1145","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3022-3718","authenticated-orcid":false,"given":"Yirui","family":"Wu","sequence":"first","affiliation":[{"name":"College of Computer and Information, Hohai University, Nanjing City, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7434-3896","authenticated-orcid":false,"given":"Hao","family":"Li","sequence":"additional","affiliation":[{"name":"College of Computer and Information, Hohai University, Nanjing City, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4324-4016","authenticated-orcid":false,"given":"Lilai","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Computer and Information, Hohai University, Nanjing City, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7900-6282","authenticated-orcid":false,"given":"Chen","family":"Dong","sequence":"additional","affiliation":[{"name":"College of Computer and Information, Hohai University, Nanjing City, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5625-0402","authenticated-orcid":false,"given":"Qian","family":"Huang","sequence":"additional","affiliation":[{"name":"College of Computer and Information, Hohai University, Nanjing City, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7013-9081","authenticated-orcid":false,"given":"Shaohua","family":"Wan","sequence":"additional","affiliation":[{"name":"Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen City, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,5,9]]},"reference":[{"key":"e_1_3_1_2_2","article-title":"Efficient intent detection with dual sentence encoders","volume":"2003","author":"Casanueva I\u00f1igo","year":"2020","unstructured":"I\u00f1igo Casanueva, Tadas Temcinas, Daniela Gerz, Matthew Henderson, and Ivan Vulic. 2020. Efficient intent detection with dual sentence encoders. CoRR abs\/2003.04807.","journal-title":"CoRR"},{"key":"e_1_3_1_3_2","first-page":"4960","volume-title":"Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing","author":"Chan William","year":"2016","unstructured":"William Chan, Navdeep Jaitly, Quoc V. Le, and Oriol Vinyals. 2016. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. 4960\u20134964."},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2022.03.010"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3430505"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1053"},{"key":"e_1_3_1_7_2","unstructured":"Alice Coucke Alaa Saade Adrien Ball Th\u00e9odore Bluche Alexandre Caulier David Leroy Cl\u00e9ment Doumouro Thibault Gisselbrecht Francesco Caltagirone Thibaut Lavril Ma\u00ebl Primet and Joseph Dureau. 2018. Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces. CoRR abs\/1805.10190."},{"key":"e_1_3_1_8_2","first-page":"2454","volume-title":"Proceedings of Association for Computational Linguistics","author":"Dopierre Thomas","year":"2021","unstructured":"Thomas Dopierre, Christophe Gravier, and Wilfried Logerais. 2021. ProtAugment: Intent detection meta-learning through unsupervised diverse paraphrasing. In Proceedings of Association for Computational Linguistics. 2454\u20132466."},{"key":"e_1_3_1_9_2","first-page":"5467","volume-title":"Proceedings of Association for Computational Linguistics","author":"Haihong E.","year":"2019","unstructured":"E. Haihong, Peiqing Niu, Zhongfu Chen, and Meina Song. 2019. A novel bi-directional interrelated model for joint intent detection and slot filling. In Proceedings of Association for Computational Linguistics. 5467\u20135471."},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1162\/dint_a_00090"},{"key":"e_1_3_1_11_2","first-page":"7468","volume-title":"Empirical Methods in Natural Language Processing","author":"Gerz Daniela","year":"2021","unstructured":"Daniela Gerz, Pei-Hao Su, et\u00a0al. 2021. Multilingual and cross-lingual intent detection from spoken data. In Empirical Methods in Natural Language Processing. 7468\u20137475."},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-2118"},{"key":"e_1_3_1_13_2","first-page":"554","volume-title":"Proceedings of IEEE Spoken Language Technology Workshop","author":"Guo Zhaohan Daniel","year":"2014","unstructured":"Zhaohan Daniel Guo, G\u00f6khan T\u00fcr, Wen-tau Yih, and Geoffrey Zweig. 2014. Joint semantic utterance classification and slot filling with recursive neural networks. In Proceedings of IEEE Spoken Language Technology Workshop. 554\u2013559."},{"key":"e_1_3_1_14_2","first-page":"715","volume-title":"Proceedings of Annual Conference of the International Speech Communication Association","author":"Hakkani-T\u00fcr Dilek","year":"2016","unstructured":"Dilek Hakkani-T\u00fcr, G\u00f6khan T\u00fcr, Asli Celikyilmaz, Yun-Nung Chen, Jianfeng Gao, Li Deng, and Ye-Yi Wang. 2016. Multi-domain joint semantic frame parsing using bi-directional RNN-LSTM. In Proceedings of Annual Conference of the International Speech Communication Association. 715\u2013719."},{"issue":"7","key":"e_1_3_1_15_2","doi-asserted-by":"crossref","first-page":"1287","DOI":"10.1109\/TASL.2008.925143","article-title":"Triangular-chain conditional random fields","volume":"16","author":"Jeong Minwoo","year":"2008","unstructured":"Minwoo Jeong and Gary Geunbae Lee. 2008. Triangular-chain conditional random fields. IEEE Trans. Speech Audio Process. 16, 7, 1287\u20131302.","journal-title":"IEEE Trans. Speech Audio Process."},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1181"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1223"},{"key":"e_1_3_1_18_2","first-page":"282","volume-title":"Proceedings of International Conference on Machine Learning","author":"Lafferty John D.","year":"2001","unstructured":"John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of International Conference on Machine Learning. 282\u2013289."},{"key":"e_1_3_1_19_2","first-page":"1311","volume-title":"Empirical Methods in Natural Language Processing","author":"Larson Stefan","year":"2019","unstructured":"Stefan Larson, Anish Mahendran, et\u00a0al. 2019. An evaluation dataset for intent classification and out-of-scope prediction. In Empirical Methods in Natural Language Processing. 1311\u20131316."},{"key":"e_1_3_1_20_2","first-page":"3824","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing","author":"Li Changliang","year":"2018","unstructured":"Changliang Li, Liang Li, and Ji Qi. 2018. A self-attentive model with gate mechanism for spoken language understanding. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 3824\u20133833."},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324905003955"},{"key":"e_1_3_1_22_2","first-page":"106548","article-title":"ASRNN: A recurrent neural network with an attention model for sequence labeling","author":"Lin Jerry Chun-Wei","year":"2021","unstructured":"Jerry Chun-Wei Lin, Yinan Shao, Youcef Djenouri, and Unil Yun. 2021. ASRNN: A recurrent neural network with an attention model for sequence labeling. Knowl. Based Syst., 106548.","journal-title":"Knowl. Based Syst."},{"key":"e_1_3_1_23_2","article-title":"Attention-based recurrent neural network models for joint intent detection and slot filling","author":"Liu Bing","year":"2016","unstructured":"Bing Liu and Ian Lane. 2016. Attention-based recurrent neural network models for joint intent detection and slot filling. arXiv preprint arXiv:1609.01454 (2016).","journal-title":"arXiv preprint arXiv:1609.01454"},{"key":"e_1_3_1_24_2","first-page":"685","volume-title":"Proceedings of International Speech Communication Association","author":"Liu Bing","year":"2016","unstructured":"Bing Liu and Ian R. Lane. 2016. Attention-based recurrent neural network models for joint intent detection and slot filling. In Proceedings of International Speech Communication Association. 685\u2013689."},{"key":"e_1_3_1_25_2","first-page":"165","volume-title":"Proceedings of International Workshop on Spoken Dialog System Technology","author":"Liu Xingkun","year":"2019","unstructured":"Xingkun Liu, Arash Eshghi, Pawel Swietojanski, and Verena Rieser. 2019. Benchmarking natural language understanding services for building conversational agents. In Proceedings of International Workshop on Spoken Dialog System Technology. 165\u2013183."},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1166"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.4018\/JDM.2021100105"},{"key":"e_1_3_1_28_2","first-page":"57","volume-title":"1st Workshop on Behavioral Change and Ambient Intelligence for Sustainability and 2nd Workshop on Affective Interaction with Avatars and Robots","author":"Merdivan Erinc","year":"2018","unstructured":"Erinc Merdivan, Deepika Singh, Sten Hanke, and Andreas Holzinger. 2018. Dialogue systems for intelligent human computer interactions. In 1st Workshop on Behavioral Change and Ambient Intelligence for Sustainability and 2nd Workshop on Affective Interaction with Avatars and Robots. 57\u201371."},{"key":"e_1_3_1_29_2","first-page":"2852","volume-title":"Proceedings of Association for Computational Linguistics","author":"Ouyang Yawen","year":"2021","unstructured":"Yawen Ouyang, Jiasheng Ye, Yu Chen, Xinyu Dai, Shujian Huang, and Jiajun Chen. 2021. Energy-based unknown intent detection with data manipulation. In Proceedings of Association for Computational Linguistics. 2852\u20132861."},{"key":"e_1_3_1_30_2","first-page":"135","volume-title":"Proceedings of International Speech Communication Association","author":"Ravuri Suman V.","year":"2015","unstructured":"Suman V. Ravuri and Andreas Stolcke. 2015. Recurrent neural network and LSTM models for lexical utterance classification. In Proceedings of International Speech Communication Association. 135\u2013139."},{"key":"e_1_3_1_31_2","first-page":"3795","volume-title":"Proceedings of North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Schuster Sebastian","year":"2019","unstructured":"Sebastian Schuster, Sonal Gupta, Rushin Shah, and Mike Lewis. 2019. Cross-lingual transfer learning for multilingual task oriented dialog. In Proceedings of North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 3795\u20133805."},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/j.patrec.2021.02.008","article-title":"Self-attention-based conditional random fields latent variables model for sequence labeling","author":"Shao Yinan","year":"2021","unstructured":"Yinan Shao, Jerry Chun-Wei Lin, Gautam Srivastava, Alireza Jolfaei, Dongdong Guo, and Yi Hu. 2021. Self-attention-based conditional random fields latent variables model for sequence labeling. Pattern Recognit. Lett., 157\u2013164.","journal-title":"Pattern Recognit. Lett."},{"key":"e_1_3_1_33_2","first-page":"3104","volume-title":"Proceedings of Neural Information Processing Systems","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of Neural Information Processing Systems. 3104\u20133112."},{"key":"e_1_3_1_34_2","first-page":"13943","volume-title":"Proceedings of AAAI Conference on Artificial Intelligence","author":"Wang Jixuan","year":"2021","unstructured":"Jixuan Wang, Kai Wei, Martin Radfar, Weiwei Zhang, and Clement Chung. 2021. Encoding syntactic knowledge in transformer encoder for intent detection and slot filling. In Proceedings of AAAI Conference on Artificial Intelligence. 13943\u201313951."},{"key":"e_1_3_1_35_2","first-page":"5324","volume-title":"Proceedings of America Control Conference","author":"Wang Yu","year":"2017","unstructured":"Yu Wang. 2017. A new concept using LSTM neural networks for dynamic system identification. In Proceedings of America Control Conference. 5324\u20135329."},{"key":"e_1_3_1_36_2","article-title":"A bi-model based RNN semantic frame parsing model for intent detection and slot filling","author":"Wang Yu","year":"2018","unstructured":"Yu Wang, Yilin Shen, and Hongxia Jin. 2018. A bi-model based RNN semantic frame parsing model for intent detection and slot filling. arXiv preprint arXiv:1812.10235 (2018).","journal-title":"arXiv preprint arXiv:1812.10235"},{"key":"e_1_3_1_37_2","article-title":"Edge computing driven low-light image dynamic enhancement for object detection","author":"Wu Yirui","year":"2022","unstructured":"Yirui Wu, Haifeng Guo, Chinmay Chakraborty, Mohammad Khosravi, Stefano Berretti, and Shaohua Wan. 2022. Edge computing driven low-light image dynamic enhancement for object detection. IEEE Transactions on Network Science and Engineering (2022).","journal-title":"IEEE Transactions on Network Science and Engineering"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvcir.2021.103261"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2021.116319"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1348"},{"key":"e_1_3_1_41_2","first-page":"78","volume-title":"Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding","author":"Xu Puyang","year":"2013","unstructured":"Puyang Xu and Ruhi Sarikaya. 2013. Convolutional neural network based triangular CRF for joint intent detection and slot filling. In Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding. 78\u201383."},{"key":"e_1_3_1_42_2","first-page":"5052","volume-title":"Empirical Methods in Natural Language Processing","author":"Xu Weijia","year":"2020","unstructured":"Weijia Xu, Batool Haider, and Saab Mansour. 2020. End-to-end slot alignment and recognition for cross-lingual NLU. In Empirical Methods in Natural Language Processing. 5052\u20135063."},{"key":"e_1_3_1_43_2","first-page":"189","volume-title":"Proceedings of IEEE Spoken Language Technology","author":"Yao Kaisheng","year":"2014","unstructured":"Kaisheng Yao, Baolin Peng, Yu Zhang, Dong Yu, Geoffrey Zweig, and Yangyang Shi. 2014. Spoken language understanding using long short-term memory neural networks. In Proceedings of IEEE Spoken Language Technology. 189\u2013194."},{"key":"e_1_3_1_44_2","first-page":"3521","volume-title":"Proceedings of Association for Computational Linguistics","author":"Zhan Li-Ming","year":"2021","unstructured":"Li-Ming Zhan, Haowen Liang, Bo Liu, Lu Fan, Xiao-Ming Wu, and Albert Y. S. Lam. 2021. Out-of-scope intent detection with self-supervision and discriminative training. In Proceedings of Association for Computational Linguistics. 3521\u20133532."},{"key":"e_1_3_1_45_2","first-page":"2993","volume-title":"Proceedings of International Joint Conference on Artificial Intelligence","author":"Zhang Xiaodong","year":"2016","unstructured":"Xiaodong Zhang and Houfeng Wang. 2016. A joint model of intent determination and slot filling for spoken language understanding. In Proceedings of International Joint Conference on Artificial Intelligence. 2993\u20132999."},{"key":"e_1_3_1_46_2","first-page":"5675","volume-title":"Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing","author":"Zhu Su","year":"2017","unstructured":"Su Zhu and Kai Yu. 2017. Encoder-decoder with focus-mechanism for sequence labelling based spoken language understanding. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. 5675\u20135679."}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3558096","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3558096","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:32Z","timestamp":1750182572000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3558096"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,9]]},"references-count":45,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2023,5,31]]}},"alternative-id":["10.1145\/3558096"],"URL":"https:\/\/doi.org\/10.1145\/3558096","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,9]]},"assertion":[{"value":"2021-12-31","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-08-09","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-05-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}