{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T03:34:48Z","timestamp":1772854488549,"version":"3.50.1"},"reference-count":71,"publisher":"Association for Computing Machinery (ACM)","issue":"3s","license":[{"start":{"date-parts":[[2023,3,14]],"date-time":"2023-03-14T00:00:00Z","timestamp":1678752000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Key Research and Development Program of China","award":["2018AAA0100604"],"award-info":[{"award-number":["2018AAA0100604"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61720106006, 62036012, 62072455, 61721004, U1836220, 61872424"],"award-info":[{"award-number":["61720106006, 62036012, 62072455, 61721004, U1836220, 61872424"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Beijing Natural Science Foundation","award":["L201001"],"award-info":[{"award-number":["L201001"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2023,6,30]]},"abstract":"<jats:p>\n            Humans can easily infer the motivations behind human actions from only visual data by comprehensively analyzing the complex context information and utilizing abundant life experiences. Inspired by humans\u2019 reasoning ability, existing motivation prediction methods have improved image-based deep classification models using the commonsense knowledge learned by pre-trained language models. However, the knowledge learned from public text corpora is probably incompatible with the task-specific data of the motivation prediction, which may impact the model performance. To address this problem, this paper proposes a\n            <jats:bold>dual scene graph convolutional network (dual-SGCN)<\/jats:bold>\n            to comprehensively explore the complex visual information and semantic context prior from the image data for motivation prediction. The proposed dual-SGCN has a visual branch and a semantic branch. For the visual branch, we build a visual graph based on scene graph where object nodes and relation edges are represented by visual features. For the semantic branch, we build a semantic graph where nodes and edges are directly represented by the word embeddings of the object and relation labels. In each branch, node-oriented and edge-oriented message passing is adopted to propagate interaction information between different nodes and edges. Besides, a multi-modal interactive attention mechanism is adopted to cooperatively attend and fuse the visual and semantic information. The proposed dual-SGCN is learned in an end-to-end form by a multi-task co-training scheme. In the inference stage, Total Direct Effect is adopted to alleviate the bias caused by the semantic context prior. Extensive experiments demonstrate that the proposed method achieves state-of-the-art performance.\n          <\/jats:p>","DOI":"10.1145\/3572914","type":"journal-article","created":{"date-parts":[[2022,12,1]],"date-time":"2022-12-01T12:41:10Z","timestamp":1669898470000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Dual Scene Graph Convolutional Network for Motivation Prediction"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7045-8031","authenticated-orcid":false,"given":"Yuyang","family":"Wanyan","sequence":"first","affiliation":[{"name":"National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences (CASIA); School of Artificial Intelligence, University of Chinese Academy of Sciences(UCAS), Beijing,  China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5453-9755","authenticated-orcid":false,"given":"Xiaoshan","family":"Yang","sequence":"additional","affiliation":[{"name":"National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences (CASIA); School of Artificial Intelligence, University of Chinese Academy of Sciences(UCAS), China and Peng Cheng Laboratory, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4598-9505","authenticated-orcid":false,"given":"Xuan","family":"Ma","sequence":"additional","affiliation":[{"name":"National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences (CASIA); School of Artificial Intelligence, University of Chinese Academy of Sciences (UCAS), Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8343-9665","authenticated-orcid":false,"given":"Changsheng","family":"Xu","sequence":"additional","affiliation":[{"name":"National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences (CASIA); School of Artificial Intelligence, University of Chinese Academy of Sciences(UCAS), China and Peng Cheng Laboratory, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2023,3,14]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"841","volume-title":"Uncertainty in Artificial Intelligence","author":"Abu-El-Haija Sami","year":"2020","unstructured":"Sami Abu-El-Haija, Amol Kapoor, Bryan Perozzi, and Joonseok Lee. 2020. N-GCN: Multi-scale graph convolution for semi-supervised node classification. In Uncertainty in Artificial Intelligence. PMLR, 841\u2013851."},{"key":"e_1_3_1_3_2","article-title":"Neural machine translation by jointly learning to align and translate","author":"Bahdanau Dzmitry","year":"2014","unstructured":"Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).","journal-title":"arXiv preprint arXiv:1409.0473"},{"key":"e_1_3_1_4_2","doi-asserted-by":"crossref","first-page":"987521","DOI":"10.1117\/12.2228523","volume-title":"Eighth International Conference on Machine Vision (ICMV 2015)","volume":"9875","author":"Burnaev Evgeny","year":"2015","unstructured":"Evgeny Burnaev, Pavel Erofeev, and Artem Papanov. 2015. Influence of resampling on accuracy of imbalanced classification. In Eighth International Conference on Machine Vision (ICMV 2015), Vol. 9875. International Society for Optics and Photonics, 987521."},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58577-8_7"},{"key":"e_1_3_1_6_2","first-page":"149","article-title":"A model of intentional communication: AIRBUS (asymmetric intention recognition with Bayesian updating of signals)","author":"Ruiter Jan-Peter De","year":"2012","unstructured":"Jan-Peter De Ruiter and Chris Cummins. 2012. A model of intentional communication: AIRBUS (asymmetric intention recognition with Bayesian updating of signals). Proceedings of SemDial 2012 (2012), 149\u2013150.","journal-title":"Proceedings of SemDial 2012"},{"key":"e_1_3_1_7_2","first-page":"3844","volume-title":"Advances in Neural Information Processing Systems","author":"Defferrard Micha\u00ebl","year":"2016","unstructured":"Micha\u00ebl Defferrard, Xavier Bresson, and Pierre Vandergheynst. 2016. Convolutional neural networks on graphs with fast localized spectral filtering. In Advances in Neural Information Processing Systems. 3844\u20133852. arxiv:1606.09375."},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01525"},{"key":"e_1_3_1_9_2","first-page":"1462","volume-title":"International Conference on Machine Learning","author":"Gregor Karol","year":"2015","unstructured":"Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Rezende, and Daan Wierstra. 2015. Draw: A recurrent neural network for image generation. In International Conference on Machine Learning. PMLR, 1462\u20131471."},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01113"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-57351-9_30"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.5555\/3294771.3294869"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/tkde.2008.239"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.322"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_1_16_2","article-title":"Deep convolutional networks on graph-structured data","author":"Henaff Mikael","year":"2015","unstructured":"Mikael Henaff, Joan Bruna, and Yann LeCun. 2015. Deep convolutional networks on graph-structured data. arXiv preprint arXiv:1506.05163 (2015). arxiv:1506.05163http:\/\/arxiv.org\/abs\/1506.05163.","journal-title":"arXiv preprint arXiv:1506.05163"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2016.102"},{"key":"e_1_3_1_18_2","first-page":"12986","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Jia Menglin","year":"2020","unstructured":"Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, and Ser-Nam Lim. 2020. Intentonomy: A dataset and study towards human intent understanding. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 12986\u201312996. arxiv:2011.05558http:\/\/arxiv.org\/abs\/2011.05558."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-009-5108-8"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00133"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298990"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.35"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00010"},{"key":"e_1_3_1_24_2","first-page":"1564","volume-title":"Advances in Neural Information Processing Systems","author":"Kim Jin Hwa","year":"2018","unstructured":"Jin Hwa Kim, Jaehyun Jun, and Byoung Tak Zhang. 2018. Bilinear attention networks. In Advances in Neural Information Processing Systems, Vol. 2018-Decem. 1564\u20131574. arxiv:1805.07932."},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2019.2931042"},{"key":"e_1_3_1_26_2","article-title":"ViLT: Vision-and-language transformer without convolution or region supervision","author":"Kim Wonjae","year":"2021","unstructured":"Wonjae Kim, Bokyung Son, and Ildoo Kim. 2021. ViLT: Vision-and-language transformer without convolution or region supervision. arXiv preprint arXiv:2102.03334 (2021).","journal-title":"arXiv preprint arXiv:2102.03334"},{"key":"e_1_3_1_27_2","article-title":"Semi-supervised classification with graph convolutional networks","author":"Kipf Thomas N.","year":"2016","unstructured":"Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).","journal-title":"arXiv preprint arXiv:1609.02907"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3065386"},{"key":"e_1_3_1_29_2","first-page":"1378","volume-title":"International Conference on Machine Learning","author":"Kumar Ankit","year":"2016","unstructured":"Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, and Richard Socher. 2016. Ask me anything: Dynamic memory networks for natural language processing. In International Conference on Machine Learning. PMLR, 1378\u20131387."},{"key":"e_1_3_1_30_2","article-title":"A hierarchical neural autoencoder for paragraphs and documents","author":"Li Jiwei","year":"2015","unstructured":"Jiwei Li, Minh-Thang Luong, and Dan Jurafsky. 2015. A hierarchical neural autoencoder for paragraphs and documents. arXiv preprint arXiv:1506.01057 (2015).","journal-title":"arXiv preprint arXiv:1506.01057"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58577-8_8"},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.142"},{"key":"e_1_3_1_33_2","article-title":"Gated graph sequence neural networks","author":"Li Yujia","year":"2015","unstructured":"Yujia Li, Daniel Tarlow, Marc Brockschmidt, and Richard Zemel. 2015. Gated graph sequence neural networks. arXiv preprint arXiv:1511.05493 (2015).","journal-title":"arXiv preprint arXiv:1511.05493"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.324"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_1_36_2","first-page":"289","volume-title":"Advances in Neural Information Processing Systems (NIPS)","author":"Lu Jiasen","year":"2016","unstructured":"Jiasen Lu, Jianwei Yang, Dhruv Batra, and Devi Parikh. 2016. Hierarchical question-image co-attention for visual question answering. In Advances in Neural Information Processing Systems (NIPS). 289\u2013297."},{"key":"e_1_3_1_37_2","first-page":"2204","volume-title":"Advances in Neural Information Processing Systems","author":"Mnih Volodymyr","year":"2014","unstructured":"Volodymyr Mnih, Nicolas Heess, and Alex Graves. 2014. Recurrent models of visual attention. In Advances in Neural Information Processing Systems. 2204\u20132212."},{"key":"e_1_3_1_38_2","first-page":"2172","volume-title":"Advances in Neural Information Processing Systems","author":"Newell Alejandro","year":"2017","unstructured":"Alejandro Newell and Jia Deng. 2017. Pixels to graphs by associative embedding. In Advances in Neural Information Processing Systems, Vol. 2017-Decem. 2172\u20132181. arxiv:1706.07365."},{"key":"e_1_3_1_39_2","first-page":"2014","volume-title":"International Conference on Machine Learning","author":"Niepert Mathias","year":"2016","unstructured":"Mathias Niepert, Mohamed Ahmed, and Konstantin Kutzkov. 2016. Learning convolutional neural networks for graphs. In International Conference on Machine Learning. PMLR, 2014\u20132023."},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2018.09.001"},{"key":"e_1_3_1_41_2","article-title":"Dual coordinate solvers for large-scale structural SVMs","author":"Ramanan Deva","year":"2013","unstructured":"Deva Ramanan. 2013. Dual coordinate solvers for large-scale structural SVMs. arXiv preprint arXiv:1312.1743 (2013).","journal-title":"arXiv preprint arXiv:1312.1743"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"e_1_3_1_43_2","article-title":"A neural attention model for abstractive sentence summarization","author":"Rush Alexander M.","year":"2015","unstructured":"Alexander M. Rush, Sumit Chopra, and Jason Weston. 2015. A neural attention model for abstractive sentence summarization. arXiv preprint arXiv:1509.00685 (2015).","journal-title":"arXiv preprint arXiv:1509.00685"},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACV.2019.00019"},{"key":"e_1_3_1_45_2","article-title":"SCAN: A spatial context attentive network for joint multi-agent intent prediction","author":"Sekhon Jasmine","year":"2021","unstructured":"Jasmine Sekhon and Cody Fleming. 2021. SCAN: A spatial context attentive network for joint multi-agent intent prediction. arXiv preprint arXiv:2102.00109 (2021). arxiv:2102.00109. http:\/\/arxiv.org\/abs\/2102.00109","journal-title":"arXiv preprint arXiv:2102.00109"},{"key":"e_1_3_1_46_2","article-title":"Very deep convolutional networks for large-scale image recognition","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings (2015). arxiv:1409.1556.","journal-title":"3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings"},{"key":"e_1_3_1_47_2","article-title":"Deep networks with internal selective attention through feedback connections","author":"Stollenga Marijn","year":"2014","unstructured":"Marijn Stollenga, Jonathan Masci, Faustino Gomez, and J\u00fcrgen Schmidhuber. 2014. Deep networks with internal selective attention through feedback connections. arXiv preprint arXiv:1407.3068 (2014).","journal-title":"arXiv preprint arXiv:1407.3068"},{"key":"e_1_3_1_48_2","first-page":"2647","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Sylvain Tristan","year":"2020","unstructured":"Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio, R. Devon Hjelm, and Shikhar Sharma. 2020. Object-centric image generation from layouts. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 2647\u20132655. arxiv:2003.07449. http:\/\/arxiv.org\/abs\/2003.07449."},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00377"},{"key":"e_1_3_1_50_2","article-title":"Directed graph convolutional network","author":"Tong Zekun","year":"2020","unstructured":"Zekun Tong, Yuxuan Liang, Changsheng Sun, David S. Rosenblum, and Andrew Lim. 2020. Directed graph convolutional network. arXiv preprint arXiv:2004.13970 (2020).","journal-title":"arXiv preprint arXiv:2004.13970"},{"key":"e_1_3_1_51_2","article-title":"SG2Caps: Revisiting scene graphs for image captioning","author":"Tripathi Subarna","year":"2021","unstructured":"Subarna Tripathi, Kien Nguyen, Tanaya Guha, Bang Du, and Truong Q. Nguyen. 2021. SG2Caps: Revisiting scene graphs for image captioning. arXiv preprint arXiv:2102.04990 (2021). arxiv:2102.04990http:\/\/arxiv.org\/abs\/2102.04990.","journal-title":"arXiv preprint arXiv:2102.04990"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0065-2601(08)60239-7"},{"key":"e_1_3_1_53_2","first-page":"5999","article-title":"Attention is all you need","volume":"2017","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems 2017-Decem. (2017), 5999\u20136009. arxiv:1706.03762.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_54_2","article-title":"Graph attention networks","author":"Veli\u010dkovi\u0107 Petar","year":"2018","unstructured":"Petar Veli\u010dkovi\u0107, Arantxa Casanova, Pietro Li\u00f2, Guillem Cucurull, Adriana Romero, and Yoshua Bengio. 2018. Graph attention networks. 6th International Conference on Learning Representations, ICLR 2018 - Conference Track Proceedings (2018). arxiv:1710.10903.","journal-title":"6th International Conference on Learning Representations, ICLR 2018 - Conference Track Proceedings"},{"key":"e_1_3_1_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.327"},{"key":"e_1_3_1_56_2","doi-asserted-by":"publisher","DOI":"10.1109\/BigData47090.2019.9006464"},{"key":"e_1_3_1_57_2","first-page":"15920","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wiles Olivia","year":"2020","unstructured":"Olivia Wiles, Sebastien Ehrhardt, and Andrew Zisserman. 2020. Co-attention for conditioned image matching. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 15920\u201315929. arxiv:2007.08480. http:\/\/arxiv.org\/abs\/2007.08480."},{"key":"e_1_3_1_58_2","doi-asserted-by":"publisher","DOI":"10.1016\/0010-0277(83)90004-5"},{"key":"e_1_3_1_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.15"},{"key":"e_1_3_1_60_2","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v14i1.7338"},{"key":"e_1_3_1_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.634"},{"key":"e_1_3_1_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.330"},{"key":"e_1_3_1_63_2","first-page":"2048","volume-title":"International Conference on Machine Learning","author":"Xu Kelvin","year":"2015","unstructured":"Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In International Conference on Machine Learning. PMLR, 2048\u20132057."},{"key":"e_1_3_1_64_2","first-page":"15039","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Yoon Jae Shin","year":"2020","unstructured":"Jae Shin Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, Hyun Soo Park, and Christian Theobalt. 2020. Pose-guided human animation from a single image in the wild. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 15039\u201315048. arxiv:2012.03796http:\/\/arxiv.org\/abs\/2012.03796."},{"key":"e_1_3_1_65_2","first-page":"10718","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Yoon Sangwoong","year":"2020","unstructured":"Sangwoong Yoon, Woo Young Kang, Sungwook Jeon, SeongEun Lee, Changjin Han, Jonghun Park, and Eun-Sol Kim. 2020. Image-to-image retrieval by learning similarity between scene graphs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 10718\u201310726. arxiv:2012.14700. http:\/\/arxiv.org\/abs\/2012.14700"},{"key":"e_1_3_1_66_2","first-page":"3208","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Yu Fei","year":"2020","unstructured":"Fei Yu, Jiji Tang, Weichong Yin, Yu Sun, Hao Tian, Hua Wu, and Haifeng Wang. 2020. ERNIE-ViL: Knowledge enhanced vision-language representations through scene graph. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 3208\u20133216. arxiv:2006.16934http:\/\/arxiv.org\/abs\/2006.16934."},{"key":"e_1_3_1_67_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00644"},{"key":"e_1_3_1_68_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00611"},{"key":"e_1_3_1_69_2","first-page":"14374","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Zhang Hanlei","year":"2020","unstructured":"Hanlei Zhang, Hua Xu, and Ting-En Lin. 2020. Deep open intent classification with adaptive decision boundary. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 14374\u201314382. arxiv:2012.10209http:\/\/arxiv.org\/abs\/2012.10209."},{"key":"e_1_3_1_70_2","first-page":"14365","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Zhang Hanlei","year":"2020","unstructured":"Hanlei Zhang, Hua Xu, Ting-En Lin, and Rui Lyu. 2020. Discovering new intents with deep aligned clustering. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 14365\u201314373. arxiv:2012.08987http:\/\/arxiv.org\/abs\/2012.08987."},{"key":"e_1_3_1_71_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-020-01300-7"},{"key":"e_1_3_1_72_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.319"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3572914","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3572914","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:51:38Z","timestamp":1750182698000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3572914"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,14]]},"references-count":71,"journal-issue":{"issue":"3s","published-print":{"date-parts":[[2023,6,30]]}},"alternative-id":["10.1145\/3572914"],"URL":"https:\/\/doi.org\/10.1145\/3572914","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,14]]},"assertion":[{"value":"2022-02-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-11-20","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-03-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}