{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T07:56:11Z","timestamp":1775548571226,"version":"3.50.1"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,2,6]],"date-time":"2023-02-06T00:00:00Z","timestamp":1675641600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61832001"],"award-info":[{"award-number":["61832001"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Shenzhen Science and Technology Program","award":["JCYJ20210324121213037"],"award-info":[{"award-number":["JCYJ20210324121213037"]}]},{"name":"Zhejiang Lab\u2019s International Talent Fund for Young Professionals"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2023,5,31]]},"abstract":"<jats:p>\n            Knowledge graphs often suffer from incompleteness, and\n            <jats:bold>knowledge graph completion (KGC)<\/jats:bold>\n            aims at inferring the missing triplets through knowledge graph embedding from known factual triplets. However, most existing knowledge graph embedding methods only use the relational information of knowledge graph and treat the entities and relations as IDs with simple embedding layer, ignoring the multi-modal information among triplets, such as text descriptions, images, etc. In this work, we propose a novel network to incorporate different modal information with graph structure information for more precise representation of multi-modal knowledge graph, termed as\n            <jats:bold>hyper-node relational graph attention (HRGAT)<\/jats:bold>\n            network. In HRGAT, we use low-rank multi-modal fusion to model the intra-modality and inter-modality dynamics, which transforms the original knowledge graph to a hyper-node graph. Then,\n            <jats:bold>relational graph attention (RGAT)<\/jats:bold>\n            network is used, which contains relation-specific attention and entity-relation fusion operation to capture the graph structure information. Finally, we aggregate the updated multi-modal information and graph structure information to generate the final embeddings of knowledge graph to achieve KGC. By exploring multi-modal information and graph structure information, HRGAT embraces faster convergence speed and achieves the state-of-the-art for KGC on the standard datasets. Implementation code is available at https:\/\/github.com\/broliang\/HRGAT.\n          <\/jats:p>","DOI":"10.1145\/3545573","type":"journal-article","created":{"date-parts":[[2022,6,30]],"date-time":"2022-06-30T10:21:03Z","timestamp":1656584463000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":50,"title":["Hyper-node Relational Graph Attention Network for Multi-modal Knowledge Graph Completion"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7387-2801","authenticated-orcid":false,"given":"Shuang","family":"Liang","sequence":"first","affiliation":[{"name":"University of Electronic Science and Technology of China, Chengdu, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4634-7961","authenticated-orcid":false,"given":"Anjie","family":"Zhu","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China, Chengdu, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5640-2020","authenticated-orcid":false,"given":"Jiasheng","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China, Chengdu, China; Sichuan Artificial Intelligence Research Institute, Yibin, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2615-1555","authenticated-orcid":false,"given":"Jie","family":"Shao","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China, Chengdu, China; Shenzhen Institute for Advanced Study, UESTC, China"}]}],"member":"320","published-online":{"date-parts":[[2023,2,6]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"745","volume-title":"Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, Volume 1 (Long Papers)","author":"An Bo","year":"2018","unstructured":"Bo An, Bo Chen, Xianpei Han, and Le Sun. 2018. Accurate text-enhanced knowledge graph representation learning. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, Volume 1 (Long Papers). 745\u2013755."},{"key":"e_1_3_1_3_2","first-page":"722","volume-title":"The Semantic Web, 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC 2007","author":"Auer S\u00f6ren","year":"2007","unstructured":"S\u00f6ren Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. 2007. DBpedia: A nucleus for a web of open data. In The Semantic Web, 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC 2007. 722\u2013735."},{"key":"e_1_3_1_4_2","first-page":"553","volume-title":"Artificial Neural Networks and Machine Learning - ICANN 2019-28th International Conference on Artificial Neural Networks","author":"Balazevic Ivana","year":"2019","unstructured":"Ivana Balazevic, Carl Allen, and Timothy M. Hospedales. 2019. Hypernetwork knowledge graph embeddings. In Artificial Neural Networks and Machine Learning - ICANN 2019-28th International Conference on Artificial Neural Networks. 553\u2013565."},{"key":"e_1_3_1_5_2","doi-asserted-by":"crossref","first-page":"5184","DOI":"10.18653\/v1\/D19-1522","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019","author":"Balazevic Ivana","year":"2019","unstructured":"Ivana Balazevic, Carl Allen, and Timothy M. Hospedales. 2019. TuckER: Tensor factorization for knowledge graph completion. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019. 5184\u20135193."},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376746"},{"key":"e_1_3_1_7_2","first-page":"2787","volume-title":"Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013","author":"Bordes Antoine","year":"2013","unstructured":"Antoine Bordes, Nicolas Usunier, Alberto Garc\u00eda-Dur\u00e1n, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. 2787\u20132795."},{"key":"e_1_3_1_8_2","volume-title":"Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2011","author":"Bordes Antoine","year":"2011","unstructured":"Antoine Bordes, Jason Weston, Ronan Collobert, and Yoshua Bengio. 2011. Learning structured embeddings of knowledge bases. In Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2011."},{"issue":"1","key":"e_1_3_1_9_2","first-page":"13:1\u201313:23","article-title":"Image retrieval for complex queries using knowledge embedding","volume":"16","author":"Chaudhary Chandramani","year":"2020","unstructured":"Chandramani Chaudhary, Poonam Goyal, Navneet Goyal, and Yi-Ping Phoebe Chen. 2020. Image retrieval for complex queries using knowledge embedding. ACM Trans. Multim. Comput. Commun. Appl. 16, 1 (2020), 13:1\u201313:23.","journal-title":"ACM Trans. Multim. Comput. Commun. Appl."},{"key":"e_1_3_1_10_2","first-page":"134","volume-title":"Knowledge Science, Engineering and Management - 13th International Conference, KSEM 2020","author":"Chen Liyi","year":"2020","unstructured":"Liyi Chen, Zhi Li, Yijun Wang, Tong Xu, Zhefeng Wang, and Enhong Chen. 2020. MMEA: Entity alignment for multi-modal knowledge graph. In Knowledge Science, Engineering and Management - 13th International Conference, KSEM 2020. 134\u2013147."},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107181"},{"key":"e_1_3_1_12_2","volume-title":"9th International Conference on Learning Representations, ICLR 2021","author":"Chen Zhenfang","year":"2021","unstructured":"Zhenfang Chen, Jiayuan Mao, Jiajun Wu, Kwan-Yee Kenneth Wong, Joshua B. Tenenbaum, and Chuang Gan. 2021. Grounding physical concepts of objects and events through dynamic visual reasoning. In 9th International Conference on Learning Representations, ICLR 2021."},{"key":"e_1_3_1_13_2","volume-title":"10th International Conference on Learning Representations, ICLR 2022","author":"Chen Zhenfang","year":"2022","unstructured":"Zhenfang Chen, Kexin Yi, Yunzhu Li, Mingyu Ding, Joshua B. Tenenbaum, Antonio Torralba, and Chuang Gan. 2022. ComPhy: Compositional physical reasoning of objects and events from videos. In 10th International Conference on Learning Representations, ICLR 2022."},{"key":"e_1_3_1_14_2","first-page":"1811","volume-title":"Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, AAAI 2018","author":"Dettmers Tim","year":"2018","unstructured":"Tim Dettmers, Pasquale Minervini, Pontus Stenetorp, and Sebastian Riedel. 2018. Convolutional 2D knowledge graph embeddings. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, AAAI 2018. 1811\u20131818."},{"key":"e_1_3_1_15_2","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Volume 1 (Long and Short Papers)","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Volume 1 (Long and Short Papers). 4171\u20134186."},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/2623330.2623623"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2019.2893920"},{"key":"e_1_3_1_18_2","first-page":"372","volume-title":"Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, UAI 2018","author":"Garc\u00eda-Dur\u00e1n Alberto","year":"2018","unstructured":"Alberto Garc\u00eda-Dur\u00e1n and Mathias Niepert. 2018. KBlrn: End-to-end learning of knowledge base representations with latent, relational, and numerical features. In Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, UAI 2018. 372\u2013381."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107310"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107188"},{"key":"e_1_3_1_21_2","first-page":"2693","volume-title":"Proceedings of the 35th International Conference on Machine Learning, ICML 2018","author":"Kipf Thomas N.","year":"2018","unstructured":"Thomas N. Kipf, Ethan Fetaya, Kuan-Chieh Wang, Max Welling, and Richard S. Zemel. 2018. Neural relational inference for interacting systems. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018. 2693\u20132702."},{"key":"e_1_3_1_22_2","volume-title":"5th International Conference on Learning Representations, ICLR 2017","author":"Kipf Thomas N.","year":"2017","unstructured":"Thomas N. Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In 5th International Conference on Learning Representations, ICLR 2017."},{"key":"e_1_3_1_23_2","first-page":"2181","volume-title":"Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI 2015","author":"Lin Yankai","year":"2015","unstructured":"Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, and Xuan Zhu. 2015. Learning entity and relation embeddings for knowledge graph completion. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI 2015. 2181\u20132187."},{"key":"e_1_3_1_24_2","first-page":"2168","volume-title":"Proceedings of the 34th International Conference on Machine Learning, ICML 2017","author":"Liu Hanxiao","year":"2017","unstructured":"Hanxiao Liu, Yuexin Wu, and Yiming Yang. 2017. Analogical inference for multi-relational embeddings. In Proceedings of the 34th International Conference on Machine Learning, ICML 2017. 2168\u20132178."},{"key":"e_1_3_1_25_2","first-page":"459","volume-title":"The Semantic Web - 16th International Conference, ESWC 2019","author":"Liu Ye","year":"2019","unstructured":"Ye Liu, Hui Li, Alberto Garc\u00eda-Dur\u00e1n, Mathias Niepert, Daniel O\u00f1oro-Rubio, and David S. Rosenblum. 2019. MMKG: Multi-modal knowledge graphs. In The Semantic Web - 16th International Conference, ESWC 2019. 459\u2013474."},{"key":"e_1_3_1_26_2","first-page":"2247","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Volume 1: Long Papers","author":"Liu Zhun","year":"2018","unstructured":"Zhun Liu, Ying Shen, Varun Bharadhwaj Lakshminarasimhan, Paul Pu Liang, Amir Zadeh, and Louis-Philippe Morency. 2018. Efficient low-rank multimodal fusion with modality-specific factors. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Volume 1: Long Papers. 2247\u20132256."},{"key":"e_1_3_1_27_2","first-page":"4710","volume-title":"Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019","author":"Nathani Deepak","year":"2019","unstructured":"Deepak Nathani, Jatin Chauhan, Charu Sharma, and Manohar Kaul. 2019. Learning attention-based embeddings for relation prediction in knowledge graphs. In Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019. 4710\u20134723."},{"key":"e_1_3_1_28_2","first-page":"327","volume-title":"Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, Volume 2 (Short Papers)","author":"Nguyen Dai Quoc","year":"2018","unstructured":"Dai Quoc Nguyen, Tu Dinh Nguyen, Dat Quoc Nguyen, and Dinh Q. Phung. 2018. A novel embedding model for knowledge base completion based on convolutional neural network. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, Volume 2 (Short Papers). 327\u2013333."},{"key":"e_1_3_1_29_2","first-page":"809","volume-title":"Proceedings of the 28th International Conference on Machine Learning, ICML 2011","author":"Nickel Maximilian","year":"2011","unstructured":"Maximilian Nickel, Volker Tresp, and Hans-Peter Kriegel. 2011. A three-way model for collective learning on multi-relational data. In Proceedings of the 28th International Conference on Machine Learning, ICML 2011. 809\u2013816."},{"key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"3208","DOI":"10.18653\/v1\/D18-1359","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, EMNLP 2018","author":"Pezeshkpour Pouya","year":"2018","unstructured":"Pouya Pezeshkpour, Liyan Chen, and Sameer Singh. 2018. Embedding multimodal relational data for knowledge base completion. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, EMNLP 2018. 3208\u20133218."},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3451215"},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","first-page":"3980","DOI":"10.18653\/v1\/D19-1410","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019","author":"Reimers Nils","year":"2019","unstructured":"Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019. 3980\u20133990."},{"key":"e_1_3_1_33_2","first-page":"593","volume-title":"The Semantic Web - 15th International Conference, ESWC 2018","author":"Schlichtkrull Michael Sejr","year":"2018","unstructured":"Michael Sejr Schlichtkrull, Thomas N. Kipf, Peter Bloem, Rianne van den Berg, Ivan Titov, and Max Welling. 2018. Modeling relational data with graph convolutional networks. In The Semantic Web - 15th International Conference, ESWC 2018. 593\u2013607."},{"key":"e_1_3_1_34_2","first-page":"3060","volume-title":"The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019","author":"Shang Chao","year":"2019","unstructured":"Chao Shang, Yun Tang, Jing Huang, Jinbo Bi, Xiaodong He, and Bowen Zhou. 2019. End-to-end structure-aware convolutional networks for knowledge base completion. In The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019. 3060\u20133067."},{"key":"e_1_3_1_35_2","volume-title":"3rd International Conference on Learning Representations, ICLR 2015","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In 3rd International Conference on Learning Representations, ICLR 2015."},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242667"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3411947"},{"key":"e_1_3_1_38_2","volume-title":"7th International Conference on Learning Representations, ICLR 2019","author":"Sun Zhiqing","year":"2019","unstructured":"Zhiqing Sun, Zhihong Deng, Jianyun Nie, and Jian Tang. 2019. RotatE: Knowledge graph embedding by relational rotation in complex space. In 7th International Conference on Learning Representations, ICLR 2019."},{"key":"e_1_3_1_39_2","doi-asserted-by":"crossref","first-page":"57","DOI":"10.18653\/v1\/W15-4007","volume-title":"Proceedings of the 3rd Workshop on Continuous Vector Space Models and their Compositionality, CVSC 2015","author":"Toutanova Kristina","year":"2015","unstructured":"Kristina Toutanova and Danqi Chen. 2015. Observed versus latent features for knowledge base and text inference. In Proceedings of the 3rd Workshop on Continuous Vector Space Models and their Compositionality, CVSC 2015. 57\u201366."},{"key":"e_1_3_1_40_2","first-page":"2071","volume-title":"Proceedings of the 33rd International Conference on Machine Learning, ICML 2016","author":"Trouillon Th\u00e9o","year":"2016","unstructured":"Th\u00e9o Trouillon, Johannes Welbl, Sebastian Riedel, \u00c9ric Gaussier, and Guillaume Bouchard. 2016. Complex embeddings for simple link prediction. In Proceedings of the 33rd International Conference on Machine Learning, ICML 2016. 2071\u20132080."},{"key":"e_1_3_1_41_2","volume-title":"8th International Conference on Learning Representations, ICLR 2020","author":"Vashishth Shikhar","year":"2020","unstructured":"Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, and Partha P. Talukdar. 2020. Composition-based multi-relational graph convolutional networks. In 8th International Conference on Learning Representations, ICLR 2020."},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/2629489"},{"key":"e_1_3_1_43_2","volume-title":"Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1 (NeurIPS Datasets and Benchmarks 2021)","author":"Wu Bo","year":"2021","unstructured":"Bo Wu, Sijia Liu, Pin-Yu Chen, Gaoyuan Zhang, and Chuang Gan. 2021. STAR: A benchmark for situated reasoning in real-world videos. In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1 (NeurIPS Datasets and Benchmarks 2021)."},{"key":"e_1_3_1_44_2","volume-title":"Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, Volume 1: Long Papers","author":"Xiao Han","year":"2016","unstructured":"Han Xiao, Minlie Huang, and Xiaoyan Zhu. 2016. TransG : A generative model for knowledge graph embedding. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, Volume 1: Long Papers."},{"key":"e_1_3_1_45_2","first-page":"3140","volume-title":"Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017","author":"Xie Ruobing","year":"2017","unstructured":"Ruobing Xie, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. 2017. Image-embodied knowledge representation learning. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017. 3140\u20133146."},{"key":"e_1_3_1_46_2","volume-title":"3rd International Conference on Learning Representations, ICLR 2015","author":"Yang Bishan","year":"2015","unstructured":"Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, and Li Deng. 2015. Embedding entities and relations for learning and inference in knowledge bases. In 3rd International Conference on Learning Representations, ICLR 2015."},{"key":"e_1_3_1_47_2","article-title":"KG-BERT: BERT for knowledge graph completion","volume":"1909","author":"Yao Liang","year":"2019","unstructured":"Liang Yao, Chengsheng Mao, and Yuan Luo. 2019. KG-BERT: BERT for knowledge graph completion. CoRR abs\/1909.03193 (2019).","journal-title":"CoRR"},{"key":"e_1_3_1_48_2","volume-title":"8th International Conference on Learning Representations, ICLR 2020","author":"Yi Kexin","year":"2020","unstructured":"Kexin Yi, Chuang Gan, Yunzhu Li, Pushmeet Kohli, Jiajun Wu, Antonio Torralba, and Joshua B. Tenenbaum. 2020. CLEVRER: Collision events for video representation and reasoning. In 8th International Conference on Learning Representations, ICLR 2020."},{"key":"e_1_3_1_49_2","first-page":"1103","volume-title":"Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017","author":"Zadeh Amir","year":"2017","unstructured":"Amir Zadeh, Minghai Chen, Soujanya Poria, Erik Cambria, and Louis-Philippe Morency. 2017. Tensor fusion network for multimodal sentiment analysis. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017. 1103\u20131114."},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107369"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413736"},{"key":"e_1_3_1_52_2","doi-asserted-by":"crossref","first-page":"1089","DOI":"10.1145\/3343031.3351033","volume-title":"Proceedings of the 27th ACM International Conference on Multimedia, MM 2019","author":"Zhang Yingying","year":"2019","unstructured":"Yingying Zhang, Shengsheng Qian, Quan Fang, and Changsheng Xu. 2019. Multi-modal knowledge-aware hierarchical attention network for explainable medical question answering. In Proceedings of the 27th ACM International Conference on Multimedia, MM 2019. 1089\u20131097."},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.3005952"},{"key":"e_1_3_1_54_2","first-page":"739","volume-title":"Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2020","author":"Zheng Da","year":"2020","unstructured":"Da Zheng, Xiang Song, Chao Ma, Zeyuan Tan, Zihao Ye, Jin Dong, Hao Xiong, Zheng Zhang, and George Karypis. 2020. DGL-KE: Training knowledge graph embeddings at scale. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2020. 739\u2013748."},{"key":"e_1_3_1_55_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.108100"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3545573","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3545573","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:45Z","timestamp":1750186965000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3545573"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,6]]},"references-count":54,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,5,31]]}},"alternative-id":["10.1145\/3545573"],"URL":"https:\/\/doi.org\/10.1145\/3545573","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,6]]},"assertion":[{"value":"2022-01-26","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-06-23","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-02-06","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}