{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T04:53:44Z","timestamp":1778648024504,"version":"3.51.4"},"reference-count":71,"publisher":"Association for Computing Machinery (ACM)","issue":"6","funder":[{"DOI":"10.13039\/501100018537","name":"National Science and Technology Major Project","doi-asserted-by":"crossref","award":["2022ZD0117104"],"award-info":[{"award-number":["2022ZD0117104"]}],"id":[{"id":"10.13039\/501100018537","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2025,6,30]]},"abstract":"<jats:p>\n            Multimodal sentiment analysis leverages information from multiple sensors to achieve a comprehensive interpretation of emotions. However, different modalities do not always boost each other as expected. They compete with each other, leading to some modalities being under-optimized during the training process. To address this issue, we propose\n            <jats:bold>Adaptive Gradient Scaling with Sparse Mixture-of-Experts (AGS-SMoE)<\/jats:bold>\n            . We first discuss the issue of modal preemption in unified multimodal learning from the perspective of causal preemption. Driven by actual cause, we use the gradient norms from different encoders at two fusion stages as evidence, estimating the current modal preemption state using a parameter-free method. Then, based on the dynamic preemption factor, we design a gradient scaling method to balance optimization for different encoders. Furthermore, we use Mixture-of-Experts to sparsify and perceive multimodal tokens in different preemption states. As a result, our experiments on four multimodal sentiment analysis datasets have achieved state-of-the-art results. Moreover, our method improves modal representation learning at different stages. Extensive experiments confirm that our method can alleviate the modal preemption problem in a plug-and-play manner. Our code is available at\n            <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/TheShy-Dream\/AGS-SMoE\">https:\/\/github.com\/TheShy-Dream\/AGS-SMoE<\/jats:ext-link>\n            .\n          <\/jats:p>","DOI":"10.1145\/3736415","type":"journal-article","created":{"date-parts":[[2025,5,20]],"date-time":"2025-05-20T13:06:27Z","timestamp":1747746387000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["Actual Cause-Guided Adaptive Gradient Scaling for Balanced Multimodal Sentiment Analysis"],"prefix":"10.1145","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-0794-1204","authenticated-orcid":false,"given":"Jili","family":"Chen","sequence":"first","affiliation":[{"name":"Zhejiang Key Laboratory of Intelligent Education Technology and Application, College of Education, Zhejiang Normal University, Jinhua, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5041-6093","authenticated-orcid":false,"given":"Qionghao","family":"Huang","sequence":"additional","affiliation":[{"name":"Zhejiang Key Laboratory of Intelligent Education Technology and Application, College of Education, Zhejiang Normal University, Jinhua, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1371-2608","authenticated-orcid":false,"given":"Changqin","family":"Huang","sequence":"additional","affiliation":[{"name":"College of Education, Zhejiang University, Hangzhou, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6084-1851","authenticated-orcid":false,"given":"Xiaodi","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Computing, Mathematics and Engineering, Charles Sturt University, Albury, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,7,8]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-7908-2604-3_16"},{"issue":"2","key":"e_1_3_1_3_2","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1007\/BF00354164","article-title":"Causal preemption and counterfactuals","volume":"37","author":"Bunzl Martin","year":"1980","unstructured":"Martin Bunzl. 1980. Causal preemption and counterfactuals. Philosophical Studies 37, 2 (1980), 115\u2013124.","journal-title":"Philosophical Studies"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3586075"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6853739"},{"key":"e_1_3_1_6_2","first-page":"8632","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Du Chenzhuang","year":"2023","unstructured":"Chenzhuang Du, Jiaye Teng, Tingle Li, Yichen Liu, Tianyuan Yuan, Yue Wang, Yang Yuan, and Hang Zhao. 2023. On uni-modal feature learning in supervised multi-modal learning. In Proceedings of the International Conference on Machine Learning. PMLR, 8632\u20138656."},{"issue":"1","key":"e_1_3_1_7_2","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1093\/bjps\/35.1.55","article-title":"Probabilistic causality and preemption","volume":"35","author":"Ehring Douglas","year":"1984","unstructured":"Douglas Ehring. 1984. Probabilistic causality and preemption. The British Journal for the Philosophy of Science 35, 1 (1984), 55\u201357.","journal-title":"The British Journal for the Philosophy of Science"},{"issue":"1","key":"e_1_3_1_8_2","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1007\/BF00873194","article-title":"Preemption, direct causation, and identity","volume":"85","author":"Ehring Douglas","year":"1990","unstructured":"Douglas Ehring. 1990. Preemption, direct causation, and identity. Synthese 85, 1 (1990), 55\u201370.","journal-title":"Synthese"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01918"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2024.120682"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2022.09.025"},{"key":"e_1_3_1_12_2","volume-title":"Proceedings of the 38th Annual Conference on Neural Information Processing Systems","author":"Guo Zirun","year":"2024","unstructured":"Zirun Guo, Tao Jin, Jingyuan Chen, and Zhou Zhao. 2024. Classifier-guided gradient modulation for enhanced multimodal learning. In Proceedings of the 38th Annual Conference on Neural Information Processing Systems."},{"key":"e_1_3_1_13_2","first-page":"6","volume-title":"Proceedings of the 2021 International Conference on Multimodal Interaction","author":"Han Wei","unstructured":"Wei Han, Hui Chen, Alexander Gelbukh, Amir Zadeh, Louis-Philippe Morency, and Soujanya Poria. 2021. Bi-bimodal modality fusion for correlation-controlled multimodal sentiment analysis. In Proceedings of the 2021 International Conference on Multimodal Interaction, 6\u201315."},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.723"},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","first-page":"2046","DOI":"10.18653\/v1\/D19-1211","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Hasan Md Kamrul","year":"2019","unstructured":"Md Kamrul Hasan, Wasifur Rahman, AmirAli Bagher Zadeh, Jianyuan Zhong, Md Iftekhar Tanveer, Louis Philippe Morency, and Mohammed Ehsan Hoque. 2019. UR-FUNNY: A multimodal language dataset for understanding humor. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2046\u20132056."},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413678"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.534"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2023.110502"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2023.111346"},{"issue":"4","key":"e_1_3_1_21_2","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1007\/s00530-024-01421-w","article-title":"Text-centered cross-sample fusion network for multimodal sentiment analysis","volume":"30","author":"Huang Qionghao","year":"2024","unstructured":"Qionghao Huang, Jili Chen, Changqin Huang, Xiaodi Huang, and Yi Wang. 2024. Text-centered cross-sample fusion network for multimodal sentiment analysis. Multimedia Systems 30, 4 (2024), 228.","journal-title":"Multimedia Systems"},{"key":"e_1_3_1_22_2","first-page":"9226","volume-title":"Proceedings of the International Conference on Machine Learning. PMLR","author":"Huang Yu","year":"2022","unstructured":"Yu Huang, Junyang Lin, Chang Zhou, Hongxia Yang, and Longbo Huang. 2022. Modality competition: What makes joint training of multi-modal network fail in deep learning? (provably). In Proceedings of the International Conference on Machine Learning. PMLR, 9226\u20139259."},{"key":"e_1_3_1_23_2","first-page":"16006","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Huo Fushuo","year":"2024","unstructured":"Fushuo Huo, Wenchao Xu, Jingcai Guo, Haozhao Wang, and Song Guo. 2024. C2KD: Bridging the modality gap for cross-modal knowledge distillation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 16006\u201316015."},{"key":"e_1_3_1_24_2","unstructured":"Albert Q. Jiang Alexandre Sablayrolles Arthur Mensch Chris Bamford Devendra Singh Chaplot Diego de las Casas Florian Bressand Gianna Lengyel Guillaume Lample Lucile Saulnier et al. 2023. Mistral 7B. arXiv:2310.06825. Retrieved from https:\/\/arxiv.org\/abs\/2310.06825"},{"key":"e_1_3_1_25_2","first-page":"4171","volume-title":"Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Jacob Devlin","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 4171\u20134186."},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2022.11.022"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.02030"},{"key":"e_1_3_1_28_2","unstructured":"Jiachen Li Xinyao Wang Sijie Zhu Chia-Wen Kuo Lu Xu Fan Chen Jitesh Jain Humphrey Shi and Longyin Wen. 2024. Cumo: Scaling multimodal LLM with co-upcycled mixture-of-experts. arXiv:2405.05949. Retrieved from https:\/\/arxiv.org\/abs\/2405.05949"},{"key":"e_1_3_1_29_2","unstructured":"Yunxin Li Shenyuan Jiang Baotian Hu Longyue Wang Wanqi Zhong Wenhan Luo Lin Ma and Min Zhang. 2024. Uni-MoE: Scaling unified multimodal LLMs with mixture of experts. arXiv:2405.11273. Retrieved from https:\/\/arxiv.org\/abs\/2405.11273"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2023.101891"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.124236"},{"key":"e_1_3_1_32_2","unstructured":"Bin Lin Zhenyu Tang Yang Ye Jiaxi Cui Bin Zhu Peng Jin Junwu Zhang Munan Ning and Li Yuan. 2024. MoE-LLaVA: Mixture of experts for large vision-language models. arXiv:2401.15947. Retrieved from https:\/\/arxiv.org\/abs\/2401.15947"},{"key":"e_1_3_1_33_2","first-page":"150","volume-title":"Proceedings of the 2024 16th International Conference on Advanced Computational Intelligence (ICACI)","author":"Liu Dahuang","year":"2024","unstructured":"Dahuang Liu, Zhenguo Yang, and Zhiwei Guo. 2024. Progressive fusion network with mixture of experts for multimodal sentiment analysis. In Proceedings of the 2024 16th International Conference on Advanced Computational Intelligence (ICACI). IEEE, 150\u2013157."},{"key":"e_1_3_1_34_2","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics","author":"Liu Zhun","year":"2018","unstructured":"Zhun Liu and Ying Shen. 2018. Efficient low-rank multimodal fusion with modality-specific factors. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics."},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2024.112011"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2023.126836"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2022.3172360"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-demo.20"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-012-9338-y"},{"issue":"417","key":"e_1_3_1_40_2","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1093\/mind\/105.417.85","article-title":"Probabilistic causation and the pre-emption problem","volume":"105","author":"Menzies Peter","year":"1996","unstructured":"Peter Menzies. 1996. Probabilistic causation and the pre-emption problem. Mind 105, 417 (1996), 85\u2013117.","journal-title":"Mind"},{"key":"e_1_3_1_41_2","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-85729-997-0","volume-title":"Visual Analysis of Humans","author":"Moeslund Thomas B.","year":"2011","unstructured":"Thomas B. Moeslund, Adrian Hilton, Volker Kr\u00fcger, and Leonid Sigal. 2011. Visual Analysis of Humans. Springer."},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/79.543975"},{"key":"e_1_3_1_43_2","first-page":"9564","article-title":"Multimodal contrastive learning with limoe: The language-image mixture of experts","volume":"35","author":"Mustafa Basil","year":"2022","unstructured":"Basil Mustafa, Carlos Riquelme, Joan Puigcerver, Rodolphe Jenatton, and Neil Houlsby. 2022. Multimodal contrastive learning with limoe: The language-image mixture of experts. Advances in Neural Information Processing Systems 35 (2022), 9564\u20139576.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_44_2","unstructured":"M. Z. Naser. 2022. Causality causal discovery and causal inference in structural engineering. arXiv:2204.01543. Retrieved from https:\/\/arxiv.org\/abs\/2204.01543"},{"key":"e_1_3_1_45_2","volume-title":"Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference","author":"Judea Pearl","year":"1988","unstructured":"Judea Pearl. 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann."},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511803161"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00806"},{"key":"e_1_3_1_48_2","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Rahman Wasifur","unstructured":"Wasifur Rahman, Md Kamrul Hasan, Sangwu Lee, AmirAli Bagher Zadeh, Chengfeng Mao, Louis-Philippe Morency, and Ehsan Hoque. 2020. Integrating multimodal information in large pretrained transformers. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics."},{"key":"e_1_3_1_49_2","unstructured":"Noam Shazeer Azalia Mirhoseini Krzysztof Maziarz Andy Davis Quoc Le Geoffrey Hinton and Jeff Dean. 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. arXiv:1701.06538. Retrieved from https:\/\/arxiv.org\/abs\/1701.06538"},{"key":"e_1_3_1_50_2","unstructured":"Li Shen Anke Tang Enneng Yang Guibing Guo Yong Luo Lefei Zhang Xiaochun Cao Bo Du and Dacheng Tao. 2024. Efficient and effective weight-ensembling mixture of experts for multi-task model merging. arXiv:2410.21804. Retrieved from https:\/\/arxiv.org\/abs\/2410.21804"},{"key":"e_1_3_1_51_2","doi-asserted-by":"crossref","first-page":"11329","DOI":"10.18653\/v1\/2023.findings-emnlp.758","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics (EMNLP \u201923)","author":"Shen Sheng","year":"2023","unstructured":"Sheng Shen, Zhewei Yao, Chunyuan Li, Trevor Darrell, Kurt Keutzer, and Yuxiong He. 2023. Scaling vision-language models with sparse mixture of experts. In Proceedings of the Findings of the Association for Computational Linguistics (EMNLP \u201923), 11329\u201311344."},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2023.111149"},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/3507918"},{"key":"e_1_3_1_54_2","first-page":"6558","volume-title":"Proceedings of the Conference on Association for Computational Linguistics","volume":"2019","author":"Tsai Yao-Hung Hubert","year":"2019","unstructured":"Yao-Hung Hubert Tsai, Shaojie Bai, Paul Pu Liang, J. Zico Kolter, Louis-Philippe Morency, and Ruslan Salakhutdinov. 2019. Multimodal transformer for unaligned multimodal language sequences. In Proceedings of the Conference on Association for Computational Linguistics. NIH Public Access, Vol. 2019, 6558."},{"key":"e_1_3_1_55_2","first-page":"11","article-title":"Visualizing data using t-SNE","volume":"9","author":"Van der Maaten Laurens","year":"2008","unstructured":"Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9 (2008), 11.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_1_56_2","article-title":"Attention is all you need","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 30.","journal-title":"Proceedings of the Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_57_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2022.109259"},{"key":"e_1_3_1_58_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01271"},{"key":"e_1_3_1_59_2","doi-asserted-by":"crossref","unstructured":"Thomas Winterbottom Sarah Xiao Alistair McLean and Noura Al Moubayed. 2020. On modality bias in the tvqa dataset. arXiv:2012.10210. Retrieved from https:\/\/arxiv.org\/abs\/2012.10210","DOI":"10.5244\/C.34.122"},{"key":"e_1_3_1_60_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107676"},{"key":"e_1_3_1_61_2","unstructured":"Fuzhao Xue Zian Zheng Yao Fu Jinjie Ni Zangwei Zheng Wangchunshu Zhou and Yang You. 2024. Openmoe: An early effort on open mixture-of-experts language models. arXiv:2402.01739. Retrieved from https:\/\/arxiv.org\/abs\/2402.01739"},{"key":"e_1_3_1_62_2","unstructured":"Shu Yang Muhammad Asif Ali Cheng-Long Wang Lijie Hu and Di Wang. 2024. MoRAL: MoE augmented LoRA for LLMs\u2019 lifelong learning. arXiv:2402.11260. Retrieved from https:\/\/arxiv.org\/abs\/2402.11260"},{"key":"e_1_3_1_63_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.343"},{"key":"e_1_3_1_64_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i12.17289"},{"key":"e_1_3_1_65_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1115"},{"key":"e_1_3_1_66_2","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"32","author":"Zadeh Amir","unstructured":"Amir Zadeh, Paul Pu Liang, Navonil Mazumder, Soujanya Poria, Erik Cambria, and Louis-Philippe Morency. 2018. Memory fusion network for multi-view sequential learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32."},{"key":"e_1_3_1_67_2","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2016.94"},{"key":"e_1_3_1_68_2","first-page":"2236","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics","author":"Zadeh AmirAli Bagher","year":"2018","unstructured":"AmirAli Bagher Zadeh, Paul Pu Liang, Soujanya Poria, Erik Cambria, and Louis-Philippe Morency. 2018. Multimodal language analysis in the wild: CMU-MOSEI dataset and interpretable dynamic fusion graph. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2236\u20132246."},{"key":"e_1_3_1_69_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.02592"},{"key":"e_1_3_1_70_2","first-page":"263","volume-title":"Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention","author":"Zhang Yupei","year":"2024","unstructured":"Yupei Zhang, Xiaofei Wang, Fangliangzi Meng, Jin Tang, and Chao Li. 2024. Knowledge-driven subspace fusion and gradient coordination for multi-modal learning. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 263\u2013273."},{"key":"e_1_3_1_71_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2023.101958"},{"key":"e_1_3_1_72_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2023.02.028"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3736415","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,8]],"date-time":"2025-07-08T17:07:32Z","timestamp":1751994452000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3736415"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,30]]},"references-count":71,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2025,6,30]]}},"alternative-id":["10.1145\/3736415"],"URL":"https:\/\/doi.org\/10.1145\/3736415","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,6,30]]},"assertion":[{"value":"2024-11-20","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-05-01","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-07-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}