{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,19]],"date-time":"2026-07-19T09:45:28Z","timestamp":1784454328489,"version":"3.55.0"},"reference-count":49,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2025,8,28]],"date-time":"2025-08-28T00:00:00Z","timestamp":1756339200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2022YFB3103600"],"award-info":[{"award-number":["2022YFB3103600"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>Implicit emotions are often expressed through implicit and weak clues between modalities due to the lack of explicit emotional feature words, representing a significant challenge for multimodal sentiment analysis. In order to improve implicit emotion recognition, this paper proposes a multimodal sentiment analysis method that integrates KAN and the modal dynamic fusion mechanism. This method first introduces the KAN structure to construct a modal feature encoder to enhance the emotional expression ability of features. Then, the emotional contribution weight of each modality is calculated using the difference between the unimodal and multimodal sentiment scores, and the cross-attention mechanism guided by the main modality is used for feature fusion. Experiments on four datasets, CH-SIMS, CH-SIMSv2, MOSI, and MOSEI, show that the proposed method significantly outperforms the mainstream model in multiple indicators, especially when dealing with samples with implicit or ambiguous emotional expressions. The results verify the effectiveness of enhancing feature encoding capabilities and utilizing modal asymmetry information in implicit sentiment analysis.<\/jats:p>","DOI":"10.3390\/sym17091401","type":"journal-article","created":{"date-parts":[[2025,8,28]],"date-time":"2025-08-28T07:43:16Z","timestamp":1756366996000},"page":"1401","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["KD-MSA: A Multimodal Implicit Sentiment Analysis Approach Based on KAN and Asymmetric Contribution-Aware Dynamic Fusion"],"prefix":"10.3390","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-5386-7891","authenticated-orcid":false,"given":"Zhiyuan","family":"Hou","sequence":"first","affiliation":[{"name":"School of Information and Electrical Engineering, Hebei University of Engineering, Handan 056038, China"},{"name":"Information Research Center of Military Sciences, Academy of Military Sciences, Beijing 100142, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-8652-2940","authenticated-orcid":false,"given":"Qiang","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information and Electrical Engineering, Hebei University of Engineering, Handan 056038, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-7267-2509","authenticated-orcid":false,"given":"Ziwei","family":"Lei","sequence":"additional","affiliation":[{"name":"Information Research Center of Military Sciences, Academy of Military Sciences, Beijing 100142, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zheng","family":"Zeng","sequence":"additional","affiliation":[{"name":"Information Research Center of Military Sciences, Academy of Military Sciences, Beijing 100142, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ruijun","family":"Jia","sequence":"additional","affiliation":[{"name":"Information Research Center of Military Sciences, Academy of Military Sciences, Beijing 100142, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2025,8,28]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"478","DOI":"10.1109\/JSTSP.2020.2987728","article-title":"Multimodal Intelligence: Representation Learning, Information Fusion, and Applications","volume":"14","author":"Zhang","year":"2020","journal-title":"IEEE J. Sel. Top. Signal Process."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1109\/TPAMI.2018.2798607","article-title":"Multimodal Machine Learning: A Survey and Taxonomy","volume":"41","author":"Ahuja","year":"2019","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"107676","DOI":"10.1016\/j.knosys.2021.107676","article-title":"Video sentiment analysis with bimodal information-augmented multi-head attention","volume":"235","author":"Wu","year":"2022","journal-title":"Knowl.-Based Syst."},{"key":"ref_4","unstructured":"Zhang, H., Li, M., and Zhang, J. (2025). Implicit Sentiment Analysis for Chinese Texts based on Multimodal Information Fusion. Comput. Eng. Appl., 179\u2013190."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"111346","DOI":"10.1016\/j.knosys.2023.111346","article-title":"TMBL: Transformer-based multimodal binding learning model for multimodal sentiment analysis","volume":"285","author":"Huang","year":"2024","journal-title":"Knowl.-Based Syst."},{"key":"ref_6","unstructured":"Bouamor, H., Pino, J., and Bali, K. (2023, January 6\u201310). Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore."},{"key":"ref_7","unstructured":"Wang, Y., Lu, C., and Chen, Z. (2024). Multimodal sentiment analysis model with cross-modal text-information enhancement. J. Comput. Appl., 1\u201310."},{"key":"ref_8","unstructured":"Liu, Z., Wang, Y., Vaidya, S., Ruehle, F., Halverson, J., Solja\u010di\u0107, M., Hou, T.Y., and Tegmark, M. (2024). KAN: Kolmogorov-Arnold Networks. arXiv."},{"key":"ref_9","unstructured":"Barzilay, R., and Johnson, M. (2011, January 27\u201331). Literal and Metaphorical Sense Identification through Concrete and Abstract Context. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, UK."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1016\/j.knosys.2018.11.023","article-title":"Identification of fact-implied implicit sentiment based on multi-level semantic fused representation","volume":"165","author":"Liao","year":"2019","journal-title":"Knowl.-Based Syst."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Gandy, L., Allan, N., Atallah, M., Frieder, O., Howard, N., Kanareykin, S., Koppel, M., Last, M., Neuman, Y., and Argamon, S. (2013, January 14\u201318). Automatic identification of conceptual metaphors with limited knowledge. Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, Bellevue, WA, USA. AAAI\u201913.","DOI":"10.1609\/aaai.v27i1.8648"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1016\/j.neucom.2019.11.054","article-title":"BiLSTM with Multi-Polarity Orthogonal Attention for Implicit Sentiment Analysis","volume":"383","author":"Wei","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Fu, L., and Liu, S. (2022, January 12\u201314). A Syntax-based BSGCN Model for Chinese Implicit Sentiment Analysis with Multi-classification. Proceedings of the 2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT), Washington DC, USA.","DOI":"10.1109\/AICT55583.2022.10013562"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"7642","DOI":"10.1109\/TNNLS.2022.3219615","article-title":"DualGCN: Exploring Syntactic and Semantic Information for Aspect-Based Sentiment Analysis","volume":"35","author":"Li","year":"2024","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_15","first-page":"738","article-title":"Aspect-Level Sentiment Analysis Based on Enhanced Syntactic Information and Multi-feature Graph Convolutional Fusion","volume":"19","author":"Tian","year":"2025","journal-title":"J. Front. Comput. Sci. Technol."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zhou, X., Wan, X., and Xiao, J. (2016, January 1\u20134). Attention-based LSTM network for cross-lingual sentiment classification. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA.","DOI":"10.18653\/v1\/D16-1024"},{"key":"ref_17","first-page":"142","article-title":"Implicit Sentiment Analysis Based on RoBERTa Fused with BiLSTM and Attention Mechanism","volume":"58","author":"Zhang","year":"2022","journal-title":"Comput. Eng. Appl."},{"key":"ref_18","first-page":"2236","article-title":"Metaphorical Aspect Sentiment Analysis Based on RoBERTa and Attention Mechanism","volume":"44","author":"Ma","year":"2023","journal-title":"J. Chin. Comput. Syst."},{"key":"ref_19","unstructured":"Knight, K., Nenkova, A., and Rambow, O. (2016, January 12\u201317). Black Holes and White Rabbits: Metaphor Identification with Visual Features. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Zhang, D., Zhang, M., Guo, T., Peng, C., Saikrishna, V., and Xia, F. (2021, January 18\u201322). In Your Face: Sentiment Analysis of Metaphor with Facial Expressive Features. Proceedings of the 2021 International Joint Conference on Neural Networks (IJCNN), Shenzhen, China.","DOI":"10.1109\/IJCNN52387.2021.9533972"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Chen, M., Ubul, K., Xu, X., Aysa, A., and Muhammat, M. (2022). Connecting Text Classification with Image Classification: A New Preprocessing Method for Implicit Sentiment Text Classification. Sensors, 22.","DOI":"10.3390\/s22051899"},{"key":"ref_22","first-page":"309","article-title":"Emotion recognition based on visual and auditory information","volume":"57","author":"Fan","year":"2021","journal-title":"J. Nanjing Univ. (Nat. Sci.)"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"109259","DOI":"10.1016\/j.patcog.2022.109259","article-title":"TETFN: A text enhanced transformer fusion network for multimodal sentiment analysis","volume":"136","author":"Wang","year":"2023","journal-title":"Pattern Recognit."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"101958","DOI":"10.1016\/j.inffus.2023.101958","article-title":"SKEAFN: Sentiment knowledge enhanced attention fusion network for multimodal sentiment analysis","volume":"100","author":"Zhu","year":"2023","journal-title":"Inf. Fusion"},{"key":"ref_25","unstructured":"Moens, M.F., Huang, X., Specia, L., and Yih, S.W.T. (2021, January 7\u201311). Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Online and Punta Cana, Dominican Republic."},{"key":"ref_26","unstructured":"Calzolari, N., Huang, C.R., Kim, H., Pustejovsky, J., Wanner, L., Choi, K.S., Ryu, P.M., Chen, H.H., Donatelli, L., and Ji, H. (2022, January 12\u201317). AMOA: Global Acoustic Feature Enhanced Modal-Order-Aware Network for Multimodal Sentiment Analysis. Proceedings of the 29th International Conference on Computational Linguistics, Gyeongju, Republic of Korea."},{"key":"ref_27","unstructured":"Goldberg, Y., Kozareva, Z., and Zhang, Y. (2022, January 7\u201311). Multimodal Contrastive Learning via Uni-Modal Coding and Cross-Modal Prediction for Multimodal Sentiment Analysis. Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, Abu Dhabi, United Arab Emirates."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Feng, X., Lin, Y., He, L., Li, Y., Chang, L., and Zhou, Y. (2024, January 12\u201316). Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis. Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, FL, USA.","DOI":"10.18653\/v1\/2024.findings-emnlp.865"},{"key":"ref_29","first-page":"46","article-title":"Multimodal Sentiment Analysis Based on Bidirectional Mask Attention Mechanism","volume":"7","author":"Zhang","year":"2023","journal-title":"Data Anal. Knowl. Discov."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"127181","DOI":"10.1016\/j.neucom.2023.127181","article-title":"Multimodal transformer with adaptive modality weighting for multimodal sentiment analysis","volume":"572","author":"Wang","year":"2024","journal-title":"Neurocomputing"},{"key":"ref_31","unstructured":"Korhonen, A., Traum, D., and M\u00e0rquez, L. (August, January 28). Multimodal Transformer for Unaligned Multimodal Language Sequences. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"4909","DOI":"10.1109\/TMM.2022.3183830","article-title":"Cross-Modal Enhancement Network for Multimodal Sentiment Analysis","volume":"25","author":"Wang","year":"2023","journal-title":"IEEE Trans. Multimed."},{"key":"ref_33","first-page":"67","article-title":"Multimodal Sentiment Analysis Method Based on Cross-Modal Attention and Gated Unit Fusion Network","volume":"8","author":"Chen","year":"2024","journal-title":"Data Anal. Knowl. Discov."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Mengara Mengara, A.G., and Moon, Y.k. (2025). CAG-MoE: Multimodal Emotion Recognition with Cross-Attention Gated Mixture of Experts. Mathematics, 13.","DOI":"10.3390\/math13121907"},{"key":"ref_35","unstructured":"Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., and Stoyanov, V. (2019). RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Cui, Y., Che, W., Liu, T., Qin, B., Wang, S., and Hu, G. (2020, January 16\u201320). Revisiting Pre-Trained Models for Chinese Natural Language Processing. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, Online.","DOI":"10.18653\/v1\/2020.findings-emnlp.58"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Baltrusaitis, T., Zadeh, A., Lim, Y.C., and Morency, L.P. (2018, January 15\u201319). OpenFace 2.0: Facial Behavior Analysis Toolkit. Proceedings of the 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi\u2019an, China.","DOI":"10.1109\/FG.2018.00019"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"18","DOI":"10.25080\/Majora-7b98e3ed-003","article-title":"librosa: Audio and music signal analysis in python","volume":"2015","author":"McFee","year":"2015","journal-title":"SciPy"},{"key":"ref_39","unstructured":"Teh, Y.W., and Titterington, M. (2010, January 13\u201315). Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna Resort, Sardinia, Italy."},{"key":"ref_40","unstructured":"Jurafsky, D., Chai, J., Schluter, N., and Tetreault, J. (2020, January 5\u201310). CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotation of Modality. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Liu, Y., Yuan, Z., Mao, H., Liang, Z., Yang, W., Qiu, Y., Cheng, T., Li, X., Xu, H., and Gao, K. (2022, January 7\u201311). Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module. Proceedings of the 2022 International Conference on Multimodal Interaction, Bengaluru, India.","DOI":"10.1145\/3536221.3556630"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1109\/MIS.2016.94","article-title":"Multimodal Sentiment Intensity Analysis in Videos: Facial Gestures and Verbal Messages","volume":"31","author":"Zadeh","year":"2016","journal-title":"IEEE Intell. Syst."},{"key":"ref_43","unstructured":"Gurevych, I., and Miyao, Y. (2018, January 15\u201320). Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Melbourne, Australia."},{"key":"ref_44","unstructured":"Palmer, M., Hwa, R., and Riedel, S. (2017, January 9\u201311). Tensor Fusion Network for Multimodal Sentiment Analysis. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark."},{"key":"ref_45","unstructured":"Gurevych, I., and Miyao, Y. (2018, January 15\u201320). Efficient Low-rank Multimodal Fusion With Modality-Specific Factors. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Melbourne, Australia."},{"key":"ref_46","unstructured":"Hazarika, D., Zimmermann, R., and Poria, S. (2020, January 12\u201316). MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis. Proceedings of the 28th ACM International Conference on Multimedia, Seattle, WA, USA. MM \u201920."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Han, W., Chen, H., Gelbukh, A., Zadeh, A., Morency, L.p., and Poria, S. (2021, January 18\u201322). Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment Analysis. Proceedings of the 2021 International Conference on Multimodal Interaction, Montr\u00e9al, QC, Canada. ICMI \u201921.","DOI":"10.1145\/3462244.3479919"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Sun, H., Wang, H., Liu, J., Chen, Y.W., and Lin, L. (2022, January 10\u201314). CubeMLP: An MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation. Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, Portugal. MM\u201922.","DOI":"10.1145\/3503161.3548025"},{"key":"ref_49","unstructured":"Basile, V., Kozareva, Z., and Stajner, S. (2022, January 22\u201327). M-SENA: An Integrated Platform for Multimodal Sentiment Analysis. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Dublin, Ireland."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/17\/9\/1401\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:34:08Z","timestamp":1760034848000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/17\/9\/1401"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,28]]},"references-count":49,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2025,9]]}},"alternative-id":["sym17091401"],"URL":"https:\/\/doi.org\/10.3390\/sym17091401","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,28]]}}}