{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T16:32:18Z","timestamp":1776184338396,"version":"3.50.1"},"reference-count":35,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2025,8,7]],"date-time":"2025-08-07T00:00:00Z","timestamp":1754524800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Gansu Province Higher Education Institutions Industrial Support Program","award":["2020C-29"],"award-info":[{"award-number":["2020C-29"]}]},{"name":"Gansu Province Higher Education Institutions Industrial Support Program","award":["61562002"],"award-info":[{"award-number":["61562002"]}]},{"name":"National Natural Science Foundation of China","award":["2020C-29"],"award-info":[{"award-number":["2020C-29"]}]},{"name":"National Natural Science Foundation of China","award":["61562002"],"award-info":[{"award-number":["61562002"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>In recent years, the proliferation of fake news and misinformation has grown exponentially, far surpassing that of genuine news and posing a serious threat to social stability. Existing research in fake news detection primarily applies contrastive learning methods with a single-hot labeling strategy. The issue does not lie with contrastive learning as a technique but with its current application in fake news detection systems. Specifically, these systems penalize all negative samples equally due to the use of single-hot labeling, thus overlooking the underlying semantic relationships among negative samples. As a result, contrastive learning models tend to learn from simple samples while neglecting highly deceptive samples located at the boundary between true and false, as well as the heterogeneity of text-image features, which complicates cross-modal fusion. To mitigate these known limitations in current applications, this paper proposes a fake news detection method based on contrastive learning and cross-modal interaction. First, a consistency-aware soft-label contrastive learning mechanism based on semantic similarity is designed to provide more granular supervision signals for contrastive learning. Secondly, a difficult negative sample mining strategy based on a similarity matrix is designed to optimize the symmetry alignment of image and text features, which effectively improves the model\u2019s ability to discriminate boundary samples. To further optimize the feature fusion process, a cross-modal interaction module is designed to learn the symmetric interaction relationship between image and text features. Finally, an attention mechanism is designed to adaptively adjust the contributions of text-image features and interaction features, forming the final multimodal feature representation. Experiments are conducted on two major social media platform datasets, and compared with existing methods, the proposed method effectively improves the detection capability of fake news.<\/jats:p>","DOI":"10.3390\/sym17081260","type":"journal-article","created":{"date-parts":[[2025,8,7]],"date-time":"2025-08-07T08:33:06Z","timestamp":1754555586000},"page":"1260","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Fake News Detection Based on Contrastive Learning and Cross-Modal Interaction"],"prefix":"10.3390","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-8441-9534","authenticated-orcid":false,"given":"Zhenxiang","family":"He","sequence":"first","affiliation":[{"name":"School of Cyberspace Security, Gansu University of Political Science and Law, No. 6 Anning West Road, Lanzhou 730070, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-0133-2129","authenticated-orcid":false,"given":"Hanbin","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Cyberspace Security, Gansu University of Political Science and Law, No. 6 Anning West Road, Lanzhou 730070, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-9078-1341","authenticated-orcid":false,"given":"Le","family":"Li","sequence":"additional","affiliation":[{"name":"School of Cyberspace Security, Gansu University of Political Science and Law, No. 6 Anning West Road, Lanzhou 730070, China"}]}],"member":"1968","published-online":{"date-parts":[[2025,8,7]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1007\/s13735-023-00296-3","article-title":"A comprehensive survey of multimodal fake news detection techniques: Advances, challenges, and opportunities","volume":"12","author":"Tufchi","year":"2023","journal-title":"Int. J. Multimed. Inf. Retr."},{"key":"ref_2","first-page":"397","article-title":"Research and Outlook on Technology for Detecting False Information on the Internet","volume":"53","author":"Wang","year":"2022","journal-title":"\u592a\u539f\u7406\u5de5\u5927\u5b66\u5b66\u62a5"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Su, J., Cardie, C., and Nakov, P. (2023). Adapting fake news detection to the era of large language models. arXiv.","DOI":"10.18653\/v1\/2024.findings-naacl.95"},{"key":"ref_4","first-page":"489","article-title":"A Review of Deep Learning Research on Cross-Modal Image Retrieval","volume":"16","author":"Liu","year":"2022","journal-title":"Comput. Sci. Explor."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1146","DOI":"10.1126\/science.aap9559","article-title":"The spread of true and false news online","volume":"359","author":"Vosoughi","year":"2018","journal-title":"Science"},{"key":"ref_6","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017). Attention is all you need. Adv. Neural Inf. Process. Syst., 30."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"21503","DOI":"10.1007\/s00521-021-06086-4","article-title":"Predicting image credibility in fake news over social media using multi-modal approach","volume":"34","author":"Singh","year":"2022","journal-title":"Neural Comput. Appl."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"5153","DOI":"10.1007\/s40747-024-01413-3","article-title":"Clip-GCN: An adaptive detection model for multimodal emergent fake news domains","volume":"10","author":"Zhou","year":"2024","journal-title":"Complex Intell. Syst."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Liu, Y., Liu, Y., Li, Z., Yao, R., Zhang, Y., and Wang, D. (May, January 28). Modality interactive mixture-of-experts for fake news detection. Proceedings of the ACM on Web Conference 2025, Sydney, Australia.","DOI":"10.1145\/3696410.3714522"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Lai, H., and Nissim, M. (2024). mCoT: Multilingual instruction tuning for reasoning consistency in language models. arXiv.","DOI":"10.18653\/v1\/2024.acl-long.649"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"102160","DOI":"10.1016\/j.jksuci.2024.102160","article-title":"Sard: Fake news detection based on clip contrastive learning and multimodal semantic alignment","volume":"36","author":"Yan","year":"2024","journal-title":"J. King Saud-Univ.-Comput. Inf. Sci."},{"key":"ref_12","unstructured":"Li, J., Li, D., Xiong, C., and Hoi, S. (2022, January 17\u201323). Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation. Proceedings of the International Conference on Machine Learning, Baltimore, MD, USA."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Yang, J., Li, C., Zhang, P., Xiao, B., Liu, C., Yuan, L., and Gao, J. (2022, January 18\u201324). Unified contrastive learning in image-text-label space. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01857"},{"key":"ref_14","first-page":"32897","article-title":"Vlmo: Unified vision-language pre-training with mixture-of-modality-experts","volume":"35","author":"Bao","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"102","DOI":"10.11834\/jig.220556","article-title":"Advances in Image Fusion Technology in the Era of Deep Learning","volume":"28","author":"Yifan","year":"2023","journal-title":"Chin. J. Image Graph."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"4649","DOI":"10.1109\/TCSS.2022.3177359","article-title":"Detecting and mitigating the dissemination of fake news: Challenges and future research opportunities","volume":"11","author":"Shahid","year":"2022","journal-title":"IEEE Trans. Comput. Soc. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1002\/pra2.2015.145052010082","article-title":"Automatic deception detection: Methods for finding fake news","volume":"52","author":"Conroy","year":"2015","journal-title":"Proc. Assoc. Inf. Sci. Technol."},{"key":"ref_19","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019). Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Wang, Y., Ma, F., Jin, Z., Yuan, Y., Xun, G., Jha, K., Su, L., and Gao, J. (2018, January 19\u201323). Eann: Event adversarial neural networks for multi-modal fake news detection. Proceedings of the 24th ACM Sigkdd International Conference on Knowledge Discovery & Data Mining, London, UK.","DOI":"10.1145\/3219819.3219903"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Khattar, D., Goud, J.S., Gupta, M., and Varma, V. (2019, January 13\u201317). Mvae: Multimodal variational autoencoder for fake news detection. Proceedings of the World Wide Web Conference, San Francisco, CA, USA.","DOI":"10.1145\/3308558.3313552"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wu, Y., Zhan, P., Zhang, Y., Wang, L., and Xu, Z. (2021, January 1\u20136). Multimodal fusion with co-attention networks for fake news detection. Proceedings of the Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Online Event.","DOI":"10.18653\/v1\/2021.findings-acl.226"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Chen, Y., Li, D., Zhang, P., Sui, J., Lv, Q., Tun, L., and Shang, L. (2022, January 25\u201329). Cross-modal ambiguity learning for multimodal fake news detection. Proceedings of the ACM Web Conference 2022, Virtual Event.","DOI":"10.1145\/3485447.3511968"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Singhal, S., Pandey, T., Mrig, S., Shah, R.R., and Kumaraguru, P. (2022, January 25\u201329). Leveraging intra and inter modality relationship for multimodal fake news detection. Proceedings of the Companion Web Conference 2022, Virtual Event.","DOI":"10.1145\/3487553.3524650"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"5851","DOI":"10.1007\/s40747-024-01473-5","article-title":"Multimodal fake news detection through intra-modality feature aggregation and inter-modality semantic fusion","volume":"10","author":"Zhu","year":"2024","journal-title":"Complex Intell. Syst."},{"key":"ref_26","unstructured":"Zhang, M., Chang, K., and Wu, Y. (2024). Multi-modal Semantic Understanding with Contrastive Cross-modal Feature Alignment. arXiv."},{"key":"ref_27","unstructured":"Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., and Clark, J. (2021, January 18\u201324). Learning transferable visual models from natural language supervision. Proceedings of the International Conference on Machine Learning, Virtual."},{"key":"ref_28","unstructured":"Jia, C., Yang, Y., Xia, Y., Chen, Y.T., Parekh, Z., Pham, H., Le, Q., Sung, Y.H., Li, Z., and Duerig, T. (2021, January 18\u201324). Scaling up visual and vision-language representation learning with noisy text supervision. Proceedings of the International Conference on Machine Learning, Virtual."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Zhou, Y., Yang, Y., Ying, Q., Qian, Z., and Zhang, X. (2023, January 10\u201314). Multimodal fake news detection via clip-guided learning. Proceedings of the 2023 IEEE International Conference on Multimedia and Expo (ICME), Brisbane, Australia.","DOI":"10.1109\/ICME55011.2023.00480"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1007\/s13278-024-01267-0","article-title":"GraMuFeN: Graph-based multi-modal fake news detection in social media","volume":"14","author":"Kananian","year":"2024","journal-title":"Soc. Netw. Anal. Min."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"101944","DOI":"10.1016\/j.inffus.2023.101944","article-title":"MFIR: Multimodal fusion and inconsistency reasoning for explainable fake news detection","volume":"100","author":"Wu","year":"2023","journal-title":"Inf. Fusion"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1109\/TPAMI.2022.3152247","article-title":"A survey on vision transformer","volume":"45","author":"Han","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1007\/s13735-017-0143-x","article-title":"Detection and visualization of misleading content on Twitter","volume":"7","author":"Boididou","year":"2018","journal-title":"Int. J. Multimed. Inf. Retr."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ma, J., Gao, W., and Wong, K.F. (2017). Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning, Association for Computational Linguistics.","DOI":"10.18653\/v1\/P17-1066"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"103822","DOI":"10.1016\/j.csi.2023.103822","article-title":"MRAN: Multimodal relationship-aware attention network for fake news detection","volume":"89","author":"Yang","year":"2024","journal-title":"Comput. Stand. Interfaces"}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/17\/8\/1260\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:25:10Z","timestamp":1760034310000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/17\/8\/1260"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,7]]},"references-count":35,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2025,8]]}},"alternative-id":["sym17081260"],"URL":"https:\/\/doi.org\/10.3390\/sym17081260","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,7]]}}}