{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T05:41:40Z","timestamp":1775886100057,"version":"3.50.1"},"reference-count":53,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2025,1,20]],"date-time":"2025-01-20T00:00:00Z","timestamp":1737331200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62162033, 62462038, U21B2027"],"award-info":[{"award-number":["62162033, 62462038, U21B2027"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Yunnan Provincial Major Science and Technology Special Plan","award":["202303AP140008, 202402AD080001"],"award-info":[{"award-number":["202303AP140008, 202402AD080001"]}]},{"name":"Yunnan Foundation Research Projects","award":["202101AT070438, 202101BE070001-056"],"award-info":[{"award-number":["202101AT070438, 202101BE070001-056"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2025,1,31]]},"abstract":"<jats:p>\n            Multilingual document clustering (MDC) aims to partition multilingual documents into distinct clusters based on topic categories in an unsupervised manner. However, existing MDC methods still suffer from several limitations in practice tasks. Firstly, most of them optimize multiple objectives within the same feature space, thereby leading to the conflict between learning consistently shared semantics and reconstructing inconsistent view-specific information. Secondly, several methods directly integrate information from multilingual documents during the fusion stage, thereby overlooking the semantic differences between different language features. To address the aforementioned problems, we propose a novel multi-view learning method, called Semantic Feature Graph Consistency with Contrastive Cluster Assignments (SFGC\n            <jats:sup>3<\/jats:sup>\n            A), for MDC. Specifically, the proposed SFGC\n            <jats:sup>3<\/jats:sup>\n            A method implements consistency objective and reconstruction objective in different feature spaces, thus effectively avoiding conflicts between consistency learning and inconsistency reconstruction. Subsequently, we design the semantic feature graph consistency and semantic label consistency modules to further explore consistent semantic information among multilingual documents, thereby reducing the semantic differences among different language views. Extensive experiments on several multilingual document datasets have shown the effectiveness of the proposed SFGC\n            <jats:sup>3<\/jats:sup>\n            A method in MDC tasks. The source codes for this work will be released later.\n          <\/jats:p>","DOI":"10.1145\/3708887","type":"journal-article","created":{"date-parts":[[2024,12,19]],"date-time":"2024-12-19T11:12:11Z","timestamp":1734606731000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Semantic Feature Graph Consistency with Contrastive Cluster Assignments for Multilingual Document Clustering"],"prefix":"10.1145","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-0768-0363","authenticated-orcid":false,"given":"Teng","family":"Sun","sequence":"first","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5737-3383","authenticated-orcid":false,"given":"Zhenqiu","family":"Shu","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1277-6212","authenticated-orcid":false,"given":"Yuxin","family":"Huang","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2176-2998","authenticated-orcid":false,"given":"Hongbin","family":"Wang","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8952-8984","authenticated-orcid":false,"given":"Zhengtao","family":"Yu","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China"}]}],"member":"320","published-online":{"date-parts":[[2025,1,20]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"19","volume-title":"Proceedings of the IEEE International Conference on Data Mining","volume":"4","author":"Bickel Steffen","year":"2004","unstructured":"Steffen Bickel and Tobias Scheffer. 2004. Multi-view clustering. In Proceedings of the IEEE International Conference on Data Mining. Vol. 4, Citeseer, 19\u201326."},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298657"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/TAI.2021.3065894"},{"key":"e_1_3_1_5_2","volume-title":"Proceedings of the 18th International Conference on Computational Linguistics","author":"Chen Hsin-Hsi","year":"2000","unstructured":"Hsin-Hsi Chen and Chuan-Jie Lin. 2000. A muitilingual news summarizer. In Proceedings of the 18th International Conference on Computational Linguistics (COLING 2000). Vol. 1."},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","unstructured":"Jie Chen Hua Mao Wai Lok Woo and Xi Peng. 2023. Deep multiview clustering by contrasting cluster assignments. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV). 16706\u201316715.","DOI":"10.1109\/ICCV51070.2023.01536"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41019-022-00190-8"},{"key":"e_1_3_1_8_2","first-page":"1597","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Chen Ting","year":"2020","unstructured":"Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. In Proceedings of the International Conference on Machine Learning. PMLR, 1597\u20131607."},{"key":"e_1_3_1_9_2","first-page":"2973","volume-title":"Proceedings of the 29th International Conference on International Joint Conferences on Artificial Intelligence","author":"Cheng Jiafeng","year":"2021","unstructured":"Jiafeng Cheng, Qianqian Wang, Zhiqiang Tao, Deyan Xie, and Quanxue Gao. 2021. Multi-view attribute graph convolution networks for clustering. In Proceedings of the 29th International Conference on International Joint Conferences on Artificial Intelligence. 2973\u20132979."},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41019-021-00159-z"},{"key":"e_1_3_1_11_2","doi-asserted-by":"crossref","first-page":"1275","DOI":"10.1145\/1989323.1989474","volume-title":"Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data","author":"Flaounas Ilias","year":"2011","unstructured":"Ilias Flaounas, Omar Ali, Marco Turchi, Tristan Snowsill, Florent Nicart, Tijl De Bie, and Nello Cristianini. 2011. NOAM: News outlets analysis and monitoring system. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data. 1275\u20131278."},{"key":"e_1_3_1_12_2","first-page":"4116","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Hassani Kaveh","year":"2020","unstructured":"Kaveh Hassani and Amir Hosein Khasahmadi. 2020. Contrastive multi-view representation learning on graphs. In Proceedings of the International Conference on Machine Learning. PMLR, 4116\u20134126."},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-017-4838-z"},{"key":"e_1_3_1_14_2","doi-asserted-by":"crossref","unstructured":"Bruno Pouliquen Ralf Steinberger Camelia Ignat Emilia K\u00e4spers and Irina Temnikova. 2004. Multilingual and cross-lingual news topic tracking. In Proceedings of the 20th International Conference on Computational Linguistics (COLING\u201904). 959\u2013965.","DOI":"10.3115\/1220355.1220493"},{"key":"e_1_3_1_15_2","article-title":"A clustering-guided contrastive fusion for multi-view representation learning","author":"Ke Guanzhou","year":"2023","unstructured":"Guanzhou Ke, Guoqing Chao, Xiaoli Wang, Chenyang Xu, Yongqi Zhu, and Yang Yu. 2023. A clustering-guided contrastive fusion for multi-view representation learning. IEEE Transactions on Circuits and Systems for Video Technology 34, 4 (2023), 2056\u20132069.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"e_1_3_1_16_2","first-page":"1945","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Kusner Matt J.","year":"2017","unstructured":"Matt J. Kusner, Brooks Paige, and Jos\u00e9 Miguel Hern\u00e1ndez-Lobato. 2017. Grammar variational autoencoder. In Proceedings of the International Conference on Machine Learning. PMLR, 1945\u20131954."},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2023.126521"},{"key":"e_1_3_1_18_2","volume-title":"Proceedings of the International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019","author":"Li Zhaoyang","year":"2019","unstructured":"Zhaoyang Li, Qianqian Wang, Zhiqiang Tao, Quanxue Gao, Zhaohua Yang2019. Deep adversarial multi-view clustering network. In Proceedings of the International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019, Sarit Kraus (Ed.). ijcai.org, 2952\u20132958."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2020.10.021"},{"issue":"4","key":"e_1_3_1_20_2","first-page":"4447","article-title":"Dual contrastive prediction for incomplete multi-view representation learning","volume":"45","author":"Lin Yijie","year":"2022","unstructured":"Yijie Lin, Yuanbiao Gou, Xiaotian Liu, Jinfeng Bai, Jiancheng Lv, and Xi Peng. 2022. Dual contrastive prediction for incomplete multi-view representation learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 4 (2022), 4447\u20134461.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01102"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","unstructured":"Chengliang Liu Jie Wen Zhihao Wu Xiaoling Luo Chao Huang and Yong Xu. 2023. Information recovery-driven deep incomplete multi-view clustering network. IEEE Transactions on Neural Networks and Learning Systems 35 11 (2023) 15442\u201315452. DOI:10.1109\/TNNLS.2023.3286918","DOI":"10.1109\/TNNLS.2023.3286918"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972832.28"},{"key":"e_1_3_1_24_2","first-page":"1590","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Luo Keyang","year":"2020","unstructured":"Keyang Luo, Tao Guan, Lili Ju, Yuesong Wang, Zhuo Chen, and Yawei Luo. 2020. Attention-aware multi-view stereo. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1590\u20131599."},{"key":"e_1_3_1_25_2","first-page":"116","volume-title":"Proceedings of the RIAO","author":"Mathieu Benoit","year":"2004","unstructured":"Benoit Mathieu, Romaric Besan\u00e7on, and Christian Fluhr. 2004. Multilingual document clusters discovery. In Proceedings of the RIAO. Citeseer, 116\u2013125."},{"key":"e_1_3_1_26_2","first-page":"2148","article-title":"Multi-view contrastive graph clustering","author":"Pan Erlin","year":"2021","unstructured":"Erlin Pan and Zhao Kang. 2021. Multi-view contrastive graph clustering. In Proceedings of the 35th International Conference on Neural Information Processing Systems . 2148\u20132159.","journal-title":"Proceedings of the 35th International Conference on Neural Information Processing Systems"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2019.105339"},{"key":"e_1_3_1_28_2","volume-title":"Proceedings of the 3rd All-Russian Conference Digital Libraries: Advanced Methods and Technologies","author":"Rauber Andreas","year":"2001","unstructured":"Andreas Rauber, Michael Dittenbach, and Dieter Merkl. 2001. Towards automatic content-based organization of multilingual digital libraries: An English, French, and German view of the Russian information agency novosti news. In Proceedings of the 3rd All-Russian Conference Digital Libraries: Advanced Methods and Technologies."},{"key":"e_1_3_1_29_2","doi-asserted-by":"crossref","first-page":"600","DOI":"10.3115\/v1\/D14-1065","volume-title":"Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Romeo Salvatore","year":"2014","unstructured":"Salvatore Romeo, Andrea Tagarelli, and Dino Ienco. 2014. Semantic-based multilingual document clustering via tensor modeling. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 600\u2013609."},{"issue":"2","key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"58","DOI":"10.26555\/jifo.v14i2.a17513","article-title":"State of the art document clustering algorithms based on semantic similarity","volume":"14","author":"Salih Niyaz Mohammed","year":"2020","unstructured":"Niyaz Mohammed Salih and Karwan Jacksi. 2020. State of the art document clustering algorithms based on semantic similarity. Jurnal Informatika 14, 2 (2020), 58\u201375.","journal-title":"Jurnal Informatika"},{"key":"e_1_3_1_31_2","unstructured":"Jo\u00e3o Santos Afonso Mendes and Sebasti\u00e3o Miranda. 2022. Simplifying multilingual news clustering through projection from a shared space. ArXiv preprint abs\/2204.13418 (2022). https:\/\/arxiv.org\/abs\/2204.13418"},{"key":"e_1_3_1_32_2","article-title":"Robust dual-graph regularized deep matrix factorization for multi-view clustering","author":"Shu Zhenqiu","year":"2023","unstructured":"Zhenqiu Shu, Bin Li, Cong Hu, Zhengtao Yu, and Xiao-Jun Wu. 2023. Robust dual-graph regularized deep matrix factorization for multi-view clustering. Neural Processing Letters 55, 5 (2023), 6067\u20136087.","journal-title":"Neural Processing Letters"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00030"},{"key":"e_1_3_1_34_2","doi-asserted-by":"crossref","first-page":"776","DOI":"10.1007\/978-3-030-58621-8_45","volume-title":"Proceedings of the 16th European Conference on Computer Vision\u2013ECCV 2020, Part XI 16","author":"Tian Yonglong","year":"2020","unstructured":"Yonglong Tian, Dilip Krishnan, and Phillip Isola. 2020. Contrastive multiview coding. In Proceedings of the 16th European Conference on Computer Vision\u2013ECCV 2020, Part XI 16. Springer, 776\u2013794."},{"key":"e_1_3_1_35_2","first-page":"6827","article-title":"What makes for good views for contrastive learning? In","author":"Tian Yonglong","year":"2020","unstructured":"Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, and Phillip Isola. 2020. What makes for good views for contrastive learning? In Proceedings of the 34th International Conference on Neural Information Processing Systems . 6827\u20136839.","journal-title":"Proceedings of the 34th International Conference on Neural Information Processing Systems"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00131"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2019.2903810"},{"key":"e_1_3_1_38_2","article-title":"Adversarial multiview clustering networks with adaptive fusion","author":"Wang Qianqian","year":"2022","unstructured":"Qianqian Wang, Zhiqiang Tao, Wei Xia, Quanxue Gao, Xiaochun Cao, and Licheng Jiao. 2022. Adversarial multiview clustering networks with adaptive fusion. IEEE Transactions on Neural Networks and Learning Systems 34, 10 (2022), 7635\u20137647.","journal-title":"IEEE Transactions on Neural Networks and Learning Systems"},{"key":"e_1_3_1_39_2","first-page":"9929","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Wang Tongzhou","year":"2020","unstructured":"Tongzhou Wang and Phillip Isola. 2020. Understanding contrastive representation learning through alignment and uniformity on the hypersphere. In Proceedings of the International Conference on Machine Learning. PMLR, 9929\u20139939."},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2021.3136098"},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.dss.2007.07.008"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2023.3243521"},{"key":"e_1_3_1_43_2","first-page":"8761","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"36","author":"Xu Jie","year":"2022","unstructured":"Jie Xu, Chao Li, Yazhou Ren, Liang Peng, Yujie Mo, Xiaoshuang Shi, and Xiaofeng Zhu. 2022. Deep incomplete multi-view clustering via mining cluster complementarity. In Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 36, 8761\u20138769."},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2020.12.073"},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00910"},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2022.3193569"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00333"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01902"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2024.3357087"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2022.3155499"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1145\/3581783.3611951"},{"key":"e_1_3_1_52_2","first-page":"6688","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Yin Ming","year":"2020","unstructured":"Ming Yin, Weitian Huang, and Junbin Gao. 2020. Shared generative latent representation learning for multi-view clustering. In Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 34, 6688\u20136695."},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2023.10.001"},{"key":"e_1_3_1_54_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01463"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3708887","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3708887","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:17:54Z","timestamp":1750295874000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3708887"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,20]]},"references-count":53,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1,31]]}},"alternative-id":["10.1145\/3708887"],"URL":"https:\/\/doi.org\/10.1145\/3708887","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,20]]},"assertion":[{"value":"2024-03-09","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-12-11","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-01-20","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}