{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T19:25:42Z","timestamp":1773775542707,"version":"3.50.1"},"reference-count":52,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2024,5,17]],"date-time":"2024-05-17T00:00:00Z","timestamp":1715904000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62106059, 62302227 and U20B2053"],"award-info":[{"award-number":["62106059, 62302227 and U20B2053"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2024,6,30]]},"abstract":"<jats:p>Bipartite graph representation learning aims to obtain node embeddings by compressing sparse vectorized representations of interactions between two types of nodes, e.g., users and items. Incorporating structural attributes among homogeneous nodes, such as user communities, improves the identification of similar interaction preferences, namely, user\/item embeddings, for downstream tasks. However, existing methods often fail to proactively discover and fully utilize these latent structural attributes. Moreover, the manual collection and labeling of structural attributes is always costly. In this article, we propose a novel approach called Dirichlet Max-margin Matrix Factorization (DM3F), which adopts a self-supervised strategy to discover latent structural attributes and model discriminative node representations. Specifically, in self-supervised learning, our approach generates pseudo group labels (i.e., structural attributes) as a supervised signal using the Dirichlet process without relying on manual collection and labeling, and employs them in a max-margin classification. Additionally, we introduce a Variational Markov Chain Monte Carlo algorithm (Variational MCMC) to effectively update the parameters. The experimental results on six real datasets demonstrate that, in the majority of cases, the proposed method outperforms existing approaches based on matrix factorization and neural networks. Furthermore, the modularity analysis confirms the effectiveness of our model in capturing structural attributes to produce high-quality user embeddings.<\/jats:p>","DOI":"10.1145\/3645098","type":"journal-article","created":{"date-parts":[[2024,3,8]],"date-time":"2024-03-08T09:19:56Z","timestamp":1709889596000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Self-supervised Bipartite Graph Representation Learning: A Dirichlet Max-margin Matrix Factorization Approach"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7455-5453","authenticated-orcid":false,"given":"Shenghai","family":"Zhong","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University, Haidian District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9660-0291","authenticated-orcid":false,"given":"Shu","family":"Guo","sequence":"additional","affiliation":[{"name":"National Computer Network Emergency Response Technical Team\/Coordination Center of China, Chaoyang District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-3957-6441","authenticated-orcid":false,"given":"Jing","family":"Liu","sequence":"additional","affiliation":[{"name":"National Computer Network Emergency Response Technical Team\/Coordination Center of China, Chaoyang District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-2793-0186","authenticated-orcid":false,"given":"Hongren","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University, Haidian District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0179-2364","authenticated-orcid":false,"given":"Lihong","family":"Wang","sequence":"additional","affiliation":[{"name":"National Computer Network Emergency Response Technical Team\/Coordination Center of China, Chaoyang District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5152-0055","authenticated-orcid":false,"given":"Jianxin","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University, Haidian District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4175-1658","authenticated-orcid":false,"given":"Chen","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University, Haidian District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0794-9932","authenticated-orcid":false,"given":"Yiming","family":"Hei","sequence":"additional","affiliation":[{"name":"School of Cyber Science and Technology, Beihang University, Haidian District, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,5,17]]},"reference":[{"key":"e_1_3_2_2_2","article-title":"Probabilistic matrix factorization","author":"Mnih Andriy","year":"2007","unstructured":"Andriy Mnih and Russ R. Salakhutdinov. 2007. Probabilistic matrix factorization. Advances in Neural Information Processing Systems 20 (2007), 1257\u20131264.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/3159652.3159706"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482266"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441759"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3209978.3209987"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE48307.2020.00149"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441783"},{"key":"e_1_3_2_9_2","first-page":"3889","volume-title":"Proceedings of the 25th International Joint Conference on Artificial Intelligence","volume":"2016","author":"Tu Cunchao","year":"2016","unstructured":"Cunchao Tu, Weicheng Zhang, Zhiyuan Liu, and Maosong Sun. 2016. Max-margin deepwalk: Discriminative learning of network representation. In Proceedings of the 25th International Joint Conference on Artificial Intelligence, Vol. 2016. 3889\u20133895."},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.6048"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM50108.2020.00059"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbad235"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2020.01.033"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNSE.2021.3110677"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.5555\/3524938.3525946"},{"issue":"1","key":"e_1_3_2_16_2","first-page":"857","article-title":"Self-supervised learning: Generative or contrastive","volume":"35","author":"Liu Xiao","year":"2021","unstructured":"Xiao Liu, Fanjin Zhang, Zhenyu Hou, Li Mian, Zhaoyu Wang, Jing Zhang, and Jie Tang. 2021. Self-supervised learning: Generative or contrastive. IEEE Transactions on Knowledge and Data Engineering 35, 1 (2021), 857\u2013876.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482392"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i4.20333"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3031549"},{"key":"e_1_3_2_20_2","article-title":"Efficient estimation of word representations in vector space","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).","journal-title":"arXiv preprint arXiv:1301.3781"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.202"},{"key":"e_1_3_2_22_2","article-title":"Improved deep metric learning with multi-class n-pair loss objective","volume":"29","author":"Sohn Kihyuk","year":"2016","unstructured":"Kihyuk Sohn. 2016. Improved deep metric learning with multi-class n-pair loss objective. Advances in Neural Information Processing Systems 29 (2016).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_23_2","article-title":"Representation learning with contrastive predictive coding","author":"Oord Aaron van den","year":"2018","unstructured":"Aaron van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 (2018).","journal-title":"arXiv preprint arXiv:1807.03748"},{"key":"e_1_3_2_24_2","first-page":"297","volume-title":"Proceedings of the 13th International Conference on Artificial Intelligence and Statistics","author":"Gutmann Michael","year":"2010","unstructured":"Michael Gutmann and Aapo Hyv\u00e4rinen. 2010. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics. JMLR Workshop and Conference Proceedings, 297\u2013304."},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00393"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01264-9_9"},{"key":"e_1_3_2_27_2","first-page":"12559","article-title":"Self-supervised graph transformer on large-scale molecular data","volume":"33","author":"Rong Yu","year":"2020","unstructured":"Yu Rong, Yatao Bian, Tingyang Xu, Weiyang Xie, Ying Wei, Wenbing Huang, and Junzhou Huang. 2020. Self-supervised graph transformer on large-scale molecular data. Advances in Neural Information Processing Systems 33 (2020), 12559\u201312571.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00305"},{"key":"e_1_3_2_29_2","article-title":"Self-labelling via simultaneous clustering and representation learning","author":"Asano Yuki Markus","year":"2019","unstructured":"Yuki Markus Asano, Christian Rupprecht, and Andrea Vedaldi. 2019. Self-labelling via simultaneous clustering and representation learning. arXiv preprint arXiv:1911.05371 (2019).","journal-title":"arXiv preprint arXiv:1911.05371"},{"key":"e_1_3_2_30_2","first-page":"81","article-title":"Bayesian nonparametric models.","volume":"1","author":"Orbanz Peter","year":"2010","unstructured":"Peter Orbanz and Yee Whye Teh. 2010. Bayesian nonparametric models. Encyclopedia of Machine Learning 1 (2010), 81\u201389.","journal-title":"Encyclopedia of Machine Learning"},{"key":"e_1_3_2_31_2","first-page":"280","article-title":"Dirichlet process.","volume":"1063","author":"Teh Yee Whye","year":"2010","unstructured":"Yee Whye Teh et\u00a0al. 2010. Dirichlet process. Encyclopedia of Machine Learning 1063 (2010), 280\u2013287.","journal-title":"Encyclopedia of Machine Learning"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-014-0790-7"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206800"},{"key":"e_1_3_2_34_2","article-title":"Monte Carlo methods for maximum margin supervised topic models","volume":"25","author":"Jiang Qixia","year":"2012","unstructured":"Qixia Jiang, Jun Zhu, Maosong Sun, and Eric Xing. 2012. Monte Carlo methods for maximum margin supervised topic models. Advances in Neural Information Processing Systems 25 (2012), 1601\u20131609.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_35_2","article-title":"Large margin learning of upstream scene understanding models","volume":"23","author":"Zhu Jun","year":"2010","unstructured":"Jun Zhu, Li-Jia Li, Li Fei-Fei, and Eric Xing. 2010. Large margin learning of upstream scene understanding models. Advances in Neural Information Processing Systems 23 (2010), 2586\u20132594.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1080\/00031305.1988.10475585"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2638592"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.5555\/647235.720242"},{"key":"e_1_3_2_39_2","volume-title":"Proceedings of the WSEAS International Conference on Neural Networks and Applications (NNA)","author":"Takabatake Kazuya","year":"2004","unstructured":"Kazuya Takabatake. 2004. Information geometry of Gibbs sampler. In Proceedings of the WSEAS International Conference on Neural Networks and Applications (NNA)."},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1287\/moor.1120.0569"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150429"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/2827872"},{"key":"e_1_3_2_43_2","article-title":"Geometric matrix completion with recurrent multi-graph neural networks","volume":"30","author":"Monti Federico","year":"2017","unstructured":"Federico Monti, Michael Bronstein, and Xavier Bresson. 2017. Geometric matrix completion with recurrent multi-graph neural networks. Advances in Neural Information Processing Systems 30 (2017), 3697\u20133707.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_44_2","first-page":"3","volume-title":"Proceedings of KDD Cup 2011","author":"Dror Gideon","year":"2012","unstructured":"Gideon Dror, Noam Koenigstein, Yehuda Koren, and Markus Weimer. 2012. The Yahoo! music dataset and KDD-cup\u201911. In Proceedings of KDD Cup 2011. PMLR, 3\u201318."},{"key":"e_1_3_2_45_2","first-page":"189","volume-title":"Proceedings of the 20th International Conference on Machine Learning","author":"Lu Qing","year":"2003","unstructured":"Qing Lu and Lise Getoor. 2003. Link-based classification. In Proceedings of the 20th International Conference on Machine Learning. 189\u2013207."},{"key":"e_1_3_2_46_2","article-title":"Collaborative filtering with graph information: Consistency and scalable methods","volume":"28","author":"Rao Nikhil","year":"2015","unstructured":"Nikhil Rao, Hsiang-Fu Yu, Pradeep K. Ravikumar, and Inderjit S. Dhillon. 2015. Collaborative filtering with graph information: Consistency and scalable methods. Advances in Neural Information Processing Systems 28 (2015), 2107\u20132115.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.6043"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052639"},{"key":"e_1_3_2_49_2","article-title":"On the difficulty of evaluating baselines: A study on recommender systems","author":"Rendle Steffen","year":"2019","unstructured":"Steffen Rendle, Li Zhang, and Yehuda Koren. 2019. On the difficulty of evaluating baselines: A study on recommender systems. arXiv preprint arXiv:1905.01395 (2019).","journal-title":"arXiv preprint arXiv:1905.01395"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/963770.963772"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1145\/1864708.1864721"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.5555\/3104322.3104413"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/3172867"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3645098","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3645098","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:03:27Z","timestamp":1750291407000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3645098"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,17]]},"references-count":52,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,6,30]]}},"alternative-id":["10.1145\/3645098"],"URL":"https:\/\/doi.org\/10.1145\/3645098","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,17]]},"assertion":[{"value":"2022-04-30","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-16","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-05-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}