{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:14:47Z","timestamp":1750220087920,"version":"3.41.0"},"reference-count":55,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,2,20]],"date-time":"2023-02-20T00:00:00Z","timestamp":1676851200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2023,4,30]]},"abstract":"<jats:p>In real-world applications, a single instance could have more than one label. To solve this task, multi-label learning methods emerged in recent years. It is a more challenging problem for many reasons, such as complex label correlation, long-tail label distribution, and data shortage. In general, overcoming these challenges and bettering learning performance could be achieved by utilizing more training samples and including label correlations. However, these solutions are expensive and inflexible. Large-scale, well-labeled datasets are difficult to obtain, and building label correlation maps requires task-specific semantic information as prior knowledge. To address these limitations, we propose a general and compact Multi-Label Correlation Learning (MUCO) framework. MUCO explicitly and effectively learns the latent label correlations by updating a label correlation tensor, which provides highly accurate and interpretable prediction results. In addition, a multi-label generative strategy is deployed to handle the long-tail label distribution challenge. It borrows the visual clues from limited samples and synthesizes more diverse samples. All networks in our model are optimized simultaneously. Extensive experiments illustrate the effectiveness and efficiency of MUCO. Ablation studies further prove the effectiveness of all the modules.<\/jats:p>","DOI":"10.1145\/3538708","type":"journal-article","created":{"date-parts":[[2022,6,6]],"date-time":"2022-06-06T09:58:11Z","timestamp":1654509491000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Generative Multi-Label Correlation Learning"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3741-9492","authenticated-orcid":false,"given":"Lichen","family":"Wang","sequence":"first","affiliation":[{"name":"Northeastern University, Boston, Massachusetts"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6994-5278","authenticated-orcid":false,"given":"Zhengming","family":"Ding","sequence":"additional","affiliation":[{"name":"Tulane University, New Orleans, Louisiana"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7956-1876","authenticated-orcid":false,"given":"Kasey","family":"Lee","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, Massachusetts"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7293-1419","authenticated-orcid":false,"given":"Seungju","family":"Han","sequence":"additional","affiliation":[{"name":"Samsung Electronics, Suwon-si, Gyeonggi-do, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6505-6529","authenticated-orcid":false,"given":"Jae-Joon","family":"Han","sequence":"additional","affiliation":[{"name":"Samsung Electronics, Suwon-si, Gyeonggi-do, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1470-8662","authenticated-orcid":false,"given":"Changkyu","family":"Choi","sequence":"additional","affiliation":[{"name":"Samsung Electronics, Suwon-si, Gyeonggi-do, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5098-2853","authenticated-orcid":false,"given":"Yun","family":"Fu","sequence":"additional","affiliation":[{"name":"Northeastern University, Boston, Massachusetts"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,2,20]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2004.1326716"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2004.03.009"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403368"},{"key":"e_1_3_1_5_2","unstructured":"Tong Che Yanran Li Athul Paul Jacob Yoshua Bengio and Wenjie Li. 2017. Mode regularized generative adversarial networks. In Proceedings of the International Conference on Learning Representations ."},{"key":"e_1_3_1_6_2","first-page":"1274","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Chen Minmin","year":"2013","unstructured":"Minmin Chen, Alice Zheng, and Kilian Weinberger. 2013. Fast image tagging. In Proceedings of the International Conference on Machine Learning. 1274\u20131282."},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.12230"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.5555\/645318.649254"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00139"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/1099554.1099591"},{"key":"e_1_3_1_11_2","first-page":"2672","volume-title":"Proceedings of the Neural Information Processing Systems","author":"Goodfellow Ian","year":"2014","unstructured":"Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Proceedings of the Neural Information Processing Systems. 2672\u20132680."},{"key":"e_1_3_1_12_2","volume-title":"Proceedings of the OntoImage","author":"Grubinger Michael","year":"2006","unstructured":"Michael Grubinger, Paul Clough, Henning M\u00fcller, and Thomas Deselaers. 2006. The IAPR TC-12 benchmark: A new evaluation resource for visual information systems. In Proceedings of the OntoImage."},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459266"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2016.0113"},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","unstructured":"Ting Jiang Deqing Wang Leilei Sun Huayi Yang Zhengyang Zhao and Fuzhen Zhuang. 2021. LightXML: Transformer with dynamic negative sampling for high-performance extreme multi-label text classification. 35 (2021) 7987\u20137994.","DOI":"10.1609\/aaai.v35i9.16974"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.90"},{"key":"e_1_3_1_17_2","unstructured":"Diederik Kingma and Jimmy Ba. 2015. ADAM: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations ."},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.473"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/tpami.2013.140"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00170"},{"key":"e_1_3_1_21_2","first-page":"421","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Liu Yi","year":"2006","unstructured":"Yi Liu, Rong Jin, and Liu Yang. 2006. Semi-supervised multi-label learning by constrained non-negative matrix factorization. In Proceedings of the AAAI Conference on Artificial Intelligence. 421\u2013426."},{"key":"e_1_3_1_22_2","unstructured":"Qianqian Ma Yang-Yu Liu and Alex Olshevsky. 2020. Optimal Lockdown for Pandemic Control. arXiv:2010.12923. Retrieved from https:\/\/arxiv.org\/abs\/2010.12923."},{"key":"e_1_3_1_23_2","unstructured":"Qianqian Ma Yang-Yu Liu and Alex Olshevsky. 2021. Optimal vaccine allocation for pandemic stabilization. arXiv:2109.04612. Retrieved from https:\/\/arxiv.org\/abs\/2109.04612."},{"key":"e_1_3_1_24_2","first-page":"21841","volume-title":"Proceedings of the Neural Information Processing Systems","author":"Ma Qianqian","year":"2020","unstructured":"Qianqian Ma and Alex Olshevsky. 2020. Adversarial crowdsourcing through robust rank-one matrix completion. In Proceedings of the Neural Information Processing Systems. 21841\u201321852."},{"key":"e_1_3_1_25_2","unstructured":"Xudong Mao Qing Li Haoran Xie Raymond Y. K. Lau Zhen Wang and Stephen Paul Smolley. 2017. Least squares generative adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision ."},{"key":"e_1_3_1_26_2","first-page":"1","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"McCallum Andrew","year":"1999","unstructured":"Andrew McCallum. 1999. Multi-label text classification with a mixture model trained by EM. In Proceedings of the AAAI Conference on Artificial Intelligence. 1\u20137."},{"key":"e_1_3_1_27_2","unstructured":"Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv:1411.1784. Retrieved from https:\/\/arxiv.org\/abs\/1411.1784."},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441807"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3442381.3449815"},{"key":"e_1_3_1_30_2","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Nair Vinod","year":"2010","unstructured":"Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In Proceedings of the International Conference on Machine Learning."},{"key":"e_1_3_1_31_2","first-page":"2642","volume-title":"Proceedings of the Journal of Machine Learning Research","author":"Odena Augustus","year":"2017","unstructured":"Augustus Odena, Christopher Olah, and Jonathon Shlens. 2017. Conditional image synthesis with auxiliary classifier GANs. In Proceedings of the Journal of Machine Learning Research. 2642\u20132651."},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.191"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6247998"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-019-01265-2"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1145\/1291233.1291245"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611976700.65"},{"key":"e_1_3_1_37_2","first-page":"2234","volume-title":"Proceedings of the Neural Information Processing Systems","author":"Salimans Tim","year":"2016","unstructured":"Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. 2016. Improved techniques for training GANs. In Proceedings of the Neural Information Processing Systems. 2234\u20132242."},{"key":"e_1_3_1_38_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1162\/NECO_a_00320"},{"issue":"1","key":"e_1_3_1_40_2","first-page":"3221","article-title":"Accelerating t-SNE using tree-based algorithms","volume":"15","author":"Maaten Laurens Van Der","year":"2014","unstructured":"Laurens Van Der Maaten. 2014. Accelerating t-SNE using tree-based algorithms. Journal of Machine Learning Research 15, 1 (2014), 3221\u20133245.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1145\/985692.985733"},{"key":"e_1_3_1_42_2","volume-title":"The Caltech-UCSD Birds-200-2011 Dataset","author":"Wah C.","year":"2011","unstructured":"C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie. 2011. The Caltech-UCSD Birds-200-2011 Dataset. Technical Report CNS-TR-2011-001."},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/388"},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/3451884"},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2019.00069"},{"key":"e_1_3_1_46_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Wang Lichen","year":"2020","unstructured":"Lichen Wang, Bo Zong, Qianqian Ma, Wei Cheng, Jingchao Ni, Wenchao Yu, Yanchi Liu, Dongjin Song, Haifeng Chen, and Yun Fu. 2020. Inductive and unsupervised representation learning on graph structured objects. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2003.819861"},{"issue":"7","key":"e_1_3_1_48_2","first-page":"2315","article-title":"Does tail label help for large-scale multi-label learning?","volume":"31","author":"Wei Tong","year":"2019","unstructured":"Tong Wei and Yu-Feng Li. 2019. Does tail label help for large-scale multi-label learning? IEEE Transactions on Neural Networks and Learning Systems 31, 7 (2019), 2315\u20132324.","journal-title":"IEEE Transactions on Neural Networks and Learning Systems"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00831"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-018-1085-3"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.473"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58548-8_10"},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00855"},{"key":"e_1_3_1_54_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01246-5_25"},{"key":"e_1_3_1_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"},{"key":"e_1_3_1_56_2","first-page":"912","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Zhu Xiaojin","year":"2003","unstructured":"Xiaojin Zhu, Zoubin Ghahramani, and John D. Lafferty. 2003. Semi-supervised learning using Gaussian fields and harmonic functions. In Proceedings of the International Conference on Machine Learning. 912\u2013919."}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3538708","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3538708","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:38Z","timestamp":1750183778000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3538708"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,20]]},"references-count":55,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,4,30]]}},"alternative-id":["10.1145\/3538708"],"URL":"https:\/\/doi.org\/10.1145\/3538708","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"type":"print","value":"1556-4681"},{"type":"electronic","value":"1556-472X"}],"subject":[],"published":{"date-parts":[[2023,2,20]]},"assertion":[{"value":"2021-11-18","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-05-07","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-02-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}