{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,18]],"date-time":"2025-12-18T19:57:26Z","timestamp":1766087846203,"version":"3.41.0"},"reference-count":52,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2023,1,5]],"date-time":"2023-01-05T00:00:00Z","timestamp":1672876800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2017YFA0700800"],"award-info":[{"award-number":["2017YFA0700800"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62131003 and 62021001"],"award-info":[{"award-number":["62131003 and 62021001"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2023,1,31]]},"abstract":"<jats:p>Domain generalization aims at generalizing the network trained on multiple domains to unknown but related domains. Under the assumption that different domains share the same classes, previous works can build relationships across domains. However, in realistic scenarios, the change of domains is always followed by the change of categories, which raises a difficulty for collecting sufficient aligned categories across domains. Bearing this in mind, this article introduces union domain generalization (UDG) as a new domain generalization scenario, in which the label space varies across domains, and the categories in unknown domains belong to the union of all given domain categories. The absence of categories in given domains is the main obstacle to aligning different domain distributions and obtaining domain-invariant information. To address this problem, we propose category-stitch learning (CSL), which aims at jointly learning the domain-invariant information and completing missing categories in all domains through an improved variational autoencoder and generators. The domain-invariant information extraction and sample generation cross-promote each other to better generalizability. Additionally, we decouple category and domain information and propose explicitly regularizing the semantic information by the classification loss with transferred samples. Thus our method can breakthrough the category limit and generate samples of missing categories in each domain. Extensive experiments and visualizations are conducted on MNIST, VLCS, PACS, Office-Home, and DomainNet datasets to demonstrate the effectiveness of our proposed method.<\/jats:p>","DOI":"10.1145\/3524136","type":"journal-article","created":{"date-parts":[[2022,3,17]],"date-time":"2022-03-17T13:36:53Z","timestamp":1647524213000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Category-Stitch Learning for Union Domain Generalization"],"prefix":"10.1145","volume":"19","author":[{"given":"Yajing","family":"Liu","sequence":"first","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhiwei","family":"Xiong","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ya","family":"Li","sequence":"additional","affiliation":[{"name":"iFLYTEK Research, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuning","family":"Lu","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xinmei","family":"Tian","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zheng-Jun","family":"Zha","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,1,5]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"998","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Balaji Yogesh","year":"2018","unstructured":"Yogesh Balaji, Swami Sankaranarayanan, and Rama Chellappa. 2018. Metareg: Towards domain generalization using meta-regularization. In Proceedings of the Advances in Neural Information Processing Systems. 998\u20131008."},{"key":"e_1_3_1_3_2","first-page":"37","volume-title":"Proceedings of the ICML Workshop on Unsupervised and Transfer Learning","author":"Baldi Pierre","year":"2012","unstructured":"Pierre Baldi. 2012. Autoencoders, unsupervised learning, and deep architectures. In Proceedings of the ICML Workshop on Unsupervised and Transfer Learning. 37\u201349."},{"key":"e_1_3_1_4_2","doi-asserted-by":"crossref","unstructured":"Shai Ben-David John Blitzer Koby Crammer and Fernando Pereira. 2007. Analysis of representations for domain adaptation. In Proceedings of the Advances in Neural Information Processing Systems . 137\u2013144.","DOI":"10.7551\/mitpress\/7503.003.0022"},{"key":"e_1_3_1_5_2","first-page":"135","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Cao Zhangjie","year":"2018","unstructured":"Zhangjie Cao, Lijia Ma, Mingsheng Long, and Jianmin Wang. 2018. Partial adversarial domain adaptation. In Proceedings of the European Conference on Computer Vision. 135\u2013150."},{"key":"e_1_3_1_6_2","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition.","author":"Cao Zhangjie","year":"2019","unstructured":"Zhangjie Cao, Kaichao You, Mingsheng Long, Jianmin Wang, and Qiang Yang. 2019. Learning to transfer examples for partial domain adaptation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00233"},{"key":"e_1_3_1_8_2","unstructured":"Myung Jin Choi Joseph J. Lim Antonio Torralba and Alan S. Willsky. 2010. Exploiting hierarchical context on a large database of object categories. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00916"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3369393"},{"key":"e_1_3_1_11_2","article-title":"Tutorial on variational autoencoders","author":"Doersch Carl","year":"2016","unstructured":"Carl Doersch. 2016. Tutorial on variational autoencoders. arXiv:1606.05908. Retrieved from https:\/\/arxiv.org\/abs\/1606.05908.","journal-title":"arXiv:1606.05908."},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0275-4"},{"key":"e_1_3_1_13_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"French Geoff","year":"2018","unstructured":"Geoff French, Michal Mackiewicz, and Mark Fisher. 2018. Self-ensembling for visual domain adaptation. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=rkpoTaxA-."},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.5555\/2946645.2946704"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/tpami.2016.2599532"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.293"},{"key":"e_1_3_1_17_2","unstructured":"Gregory Griffin Alex Holub and Pietro Perona. 2007. Caltech-256 Object Category Dataset . California Institute of Technology."},{"key":"e_1_3_1_18_2","first-page":"158","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Khosla Aditya","year":"2012","unstructured":"Aditya Khosla, Tinghui Zhou, Tomasz Malisiewicz, Alexei A. Efros, and Antonio Torralba. 2012. Undoing the damage of dataset bias. In Proceedings of the European Conference on Computer Vision. Springer, 158\u2013171."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/3418213"},{"key":"e_1_3_1_20_2","article-title":"Auto-encoding variational bayes","author":"Kingma Diederik P.","year":"2013","unstructured":"Diederik P. Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv:1312.6114. Retrieved from https:\/\/arxiv.org\/abs\/1312.6114.","journal-title":"arXiv:1312.6114."},{"key":"e_1_3_1_21_2","first-page":"6444","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Klys Jack","year":"2018","unstructured":"Jack Klys, Jake Snell, and Richard Zemel. 2018. Learning latent subspaces in variational autoencoders. In Proceedings of the Advances in Neural Information Processing Systems. 6444\u20136454."},{"key":"e_1_3_1_22_2","first-page":"1097","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Proceedings of the Advances in Neural Information Processing Systems. 1097\u20131105."},{"key":"e_1_3_1_23_2","volume-title":"Information Theory and Statistics","author":"Kullback Solomon","year":"1997","unstructured":"Solomon Kullback. 1997. Information Theory and Statistics. Courier Corporation."},{"key":"e_1_3_1_24_2","first-page":"5543","volume-title":"Proceedings of the 2017 IEEE International Conference on Computer Vision.","author":"Li Da","year":"2017","unstructured":"Da Li, Yongxin Yang, Yi-Zhe Song, and Timothy M. Hospedales. 2017. Deeper, broader and artier domain generalization. In Proceedings of the 2017 IEEE International Conference on Computer Vision. IEEE, 5543\u20135551."},{"key":"e_1_3_1_25_2","volume-title":"Proceedings of the 32nd AAAI Conference on Artificial Intelligence","author":"Li Da","year":"2018","unstructured":"Da Li, Yongxin Yang, Yi-Zhe Song, and Timothy M. Hospedales. 2018. Learning to generalize: Meta-learning for domain generalization. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_1_26_2","volume-title":"Proceedings of the International Conference on Computer Vision","author":"Li Da","year":"2019","unstructured":"Da Li, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song, and Timothy M. Hospedales. 2019. Episodic training for domain generalization. In Proceedings of the International Conference on Computer Vision. Institute of Electrical and Electronics Engineers (IEEE)."},{"key":"e_1_3_1_27_2","first-page":"624","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Li Ya","year":"2018","unstructured":"Ya Li, Xinmei Tian, Mingming Gong, Yajing Liu, Tongliang Liu, Kun Zhang, and Dacheng Tao. 2018. Deep domain generalization via conditional invariant adversarial networks. In Proceedings of the European Conference on Computer Vision. 624\u2013639."},{"key":"e_1_3_1_28_2","first-page":"3915","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Li Yiying","year":"2019","unstructured":"Yiying Li, Yongxin Yang, Wei Zhou, and Timothy Hospedales. 2019. Feature-critic networks for heterogeneous domain generalization. In Proceedings of the International Conference on Machine Learning. 3915\u20133924."},{"key":"e_1_3_1_29_2","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Liu Yajing","year":"2019","unstructured":"Yajing Liu, Xinmei Tian, Ya Li, Zhiwei Xiong, and Feng Wu. 2019. Compact feature learning for multi-domain image classification. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2021.3121564"},{"key":"e_1_3_1_31_2","first-page":"1084","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Mitsuzumi Yu","year":"2021","unstructured":"Yu Mitsuzumi, Go Irie, Daiki Ikami, and Takashi Shibata. 2021. Generalized domain adaptation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1084\u20131093."},{"key":"e_1_3_1_32_2","first-page":"10","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Muandet Krikamol","year":"2013","unstructured":"Krikamol Muandet, David Balduzzi, and Bernhard Sch\u00f6lkopf. 2013. Domain generalization via invariant feature representation. In Proceedings of the International Conference on Machine Learning. 10\u201318."},{"key":"e_1_3_1_33_2","first-page":"2642","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Odena Augustus","year":"2017","unstructured":"Augustus Odena, Christopher Olah, and Jonathon Shlens. 2017. Conditional image synthesis with auxiliary classifier gans. In Proceedings of the International Conference on Machine Learning. PMLR, 2642\u20132651."},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.191"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01388"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00234"},{"key":"e_1_3_1_37_2","unstructured":"G. Parascandolo A. Neitz A. Orvieto L. Gresele and B. Schlkopf. 2020. Learning explanations that are hard to vary. (2020)."},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00149"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00149"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-007-0090-8"},{"key":"e_1_3_1_41_2","first-page":"2168","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Sariyildiz Mert Bulent","year":"2019","unstructured":"Mert Bulent Sariyildiz and Ramazan Gokberk Cinbis. 2019. Gradient matching generative networks for zero-shot learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2168\u20132178."},{"key":"e_1_3_1_42_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Shankar Shiv","year":"2018","unstructured":"Shiv Shankar, Vihari Piratla, Soumen Chakrabarti, Siddhartha Chaudhuri, Preethi Jyothi, and Sunita Sarawagi. 2018. Generalizing across domains via cross-gradient training. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=r1Dx7fbCW."},{"key":"e_1_3_1_43_2","first-page":"3483","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Sohn Kihyuk","year":"2015","unstructured":"Kihyuk Sohn, Honglak Lee, and Xinchen Yan. 2015. Learning structured output representation using deep conditional generative models. In Proceedings of the Advances in Neural Information Processing Systems. 3483\u20133491."},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/2998574"},{"key":"e_1_3_1_45_2","unstructured":"Yingtao Tian and Jesse Engel. 2018. Latent domain transfer: Crossing modalities with bridging autoencoders. In Proceedings of the ICLR 2019 Conference on Blind Submission."},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995347"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.572"},{"key":"e_1_3_1_48_2","doi-asserted-by":"crossref","first-page":"1267","DOI":"10.1145\/3343031.3351004","volume-title":"Proceedings of the 27th ACM International Conference on Multimedia","author":"Wang Yaxing","year":"2019","unstructured":"Yaxing Wang, Abel Gonzalez-Garcia, Joost van de Weijer, and Luis Herranz. 2019. SDIT: Scalable and diverse cross-domain image translation. In Proceedings of the 27th ACM International Conference on Multimedia. 1267\u20131276."},{"key":"e_1_3_1_49_2","first-page":"3964","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Xu Ruijia","year":"2018","unstructured":"Ruijia Xu, Ziliang Chen, Wangmeng Zuo, Junjie Yan, and Liang Lin. 2018. Deep cocktail network: Multi-source unsupervised domain adaptation with category shift. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3964\u20133973."},{"key":"e_1_3_1_50_2","first-page":"628","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Xu Zheng","year":"2014","unstructured":"Zheng Xu, Wen Li, Li Niu, and Dong Xu. 2014. Exploiting low-rank structure from latent domains for domain generalization. In Proceedings of the European Conference on Computer Vision. Springer, 628\u2013643."},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3350955"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3458280"},{"key":"e_1_3_1_53_2","article-title":"Domain generalization via entropy regularization","volume":"33","author":"Zhao Shanshan","year":"2020","unstructured":"Shanshan Zhao, Mingming Gong, Tongliang Liu, Huan Fu, and Dacheng Tao. 2020. Domain generalization via entropy regularization. Advances in Neural Information Processing Systems 33 (2020).","journal-title":"Advances in Neural Information Processing Systems"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3524136","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3524136","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:31:05Z","timestamp":1750188665000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3524136"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,5]]},"references-count":52,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,1,31]]}},"alternative-id":["10.1145\/3524136"],"URL":"https:\/\/doi.org\/10.1145\/3524136","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2023,1,5]]},"assertion":[{"value":"2021-06-18","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-03-06","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-01-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}