{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:16:32Z","timestamp":1750220192507,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":37,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,12,27]],"date-time":"2022-12-27T00:00:00Z","timestamp":1672099200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,12,27]]},"DOI":"10.1145\/3574131.3574431","type":"proceedings-article","created":{"date-parts":[[2023,1,14]],"date-time":"2023-01-14T18:25:51Z","timestamp":1673720751000},"page":"1-8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["MMTrans: MultiModal Transformer for realistic video virtual try-on"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6563-669X","authenticated-orcid":false,"given":"Xinrong","family":"Hu","sequence":"first","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3371-7190","authenticated-orcid":false,"given":"Ziyi","family":"Zhang","sequence":"additional","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1806-7452","authenticated-orcid":false,"given":"Ruiqi","family":"Luo","sequence":"additional","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3388-0094","authenticated-orcid":false,"given":"Junjie","family":"Huang","sequence":"additional","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7570-1827","authenticated-orcid":false,"given":"Jinxing","family":"Liang","sequence":"additional","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6214-9781","authenticated-orcid":false,"given":"Jin","family":"Huang","sequence":"additional","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1085-7246","authenticated-orcid":false,"given":"Tao","family":"Peng","sequence":"additional","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6076-5169","authenticated-orcid":false,"given":"Hao","family":"Cai","sequence":"additional","affiliation":[{"name":"Wuhan Textile University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,1,13]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.502"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01355"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01391"},{"key":"e_1_3_2_1_4_1","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision. 14638\u201314647","author":"Cui Aiyu","year":"2021","unstructured":"Aiyu Cui , Daniel McKee , and Svetlana Lazebnik . 2021 . Dressing in order: Recurrent person image generation for pose transfer, virtual try-on and outfit editing . In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 14638\u201314647 . Aiyu Cui, Daniel McKee, and Svetlana Lazebnik. 2021. Dressing in order: Recurrent person image generation for pose transfer, virtual try-on and outfit editing. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 14638\u201314647."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00912"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00125"},{"key":"e_1_3_2_1_7_1","unstructured":"Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929(2020).  Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929(2020)."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01665"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00838"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3422622"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.01057"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00787"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00685"},{"key":"e_1_3_2_1_14_1","volume-title":"Viton-gan: Virtual try-on image generator trained with adversarial loss. arXiv preprint arXiv:1911.07926(2019).","author":"Honda Shion","year":"2019","unstructured":"Shion Honda . 2019 . Viton-gan: Virtual try-on image generator trained with adversarial loss. arXiv preprint arXiv:1911.07926(2019). Shion Honda. 2019. Viton-gan: Virtual try-on image generator trained with adversarial loss. arXiv preprint arXiv:1911.07926(2019)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58565-5_37"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2017.269"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58565-5_2"},{"key":"e_1_3_2_1_18_1","volume-title":"CVPR Workshops.","author":"Minar Matiur\u00a0Rahman","year":"2020","unstructured":"Matiur\u00a0Rahman Minar , Thai\u00a0Thanh Tuan , Heejune Ahn , Paul Rosin , and Yu-Kun Lai . 2020 . Cp-vton+: Clothing shape and texture preserving image-based virtual try-on . In CVPR Workshops. Matiur\u00a0Rahman Minar, Thai\u00a0Thanh Tuan, Heejune Ahn, Paul Rosin, and Yu-Kun Lai. 2020. Cp-vton+: Clothing shape and texture preserving image-based virtual try-on. In CVPR Workshops."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00705"},{"key":"e_1_3_2_1_20_1","unstructured":"Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784(2014).  Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784(2014)."},{"key":"e_1_3_2_1_21_1","unstructured":"Alec Radford Luke Metz and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434(2015).  Alec Radford Luke Metz and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434(2015)."},{"key":"e_1_3_2_1_22_1","unstructured":"Bin Ren Hao Tang Fanyang Meng Runwei Ding Ling Shao Philip\u00a0HS Torr and Nicu Sebe. 2021. Cloth interactive transformer for virtual try-on. arXiv preprint arXiv:2104.05519(2021).  Bin Ren Hao Tang Fanyang Meng Runwei Ding Ling Shao Philip\u00a0HS Torr and Nicu Sebe. 2021. Cloth interactive transformer for virtual try-on. arXiv preprint arXiv:2104.05519(2021)."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3416270"},{"key":"e_1_3_2_1_24_1","unstructured":"Hao Tang Song Bai Philip\u00a0HS Torr and Nicu Sebe. 2020b. Bipartite graph reasoning gans for person image generation. arXiv preprint arXiv:2008.04381(2020).  Hao Tang Song Bai Philip\u00a0HS Torr and Nicu Sebe. 2020b. Bipartite graph reasoning gans for person image generation. arXiv preprint arXiv:2008.04381(2020)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58595-2_43"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1656"},{"key":"e_1_3_2_1_27_1","volume-title":"Attention is all you need. Advances in neural information processing systems 30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan\u00a0 N Gomez , \u0141ukasz Kaiser , and Illia Polosukhin . 2017. Attention is all you need. Advances in neural information processing systems 30 ( 2017 ). Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan\u00a0N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01261-8_36"},{"key":"e_1_3_2_1_29_1","unstructured":"Jiahang Wang Wei Zhang Weizhong Liu and Tao Mei. 2019. Down to the last detail: Virtual try-on with detail carving. arXiv preprint arXiv:1912.06324(2019).  Jiahang Wang Wei Zhang Weizhong Liu and Tao Mei. 2019. Down to the last detail: Virtual try-on with detail carving. arXiv preprint arXiv:1912.06324(2019)."},{"key":"e_1_3_2_1_30_1","unstructured":"Ting-Chun Wang Ming-Yu Liu Jun-Yan Zhu Guilin Liu Andrew Tao Jan Kautz and Bryan Catanzaro. 2018b. Video-to-video synthesis. arXiv preprint arXiv:1808.06601(2018).  Ting-Chun Wang Ming-Yu Liu Jun-Yan Zhu Guilin Liu Andrew Tao Jan Kautz and Bryan Catanzaro. 2018b. Video-to-video synthesis. arXiv preprint arXiv:1808.06601(2018)."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00813"},{"key":"e_1_3_2_1_32_1","volume-title":"Image quality assessment: from error visibility to structural similarity","author":"Wang Zhou","year":"2004","unstructured":"Zhou Wang , Alan\u00a0 C Bovik , Hamid\u00a0 R Sheikh , and Eero\u00a0 P Simoncelli . 2004. Image quality assessment: from error visibility to structural similarity . IEEE transactions on image processing 13, 4 ( 2004 ), 600\u2013612. Zhou Wang, Alan\u00a0C Bovik, Hamid\u00a0R Sheikh, and Eero\u00a0P Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing 13, 4 (2004), 600\u2013612."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00787"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00787"},{"key":"e_1_3_2_1_35_1","unstructured":"Honglun Zhang Wenqing Chen Hao He and Yaohui Jin. 2019. Disentangled makeup transfer with generative adversarial network. arXiv preprint arXiv:1907.01144(2019).  Honglun Zhang Wenqing Chen Hao He and Yaohui Jin. 2019. Disentangled makeup transfer with generative adversarial network. arXiv preprint arXiv:1907.01144(2019)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00068"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475269"}],"event":{"name":"VRCAI '22: The 18th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry","sponsor":["SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques"],"location":"Guangzhou China","acronym":"VRCAI '22"},"container-title":["Proceedings of the 18th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3574131.3574431","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3574131.3574431","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:33Z","timestamp":1750186953000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3574131.3574431"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,27]]},"references-count":37,"alternative-id":["10.1145\/3574131.3574431","10.1145\/3574131"],"URL":"https:\/\/doi.org\/10.1145\/3574131.3574431","relation":{},"subject":[],"published":{"date-parts":[[2022,12,27]]},"assertion":[{"value":"2023-01-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}