{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,5]],"date-time":"2025-07-05T04:47:05Z","timestamp":1751690825258,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":30,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,15]],"date-time":"2018-10-15T00:00:00Z","timestamp":1539561600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Research Foundation of Singapore"},{"name":"NUS","award":["MOE Tier-II R-263-000-D17-112","ECRA R-263-000-C87-133","IDS R-263-000-C67-646"],"award-info":[{"award-number":["MOE Tier-II R-263-000-D17-112","ECRA R-263-000-C87-133","IDS R-263-000-C67-646"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,15]]},"DOI":"10.1145\/3240508.3240515","type":"proceedings-article","created":{"date-parts":[[2018,10,18]],"date-time":"2018-10-18T17:52:08Z","timestamp":1539885128000},"page":"45-53","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["Multi-Human Parsing Machines"],"prefix":"10.1145","author":[{"given":"Jianshu","family":"Li","sequence":"first","affiliation":[{"name":"National University of Singapore &amp; SAP Machine Learning, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jian","family":"Zhao","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yunpeng","family":"Chen","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sujoy","family":"Roy","sequence":"additional","affiliation":[{"name":"SAP Machine Learning, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shuicheng","family":"Yan","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiashi","family":"Feng","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Terence","family":"Sim","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,10,15]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proceedings of the International Conference on Machine Learning (ICML). 214--223","author":"Arjovsky Martin","year":"2017","unstructured":"Martin Arjovsky , Soumith Chintala , and L\u00e9on Bottou . 2017 . Wasserstein generative adversarial networks . In Proceedings of the International Conference on Machine Learning (ICML). 214--223 . Martin Arjovsky, Soumith Chintala, and L\u00e9on Bottou. 2017. Wasserstein generative adversarial networks. In Proceedings of the International Conference on Machine Learning (ICML). 214--223."},{"key":"e_1_3_2_1_2_1","volume-title":"Yuille","author":"Chen Liang-Chieh","year":"2016","unstructured":"Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos , Kevin Murphy , and Alan L . Yuille . 2016 . DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs . arXiv:1606.00915 (2016). Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2016. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. arXiv:1606.00915 (2016)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.254"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Jifeng Dai Kaiming He and Jian Sun. 2016. Instance-aware semantic segmentation via multi-task network cascades Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3150--3158.  Jifeng Dai Kaiming He and Jian Sun. 2016. Instance-aware semantic segmentation via multi-task network cascades Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3150--3158.","DOI":"10.1109\/CVPR.2016.343"},{"key":"e_1_3_2_1_5_1","volume-title":"Semantic instance segmentation with a discriminative loss function. arXiv preprint arXiv:1708.02551","author":"Brabandere Bert De","year":"2017","unstructured":"Bert De Brabandere , Davy Neven , and Luc Van Gool . 2017. Semantic instance segmentation with a discriminative loss function. arXiv preprint arXiv:1708.02551 ( 2017 ). Bert De Brabandere, Davy Neven, and Luc Van Gool. 2017. Semantic instance segmentation with a discriminative loss function. arXiv preprint arXiv:1708.02551 (2017)."},{"key":"e_1_3_2_1_6_1","volume-title":"Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing. arXiv preprint arXiv:1703.05446","author":"Gong Ke","year":"2017","unstructured":"Ke Gong , Xiaodan Liang , Xiaohui Shen , and Liang Lin . 2017. Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing. arXiv preprint arXiv:1703.05446 ( 2017 ). Ke Gong, Xiaodan Liang, Xiaohui Shen, and Liang Lin. 2017. Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing. arXiv preprint arXiv:1703.05446 (2017)."},{"key":"e_1_3_2_1_7_1","volume-title":"Proceedings of the Advances in Neural Information Processing Systems (NIPS). 2672--2680","author":"Goodfellow Ian","year":"2014","unstructured":"Ian Goodfellow , Jean Pouget-Abadie , Mehdi Mirza , Bing Xu , David Warde-Farley , Sherjil Ozair , Aaron Courville , and Yoshua Bengio . 2014 . Generative adversarial nets . In Proceedings of the Advances in Neural Information Processing Systems (NIPS). 2672--2680 . Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Proceedings of the Advances in Neural Information Processing Systems (NIPS). 2672--2680."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.322"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_10_1","volume-title":"Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis. arXiv preprint arXiv:1704.04086","author":"Huang Rui","year":"2017","unstructured":"Rui Huang , Shu Zhang , Tianyu Li , and Ran He. 2017. Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis. arXiv preprint arXiv:1704.04086 ( 2017 ). Rui Huang, Shu Zhang, Tianyu Li, and Ran He. 2017. Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis. arXiv preprint arXiv:1704.04086 (2017)."},{"key":"e_1_3_2_1_11_1","unstructured":"Max Jaderberg Karen Simonyan Andrew Zisserman etal 2015. Spatial transformer networks. In Advances in neural information processing systems. 2017--2025.   Max Jaderberg Karen Simonyan Andrew Zisserman et al. 2015. Spatial transformer networks. In Advances in neural information processing systems. 2017--2025."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.366"},{"key":"e_1_3_2_1_13_1","volume-title":"Towards Real World Human Parsing: Multiple-Human Parsing in the Wild. arXiv preprint arXiv:1705.07206","author":"Li Jianshu","year":"2017","unstructured":"Jianshu Li , Jian Zhao , Yunchao Wei , Congyan Lang , Yidong Li , and Jiashi Feng . 2017. Towards Real World Human Parsing: Multiple-Human Parsing in the Wild. arXiv preprint arXiv:1705.07206 ( 2017 ). Jianshu Li, Jian Zhao, Yunchao Wei, Congyan Lang, Yidong Li, and Jiashi Feng. 2017. Towards Real World Human Parsing: Multiple-Human Parsing in the Wild. arXiv preprint arXiv:1705.07206 (2017)."},{"key":"e_1_3_2_1_14_1","volume-title":"Fully Convolutional Instance-aware Semantic Segmentation. arXiv preprint arXiv:1611.07709","author":"Li Yi","year":"2016","unstructured":"Yi Li , Haozhi Qi , Jifeng Dai , Xiangyang Ji , and Yichen Wei . 2016. Fully Convolutional Instance-aware Semantic Segmentation. arXiv preprint arXiv:1611.07709 ( 2016 ). Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, and Yichen Wei. 2016. Fully Convolutional Instance-aware Semantic Segmentation. arXiv preprint arXiv:1611.07709 (2016)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2408360"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Xiaodan Liang Xiaohui Shen Donglai Xiang Jiashi Feng Liang Lin and Shuicheng Yan. 2016. Semantic object parsing with local-global long short-term memory Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3185--3193.  Xiaodan Liang Xiaohui Shen Donglai Xiang Jiashi Feng Liang Lin and Shuicheng Yan. 2016. Semantic object parsing with local-global long short-term memory Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3185--3193.","DOI":"10.1109\/CVPR.2016.347"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.163"},{"key":"e_1_3_2_1_18_1","volume-title":"ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing. arXiv preprint arXiv:1803.01837","author":"Lin Chen-Hsuan","year":"2018","unstructured":"Chen-Hsuan Lin , Ersin Yumer , Oliver Wang , Eli Shechtman , and Simon Lucey . 2018. ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing. arXiv preprint arXiv:1803.01837 ( 2018 ). Chen-Hsuan Lin, Ersin Yumer, Oliver Wang, Eli Shechtman, and Simon Lucey. 2018. ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing. arXiv preprint arXiv:1803.01837 (2018)."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298748"},{"key":"e_1_3_2_1_20_1","unstructured":"Alejandro Newell Zhiao Huang and Jia Deng. 2017. Associative embedding: End-to-end learning for joint detection and grouping. In Advances in Neural Information Processing Systems. 2274--2284.  Alejandro Newell Zhiao Huang and Jia Deng. 2017. Associative embedding: End-to-end learning for joint detection and grouping. In Advances in Neural Information Processing Systems. 2274--2284."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"crossref","unstructured":"Zhang Ning Paluri Manohar Taigman Yaniv Fergus Rob and Bourdev Lubomir. 2015. Beyond Frontal Faces: Improving Person Recognition Using Multiple Cues. arXiv:arXiv:1501.05703  Zhang Ning Paluri Manohar Taigman Yaniv Fergus Rob and Bourdev Lubomir. 2015. Beyond Frontal Faces: Improving Person Recognition Using Multiple Cues. arXiv:arXiv:1501.05703","DOI":"10.1109\/CVPR.2015.7299113"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.299"},{"key":"e_1_3_2_1_23_1","first-page":"12","article-title":"Human Instance Segmentation from Video using Detector-based Conditional Random Fields","volume":"2","author":"Vineet Vibhav","year":"2011","unstructured":"Vibhav Vineet , Jonathan Warrell , Lubor Ladicky , and Philip HS Torr . 2011 . Human Instance Segmentation from Video using Detector-based Conditional Random Fields . In BMVC , Vol. 2. 12 -- 15 . Vibhav Vineet, Jonathan Warrell, Lubor Ladicky, and Philip HS Torr. 2011. Human Instance Segmentation from Video using Detector-based Conditional Random Fields. In BMVC, Vol. 2. 12--15.","journal-title":"BMVC"},{"key":"e_1_3_2_1_24_1","volume-title":"Gp-gan: Towards realistic high-resolution image blending. arXiv preprint arXiv:1703.07195","author":"Wu Huikai","year":"2017","unstructured":"Huikai Wu , Shuai Zheng , Junge Zhang , and Kaiqi Huang . 2017 . Gp-gan: Towards realistic high-resolution image blending. arXiv preprint arXiv:1703.07195 (2017). Huikai Wu, Shuai Zheng, Junge Zhang, and Kaiqi Huang. 2017. Gp-gan: Towards realistic high-resolution image blending. arXiv preprint arXiv:1703.07195 (2017)."},{"volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3570--3577","author":"Yamaguchi Kota","key":"e_1_3_2_1_25_1","unstructured":"Kota Yamaguchi , M. Hadi Kiapour , Luis E. Ortiz , and Tamara L. Berg . 2012. Parsing clothing in fashion photographs . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3570--3577 . Kota Yamaguchi, M. Hadi Kiapour, Luis E. Ortiz, and Tamara L. Berg. 2012. Parsing clothing in fashion photographs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3570--3577."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-017-1055-1"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2017.204"},{"key":"e_1_3_2_1_28_1","volume-title":"Places: A 10 million Image Database for Scene Recognition","author":"Zhou Bolei","year":"2017","unstructured":"Bolei Zhou , Agata Lapedriza , Aditya Khosla , Aude Oliva , and Antonio Torralba . 2017 . Places: A 10 million Image Database for Scene Recognition . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2017). Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2017. Places: A 10 million Image Database for Scene Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (2017)."},{"key":"e_1_3_2_1_29_1","volume-title":"Semantic understanding of scenes through the ade20k dataset. arXiv preprint arXiv:1608.05442","author":"Zhou Bolei","year":"2016","unstructured":"Bolei Zhou , Hang Zhao , Xavier Puig , Sanja Fidler , Adela Barriuso , and Antonio Torralba . 2016. Semantic understanding of scenes through the ade20k dataset. arXiv preprint arXiv:1608.05442 ( 2016 ). Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, and Antonio Torralba. 2016. Semantic understanding of scenes through the ade20k dataset. arXiv preprint arXiv:1608.05442 (2016)."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.544"}],"event":{"name":"MM '18: ACM Multimedia Conference","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Seoul Republic of Korea","acronym":"MM '18"},"container-title":["Proceedings of the 26th ACM international conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3240508.3240515","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3240508.3240515","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:44:00Z","timestamp":1750207440000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3240508.3240515"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,15]]},"references-count":30,"alternative-id":["10.1145\/3240508.3240515","10.1145\/3240508"],"URL":"https:\/\/doi.org\/10.1145\/3240508.3240515","relation":{},"subject":[],"published":{"date-parts":[[2018,10,15]]},"assertion":[{"value":"2018-10-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}