{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,23]],"date-time":"2025-08-23T05:25:05Z","timestamp":1755926705420,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Key R&D Project","award":["2021YFC3320301"],"award-info":[{"award-number":["2021YFC3320301"]}]},{"name":"National Natural Science Foundation of China","award":["62171325"],"award-info":[{"award-number":["62171325"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3548014","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:42:35Z","timestamp":1665416555000},"page":"6618-6626","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Towards Causality Inference for Very Important Person Localization"],"prefix":"10.1145","author":[{"given":"Xiao","family":"Wang","sequence":"first","affiliation":[{"name":"Wuhan University of Science and Technology, Wuhan, China"}]},{"given":"Zheng","family":"Wang","sequence":"additional","affiliation":[{"name":"Wuhan University, Wuhan, China"}]},{"given":"Wu","family":"Liu","sequence":"additional","affiliation":[{"name":"JD Explore Academy, Beijing, China"}]},{"given":"Xin","family":"Xu","sequence":"additional","affiliation":[{"name":"Wuhan University of Science and Technology, Wuhan, China"}]},{"given":"Qijun","family":"Zhao","sequence":"additional","affiliation":[{"name":"Sichuan University, Chengdu, China"}]},{"given":"Shin'ichi","family":"Satoh","sequence":"additional","affiliation":[{"name":"National Institute of Informatics, Tokyo, Japan"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","first-page":"234","volume-title":"2018 13th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2018","author":"Li Wei-Hong","year":"2018","unstructured":"Wei-Hong Li , Benchao Li , and Wei-Shi Zheng . Personrank : Detecting important people in images . In 2018 13th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2018 ), pages 234 -- 241 . IEEE, 2018 . Wei-Hong Li, Benchao Li, and Wei-Shi Zheng. Personrank: Detecting important people in images. In 2018 13th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2018), pages 234--241. IEEE, 2018."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299119"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i4.16386"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3417332"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3422852.3423485"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3479207"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00514"},{"key":"e_1_3_2_2_8_1","first-page":"4145","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Hong Fa-Ting","year":"2020","unstructured":"Fa-Ting Hong , Weihong Li , and Wei-Shi Zheng . Learning to detect important people in unlabelled images for semi-supervise important people detection . In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pages 4145 -- 4153 , 2020 . Fa-Ting Hong, Weihong Li, and Wei-Shi Zheng. Learning to detect important people in unlabelled images for semi-supervise important people detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 4145--4153, 2020."},{"key":"e_1_3_2_2_9_1","volume-title":"Causality: Models, reasoning and inference","author":"Judea Pearl","year":"2000","unstructured":"Judea Pearl et al. Causality: Models, reasoning and inference . Cambridge, UK : Cambridge University Press , 19, 2000 . Judea Pearl et al. Causality: Models, reasoning and inference. Cambridge, UK: Cambridge University Press, 19, 2000."},{"key":"e_1_3_2_2_10_1","volume-title":"Yolov4: Optimal speed and accuracy of object detection. CoRR, abs\/2004.10934","author":"Bochkovskiy Alexey","year":"2020","unstructured":"Alexey Bochkovskiy , Chien-Yao Wang , and Hong-Yuan Mark Liao . Yolov4: Optimal speed and accuracy of object detection. CoRR, abs\/2004.10934 , 2020 . Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. Yolov4: Optimal speed and accuracy of object detection. CoRR, abs\/2004.10934, 2020."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6248100"},{"key":"e_1_3_2_2_12_1","first-page":"1346","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Lee Yong Jae","year":"2012","unstructured":"Yong Jae Lee , Joydeep Ghosh , and Kristen Grauman . Discovering important people and objects for egocentric video summarization . In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pages 1346 -- 1353 , 2012 . Yong Jae Lee, Joydeep Ghosh, and Kristen Grauman. Discovering important people and objects for egocentric video summarization. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1346--1353, 2012."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0794-5"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.332"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3524497"},{"key":"e_1_3_2_2_16_1","first-page":"518","volume-title":"European Conference on Computer Vision (ECCV)","author":"Ghosh Shreya","year":"2018","unstructured":"Shreya Ghosh and Abhinav Dhall . Role of group level affect to find the most influential person in images . In European Conference on Computer Vision (ECCV) , pages 518 -- 533 , 2018 . Shreya Ghosh and Abhinav Dhall. Role of group level affect to find the most influential person in images. In European Conference on Computer Vision (ECCV), pages 518--533, 2018."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2019.00279"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00368"},{"key":"e_1_3_2_2_19_1","volume-title":"Putting people in their place: Monocular regression of 3d people in depth. CoRR, abs\/2112.08274","author":"Sun Yu","year":"2021","unstructured":"Yu Sun , Wu Liu , Qian Bao , Yili Fu , Tao Mei , and Michael J. Black . Putting people in their place: Monocular regression of 3d people in depth. CoRR, abs\/2112.08274 , 2021 . Yu Sun, Wu Liu, Qian Bao, Yili Fu, Tao Mei, and Michael J. Black. Putting people in their place: Monocular regression of 3d people in depth. CoRR, abs\/2112.08274, 2021."},{"key":"e_1_3_2_2_20_1","volume-title":"Causal inference in statistics: A primer","author":"Pearl Judea","year":"2016","unstructured":"Judea Pearl , Madelyn Glymour , and Nicholas P Jewell . Causal inference in statistics: A primer . John Wiley & Sons , 2016 . Judea Pearl, Madelyn Glymour, and Nicholas P Jewell. Causal inference in statistics: A primer. John Wiley & Sons, 2016."},{"key":"e_1_3_2_2_21_1","volume-title":"Causality for machine learning. arXiv preprint arXiv:1911.10500","author":"Sch\u00f6lkopf Bernhard","year":"2019","unstructured":"Bernhard Sch\u00f6lkopf . Causality for machine learning. arXiv preprint arXiv:1911.10500 , 2019 . Bernhard Sch\u00f6lkopf. Causality for machine learning. arXiv preprint arXiv:1911.10500, 2019."},{"key":"e_1_3_2_2_22_1","volume-title":"Visual causal feature learning. arXiv preprint arXiv:1412.2309","author":"Chalupka Krzysztof","year":"2014","unstructured":"Krzysztof Chalupka , Pietro Perona , and Frederick Eberhardt . Visual causal feature learning. arXiv preprint arXiv:1412.2309 , 2014 . Krzysztof Chalupka, Pietro Perona, and Frederick Eberhardt. Visual causal feature learning. arXiv preprint arXiv:1412.2309, 2014."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01077"},{"key":"e_1_3_2_2_24_1","volume-title":"Interventional few-shot learning. arXiv preprint arXiv:2009.13000","author":"Yue Zhongqi","year":"2020","unstructured":"Zhongqi Yue , Hanwang Zhang , Qianru Sun , and Xian-Sheng Hua . Interventional few-shot learning. arXiv preprint arXiv:2009.13000 , 2020 . Zhongqi Yue, Hanwang Zhang, Qianru Sun, and Xian-Sheng Hua. Interventional few-shot learning. arXiv preprint arXiv:2009.13000, 2020."},{"key":"e_1_3_2_2_25_1","volume-title":"Long-tailed classification by keeping the good and removing the bad momentum causal effect. arXiv preprint arXiv:2009.12991","author":"Tang Kaihua","year":"2020","unstructured":"Kaihua Tang , Jianqiang Huang , and Hanwang Zhang . Long-tailed classification by keeping the good and removing the bad momentum causal effect. arXiv preprint arXiv:2009.12991 , 2020 . Kaihua Tang, Jianqiang Huang, and Hanwang Zhang. Long-tailed classification by keeping the good and removing the bad momentum causal effect. arXiv preprint arXiv:2009.12991, 2020."},{"key":"e_1_3_2_2_26_1","volume-title":"Causal intervention for weakly-supervised semantic segmentation. arXiv preprint arXiv:2009.12547","author":"Zhang Dong","year":"2020","unstructured":"Dong Zhang , Hanwang Zhang , Jinhui Tang , Xiansheng Hua , and Qianru Sun . Causal intervention for weakly-supervised semantic segmentation. arXiv preprint arXiv:2009.12547 , 2020 . Dong Zhang, Hanwang Zhang, Jinhui Tang, Xiansheng Hua, and Qianru Sun. Causal intervention for weakly-supervised semantic segmentation. arXiv preprint arXiv:2009.12547, 2020."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00377"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01087"},{"key":"e_1_3_2_2_29_1","volume-title":"Counterfactuals uncover the modular structure of deep generative models. arXiv preprint arXiv:1812.03253","author":"Besserve Michel","year":"2018","unstructured":"Michel Besserve , Arash Mehrjou , R\u00e9my Sun , and Bernhard Sch\u00f6lkopf . Counterfactuals uncover the modular structure of deep generative models. arXiv preprint arXiv:1812.03253 , 2018 . Michel Besserve, Arash Mehrjou, R\u00e9my Sun, and Bernhard Sch\u00f6lkopf. Counterfactuals uncover the modular structure of deep generative models. arXiv preprint arXiv:1812.03253, 2018."},{"key":"e_1_3_2_2_30_1","first-page":"4036","volume-title":"International Conference on Machine Learning","author":"Parascandolo Giambattista","year":"2018","unstructured":"Giambattista Parascandolo , Niki Kilbertus , Mateo Rojas-Carulla , and Bernhard Sch\u00f6lkopf . Learning independent causal mechanisms . In International Conference on Machine Learning , pages 4036 -- 4044 . PMLR, 2018 . Giambattista Parascandolo, Niki Kilbertus, Mateo Rojas-Carulla, and Bernhard Sch\u00f6lkopf. Learning independent causal mechanisms. In International Conference on Machine Learning, pages 4036--4044. PMLR, 2018."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00471"},{"key":"e_1_3_2_2_32_1","first-page":"1209","volume-title":"CVPR","author":"Caesar Holger","year":"2018","unstructured":"Holger Caesar , Jasper R. R. Uijlings , and Vittorio Ferrari . Coco-stuff : Thing and stuff classes in context . In CVPR , pages 1209 -- 1218 , 2018 . Holger Caesar, Jasper R. R. Uijlings, and Vittorio Ferrari. Coco-stuff: Thing and stuff classes in context. In CVPR, pages 1209--1218, 2018."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3065386"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/BTAS.2017.8272675"},{"key":"e_1_3_2_2_36_1","volume-title":"Crowdhuman: A benchmark for detecting human in a crowd. CoRR, abs\/1805.00123","author":"Shao Shuai","year":"2018","unstructured":"Shuai Shao , Zijian Zhao , Boxun Li , Tete Xiao , Gang Yu , Xiangyu Zhang , and Jian Sun . Crowdhuman: A benchmark for detecting human in a crowd. CoRR, abs\/1805.00123 , 2018 . Shuai Shao, Zijian Zhao, Boxun Li, Tete Xiao, Gang Yu, Xiangyu Zhang, and Jian Sun. Crowdhuman: A benchmark for detecting human in a crowd. CoRR, abs\/1805.00123, 2018."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00238"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2013.59"},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.667"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413566"},{"key":"e_1_3_2_2_41_1","first-page":"1928","volume-title":"Proceedings of the 33nd International Conference on Machine Learning, ICML 2016","volume":"48","author":"Mnih Volodymyr","year":"2016","unstructured":"Volodymyr Mnih , Adri\u00e0 Puigdom\u00e8nech Badia , Mehdi Mirza , Alex Graves , Timothy P. Lillicrap , Tim Harley , David Silver , and Koray Kavukcuoglu . Asynchronous methods for deep reinforcement learning . In Proceedings of the 33nd International Conference on Machine Learning, ICML 2016 , New York City, NY, USA, June 19--24 , 2016 , volume 48 , pages 1928 -- 1937 , 2016. Volodymyr Mnih, Adri\u00e0 Puigdom\u00e8nech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. Asynchronous methods for deep reinforcement learning. In Proceedings of the 33nd International Conference on Machine Learning, ICML 2016, New York City, NY, USA, June 19--24, 2016, volume 48, pages 1928--1937, 2016."},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475382"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3380549"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Lisboa Portugal","acronym":"MM '22"},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548014","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3548014","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:29Z","timestamp":1750186949000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548014"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":43,"alternative-id":["10.1145\/3503161.3548014","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3548014","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}