{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T21:44:00Z","timestamp":1774993440356,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":32,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,12,13]],"date-time":"2022-12-13T00:00:00Z","timestamp":1670889600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"JSPS KAKENJI","award":["21H05812, 22H00540, 22H00548, and 22K19808"],"award-info":[{"award-number":["21H05812, 22H00540, 22H00548, and 22K19808"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,12,13]]},"DOI":"10.1145\/3551626.3564944","type":"proceedings-article","created":{"date-parts":[[2022,12,7]],"date-time":"2022-12-07T00:55:45Z","timestamp":1670374545000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Parallel Queries for Human-Object Interaction Detection"],"prefix":"10.1145","author":[{"given":"Junwen","family":"Chen","sequence":"first","affiliation":[{"name":"The University of Electro-Communications, Tokyo, Japan"}]},{"given":"Keiji","family":"Yanai","sequence":"additional","affiliation":[{"name":"The University of Electro-Communications, Tokyo, Japan"}]}],"member":"320","published-online":{"date-parts":[[2022,12,13]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Jamie Ryan Kiros, and Geoffrey E Hinton","author":"Ba Jimmy Lei","year":"2016","unstructured":"Jimmy Lei Ba , Jamie Ryan Kiros, and Geoffrey E Hinton . 2016 . Layer normalization. arXiv preprint arXiv:1607.06450 (2016). Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E Hinton. 2016. Layer normalization. arXiv preprint arXiv:1607.06450 (2016)."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"crossref","unstructured":"Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In ECCV.  Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In ECCV.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"crossref","unstructured":"Yu-Wei Chao Yunfan Liu Xieyang Liu Huayi Zeng and Jia Deng. 2018. Learning to detect human-object interactions. In WACV.  Yu-Wei Chao Yunfan Liu Xieyang Liu Huayi Zeng and Jia Deng. 2018. Learning to detect human-object interactions. In WACV.","DOI":"10.1109\/WACV.2018.00048"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"crossref","unstructured":"Mingfei Chen Yue Liao Si Liu Zhiyuan Chen Fei Wang and Chen Qian. 2021. Reformulating HOI detection as adaptive set prediction. In CVPR.  Mingfei Chen Yue Liao Si Liu Zhiyuan Chen Fei Wang and Chen Qian. 2021. Reformulating HOI detection as adaptive set prediction. In CVPR.","DOI":"10.1109\/CVPR46437.2021.00889"},{"key":"e_1_3_2_2_5_1","volume-title":"DRG: Dual relation graph for human-object interaction detection. In ECCV.","author":"Gao Chen","year":"2020","unstructured":"Chen Gao , Jiarui Xu , Yuliang Zou , and Jia-Bin Huang . 2020 . DRG: Dual relation graph for human-object interaction detection. In ECCV. Chen Gao, Jiarui Xu, Yuliang Zou, and Jia-Bin Huang. 2020. DRG: Dual relation graph for human-object interaction detection. In ECCV."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"crossref","unstructured":"Ross Girshick. 2015. Fast R-CNN. In ICCV.  Ross Girshick. 2015. Fast R-CNN. In ICCV.","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"crossref","unstructured":"Georgia Gkioxari Ross Girshick Piotr Doll\u00e1r and Kaiming He. 2018. Detecting and recognizing human-object interactions. In CVPR.  Georgia Gkioxari Ross Girshick Piotr Doll\u00e1r and Kaiming He. 2018. Detecting and recognizing human-object interactions. In CVPR.","DOI":"10.1109\/CVPR.2018.00872"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"crossref","unstructured":"Tanmay Gupta Alexander Schwing and Derek Hoiem. 2019. No-frills human-object interaction detection: Factorization layout encodings and training techniques. In ICCV.  Tanmay Gupta Alexander Schwing and Derek Hoiem. 2019. No-frills human-object interaction detection: Factorization layout encodings and training techniques. In ICCV.","DOI":"10.1109\/ICCV.2019.00977"},{"key":"e_1_3_2_2_9_1","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.  Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"crossref","unstructured":"Zhi Hou Baosheng Yu Yu Qiao Xiaojiang Peng and Dacheng Tao. 2021. Affordance transfer learning for human-object interaction detection. In CVPR.  Zhi Hou Baosheng Yu Yu Qiao Xiaojiang Peng and Dacheng Tao. 2021. Affordance transfer learning for human-object interaction detection. In CVPR.","DOI":"10.1109\/CVPR46437.2021.00056"},{"key":"e_1_3_2_2_11_1","volume":"202","author":"Kim Bumsoo","unstructured":"Bumsoo Kim , Taeho Choi , Jaewoo Kang , and Hyunwoo J Kim. 202 0. UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection. In ECCV. Bumsoo Kim, Taeho Choi, Jaewoo Kang, and Hyunwoo J Kim. 2020. UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection. In ECCV.","journal-title":"Hyunwoo J Kim."},{"key":"e_1_3_2_2_12_1","volume":"202","author":"Kim Bumsoo","unstructured":"Bumsoo Kim , Junhyun Lee , Jaewoo Kang , Eun-Sol Kim , and Hyunwoo J Kim. 202 1. HOTR: End-to-end human-object interaction detection with transformers. In CVPR. Bumsoo Kim, Junhyun Lee, Jaewoo Kang, Eun-Sol Kim, and Hyunwoo J Kim. 2021. HOTR: End-to-end human-object interaction detection with transformers. In CVPR.","journal-title":"Hyunwoo J Kim."},{"key":"e_1_3_2_2_13_1","unstructured":"Dong-Jin Kim Xiao Sun Jinsoo Choi Stephen Lin and In So Kweon. 2020. Detecting human-object interactions with action co-occurrence priors. In ECCV.  Dong-Jin Kim Xiao Sun Jinsoo Choi Stephen Lin and In So Kweon. 2020. Detecting human-object interactions with action co-occurrence priors. In ECCV."},{"key":"e_1_3_2_2_14_1","volume-title":"The Hungarian method for the assignment problem. Naval Res. Logist. Quart","author":"Kuhn Harold W","year":"1955","unstructured":"Harold W Kuhn . 1955. The Hungarian method for the assignment problem. Naval Res. Logist. Quart ( 1955 ), 83--97. Harold W Kuhn. 1955. The Hungarian method for the assignment problem. Naval Res. Logist. Quart (1955), 83--97."},{"key":"e_1_3_2_2_15_1","first-page":"2d","volume":"202","author":"Li Yong-Lu","unstructured":"Yong-Lu Li , Xinpeng Liu , Han Lu , Shiyi Wang , Junqi Liu , Jiefeng Li , and Cewu Lu. 202 0. Detailed 2d - 23 d joint representation for human-object interaction. In CVPR. Yong-Lu Li, Xinpeng Liu, Han Lu, Shiyi Wang, Junqi Liu, Jiefeng Li, and Cewu Lu. 2020. Detailed 2d-3d joint representation for human-object interaction. In CVPR.","journal-title":"Cewu Lu."},{"key":"e_1_3_2_2_16_1","volume-title":"PPDM: Parallel point detection and matching for real-time human-object interaction detection. In CVPR.","author":"Liao Yue","year":"2020","unstructured":"Yue Liao , Si Liu , Fei Wang , Yanjie Chen , Chen Qian , and Jiashi Feng . 2020 . PPDM: Parallel point detection and matching for real-time human-object interaction detection. In CVPR. Yue Liao, Si Liu, Fei Wang, Yanjie Chen, Chen Qian, and Jiashi Feng. 2020. PPDM: Parallel point detection and matching for real-time human-object interaction detection. In CVPR."},{"key":"e_1_3_2_2_17_1","unstructured":"Tsung-Yi Lin Michael Maire Serge Belongie James Hays Pietro Perona Deva Ramanan Piotr Doll\u00e1r and C Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In ECCV.  Tsung-Yi Lin Michael Maire Serge Belongie James Hays Pietro Perona Deva Ramanan Piotr Doll\u00e1r and C Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In ECCV."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"crossref","unstructured":"Yang Liu Qingchao Chen and Andrew Zisserman. 2020. Amplifying key cues for human-object-interaction detection. In ECCV.  Yang Liu Qingchao Chen and Andrew Zisserman. 2020. Amplifying key cues for human-object-interaction detection. In ECCV.","DOI":"10.1007\/978-3-030-58568-6_15"},{"key":"e_1_3_2_2_19_1","unstructured":"Ilya Loshchilov and Frank Hutter. 2018. Decoupled Weight Decay Regularization. In ICLR.  Ilya Loshchilov and Frank Hutter. 2018. Decoupled Weight Decay Regularization. In ICLR."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"crossref","unstructured":"Alejandro Newell Kaiyu Yang and Jia Deng. 2016. Stacked hourglass networks for human pose estimation. In ECCV.  Alejandro Newell Kaiyu Yang and Jia Deng. 2016. Stacked hourglass networks for human pose estimation. In ECCV.","DOI":"10.1007\/978-3-319-46484-8_29"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"crossref","unstructured":"Hamid Rezatofighi Nathan Tsoi JunYoung Gwak Amir Sadeghian Ian Reid and Silvio Savarese. 2019. Generalized intersection over union: A metric and a loss for bounding box regression. In CVPR.  Hamid Rezatofighi Nathan Tsoi JunYoung Gwak Amir Sadeghian Ian Reid and Silvio Savarese. 2019. Generalized intersection over union: A metric and a loss for bounding box regression. In CVPR.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"e_1_3_2_2_22_1","volume-title":"QPIC: Query-based pairwise human-object interaction detection with image-wide contextual information. In CVPR.","author":"Tamura Masato","year":"2021","unstructured":"Masato Tamura , Hiroki Ohashi , and Tomoaki Yoshinaga . 2021 . QPIC: Query-based pairwise human-object interaction detection with image-wide contextual information. In CVPR. Masato Tamura, Hiroki Ohashi, and Tomoaki Yoshinaga. 2021. QPIC: Query-based pairwise human-object interaction detection with image-wide contextual information. In CVPR."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"crossref","unstructured":"Oytun Ulutan ASM Iftekhar and Bangalore S Manjunath. 2020. VSGNet: Spatial attention network for detecting human object interactions using graph convolutions. In CVPR.  Oytun Ulutan ASM Iftekhar and Bangalore S Manjunath. 2020. VSGNet: Spatial attention network for detecting human object interactions using graph convolutions. In CVPR.","DOI":"10.1109\/CVPR42600.2020.01363"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"crossref","unstructured":"Bo Wan Desen Zhou Yongfei Liu Rongjie Li and Xuming He. 2019. Pose-aware multi-level feature network for human object interaction detection. In ICCV.  Bo Wan Desen Zhou Yongfei Liu Rongjie Li and Xuming He. 2019. Pose-aware multi-level feature network for human object interaction detection. In ICCV.","DOI":"10.1109\/ICCV.2019.00956"},{"key":"e_1_3_2_2_25_1","volume-title":"Xiangyu Zhang, and Jian Sun.","author":"Wang Tiancai","year":"2020","unstructured":"Tiancai Wang , Tong Yang , Martin Danelljan , Fahad Shahbaz Khan , Xiangyu Zhang, and Jian Sun. 2020 . Learning human-object interaction detection using interaction points. In CVPR. Tiancai Wang, Tong Yang, Martin Danelljan, Fahad Shahbaz Khan, Xiangyu Zhang, and Jian Sun. 2020. Learning human-object interaction detection using interaction points. In CVPR."},{"key":"e_1_3_2_2_26_1","unstructured":"Aixi Zhang Yue Liao Si Liu Miao Lu Yongliang Wang Chen Gao and Xiaobo Li. 2021. Mining the Benefits of Two-stage and One-stage HOI Detection. In NeurIPS.  Aixi Zhang Yue Liao Si Liu Miao Lu Yongliang Wang Chen Gao and Xiaobo Li. 2021. Mining the Benefits of Two-stage and One-stage HOI Detection. In NeurIPS."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"crossref","unstructured":"Frederic Z Zhang Dylan Campbell and Stephen Gould. 2021. Spatially conditioned graphs for detecting human-object interactions. In ICCV.  Frederic Z Zhang Dylan Campbell and Stephen Gould. 2021. Spatially conditioned graphs for detecting human-object interactions. In ICCV.","DOI":"10.1109\/ICCV48922.2021.01307"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"crossref","unstructured":"Xubin Zhong Changxing Ding Xian Qu and Dacheng Tao. 2021. Polysemy deciphering network for robust human-object interaction detection. In IJCV.  Xubin Zhong Changxing Ding Xian Qu and Dacheng Tao. 2021. Polysemy deciphering network for robust human-object interaction detection. In IJCV.","DOI":"10.1007\/s11263-021-01458-8"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"crossref","unstructured":"Xubin Zhong Xian Qu Changxing Ding and Dacheng Tao. 2021. Glance and Gaze: Inferring action-aware points for one-stage human-object interaction detection. In CVPR.  Xubin Zhong Xian Qu Changxing Ding and Dacheng Tao. 2021. Glance and Gaze: Inferring action-aware points for one-stage human-object interaction detection. In CVPR.","DOI":"10.1109\/CVPR46437.2021.01303"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"crossref","unstructured":"Penghao Zhou and Mingmin Chi. 2019. Relation parsing neural network for human-object interaction detection. In ICCV.  Penghao Zhou and Mingmin Chi. 2019. Relation parsing neural network for human-object interaction detection. In ICCV.","DOI":"10.1109\/ICCV.2019.00093"},{"key":"e_1_3_2_2_31_1","unstructured":"Xizhou Zhu Weijie Su Lewei Lu Bin Li Xiaogang Wang and Jifeng Dai. 2020. Deformable DETR: Deformable transformers for end-to-end object detection. In ICLR.  Xizhou Zhu Weijie Su Lewei Lu Bin Li Xiaogang Wang and Jifeng Dai. 2020. Deformable DETR: Deformable transformers for end-to-end object detection. In ICLR."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"crossref","unstructured":"Cheng Zou Bohan Wang Yue Hu Junqi Liu Qian Wu Yu Zhao Boxun Li Chenguang Zhang Chi Zhang Yichen Wei etal 2021. End-to-end human object interaction detection with hoi transformer. In CVPR.  Cheng Zou Bohan Wang Yue Hu Junqi Liu Qian Wu Yu Zhao Boxun Li Chenguang Zhang Chi Zhang Yichen Wei et al. 2021. End-to-end human object interaction detection with hoi transformer. In CVPR.","DOI":"10.1109\/CVPR46437.2021.01165"}],"event":{"name":"MMAsia '22: ACM Multimedia Asia","location":"Tokyo Japan","acronym":"MMAsia '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 4th ACM International Conference on Multimedia in Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3551626.3564944","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3551626.3564944","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:25Z","timestamp":1750186825000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3551626.3564944"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,13]]},"references-count":32,"alternative-id":["10.1145\/3551626.3564944","10.1145\/3551626"],"URL":"https:\/\/doi.org\/10.1145\/3551626.3564944","relation":{},"subject":[],"published":{"date-parts":[[2022,12,13]]},"assertion":[{"value":"2022-12-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}