{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,23]],"date-time":"2025-08-23T05:23:40Z","timestamp":1755926620243,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":50,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,10,17]],"date-time":"2021-10-17T00:00:00Z","timestamp":1634428800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Key R&D Plan of the Ministry of Science and Technology","award":["2020AAA0104400"],"award-info":[{"award-number":["2020AAA0104400"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,17]]},"DOI":"10.1145\/3474085.3475449","type":"proceedings-article","created":{"date-parts":[[2021,10,18]],"date-time":"2021-10-18T05:04:15Z","timestamp":1634533455000},"page":"3088-3096","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Implicit Feature Refinement for Instance Segmentation"],"prefix":"10.1145","author":[{"given":"Lufan","family":"Ma","sequence":"first","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tiancai","family":"Wang","sequence":"additional","affiliation":[{"name":"MEGVII Technology, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bin","family":"Dong","sequence":"additional","affiliation":[{"name":"MEGVII Technology, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiangpeng","family":"Yan","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiu","family":"Li","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiangyu","family":"Zhang","sequence":"additional","affiliation":[{"name":"MEGVII Technology, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,10,17]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/104134.104145"},{"key":"e_1_3_2_2_2_1","volume-title":"Laura Leal-Taix\u00e9, and Bastian Leibe.","author":"Athar Ali","year":"2020","unstructured":"Ali Athar , Sabarinath Mahadevan , Aljovs a Ovs ep , Laura Leal-Taix\u00e9, and Bastian Leibe. 2020 . STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos. In ECCV. Ali Athar, Sabarinath Mahadevan, Aljovs a Ovs ep, Laura Leal-Taix\u00e9, and Bastian Leibe. 2020. STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos. In ECCV."},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/3454287.3454350"},{"key":"e_1_3_2_2_4_1","unstructured":"Shaojie Bai J Zico Kolter and Vladlen Koltun. 2019 b. Trellis networks for sequence modeling. In ICLR.  Shaojie Bai J Zico Kolter and Vladlen Koltun. 2019 b. Trellis networks for sequence modeling. In ICLR."},{"key":"e_1_3_2_2_5_1","volume":"202","author":"Bai Shaojie","unstructured":"Shaojie Bai , Vladlen Koltun , and J Zico Kolter. 202 0. Multiscale deep equilibrium models. In NeurlPS. Shaojie Bai, Vladlen Koltun, and J Zico Kolter. 2020. Multiscale deep equilibrium models. In NeurlPS.","journal-title":"J Zico Kolter."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"crossref","unstructured":"Gedas Bertasius and Lorenzo Torresani. 2020. Classifying segmenting and tracking object instances in video with mask propagation. In CVPR.  Gedas Bertasius and Lorenzo Torresani. 2020. Classifying segmenting and tracking object instances in video with mask propagation. In CVPR.","DOI":"10.1109\/CVPR42600.2020.00976"},{"key":"e_1_3_2_2_7_1","volume-title":"Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934","author":"Bochkovskiy Alexey","year":"2020","unstructured":"Alexey Bochkovskiy , Chien-Yao Wang , and Hong-Yuan Mark Liao . 2020. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 ( 2020 ). Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. 2020. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)."},{"key":"e_1_3_2_2_8_1","unstructured":"Zhaowei Cai and Nuno Vasconcelos. 2018. Cascade r-cnn: Delving into high quality object detection. In CVPR.  Zhaowei Cai and Nuno Vasconcelos. 2018. Cascade r-cnn: Delving into high quality object detection. In CVPR."},{"key":"e_1_3_2_2_9_1","volume-title":"Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao.","author":"Cao Jiale","year":"2020","unstructured":"Jiale Cao , Rao Muhammad Anwer , Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao. 2020 . SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation. In ECCV. Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao. 2020. SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation. In ECCV."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"crossref","unstructured":"Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In ECCV.  Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In ECCV.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"crossref","unstructured":"Kai Chen Jiangmiao Pang Jiaqi Wang Yu Xiong Xiaoxiao Li Shuyang Sun Wansen Feng Ziwei Liu Jianping Shi Wanli Ouyang etal 2019. Hybrid task cascade for instance segmentation. In CVPR.  Kai Chen Jiangmiao Pang Jiaqi Wang Yu Xiong Xiaoxiao Li Shuyang Sun Wansen Feng Ziwei Liu Jianping Shi Wanli Ouyang et al. 2019. Hybrid task cascade for instance segmentation. In CVPR.","DOI":"10.1109\/CVPR.2019.00511"},{"key":"e_1_3_2_2_12_1","volume-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. TPAMI","author":"Chen Liang-Chieh","year":"2017","unstructured":"Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos , Kevin Murphy , and Alan L Yuille . 2017 . Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. TPAMI (2017). Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2017. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. TPAMI (2017)."},{"key":"e_1_3_2_2_13_1","unstructured":"Ricky TQ Chen Yulia Rubanova Jesse Bettencourt and David Duvenaud. 2018. Neural ordinary differential equations. In NeurlPS.  Ricky TQ Chen Yulia Rubanova Jesse Bettencourt and David Duvenaud. 2018. Neural ordinary differential equations. In NeurlPS."},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"crossref","unstructured":"Tianheng Cheng Xinggang Wang Lichao Huang and Wenyu Liu. 2020. Boundary-preserving mask r-cnn. In ECCV.  Tianheng Cheng Xinggang Wang Lichao Huang and Wenyu Liu. 2020. Boundary-preserving mask r-cnn. In ECCV.","DOI":"10.1007\/978-3-030-58568-6_39"},{"key":"e_1_3_2_2_15_1","volume-title":"Centernet: Keypoint triplets for object detection. In ICCV.","author":"Duan Kaiwen","year":"2019","unstructured":"Kaiwen Duan , Song Bai , Lingxi Xie , Honggang Qi , Qingming Huang , and Qi Tian . 2019 . Centernet: Keypoint triplets for object detection. In ICCV. Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, and Qi Tian. 2019. Centernet: Keypoint triplets for object detection. In ICCV."},{"key":"e_1_3_2_2_16_1","volume-title":"Implicit deep learning. arXiv preprint arXiv:1908.06315","author":"Ghaoui Laurent El","year":"2019","unstructured":"Laurent El Ghaoui , Fangda Gu , Bertrand Travacca , Armin Askari , and Alicia Y Tsai . 2019. Implicit deep learning. arXiv preprint arXiv:1908.06315 ( 2019 ). Laurent El Ghaoui, Fangda Gu, Bertrand Travacca, Armin Askari, and Alicia Y Tsai. 2019. Implicit deep learning. arXiv preprint arXiv:1908.06315 (2019)."},{"key":"e_1_3_2_2_17_1","volume-title":"Stable architectures for deep neural networks. Inverse Problems","author":"Haber Eldad","year":"2017","unstructured":"Eldad Haber and Lars Ruthotto . 2017. Stable architectures for deep neural networks. Inverse Problems ( 2017 ). Eldad Haber and Lars Ruthotto. 2017. Stable architectures for deep neural networks. Inverse Problems (2017)."},{"key":"e_1_3_2_2_18_1","unstructured":"Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask r-cnn. In ICCV.  Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask r-cnn. In ICCV."},{"key":"e_1_3_2_2_19_1","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.  Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"crossref","unstructured":"Zhaojin Huang Lichao Huang Yongchao Gong Chang Huang and Xinggang Wang. 2019. Mask scoring r-cnn. In CVPR.  Zhaojin Huang Lichao Huang Yongchao Gong Chang Huang and Xinggang Wang. 2019. Mask scoring r-cnn. In CVPR.","DOI":"10.1109\/CVPR.2019.00657"},{"key":"e_1_3_2_2_21_1","unstructured":"Myungchul Kim Sanghyun Woo Dahun Kim and In So Kweon. 2021. The devil is in the boundary: Exploiting boundary representation for basis-based instance segmentation. In WACV.  Myungchul Kim Sanghyun Woo Dahun Kim and In So Kweon. 2021. The devil is in the boundary: Exploiting boundary representation for basis-based instance segmentation. In WACV."},{"key":"e_1_3_2_2_22_1","volume-title":"Pointrend: Image segmentation as rendering. In CVPR.","author":"Kirillov Alexander","year":"2020","unstructured":"Alexander Kirillov , Yuxin Wu , Kaiming He , and Ross Girshick . 2020 . Pointrend: Image segmentation as rendering. In CVPR. Alexander Kirillov, Yuxin Wu, Kaiming He, and Ross Girshick. 2020. Pointrend: Image segmentation as rendering. In CVPR."},{"key":"e_1_3_2_2_23_1","unstructured":"Renjie Liao Yuwen Xiong Ethan Fetaya Lisa Zhang KiJung Yoon Xaq Pitkow Raquel Urtasun and Richard Zemel. 2018. Reviving and improving recurrent back-propagation. In ICML.  Renjie Liao Yuwen Xiong Ethan Fetaya Lisa Zhang KiJung Yoon Xaq Pitkow Raquel Urtasun and Richard Zemel. 2018. Reviving and improving recurrent back-propagation. In ICML."},{"key":"e_1_3_2_2_24_1","unstructured":"Tsung-Yi Lin Piotr Doll\u00e1r Ross Girshick Kaiming He Bharath Hariharan and Serge Belongie. 2017a. Feature pyramid networks for object detection. In CVPR.  Tsung-Yi Lin Piotr Doll\u00e1r Ross Girshick Kaiming He Bharath Hariharan and Serge Belongie. 2017a. Feature pyramid networks for object detection. In CVPR."},{"key":"e_1_3_2_2_25_1","unstructured":"Tsung-Yi Lin Priya Goyal Ross Girshick Kaiming He and Piotr Doll\u00e1r. 2017b. Focal loss for dense object detection. In ICCV.  Tsung-Yi Lin Priya Goyal Ross Girshick Kaiming He and Piotr Doll\u00e1r. 2017b. Focal loss for dense object detection. In ICCV."},{"key":"e_1_3_2_2_26_1","unstructured":"Tsung-Yi Lin Michael Maire Serge Belongie James Hays Pietro Perona Deva Ramanan Piotr Doll\u00e1r and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In ECCV.  Tsung-Yi Lin Michael Maire Serge Belongie James Hays Pietro Perona Deva Ramanan Piotr Doll\u00e1r and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In ECCV."},{"key":"e_1_3_2_2_27_1","volume-title":"Ssd: Single shot multibox detector. In ECCV.","author":"Liu Wei","year":"2016","unstructured":"Wei Liu , Dragomir Anguelov , Dumitru Erhan , Christian Szegedy , Scott Reed , Cheng-Yang Fu , and Alexander C Berg . 2016 . Ssd: Single shot multibox detector. In ECCV. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In ECCV."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"crossref","unstructured":"Jonathan Long Evan Shelhamer and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In CVPR.  Jonathan Long Evan Shelhamer and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In CVPR.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"crossref","unstructured":"Feng Luo Bin-Bin Gao Jiangpeng Yan and Xiu Li. 2021. A Coarse-to-Fine Instance Segmentation Network with Learning Boundary Representation. In IJCNN.  Feng Luo Bin-Bin Gao Jiangpeng Yan and Xiu Li. 2021. A Coarse-to-Fine Instance Segmentation Network with Learning Boundary Representation. In IJCNN.","DOI":"10.1109\/IJCNN52387.2021.9533399"},{"key":"e_1_3_2_2_30_1","unstructured":"Lufan Ma Bin Dong Jiangpeng Yan and Xiu Li. 2021. Matting Enhanced Mask R-CNN. In ICME.  Lufan Ma Bin Dong Jiangpeng Yan and Xiu Li. 2021. Matting Enhanced Mask R-CNN. In ICME."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.5555\/2969644.2969707"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"crossref","unstructured":"Joseph Redmon Santosh Divvala Ross Girshick and Ali Farhadi. 2016. You only look once: Unified real-time object detection. In CVPR.  Joseph Redmon Santosh Divvala Ross Girshick and Ali Farhadi. 2016. You only look once: Unified real-time object detection. In CVPR.","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/2969239.2969250"},{"key":"e_1_3_2_2_34_1","volume-title":"Weight normalization: A simple reparameterization to accelerate training of deep neural networks. arXiv preprint arXiv:1602.07868","author":"Salimans Tim","year":"2016","unstructured":"Tim Salimans and Diederik P Kingma . 2016. Weight normalization: A simple reparameterization to accelerate training of deep neural networks. arXiv preprint arXiv:1602.07868 ( 2016 ). Tim Salimans and Diederik P Kingma. 2016. Weight normalization: A simple reparameterization to accelerate training of deep neural networks. arXiv preprint arXiv:1602.07868 (2016)."},{"key":"e_1_3_2_2_35_1","unstructured":"Patrice Y Simard Mary B Ottaway and Dana H Ballard. 1988. Fixed Point Analysis for Recurrent Networks.. In NeurlPS.  Patrice Y Simard Mary B Ottaway and Dana H Ballard. 1988. Fixed Point Analysis for Recurrent Networks.. In NeurlPS."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"crossref","unstructured":"Bharat Singh and Larry S Davis. 2018. An analysis of scale invariance in object detection snip. In CVPR.  Bharat Singh and Larry S Davis. 2018. An analysis of scale invariance in object detection snip. In CVPR.","DOI":"10.1109\/CVPR.2018.00377"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"crossref","unstructured":"Peize Sun Rufeng Zhang Yi Jiang Tao Kong Chenfeng Xu Wei Zhan Masayoshi Tomizuka Lei Li Zehuan Yuan Changhu Wang etal 2020. Sparse r-cnn: End-to-end object detection with learnable proposals. arXiv preprint arXiv:2011.12450 (2020).  Peize Sun Rufeng Zhang Yi Jiang Tao Kong Chenfeng Xu Wei Zhan Masayoshi Tomizuka Lei Li Zehuan Yuan Changhu Wang et al. 2020. Sparse r-cnn: End-to-end object detection with learnable proposals. arXiv preprint arXiv:2011.12450 (2020).","DOI":"10.1109\/CVPR46437.2021.01422"},{"key":"e_1_3_2_2_38_1","volume-title":"Raft: Recurrent all-pairs field transforms for optical flow. In ECCV.","author":"Teed Zachary","year":"2020","unstructured":"Zachary Teed and Jia Deng . 2020 . Raft: Recurrent all-pairs field transforms for optical flow. In ECCV. Zachary Teed and Jia Deng. 2020. Raft: Recurrent all-pairs field transforms for optical flow. In ECCV."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"crossref","unstructured":"Zhi Tian Chunhua Shen and Hao Chen. 2020. Conditional convolutions for instance segmentation. In ECCV.  Zhi Tian Chunhua Shen and Hao Chen. 2020. Conditional convolutions for instance segmentation. In ECCV.","DOI":"10.1007\/978-3-030-58452-8_17"},{"key":"e_1_3_2_2_40_1","volume-title":"Fcos: Fully convolutional one-stage object detection. In ICCV.","author":"Tian Zhi","year":"2019","unstructured":"Zhi Tian , Chunhua Shen , Hao Chen , and Tong He . 2019 . Fcos: Fully convolutional one-stage object detection. In ICCV. Zhi Tian, Chunhua Shen, Hao Chen, and Tong He. 2019. Fcos: Fully convolutional one-stage object detection. In ICCV."},{"key":"e_1_3_2_2_41_1","volume-title":"2020 c. Implicit Feature Pyramid Network for Object Detection. arXiv preprint arXiv:2012.13563","author":"Wang Tiancai","year":"2020","unstructured":"Tiancai Wang , Xiangyu Zhang , and Jian Sun . 2020 c. Implicit Feature Pyramid Network for Object Detection. arXiv preprint arXiv:2012.13563 ( 2020 ). Tiancai Wang, Xiangyu Zhang, and Jian Sun. 2020 c. Implicit Feature Pyramid Network for Object Detection. arXiv preprint arXiv:2012.13563 (2020)."},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"crossref","unstructured":"Xinlong Wang Tao Kong Chunhua Shen Yuning Jiang and Lei Li. 2020 a. Solo: Segmenting objects by locations. In ECCV.  Xinlong Wang Tao Kong Chunhua Shen Yuning Jiang and Lei Li. 2020 a. Solo: Segmenting objects by locations. In ECCV.","DOI":"10.1007\/978-3-030-58523-5_38"},{"key":"e_1_3_2_2_43_1","unstructured":"Xinlong Wang Rufeng Zhang Tao Kong Lei Li and Chunhua Shen. 2020 d. Solov2: Dynamic faster and stronger. In NeurlPS.  Xinlong Wang Rufeng Zhang Tao Kong Lei Li and Chunhua Shen. 2020 d. Solov2: Dynamic faster and stronger. In NeurlPS."},{"key":"e_1_3_2_2_44_1","volume-title":"2020 b. End-to-End Video Instance Segmentation with Transformers. arXiv preprint arXiv:2011.14503","author":"Wang Yuqing","year":"2020","unstructured":"Yuqing Wang , Zhaoliang Xu , Xinlong Wang , Chunhua Shen , Baoshan Cheng , Hao Shen , and Huaxia Xia . 2020 b. End-to-End Video Instance Segmentation with Transformers. arXiv preprint arXiv:2011.14503 ( 2020 ). Yuqing Wang, Zhaoliang Xu, Xinlong Wang, Chunhua Shen, Baoshan Cheng, Hao Shen, and Huaxia Xia. 2020 b. End-to-End Video Instance Segmentation with Transformers. arXiv preprint arXiv:2011.14503 (2020)."},{"key":"e_1_3_2_2_45_1","unstructured":"Yuxin Wu and Kaiming He. 2018. Group normalization. In ECCV.  Yuxin Wu and Kaiming He. 2018. Group normalization. In ECCV."},{"key":"e_1_3_2_2_46_1","volume-title":"Polarmask: Single shot instance segmentation with polar representation. In CVPR.","author":"Xie Enze","year":"2020","unstructured":"Enze Xie , Peize Sun , Xiaoge Song , Wenhai Wang , Xuebo Liu , Ding Liang , Chunhua Shen , and Ping Luo . 2020 . Polarmask: Single shot instance segmentation with polar representation. In CVPR. Enze Xie, Peize Sun, Xiaoge Song, Wenhai Wang, Xuebo Liu, Ding Liang, Chunhua Shen, and Ping Luo. 2020. Polarmask: Single shot instance segmentation with polar representation. In CVPR."},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"crossref","unstructured":"Linjie Yang Yuchen Fan and Ning Xu. 2019. Video instance segmentation. In ICCV.  Linjie Yang Yuchen Fan and Ning Xu. 2019. Video instance segmentation. In ICCV.","DOI":"10.1109\/ICCV.2019.00529"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"crossref","unstructured":"Rufeng Zhang Zhi Tian Chunhua Shen Mingyu You and Youliang Yan. 2020 b. Mask encoding for single shot instance segmentation. In CVPR.  Rufeng Zhang Zhi Tian Chunhua Shen Mingyu You and Youliang Yan. 2020 b. Mask encoding for single shot instance segmentation. In CVPR.","DOI":"10.1109\/CVPR42600.2020.01024"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"crossref","unstructured":"Shifeng Zhang Cheng Chi Yongqiang Yao Zhen Lei and Stan Z Li. 2020 a. Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In CVPR.  Shifeng Zhang Cheng Chi Yongqiang Yao Zhen Lei and Stan Z Li. 2020 a. Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In CVPR.","DOI":"10.1109\/CVPR42600.2020.00978"},{"key":"e_1_3_2_2_50_1","volume-title":"Deformable DETR: Deformable Transformers for End-to-End Object Detection. arXiv preprint arXiv:2010.04159","author":"Zhu Xizhou","year":"2020","unstructured":"Xizhou Zhu , Weijie Su , Lewei Lu , Bin Li , Xiaogang Wang , and Jifeng Dai . 2020. Deformable DETR: Deformable Transformers for End-to-End Object Detection. arXiv preprint arXiv:2010.04159 ( 2020 ). Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, and Jifeng Dai. 2020. Deformable DETR: Deformable Transformers for End-to-End Object Detection. arXiv preprint arXiv:2010.04159 (2020)."}],"event":{"name":"MM '21: ACM Multimedia Conference","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Virtual Event China","acronym":"MM '21"},"container-title":["Proceedings of the 29th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3475449","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3474085.3475449","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:33Z","timestamp":1750193313000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3475449"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,17]]},"references-count":50,"alternative-id":["10.1145\/3474085.3475449","10.1145\/3474085"],"URL":"https:\/\/doi.org\/10.1145\/3474085.3475449","relation":{},"subject":[],"published":{"date-parts":[[2021,10,17]]},"assertion":[{"value":"2021-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}