{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,9]],"date-time":"2026-04-09T14:45:25Z","timestamp":1775745925201,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":56,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T00:00:00Z","timestamp":1602460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSFC","award":["61672521"],"award-info":[{"award-number":["61672521"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,10,12]]},"DOI":"10.1145\/3394171.3413989","type":"proceedings-article","created":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T13:10:44Z","timestamp":1602508244000},"page":"1615-1624","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":25,"title":["Box Guided Convolution for Pedestrian Detection"],"prefix":"10.1145","author":[{"given":"Jinpeng","family":"Li","sequence":"first","affiliation":[{"name":"Inception Institute of Artificial Intelligence (IIAI), Abu Dhabi, UAE"}]},{"given":"Shengcai","family":"Liao","sequence":"additional","affiliation":[{"name":"Inception Institute of Artificial Intelligence (IIAI), Abu Dhabi, UAE"}]},{"given":"Hangzhi","family":"Jiang","sequence":"additional","affiliation":[{"name":"University of Chinese Academy of Sciences &amp; Institute of Automation, Chinese Academy of Sciences, Beijing, China"}]},{"given":"Ling","family":"Shao","sequence":"additional","affiliation":[{"name":"Inception Institute of Artificial Intelligence (IIAI) &amp; Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, UAE"}]}],"member":"320","published-online":{"date-parts":[[2020,10,12]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Proceedings, Part IV. 354--370","author":"Cai Zhaowei","year":"2016","unstructured":"Zhaowei Cai , Quanfu Fan , Rog\u00e9rio Schmidt Feris , and Nuno Vasconcelos . 2016 . A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection. In Computer Vision - ECCV 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11--14, 2016 , Proceedings, Part IV. 354--370 . https:\/\/doi.org\/10.1007\/978--3--319--46493-0_22 10.1007\/978--3--319--46493-0_22 Zhaowei Cai, Quanfu Fan, Rog\u00e9rio Schmidt Feris, and Nuno Vasconcelos. 2016. A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection. In Computer Vision - ECCV 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11--14, 2016, Proceedings, Part IV. 354--370. https:\/\/doi.org\/10.1007\/978--3--319--46493-0_22"},{"key":"e_1_3_2_2_2_1","volume-title":"Learning Complexity-Aware Cascades for Deep Pedestrian Detection. In 2015 IEEE International Conference on Computer Vision, ICCV 2015","author":"Cai Zhaowei","year":"2015","unstructured":"Zhaowei Cai , Mohammad J. Saberian , and Nuno Vasconcelos . 2015 . Learning Complexity-Aware Cascades for Deep Pedestrian Detection. In 2015 IEEE International Conference on Computer Vision, ICCV 2015 , Santiago, Chile, December 7--13 , 2015. 3361--3369. https:\/\/doi.org\/10.1109\/ICCV.2015.384 10.1109\/ICCV.2015.384 Zhaowei Cai, Mohammad J. Saberian, and Nuno Vasconcelos. 2015. Learning Complexity-Aware Cascades for Deep Pedestrian Detection. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7--13, 2015. 3361--3369. https:\/\/doi.org\/10.1109\/ICCV.2015.384"},{"key":"e_1_3_2_2_3_1","volume-title":"Cascade R-CNN: Delving Into High Quality Object Detection. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018","author":"Cai Zhaowei","year":"2018","unstructured":"Zhaowei Cai and Nuno Vasconcelos . 2018 . Cascade R-CNN: Delving Into High Quality Object Detection. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018 , Salt Lake City, UT, USA, June 18--22 , 2018. 6154--6162. https:\/\/doi.org\/10.1109\/CVPR.2018.00644 10.1109\/CVPR.2018.00644 Zhaowei Cai and Nuno Vasconcelos. 2018. Cascade R-CNN: Delving Into High Quality Object Detection. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18--22, 2018. 6154--6162. https:\/\/doi.org\/10.1109\/CVPR.2018.00644"},{"key":"e_1_3_2_2_4_1","volume-title":"Hierarchical Shot Detector. In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019","author":"Cao Jiale","year":"2019","unstructured":"Jiale Cao , Yanwei Pang , Jungong Han , and Xuelong Li . 2019 b . Hierarchical Shot Detector. In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019 , Seoul, Korea (South), October 27 - November 2, 2019. IEEE, 9704--9713. https:\/\/doi.org\/10.1109\/ICCV.2019.00980 10.1109\/ICCV.2019.00980 Jiale Cao, Yanwei Pang, Jungong Han, and Xuelong Li. 2019 b. Hierarchical Shot Detector. In 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019. IEEE, 9704--9713. https:\/\/doi.org\/10.1109\/ICCV.2019.00980"},{"key":"e_1_3_2_2_5_1","volume-title":"Triply Supervised Decoder Networks for Joint Detection and Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019","author":"Cao Jiale","year":"2019","unstructured":"Jiale Cao , Yanwei Pang , and Xuelong Li . 2019 a . Triply Supervised Decoder Networks for Joint Detection and Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 , Long Beach, CA, USA, June 16--20 , 2019. Computer Vision Foundation \/ IEEE, 7392--7401. https:\/\/doi.org\/10.1109\/CVPR.2019.00757 10.1109\/CVPR.2019.00757 Jiale Cao, Yanwei Pang, and Xuelong Li. 2019 a. Triply Supervised Decoder Networks for Joint Detection and Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16--20, 2019. Computer Vision Foundation \/ IEEE, 7392--7401. https:\/\/doi.org\/10.1109\/CVPR.2019.00757"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2012.6466893"},{"key":"e_1_3_2_2_7_1","volume-title":"Yuille","author":"Chen Liang-Chieh","year":"2015","unstructured":"Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos , Kevin Murphy , and Alan L . Yuille . 2015 . Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds .). http:\/\/arxiv.org\/abs\/1412.7062 Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2015. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). http:\/\/arxiv.org\/abs\/1412.7062"},{"key":"e_1_3_2_2_8_1","volume-title":"The Cityscapes Dataset for Semantic Urban Scene Understanding. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016","author":"Cordts Marius","year":"2016","unstructured":"Marius Cordts , Mohamed Omran , Sebastian Ramos , Timo Rehfeld , Markus Enzweiler , Rodrigo Benenson , Uwe Franke , Stefan Roth , and Bernt Schiele . 2016 . The Cityscapes Dataset for Semantic Urban Scene Understanding. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 , Las Vegas, NV, USA, June 27--30 , 2016. 3213--3223. https:\/\/doi.org\/10.1109\/CVPR.2016.350 10.1109\/CVPR.2016.350 Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, and Bernt Schiele. 2016. The Cityscapes Dataset for Semantic Urban Scene Understanding. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016. 3213--3223. https:\/\/doi.org\/10.1109\/CVPR.2016.350"},{"key":"e_1_3_2_2_9_1","volume-title":"Deformable Convolutional Networks. In IEEE International Conference on Computer Vision, ICCV 2017","author":"Dai Jifeng","year":"2017","unstructured":"Jifeng Dai , Haozhi Qi , Yuwen Xiong , Yi Li , Guodong Zhang , Han Hu , and Yichen Wei . 2017 . Deformable Convolutional Networks. In IEEE International Conference on Computer Vision, ICCV 2017 , Venice, Italy, October 22--29 , 2017. 764--773. https:\/\/doi.org\/10.1109\/ICCV.2017.89 10.1109\/ICCV.2017.89 Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, and Yichen Wei. 2017. Deformable Convolutional Networks. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22--29, 2017. 764--773. https:\/\/doi.org\/10.1109\/ICCV.2017.89"},{"key":"e_1_3_2_2_10_1","volume-title":"Histograms of Oriented Gradients for Human Detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005","author":"Dalal Navneet","year":"2005","unstructured":"Navneet Dalal and Bill Triggs . 2005 . Histograms of Oriented Gradients for Human Detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005 ), 20--26 June 2005, San Diego, CA, USA. 886--893. https:\/\/doi.org\/10.1109\/CVPR. 2005.177 10.1109\/CVPR.2005.177 Navneet Dalal and Bill Triggs. 2005. Histograms of Oriented Gradients for Human Detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 20--26 June 2005, San Diego, CA, USA. 886--893. https:\/\/doi.org\/10.1109\/CVPR.2005.177"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2300479"},{"key":"e_1_3_2_2_13_1","volume-title":"2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009","author":"Piotr Doll\u00e1","year":"2009","unstructured":"Piotr Doll\u00e1 r, Christian Wojek , Bernt Schiele , and Pietro Perona . 2009 . Pedestrian detection: A benchmark . In 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009 ), 20--25 June 2009, Miami, Florida, USA. 304--311. https:\/\/doi.org\/10.1109\/CVPRW. 2009.5206631 10.1109\/CVPRW.2009.5206631 Piotr Doll\u00e1 r, Christian Wojek, Bernt Schiele, and Pietro Perona. 2009. Pedestrian detection: A benchmark. In 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 20--25 June 2009, Miami, Florida, USA. 304--311. https:\/\/doi.org\/10.1109\/CVPRW.2009.5206631"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123356"},{"key":"e_1_3_2_2_15_1","volume-title":"ISVC 2010, Las Vegas, NV, USA, November 29-December 1, 2010. Proceedings, Part I. 243--252","author":"Geismann Philip","year":"2010","unstructured":"Philip Geismann and Alois Knoll . 2010 . Speeding Up HOG and LBP Features for Pedestrian Detection by Multiresolution Techniques. In Advances in Visual Computing - 6th International Symposium , ISVC 2010, Las Vegas, NV, USA, November 29-December 1, 2010. Proceedings, Part I. 243--252 . https:\/\/doi.org\/10.1007\/978--3--642--17289--2_24 10.1007\/978--3--642--17289--2_24 Philip Geismann and Alois Knoll. 2010. Speeding Up HOG and LBP Features for Pedestrian Detection by Multiresolution Techniques. In Advances in Visual Computing - 6th International Symposium, ISVC 2010, Las Vegas, NV, USA, November 29-December 1, 2010. Proceedings, Part I. 243--252. https:\/\/doi.org\/10.1007\/978--3--642--17289--2_24"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.30.90"},{"key":"e_1_3_2_2_17_1","volume-title":"Fast R-CNN. In 2015 IEEE International Conference on Computer Vision, ICCV 2015","author":"Girshick Ross B.","year":"2015","unstructured":"Ross B. Girshick . 2015 . Fast R-CNN. In 2015 IEEE International Conference on Computer Vision, ICCV 2015 , Santiago, Chile, December 7--13 , 2015. 1440--1448. https:\/\/doi.org\/10.1109\/ICCV.2015.169 10.1109\/ICCV.2015.169 Ross B. Girshick. 2015. Fast R-CNN. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7--13, 2015. 1440--1448. https:\/\/doi.org\/10.1109\/ICCV.2015.169"},{"key":"e_1_3_2_2_18_1","volume-title":"Ross B. Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He.","author":"Goyal Priya","year":"2017","unstructured":"Priya Goyal , Piotr Doll\u00e1 r , Ross B. Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. 2017 . Accurate, Large Minibatch SGD : Training ImageNet in 1 Hour. CoRR , Vol. abs\/ 1706 .02677 (2017). arxiv: 1706.02677 http:\/\/arxiv.org\/abs\/1706.02677 Priya Goyal, Piotr Doll\u00e1 r, Ross B. Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. 2017. Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour. CoRR, Vol. abs\/1706.02677 (2017). arxiv: 1706.02677 http:\/\/arxiv.org\/abs\/1706.02677"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.123"},{"key":"e_1_3_2_2_20_1","volume-title":"Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016","author":"He Kaiming","year":"2016","unstructured":"Kaiming He , Xiangyu Zhang , Shaoqing Ren , and Jian Sun . 2016 . Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 , Las Vegas, NV, USA, June 27--30 , 2016. 770--778. https:\/\/doi.org\/10.1109\/CVPR.2016.90 10.1109\/CVPR.2016.90 Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016. 770--778. https:\/\/doi.org\/10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/0031-3203(90)90135-8"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2013.12.017"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299034"},{"key":"e_1_3_2_2_24_1","volume-title":"Proceedings, Part XIV. 765--781","author":"Law Hei","year":"2018","unstructured":"Hei Law and Jia Deng . 2018 . CornerNet: Detecting Objects as Paired Keypoints. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018 , Proceedings, Part XIV. 765--781 . https:\/\/doi.org\/10.1007\/978--3-030-01264--9_45 10.1007\/978--3-030-01264--9_45 Hei Law and Jia Deng. 2018. CornerNet: Detecting Objects as Paired Keypoints. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018, Proceedings, Part XIV. 765--781. https:\/\/doi.org\/10.1007\/978--3-030-01264--9_45"},{"key":"e_1_3_2_2_25_1","volume-title":"Focal Loss for Dense Object Detection. In IEEE International Conference on Computer Vision, ICCV 2017","author":"Lin Tsung-Yi","year":"2017","unstructured":"Tsung-Yi Lin , Priya Goyal , Ross B. Girshick , Kaiming He , and Piotr Doll\u00e1 r. 2017 . Focal Loss for Dense Object Detection. In IEEE International Conference on Computer Vision, ICCV 2017 , Venice, Italy, October 22--29 , 2017. 2999--3007. https:\/\/doi.org\/10.1109\/ICCV.2017.324 10.1109\/ICCV.2017.324 Tsung-Yi Lin, Priya Goyal, Ross B. Girshick, Kaiming He, and Piotr Doll\u00e1 r. 2017. Focal Loss for Dense Object Detection. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22--29, 2017. 2999--3007. https:\/\/doi.org\/10.1109\/ICCV.2017.324"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00662"},{"key":"e_1_3_2_2_27_1","volume-title":"Proceedings, Part I. 21--37","author":"Liu Wei","unstructured":"Wei Liu , Dragomir Anguelov , Dumitru Erhan , Christian Szegedy , Scott E. Reed , Cheng-Yang Fu , and Alexander C. Berg . 2016. SSD: Single Shot MultiBox Detector. In Computer Vision - ECCV 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11--14, 2016 , Proceedings, Part I. 21--37 . https:\/\/doi.org\/10.1007\/978--3--319--46448-0_2 10.1007\/978--3--319--46448-0_2 Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott E. Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single Shot MultiBox Detector. In Computer Vision - ECCV 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11--14, 2016, Proceedings, Part I. 21--37. https:\/\/doi.org\/10.1007\/978--3--319--46448-0_2"},{"key":"e_1_3_2_2_28_1","volume-title":"Proceedings, Part XIV. 643--659","author":"Liu Wei","year":"2018","unstructured":"Wei Liu , Shengcai Liao , Weidong Hu , Xuezhi Liang , and Xiao Chen . 2018 . Learning Efficient Single-Stage Pedestrian Detectors by Asymptotic Localization Fitting. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018 , Proceedings, Part XIV. 643--659 . https:\/\/doi.org\/10.1007\/978--3-030-01264--9_38 10.1007\/978--3-030-01264--9_38 Wei Liu, Shengcai Liao, Weidong Hu, Xuezhi Liang, and Xiao Chen. 2018. Learning Efficient Single-Stage Pedestrian Detectors by Asymptotic Localization Fitting. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018, Proceedings, Part XIV. 643--659. https:\/\/doi.org\/10.1007\/978--3-030-01264--9_38"},{"key":"e_1_3_2_2_29_1","volume-title":"High-Level Semantic Feature Detection: A New Perspective for Pedestrian Detection. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019","author":"Liu Wei","year":"2019","unstructured":"Wei Liu , Shengcai Liao , Weiqiang Ren , Weidong Hu , and Yinan Yu . 2019 b . High-Level Semantic Feature Detection: A New Perspective for Pedestrian Detection. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 , Long Beach, CA, USA, June 16--20 , 2019. 5187--5196. http:\/\/openaccess.thecvf.com\/content_CVPR_2019\/html\/Liu_High-Level_Semantic_Feature_Detection_A_New_Perspective_for_Pedestrian_Detection_CVPR_2019_paper.html Wei Liu, Shengcai Liao, Weiqiang Ren, Weidong Hu, and Yinan Yu. 2019 b. High-Level Semantic Feature Detection: A New Perspective for Pedestrian Detection. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16--20, 2019. 5187--5196. http:\/\/openaccess.thecvf.com\/content_CVPR_2019\/html\/Liu_High-Level_Semantic_Feature_Detection_A_New_Perspective_for_Pedestrian_Detection_CVPR_2019_paper.html"},{"key":"e_1_3_2_2_30_1","volume-title":"Handling Occlusions with Franken-Classifiers. In IEEE International Conference on Computer Vision, ICCV 2013","author":"Mathias Markus","year":"2013","unstructured":"Markus Mathias , Rodrigo Benenson , Radu Timofte , and Luc Van Gool . 2013 . Handling Occlusions with Franken-Classifiers. In IEEE International Conference on Computer Vision, ICCV 2013 , Sydney, Australia, December 1--8 , 2013. 1505--1512. https:\/\/doi.org\/10.1109\/ICCV.2013.190 10.1109\/ICCV.2013.190 Markus Mathias, Rodrigo Benenson, Radu Timofte, and Luc Van Gool. 2013. Handling Occlusions with Franken-Classifiers. In IEEE International Conference on Computer Vision, ICCV 2013, Sydney, Australia, December 1--8, 2013. 1505--1512. https:\/\/doi.org\/10.1109\/ICCV.2013.190"},{"key":"e_1_3_2_2_31_1","volume-title":"Local Decorrelation For Improved Pedestrian Detection. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014","author":"Nam Woonhyun","year":"2014","unstructured":"Woonhyun Nam , Piotr Doll\u00e1 r, and Joon Hee Han . 2014 . Local Decorrelation For Improved Pedestrian Detection. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014 , December 8 --13 2014, Montreal, Quebec, Canada. 424--432. http:\/\/papers.nips.cc\/paper\/5419-local-decorrelation-for-improved-pedestrian-detection Woonhyun Nam, Piotr Doll\u00e1 r, and Joon Hee Han. 2014. Local Decorrelation For Improved Pedestrian Detection. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8--13 2014, Montreal, Quebec, Canada. 424--432. http:\/\/papers.nips.cc\/paper\/5419-local-decorrelation-for-improved-pedestrian-detection"},{"key":"e_1_3_2_2_32_1","volume-title":"Joint Deep Learning for Pedestrian Detection. In IEEE International Conference on Computer Vision, ICCV 2013","author":"Ouyang Wanli","year":"2013","unstructured":"Wanli Ouyang and Xiaogang Wang . 2013 . Joint Deep Learning for Pedestrian Detection. In IEEE International Conference on Computer Vision, ICCV 2013 , Sydney, Australia, December 1--8 , 2013. 2056--2063. https:\/\/doi.org\/10.1109\/ICCV.2013.257 10.1109\/ICCV.2013.257 Wanli Ouyang and Xiaogang Wang. 2013. Joint Deep Learning for Pedestrian Detection. In IEEE International Conference on Computer Vision, ICCV 2013, Sydney, Australia, December 1--8, 2013. 2056--2063. https:\/\/doi.org\/10.1109\/ICCV.2013.257"},{"key":"e_1_3_2_2_33_1","unstructured":"Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in PyTorch. (2017).  Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in PyTorch. (2017)."},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPTA.2016.7821024"},{"key":"e_1_3_2_2_35_1","volume-title":"Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016","author":"Redmon Joseph","year":"2016","unstructured":"Joseph Redmon , Santosh Kumar Divvala , Ross B. Girshick , and Ali Farhadi . 2016 . You Only Look Once: Unified , Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 , Las Vegas, NV, USA, June 27--30 , 2016. 779--788. https:\/\/doi.org\/10.1109\/CVPR.2016.91 10.1109\/CVPR.2016.91 Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, and Ali Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016. 779--788. https:\/\/doi.org\/10.1109\/CVPR.2016.91"},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"e_1_3_2_2_37_1","volume-title":"CrowdHuman: A Benchmark for Detecting Human in a Crowd. CoRR","author":"Shao Shuai","year":"2018","unstructured":"Shuai Shao , Zijian Zhao , Boxun Li , Tete Xiao , Gang Yu , Xiangyu Zhang , and Jian Sun . 2018. CrowdHuman: A Benchmark for Detecting Human in a Crowd. CoRR , Vol. abs\/ 1805 .00123 ( 2018 ). arxiv: 1805.00123 http:\/\/arxiv.org\/abs\/1805.00123 Shuai Shao, Zijian Zhao, Boxun Li, Tete Xiao, Gang Yu, Xiangyu Zhang, and Jian Sun. 2018. CrowdHuman: A Benchmark for Detecting Human in a Crowd. CoRR, Vol. abs\/1805.00123 (2018). arxiv: 1805.00123 http:\/\/arxiv.org\/abs\/1805.00123"},{"key":"e_1_3_2_2_38_1","volume-title":"Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation. CoRR","author":"Song Tao","year":"2018","unstructured":"Tao Song , Leiyu Sun , Di Xie , Haiming Sun , and Shiliang Pu. 2018a. Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation. CoRR , Vol. abs\/ 1807 .01438 ( 2018 ). arxiv: 1807.01438 http:\/\/arxiv.org\/abs\/1807.01438 Tao Song, Leiyu Sun, Di Xie, Haiming Sun, and Shiliang Pu. 2018a. Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation. CoRR, Vol. abs\/1807.01438 (2018). arxiv: 1807.01438 http:\/\/arxiv.org\/abs\/1807.01438"},{"key":"e_1_3_2_2_39_1","volume-title":"Computer Vision - ECCV 2018 - 15th European Conference","author":"Song Tao","year":"2018","unstructured":"Tao Song , Leiyu Sun , Di Xie , Haiming Sun , and Shiliang Pu. 2018b. Small-Scale Pedestrian Detection Based on Topological Line Localization and Temporal Feature Aggregation . In Computer Vision - ECCV 2018 - 15th European Conference , Munich, Germany , September 8--14, 2018 , Proceedings, Part VII. 554--569. https:\/\/doi.org\/10.1007\/978--3-030-01234--2_33 10.1007\/978--3-030-01234--2_33 Tao Song, Leiyu Sun, Di Xie, Haiming Sun, and Shiliang Pu. 2018b. Small-Scale Pedestrian Detection Based on Topological Line Localization and Temporal Feature Aggregation. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018, Proceedings, Part VII. 554--569. https:\/\/doi.org\/10.1007\/978--3-030-01234--2_33"},{"key":"e_1_3_2_2_40_1","volume-title":"Deep High-Resolution Representation Learning for Human Pose Estimation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019","author":"Sun Ke","year":"2019","unstructured":"Ke Sun , Bin Xiao , Dong Liu , and Jingdong Wang . 2019 . Deep High-Resolution Representation Learning for Human Pose Estimation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 , Long Beach, CA, USA, June 16--20 , 2019. 5693--5703. http:\/\/openaccess.thecvf.com\/content_CVPR_2019\/html\/Sun_Deep_High-Resolution_Representation_Learning_for_Human_Pose_Estimation_CVPR_2019_paper.html Ke Sun, Bin Xiao, Dong Liu, and Jingdong Wang. 2019. Deep High-Resolution Representation Learning for Human Pose Estimation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16--20, 2019. 5693--5703. http:\/\/openaccess.thecvf.com\/content_CVPR_2019\/html\/Sun_Deep_High-Resolution_Representation_Learning_for_Human_Pose_Estimation_CVPR_2019_paper.html"},{"key":"e_1_3_2_2_41_1","volume-title":"5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Workshop Track Proceedings. https:\/\/openreview.net\/forum?id=ry8u21rtl","author":"Tarvainen Antti","year":"2017","unstructured":"Antti Tarvainen and Harri Valpola . 2017 . Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results . In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Workshop Track Proceedings. https:\/\/openreview.net\/forum?id=ry8u21rtl Antti Tarvainen and Harri Valpola. 2017. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Workshop Track Proceedings. https:\/\/openreview.net\/forum?id=ry8u21rtl"},{"key":"e_1_3_2_2_42_1","volume-title":"Deep Learning Strong Parts for Pedestrian Detection. In 2015 IEEE International Conference on Computer Vision, ICCV 2015","author":"Tian Yonglong","year":"2015","unstructured":"Yonglong Tian , Ping Luo , Xiaogang Wang , and Xiaoou Tang . 2015 . Deep Learning Strong Parts for Pedestrian Detection. In 2015 IEEE International Conference on Computer Vision, ICCV 2015 , Santiago, Chile, December 7--13 , 2015. 1904--1912. https:\/\/doi.org\/10.1109\/ICCV.2015.221 10.1109\/ICCV.2015.221 Yonglong Tian, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep Learning Strong Parts for Pedestrian Detection. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7--13, 2015. 1904--1912. https:\/\/doi.org\/10.1109\/ICCV.2015.221"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00972"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-013-0620-5"},{"key":"e_1_3_2_2_45_1","volume-title":"2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001","author":"Paul","year":"2001","unstructured":"Paul A. Viola and Michael J. Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features . In 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001 ), with CD-ROM, 8--14 December 2001 , Kauai, HI, USA. 511--518. https:\/\/doi.org\/10.1109\/CVPR. 2001.990517 10.1109\/CVPR.2001.990517 Paul A. Viola and Michael J. Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. In 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), with CD-ROM, 8--14 December 2001, Kauai, HI, USA. 511--518. https:\/\/doi.org\/10.1109\/CVPR.2001.990517"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00811"},{"key":"e_1_3_2_2_47_1","volume-title":"ASCNet: Adaptive-Scale Convolutional Neural Networks for Multi-Scale Feature Learning. CoRR","author":"Zhang Mo","year":"2019","unstructured":"Mo Zhang , Jie Zhao , Xiang Li , Li Zhang , and Quanzheng Li. 2019. ASCNet: Adaptive-Scale Convolutional Neural Networks for Multi-Scale Feature Learning. CoRR , Vol. abs\/ 1907 .03241 ( 2019 ). arxiv: 1907.03241 http:\/\/arxiv.org\/abs\/1907.03241 Mo Zhang, Jie Zhao, Xiang Li, Li Zhang, and Quanzheng Li. 2019. ASCNet: Adaptive-Scale Convolutional Neural Networks for Multi-Scale Feature Learning. CoRR, Vol. abs\/1907.03241 (2019). arxiv: 1907.03241 http:\/\/arxiv.org\/abs\/1907.03241"},{"key":"e_1_3_2_2_48_1","volume-title":"Scale-Adaptive Convolutions for Scene Parsing. In IEEE International Conference on Computer Vision, ICCV 2017","author":"Zhang Rui","year":"2017","unstructured":"Rui Zhang , Sheng Tang , Yongdong Zhang , Jintao Li , and Shuicheng Yan . 2017 b. Scale-Adaptive Convolutions for Scene Parsing. In IEEE International Conference on Computer Vision, ICCV 2017 , Venice, Italy, October 22--29 , 2017. IEEE Computer Society, 2050--2058. https:\/\/doi.org\/10.1109\/ICCV.2017.224 10.1109\/ICCV.2017.224 Rui Zhang, Sheng Tang, Yongdong Zhang, Jintao Li, and Shuicheng Yan. 2017b. Scale-Adaptive Convolutions for Scene Parsing. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22--29, 2017. IEEE Computer Society, 2050--2058. https:\/\/doi.org\/10.1109\/ICCV.2017.224"},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.141"},{"key":"e_1_3_2_2_50_1","volume-title":"CityPersons: A Diverse Dataset for Pedestrian Detection. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017","author":"Zhang Shanshan","year":"2017","unstructured":"Shanshan Zhang , Rodrigo Benenson , and Bernt Schiele . 2017 a. CityPersons: A Diverse Dataset for Pedestrian Detection. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 , Honolulu, HI, USA, July 21--26 , 2017. 4457--4465. https:\/\/doi.org\/10.1109\/CVPR.2017.474 10.1109\/CVPR.2017.474 Shanshan Zhang, Rodrigo Benenson, and Bernt Schiele. 2017a. CityPersons: A Diverse Dataset for Pedestrian Detection. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21--26, 2017. 4457--4465. https:\/\/doi.org\/10.1109\/CVPR.2017.474"},{"key":"e_1_3_2_2_51_1","volume-title":"Proceedings, Part III. 657--674","author":"Zhang Shifeng","unstructured":"Shifeng Zhang , Longyin Wen , Xiao Bian , Zhen Lei , and Stan Z. Li . 2018a. Occlusion-Aware R-CNN: Detecting Pedestrians in a Crowd. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018 , Proceedings, Part III. 657--674 . https:\/\/doi.org\/10.1007\/978--3-030-01219--9_39 10.1007\/978--3-030-01219--9_39 Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, and Stan Z. Li. 2018a. Occlusion-Aware R-CNN: Detecting Pedestrians in a Crowd. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018, Proceedings, Part III. 657--674. https:\/\/doi.org\/10.1007\/978--3-030-01219--9_39"},{"key":"e_1_3_2_2_52_1","volume-title":"Single-Shot Refinement Neural Network for Object Detection. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018","author":"Zhang Shifeng","year":"2018","unstructured":"Shifeng Zhang , Longyin Wen , Xiao Bian , Zhen Lei , and Stan Z. Li . 2018b . Single-Shot Refinement Neural Network for Object Detection. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018 , Salt Lake City, UT, USA, June 18--22 , 2018 . IEEE Computer Society, 4203--4212. https:\/\/doi.org\/10.1109\/CVPR.2018.00442 10.1109\/CVPR.2018.00442 Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, and Stan Z. Li. 2018b. Single-Shot Refinement Neural Network for Object Detection. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18--22, 2018. IEEE Computer Society, 4203--4212. https:\/\/doi.org\/10.1109\/CVPR.2018.00442"},{"key":"e_1_3_2_2_53_1","volume-title":"Computer Vision - ACCV 2016 - 13th Asian Conference on Computer Vision, Taipei, Taiwan, November 20--24","author":"Zhou Chunluan","year":"2016","unstructured":"Chunluan Zhou and Junsong Yuan . 2016. Learning to Integrate Occlusion-Specific Detectors for Heavily Occluded Pedestrian Detection . In Computer Vision - ACCV 2016 - 13th Asian Conference on Computer Vision, Taipei, Taiwan, November 20--24 , 2016 , Revised Selected Papers , Part II. 305--320. https:\/\/doi.org\/10.1007\/978--3--319--54184--6_19 10.1007\/978--3--319--54184--6_19 Chunluan Zhou and Junsong Yuan. 2016. Learning to Integrate Occlusion-Specific Detectors for Heavily Occluded Pedestrian Detection. In Computer Vision - ACCV 2016 - 13th Asian Conference on Computer Vision, Taipei, Taiwan, November 20--24, 2016, Revised Selected Papers, Part II. 305--320. https:\/\/doi.org\/10.1007\/978--3--319--54184--6_19"},{"key":"e_1_3_2_2_54_1","volume-title":"Proceedings, Part I. 138--154","author":"Zhou Chunluan","year":"2018","unstructured":"Chunluan Zhou and Junsong Yuan . 2018 . Bi-box Regression for Pedestrian Detection and Occlusion Estimation. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018 , Proceedings, Part I. 138--154 . https:\/\/doi.org\/10.1007\/978--3-030-01246--5_9 10.1007\/978--3-030-01246--5_9 Chunluan Zhou and Junsong Yuan. 2018. Bi-box Regression for Pedestrian Detection and Occlusion Estimation. In Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018, Proceedings, Part I. 138--154. https:\/\/doi.org\/10.1007\/978--3-030-01246--5_9"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00094"},{"key":"e_1_3_2_2_56_1","volume-title":"Feature Selective Anchor-Free Module for Single-Shot Object Detection. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019","author":"Zhu Chenchen","year":"2019","unstructured":"Chenchen Zhu , Yihui He , and Marios Savvides . 2019 . Feature Selective Anchor-Free Module for Single-Shot Object Detection. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 , Long Beach, CA, USA, June 16--20 , 2019. 840--849. http:\/\/openaccess.thecvf.com\/content_CVPR_2019\/html\/Zhu_Feature_Selective_Anchor-Free_Module_for_Single-Shot_Object_Detection_CVPR_2019_paper.html Chenchen Zhu, Yihui He, and Marios Savvides. 2019. Feature Selective Anchor-Free Module for Single-Shot Object Detection. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16--20, 2019. 840--849. http:\/\/openaccess.thecvf.com\/content_CVPR_2019\/html\/Zhu_Feature_Selective_Anchor-Free_Module_for_Single-Shot_Object_Detection_CVPR_2019_paper.html"}],"event":{"name":"MM '20: The 28th ACM International Conference on Multimedia","location":"Seattle WA USA","acronym":"MM '20","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 28th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394171.3413989","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3394171.3413989","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:07Z","timestamp":1750195927000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394171.3413989"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,12]]},"references-count":56,"alternative-id":["10.1145\/3394171.3413989","10.1145\/3394171"],"URL":"https:\/\/doi.org\/10.1145\/3394171.3413989","relation":{},"subject":[],"published":{"date-parts":[[2020,10,12]]},"assertion":[{"value":"2020-10-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}