{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T10:58:15Z","timestamp":1761562695416,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":52,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T00:00:00Z","timestamp":1602460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,10,12]]},"DOI":"10.1145\/3422852.3423477","type":"proceedings-article","created":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T08:40:35Z","timestamp":1602492035000},"page":"73-82","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Online Video Object Detection via Local and Mid-Range Feature Propagation"],"prefix":"10.1145","author":[{"given":"Zhifan","family":"Zhu","sequence":"first","affiliation":[{"name":"Nanjing University of Science and Technology, Nanjing, China"}]},{"given":"Zechao","family":"Li","sequence":"additional","affiliation":[{"name":"Nanjing University of Science and Technology, Nanjing, China"}]}],"member":"320","published-online":{"date-parts":[[2020,10,12]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01309"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01258-8_21"},{"key":"e_1_3_2_1_3_1","unstructured":"Zhaowei Cai and Nuno Vasconcelos. 2019. Cascade R-CNN: high quality object detection and instance segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019).  Zhaowei Cai and Nuno Vasconcelos. 2019. Cascade R-CNN: high quality object detection and instance segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00351"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Yu-Wei Chao Yunfan Liu Xieyang Liu Huayi Zeng and Jia Deng. 2018. Learning to detect human-object interactions. In 2018 ieee winter conference on applications of computer vision (wacv). IEEE 381--389.  Yu-Wei Chao Yunfan Liu Xieyang Liu Huayi Zeng and Jia Deng. 2018. Learning to detect human-object interactions. In 2018 ieee winter conference on applications of computer vision (wacv). IEEE 381--389.","DOI":"10.1109\/WACV.2018.00048"},{"key":"e_1_3_2_1_6_1","unstructured":"Kai Chen Jiaqi Wang Jiangmiao Pang Yuhang Cao Yu Xiong Xiaoxiao Li Shuyang Sun Wansen Feng Ziwei Liu Jiarui Xu Zheng Zhang Dazhi Cheng Chenchen Zhu Tianheng Cheng Qijie Zhao Buyu Li Xin Lu Rui Zhu Yue Wu Jifeng Dai Jingdong Wang Jianping Shi Wanli Ouyang Chen Change Loy and Dahua Lin. 2019. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155 (2019).  Kai Chen Jiaqi Wang Jiangmiao Pang Yuhang Cao Yu Xiong Xiaoxiao Li Shuyang Sun Wansen Feng Ziwei Liu Jiarui Xu Zheng Zhang Dazhi Cheng Chenchen Zhu Tianheng Cheng Qijie Zhao Buyu Li Xin Lu Rui Zhu Yue Wu Jifeng Dai Jingdong Wang Jianping Shi Wanli Ouyang Chen Change Loy and Dahua Lin. 2019. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155 (2019)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00815"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01035"},{"volume-title":"Adascale: Towards real-time video object detection using adaptive scaling. arXiv preprint arXiv:1902.02910","year":"2019","author":"Chin Ting-Wu","key":"e_1_3_2_1_9_1"},{"volume-title":"R-fcn: Object detection via region-based fully convolutional networks. In Advances in neural information processing systems. 379--387.","year":"2016","author":"Dai Jifeng","key":"e_1_3_2_1_10_1"},{"volume-title":"Object Guided External Memory Network for Video Object Detection. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV)","year":"2019","author":"Deng Hanming","key":"e_1_3_2_1_11_1"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00712"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.316"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.330"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Ross Girshick Jeff Donahue Trevor Darrell and Jitendra Malik. 2016. Region-based convolutional networks for accurate object detection and segmentation. IEEE transactions on pattern analysis and machine intelligence Vol. 38 1 (2016) 142--158.  Ross Girshick Jeff Donahue Trevor Darrell and Jitendra Malik. 2016. Region-based convolutional networks for accurate object detection and segmentation. IEEE transactions on pattern analysis and machine intelligence Vol. 38 1 (2016) 142--158.","DOI":"10.1109\/TPAMI.2015.2437384"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00401"},{"key":"e_1_3_2_1_18_1","unstructured":"Wei Han Pooya Khorrami Tom Le Paine Prajit Ramachandran Mohammad Babaeizadeh Honghui Shi Jianan Li Shuicheng Yan and Thomas S Huang. 2016. Seq-nms for video object detection. arXiv preprint arXiv:1602.08465 (2016).  Wei Han Pooya Khorrami Tom Le Paine Prajit Ramachandran Mohammad Babaeizadeh Honghui Shi Jianan Li Shuicheng Yan and Thomas S Huang. 2016. Seq-nms for video object detection. arXiv preprint arXiv:1602.08465 (2016)."},{"volume-title":"Proceedings of the IEEE International Conference on Computer Vision. 8450--8459","year":"2019","author":"He Lingxiao","key":"e_1_3_2_1_19_1"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"crossref","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long short-term memory. Neural computation Vol. 9 8 (1997) 1735--1780.  Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long short-term memory. Neural computation Vol. 9 8 (1997) 1735--1780.","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2017.2736553"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.95"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3065386"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"crossref","unstructured":"Zechao Li Jing Liu Jinhui Tang and Hanqing Lu. 2015. Robust structured subspace learning for data representation. IEEE transactions on pattern analysis and machine intelligence Vol. 37 10 (2015) 2085--2098.  Zechao Li Jing Liu Jinhui Tang and Hanqing Lu. 2015. Robust structured subspace learning for data representation. IEEE transactions on pattern analysis and machine intelligence Vol. 37 10 (2015) 2085--2098.","DOI":"10.1109\/TPAMI.2015.2400461"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.106"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.324"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5686--5695","year":"2018","author":"Liu Mason","key":"e_1_3_2_1_28_1"},{"key":"e_1_3_2_1_29_1","unstructured":"Mason Liu Menglong Zhu Marie White Yinxiao Li and Dmitry Kalenichenko. 2019. Looking fast and slow: Memory-guided mobile video object detection. arXiv preprint arXiv:1903.10172 (2019).  Mason Liu Menglong Zhu Marie White Yinxiao Li and Dmitry Kalenichenko. 2019. Looking fast and slow: Memory-guided mobile video object detection. arXiv preprint arXiv:1903.10172 (2019)."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.257"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00932"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00053"},{"key":"e_1_3_2_1_34_1","unstructured":"Joseph Redmon Santosh Kumar Divvala Ross B. Girshick and Ali Farhadi. 2015. You Only Look Once: Unified Real-Time Object Detection. CoRR Vol. abs\/1506.02640 (2015) 1--10. arxiv: 1506.02640  Joseph Redmon Santosh Kumar Divvala Ross B. Girshick and Ali Farhadi. 2015. You Only Look Once: Unified Real-Time Object Detection. CoRR Vol. abs\/1506.02640 (2015) 1--10. arxiv: 1506.02640"},{"key":"e_1_3_2_1_35_1","unstructured":"Joseph Redmon and Ali Farhadi. 2016. YOLO9000: better faster stronger. arXiv preprint arXiv:1612.08242 (2016) 1--9.  Joseph Redmon and Ali Farhadi. 2016. YOLO9000: better faster stronger. arXiv preprint arXiv:1612.08242 (2016) 1--9."},{"key":"e_1_3_2_1_36_1","unstructured":"Joseph Redmon and Ali Farhadi. 2018. YOLOv3: An Incremental Improvement. CoRR Vol. abs\/1804.02767 (2018) 1 -- 6. arxiv: 1804.02767  Joseph Redmon and Ali Farhadi. 2018. YOLOv3: An Incremental Improvement. CoRR Vol. abs\/1804.02767 (2018) 1 -- 6. arxiv: 1804.02767"},{"key":"e_1_3_2_1_37_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91--99.  Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91--99."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3350984"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Olga Russakovsky Jia Deng Hao Su Jonathan Krause Sanjeev Satheesh Sean Ma Zhiheng Huang Andrej Karpathy Aditya Khosla Michael Bernstein et almbox. 2015. Imagenet large scale visual recognition challenge. International journal of computer vision Vol. 115 3 (2015) 211--252.  Olga Russakovsky Jia Deng Hao Su Jonathan Krause Sanjeev Satheesh Sean Ma Zhiheng Huang Andrej Karpathy Aditya Khosla Michael Bernstein et almbox. 2015. Imagenet large scale visual recognition challenge. International journal of computer vision Vol. 115 3 (2015) 211--252.","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00474"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00985"},{"key":"e_1_3_2_1_42_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2020.3004249"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00720"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01261-8_33"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00813"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00931"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01237-3_30"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2017.95"},{"key":"e_1_3_2_1_50_1","unstructured":"Xizhou Zhu Jifeng Dai Xingchi Zhu Yichen Wei and Lu Yuan. 2018. Towards high performance video object detection for mobiles. arXiv preprint arXiv:1804.05830 (2018).  Xizhou Zhu Jifeng Dai Xingchi Zhu Yichen Wei and Lu Yuan. 2018. Towards high performance video object detection for mobiles. arXiv preprint arXiv:1804.05830 (2018)."},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.52"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.441"}],"event":{"name":"MM '20: The 28th ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Seattle WA USA","acronym":"MM '20"},"container-title":["Proceedings of the 1st International Workshop on Human-centric Multimedia Analysis"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3422852.3423477","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3422852.3423477","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:56Z","timestamp":1750195496000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3422852.3423477"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,12]]},"references-count":52,"alternative-id":["10.1145\/3422852.3423477","10.1145\/3422852"],"URL":"https:\/\/doi.org\/10.1145\/3422852.3423477","relation":{},"subject":[],"published":{"date-parts":[[2020,10,12]]},"assertion":[{"value":"2020-10-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}