{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T20:14:11Z","timestamp":1772482451584,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSFC","award":["U1936205"],"award-info":[{"award-number":["U1936205"]}]},{"name":"Key R&D Projects of the Ministry of Science and Technology of China","award":["2021YFC3300300"],"award-info":[{"award-number":["2021YFC3300300"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3548412","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:01Z","timestamp":1665416581000},"page":"2002-2011","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Hierarchical Few-Shot Object Detection"],"prefix":"10.1145","author":[{"given":"Lu","family":"Zhang","sequence":"first","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Yang","family":"Wang","sequence":"additional","affiliation":[{"name":"Tongji University, Shanghai, China"}]},{"given":"Jiaogen","family":"Zhou","sequence":"additional","affiliation":[{"name":"Huaiyin Normal University, Huaian, China"}]},{"given":"Chenbo","family":"Zhang","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Yinglu","family":"Zhang","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Jihong","family":"Guan","sequence":"additional","affiliation":[{"name":"Tongji University, Shanghai, China"}]},{"given":"Yatao","family":"Bian","sequence":"additional","affiliation":[{"name":"Tencent AI Lab, Shenzhen, China"}]},{"given":"Shuigeng","family":"Zhou","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Hierarchy-Based Image Embeddings for Semantic Image Retrieval. In 2019 IEEE Winter Conference on Applications of Computer Vision. 638--647","author":"Barz Bj\u00f6rn","year":"2019","unstructured":"Bj\u00f6rn Barz and Joachim Denzler . 2019 . Hierarchy-Based Image Embeddings for Semantic Image Retrieval. In 2019 IEEE Winter Conference on Applications of Computer Vision. 638--647 . https:\/\/doi.org\/10.1109\/WACV.2019.00073 Bj\u00f6rn Barz and Joachim Denzler. 2019. Hierarchy-Based Image Embeddings for Semantic Image Retrieval. In 2019 IEEE Winter Conference on Applications of Computer Vision. 638--647. https:\/\/doi.org\/10.1109\/WACV.2019.00073"},{"key":"e_1_3_2_2_2_1","volume-title":"Culotta (Eds.)","volume":"23","author":"Bengio Samy","year":"2010","unstructured":"Samy Bengio , Jason Weston , and David Grangier . 2010 . Label Embedding Trees for Large Multi-Class Tasks. In Advances in Neural Information Processing Systems, J. Lafferty, C. Williams, J. Shawe-Taylor, R. Zemel, and A . Culotta (Eds.) , Vol. 23 . Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/ 2010\/file\/06138bc5af6023646ede0e1f7c1eac75-Paper.pdf Samy Bengio, Jason Weston, and David Grangier. 2010. Label Embedding Trees for Large Multi-Class Tasks. In Advances in Neural Information Processing Systems, J. Lafferty, C. Williams, J. Shawe-Taylor, R. Zemel, and A. Culotta (Eds.), Vol. 23. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/2010\/file\/06138bc5af6023646ede0e1f7c1eac75-Paper.pdf"},{"key":"e_1_3_2_2_3_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Bertinetto Luca","unstructured":"Luca Bertinetto , Romain Mueller , Konstantinos Tertikas , Sina Samangooei , and Nicholas A. Lord . 2020. Making Better Mistakes: Leveraging Class Hierarchies With Deep Networks . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Luca Bertinetto, Romain Mueller, Konstantinos Tertikas, Sina Samangooei, and Nicholas A. Lord. 2020. Making Better Mistakes: Leveraging Class Hierarchies With Deep Networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_2_4_1","volume-title":"Weinberger (Eds.)","volume":"24","author":"Deng Jia","year":"2011","unstructured":"Jia Deng , Sanjeev Satheesh , Alexander Berg , and Fei Li . 2011 . Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition. In Advances in Neural Information Processing Systems,, J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, and K. Q . Weinberger (Eds.) , Vol. 24 . Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/ 2011\/file\/5a4b25aaed25c2ee1b74de72dc03c14e-Paper.pdf Jia Deng, Sanjeev Satheesh, Alexander Berg, and Fei Li. 2011. Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition. In Advances in Neural Information Processing Systems,, J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, and K. Q. Weinberger (Eds.), Vol. 24. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/2011\/file\/5a4b25aaed25c2ee1b74de72dc03c14e-Paper.pdf"},{"key":"e_1_3_2_2_5_1","volume-title":"Garnett (Eds.)","volume":"31","author":"Dubey Abhimanyu","year":"2018","unstructured":"Abhimanyu Dubey , Otkrist Gupta , Ramesh Raskar , and Nikhil Naik . 2018 . Maximum-Entropy Fine Grained Classification. In Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R . Garnett (Eds.) , Vol. 31 . Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/ 2018\/file\/0c74b7f78409a4022a2c4c5a5ca3ee19-Paper.pdf Abhimanyu Dubey, Otkrist Gupta, Ramesh Raskar, and Nikhil Naik. 2018. Maximum-Entropy Fine Grained Classification. In Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Vol. 31. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/2018\/file\/0c74b7f78409a4022a2c4c5a5ca3ee19-Paper.pdf"},{"key":"e_1_3_2_2_6_1","volume-title":"Christopher KI Williams, John Winn, and Andrew Zisserman.","author":"Everingham Mark","year":"2010","unstructured":"Mark Everingham , Luc Van Gool , Christopher KI Williams, John Winn, and Andrew Zisserman. 2010 . The pascal visual object classes (voc) challenge. International journal of computer vision, Vol. 88 , 2 (2010), 303--338. Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. 2010. The pascal visual object classes (voc) challenge. International journal of computer vision, Vol. 88, 2 (2010), 303--338."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00407"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00450"},{"key":"e_1_3_2_2_9_1","volume-title":"Marctextquotesingle Aurelio Ranzato, and Tomas Mikolov","author":"Frome Andrea","year":"2013","unstructured":"Andrea Frome , Greg S Corrado , Jon Shlens , Samy Bengio , Jeff Dean , Marctextquotesingle Aurelio Ranzato, and Tomas Mikolov . 2013 . DeViSE: A Deep Visual-Semantic Embedding Model. In Advances in Neural Information Processing Systems, C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger (Eds.), Vol. 26 . Curran Associates, Inc . https:\/\/proceedings.neurips.cc\/paper\/2013\/file\/7cce53cf90577442771720a370c3c723-Paper.pdf Andrea Frome, Greg S Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Marctextquotesingle Aurelio Ranzato, and Tomas Mikolov. 2013. DeViSE: A Deep Visual-Semantic Embedding Model. In Advances in Neural Information Processing Systems, C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger (Eds.), Vol. 26. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/2013\/file\/7cce53cf90577442771720a370c3c723-Paper.pdf"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.476"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6712"},{"key":"e_1_3_2_2_12_1","volume-title":"Lin (Eds.)","volume":"33","author":"Ge Yixiao","year":"2020","unstructured":"Yixiao Ge , Feng Zhu , Dapeng Chen , Rui Zhao , and hongsheng Li. 2020 . Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID. In Advances in Neural Information Processing Systems,, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H . Lin (Eds.) , Vol. 33 . Curran Associates, Inc., 11309--11321. https:\/\/proceedings.neurips.cc\/paper\/ 2020\/file\/821fa74b50ba3f7cba1e6c53e8fa6845-Paper.pdf Yixiao Ge, Feng Zhu, Dapeng Chen, Rui Zhao, and hongsheng Li. 2020. Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID. In Advances in Neural Information Processing Systems,, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 11309--11321. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/821fa74b50ba3f7cba1e6c53e8fa6845-Paper.pdf"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2008.4587410"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00525"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.132"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"crossref","unstructured":"Bingyi Kang Zhuang Liu Xin Wang Fisher Yu Jiashi Feng and Trevor Darrell. 2019. Few-Shot Object Detection via Feature Reweighting. In ICCV.  Bingyi Kang Zhuang Liu Xin Wang Fisher Yu Jiashi Feng and Trevor Darrell. 2019. Few-Shot Object Detection via Feature Reweighting. In ICCV.","DOI":"10.1109\/ICCV.2019.00851"},{"key":"e_1_3_2_2_18_1","volume-title":"Lin (Eds.)","volume":"33","author":"Khosla Prannay","year":"2020","unstructured":"Prannay Khosla , Piotr Teterwak , Chen Wang , Aaron Sarna , Yonglong Tian , Phillip Isola , Aaron Maschinot , Ce Liu , and Dilip Krishnan . 2020 . Supervised Contrastive Learning. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H . Lin (Eds.) , Vol. 33 . Curran Associates, Inc. , 18661--18673. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/d89a66c7c80a29b1bdbab0f2a1a94af8-Paper.pdf Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, and Dilip Krishnan. 2020. Supervised Contrastive Learning. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 18661--18673. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/d89a66c7c80a29b1bdbab0f2a1a94af8-Paper.pdf"},{"key":"e_1_3_2_2_19_1","first-page":"1","article-title":"Exponential moving average versus moving exponential average","volume":"58","author":"Klinker Frank","year":"2010","unstructured":"Frank Klinker . 2010 . Exponential moving average versus moving exponential average . Mathematische Semesterberichte , Vol. 58 , 1 (dec 2010), 97--107. https:\/\/doi.org\/10.1007\/s00591-010-0080--8 Frank Klinker. 2010. Exponential moving average versus moving exponential average. Mathematische Semesterberichte, Vol. 58, 1 (dec 2010), 97--107. https:\/\/doi.org\/10.1007\/s00591-010-0080--8","journal-title":"Mathematische Semesterberichte"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.106"},{"key":"e_1_3_2_2_21_1","volume-title":"Piotr Dollar, and C Lawrence Zitnick","author":"Lin Tsung-Yi","year":"2014","unstructured":"Tsung-Yi Lin , Michael Maire , Serge Belongie , James Hays , Pietro Perona , Deva Ramanan , Piotr Dollar, and C Lawrence Zitnick . 2014 . Microsoft coco: Common objects in context. In ECCV. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollar, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In ECCV."},{"key":"e_1_3_2_2_22_1","volume-title":"An Empirical Study and Comparison of Recent Few-Shot Object Detection Algorithms. arXiv preprint arXiv:2203.14205","author":"Liu Tianying","year":"2022","unstructured":"Tianying Liu , Lu Zhang , Yang Wang , Jihong Guan , Yanwei Fu , and Shuigeng Zhou . 2022. An Empirical Study and Comparison of Recent Few-Shot Object Detection Algorithms. arXiv preprint arXiv:2203.14205 ( 2022 ). Tianying Liu, Lu Zhang, Yang Wang, Jihong Guan, Yanwei Fu, and Shuigeng Zhou. 2022. An Empirical Study and Comparison of Recent Few-Shot Object Detection Algorithms. arXiv preprint arXiv:2203.14205 (2022)."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2774041"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00856"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2615423"},{"key":"e_1_3_2_2_26_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In NIPS.  Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In NIPS."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00727"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6882"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01270-0_49"},{"key":"e_1_3_2_2_30_1","unstructured":"C. Wah S. Branson P. Welinder P. Perona and S. Belongie. 2011. The Caltech-UCSD Birds-200--2011 Dataset. Technical Report.  C. Wah S. Branson P. Welinder P. Perona and S. Belongie. 2011. The Caltech-UCSD Birds-200--2011 Dataset. Technical Report."},{"key":"e_1_3_2_2_31_1","volume-title":"Proceedings of the 37th International Conference on Machine Learning (ICML) (Proceedings of Machine Learning Research), Hal Daum\u00e9 III and Aarti Singh (Eds.)","volume":"119","author":"Wang Xin","year":"2020","unstructured":"Xin Wang , Thomas Huang , Joseph Gonzalez , Trevor Darrell , and Fisher Yu . 2020 . Frustratingly Simple Few-Shot Object Detection . In Proceedings of the 37th International Conference on Machine Learning (ICML) (Proceedings of Machine Learning Research), Hal Daum\u00e9 III and Aarti Singh (Eds.) , Vol. 119 . PMLR, 9919--9928. https:\/\/proceedings.mlr.press\/v119\/wang20j.html Xin Wang, Thomas Huang, Joseph Gonzalez, Trevor Darrell, and Fisher Yu. 2020. Frustratingly Simple Few-Shot Object Detection. In Proceedings of the 37th International Conference on Machine Learning (ICML) (Proceedings of Machine Learning Research), Hal Daum\u00e9 III and Aarti Singh (Eds.), Vol. 119. PMLR, 9919--9928. https:\/\/proceedings.mlr.press\/v119\/wang20j.html"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2017.10.002"},{"key":"e_1_3_2_2_33_1","volume-title":"Rush","author":"Wiseman Sam","year":"2016","unstructured":"Sam Wiseman and Alexander M . Rush . 2016 . Sequence-to-Sequence Learning as Beam-Search Optimization. In EMNLP. 1296--1306. http:\/\/aclweb.org\/anthology\/D\/D16\/D16--1137.pdf Sam Wiseman and Alexander M. Rush. 2016. Sequence-to-Sequence Learning as Beam-Search Optimization. In EMNLP. 1296--1306. http:\/\/aclweb.org\/anthology\/D\/D16\/D16--1137.pdf"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2967205"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58517-4_27"},{"key":"e_1_3_2_2_36_1","volume-title":"Meta-RCNN: Meta Learning for Few-Shot Object Detection","author":"Wu Xiongwei","unstructured":"Xiongwei Wu , Doyen Sahoo , and Steven Hoi . 2020b. Meta-RCNN: Meta Learning for Few-Shot Object Detection . Association for Computing Machinery , New York, NY, USA , 1679--1687. https:\/\/doi.org\/10.1145\/3394171.3413832 Xiongwei Wu, Doyen Sahoo, and Steven Hoi. 2020b. Meta-RCNN: Meta Learning for Few-Shot Object Detection. Association for Computing Machinery, New York, NY, USA, 1679--1687. https:\/\/doi.org\/10.1145\/3394171.3413832"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2857768"},{"key":"e_1_3_2_2_38_1","volume-title":"Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2). https:\/\/openreview.net\/forum?id=rH8yliN6C83","author":"Yu Hang","year":"2021","unstructured":"Hang Yu , Yufei Xu , Jing Zhang , Wei Zhao , Ziyu Guan , and Dacheng Tao . 2021 . AP-10K: A Benchmark for Animal Pose Estimation in the Wild . In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2). https:\/\/openreview.net\/forum?id=rH8yliN6C83 Hang Yu, Yufei Xu, Jing Zhang, Wei Zhao, Ziyu Guan, and Dacheng Tao. 2021. AP-10K: A Benchmark for Animal Pose Estimation in the Wild. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2). https:\/\/openreview.net\/forum?id=rH8yliN6C83"},{"key":"e_1_3_2_2_39_1","unstructured":"Gongjie Zhang Zhipeng Luo Kaiwen Cui and Shijian Lu. 2021a. Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class Correlation Exploitation. https:\/\/doi.org\/10.48550\/ARXIV.2103.11731  Gongjie Zhang Zhipeng Luo Kaiwen Cui and Shijian Lu. 2021a. Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class Correlation Exploitation. https:\/\/doi.org\/10.48550\/ARXIV.2103.11731"},{"key":"e_1_3_2_2_40_1","volume-title":"Accurate Few-shot Object Detection with Support-Query Mutual Guidance and Hybrid Loss. In 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 14419--14427","author":"Zhang Lu","year":"2021","unstructured":"Lu Zhang , Shuigeng Zhou , Jihong Guan , and Ji Zhang . 2021 b. Accurate Few-shot Object Detection with Support-Query Mutual Guidance and Hybrid Loss. In 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 14419--14427 . https:\/\/doi.org\/10.1109\/CVPR46437.2021.01419 Lu Zhang, Shuigeng Zhou, Jihong Guan, and Ji Zhang. 2021b. Accurate Few-shot Object Detection with Support-Query Mutual Guidance and Hybrid Loss. In 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 14419--14427. https:\/\/doi.org\/10.1109\/CVPR46437.2021.01419"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.557"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00867"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00440"},{"key":"e_1_3_2_2_44_1","volume-title":"Deformable DETR: Deformable Transformers for End-to-End Object Detection. In International Conference on Learning Representations.","author":"Zhu Xizhou","year":"2021","unstructured":"Xizhou Zhu , Weijie Su , Lewei Lu , Bin Li , Xiaogang Wang , and Jifeng Dai . 2021 b. Deformable DETR: Deformable Transformers for End-to-End Object Detection. In International Conference on Learning Representations. Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, and Jifeng Dai. 2021b. Deformable DETR: Deformable Transformers for End-to-End Object Detection. In International Conference on Learning Representations."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240616"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548412","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3548412","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:17Z","timestamp":1750182557000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548412"}},"subtitle":["Problem, Benchmark and Method"],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":45,"alternative-id":["10.1145\/3503161.3548412","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3548412","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}