{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,15]],"date-time":"2026-03-15T15:27:03Z","timestamp":1773588423361,"version":"3.50.1"},"reference-count":85,"publisher":"Association for Computing Machinery (ACM)","issue":"9","license":[{"start":{"date-parts":[[2027,2,23]],"date-time":"2027-02-23T00:00:00Z","timestamp":1803340800000},"content-version":"vor","delay-in-days":350,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100006752","name":"U.S. Army Corps of Engineers","doi-asserted-by":"publisher","award":["W912HZ-23-2-0004"],"award-info":[{"award-number":["W912HZ-23-2-0004"]}],"id":[{"id":"10.13039\/100006752","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2026,7,31]]},"abstract":"<jats:p>Few-shot learning (FSL) and data-efficient learning paradigms enable object detection models to recognize novel classes from minimally annotated examples, addressing expensive data-labeling challenges. This systematic survey examines recent advances in few-shot, semi-supervised, sparsely-supervised, and weakly-supervised approaches for video and 3D object detection, focusing on developments through foundation models and vision-language model integration. For video object detection, techniques including tube proposals, temporal matching networks, motion-guided approaches, and temporal consistency-based semi-supervised methods utilize spatiotemporal relationships for efficient novel class adaptation, with recent architectures achieving substantial gains from 33 to 48 average precision in few-shot scenarios. For 3D object detection, specialized approaches address point cloud sparsity and texture limitations through uncertainty-aware methods, geometric learning, and multimodal fusion, with sparsely-supervised techniques achieving competitive performance using only 2% of annotations, enabling practical deployment in autonomous driving and robotics. The survey analyzes methodological advances including meta-learning, transfer learning, pseudo-label generation, contrastive instance mining, and foundation model integration across applications spanning autonomous driving, surveillance, robotics, industrial control, and medical imaging. By examining developments across multiple supervision paradigms, this work highlights data-efficient learning\u2019s potential for minimizing annotation requirements and enabling robust real-world deployment across temporal, spatial, and multimodal domains.<\/jats:p>","DOI":"10.1145\/3790093","type":"journal-article","created":{"date-parts":[[2026,2,23]],"date-time":"2026-02-23T11:21:14Z","timestamp":1771845674000},"page":"1-34","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Few-Shot Learning in Video and 3D Object Detection: A Survey"],"prefix":"10.1145","volume":"58","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8833-2274","authenticated-orcid":false,"given":"Md Meftahul","family":"Ferdaus","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of New Orleans","place":["New Orleans, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-2925-8856","authenticated-orcid":false,"given":"Kendall N.","family":"Niles","sequence":"additional","affiliation":[{"name":"US Army Corps of Engineers Mississippi Valley Division","place":["Vicksburg, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3782-2254","authenticated-orcid":false,"given":"Joe","family":"Tom","sequence":"additional","affiliation":[{"name":"US Army Corps of Engineers Mississippi Valley Division","place":["Vicksburg, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-0932-2009","authenticated-orcid":false,"given":"Mahdi","family":"Abdelguerfi","sequence":"additional","affiliation":[{"name":"Computer Science, University of New Orleans","place":["New Orleans, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-3699-6762","authenticated-orcid":false,"given":"Elias","family":"Ioup","sequence":"additional","affiliation":[{"name":"US Naval Research Laboratory","place":["Stennis Space Center, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2026,3,10]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"CVPR 2025","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","year":"2025","unstructured":"Zhaochong An, Guolei Sun, Yun Liu, Runjia Li, Junlin Han, Ender Konukoglu, and Serge Belongie. 2025. Generalized few-shot 3D point cloud segmentation with vision-language model. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. CVPR 2025. Retrieved from https:\/\/openaccess.thecvf.com\/content\/CVPR2025\/papers\/An_Generalized_Few-shot_3D_Point_Cloud_Segmentation_with_Vision-Language_Model_CVPR_2025_paper.pdf"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52734.2025.01584"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","unstructured":"H. Ankarboina O. S. V. S. Buddha A. K. Singh and R. Pamula. 2025. Enhanced Object Detection System for Autonomous Driving. In 2025 6th International Conference on Recent Advances in Information Technology (RAIT). IEEE 1\u20136. DOI:10.1109\/RAIT65068.2025.11088899","DOI":"10.1109\/RAIT65068.2025.11088899"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","unstructured":"Simone Antonelli Danilo Avola Luigi Cinque Donato Crisostomi Gian Luca Foresti Fabio Galasso Marco Raoul Marini Alessio Mecca and Daniele Pannone. 2022. Few-shot object detection: A survey. Comput. Surveys 54 11s (2022) 1\u201337. DOI:10.1145\/3519022","DOI":"10.1145\/3519022"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","unstructured":"M. K\u00f6hler M. Eisenbach and H.-M. Gross. 2023. Few-shot object detection: A comprehensive survey. IEEE Transactions on Neural Networks and Learning Systems 35 9 (2023) 11958\u201311978. DOI:10.1109\/TNNLS.2023.3307752","DOI":"10.1109\/TNNLS.2023.3307752"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","unstructured":"Haigen Hu Xiaoyuan Wang Yan Zhang Qi Chen and Qiu Guan. 2024. A comprehensive survey on contrastive learning. Neurocomputing 610 (2024) 128645. DOI:10.1016\/j.neucom.2024.128645","DOI":"10.1016\/j.neucom.2024.128645"},{"key":"e_1_3_1_8_2","unstructured":"Zhengwei Yang Yuke Li Qiang Sun Basura Fernando Heng Huang and Zheng Wang. 2024. Cross-modal few-shot learning: A generative transfer learning approach. arXiv:2410.10663. Retrieved from https:\/\/arxiv.org\/abs\/2410.10663"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","unstructured":"Jing Zhang Zhaolong Hong Xu Chen and Yunsong Li. 2024. Few-shot object detection for remote sensing imagery using meta-learning. Remote Sensing 16 19(2024) 3630. DOI:10.3390\/rs16193630","DOI":"10.3390\/rs16193630"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","unstructured":"Sathishkumar Moorthy Sachin Sakthi K.S. Sathiyamoorthi Arthanari Jae Hoon Jeong and Young Hoon Joo. 2024. Hybrid multi-attention transformer for robust video object detection. Engineering Applications of Artificial Intelligence 139 Part B (2025) 109606. DOI:10.1016\/j.engappai.2024.109606","DOI":"10.1016\/j.engappai.2024.109606"},{"key":"e_1_3_1_11_2","unstructured":"Zhaochong An Guolei Sun Yun Liu Runjia Li Min Wu Ming-Ming Cheng Ender Konukoglu and Serge Belongie. 2025. Multimodality Helps few-shot 3D point cloud semantic segmentation. In Proceedings of the International Conference on Learning Representations (ICLR 2025). arXiv:2410.22489. Retrieved from https:\/\/arxiv.org\/abs\/2410.22489"},{"key":"e_1_3_1_12_2","unstructured":"Chuhan Zhang Chaoyang Zhu Pingcheng Dong Long Chen and Dong Zhang. 2025. Cyclic contrastive knowledge transfer for open-vocabulary object detection. In Proceedings of the International Conference on Learning Representations (ICLR 2025). arXiv:2503.11005. Retrieved from https:\/\/arxiv.org\/abs\/2503.11005"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","unstructured":"Yifan Zhuang Pei Liu Hao Yang Kai Zhang Yinhai Wang and Ziyuan Pu. 2025. Few-shot learning for novel object detection in autonomous driving. Communications in Transportation Research 5 (2025) 100194. DOI:10.1016\/j.commtr.2025.100194","DOI":"10.1016\/j.commtr.2025.100194"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","unstructured":"Junchi Su Xin Gao Heping Lu Baofeng Li Feng Zhai Xiao Fang Taizhi Wang and Qiangwei Li. 2025. Generalized few-shot object detection: Challenges and solutions. IEEE Transactions on Circuits and Systems for Video Technology 35 7 (2025) 6979\u20136992. DOI:10.1109\/TCSVT.2025.3542292","DOI":"10.1109\/TCSVT.2025.3542292"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","unstructured":"Ruoyu Chen Hua Zhang Jingzhi Li Li Liu Zhen Huang and Xiaochun Cao. 2025. Generalized semantic contrastive learning via embedding side information. IEEE Transactions on Pattern Analysis and Machine Intelligence 47 (2025) 6496\u20136513. DOI:10.1109\/TPAMI.2025.3560033","DOI":"10.1109\/TPAMI.2025.3560033"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","unstructured":"Yunqing Jiang Sunyuan Qiang Wuchen Li and Yanyan Liang. 2025. LLM-DiffAug: Enhancing few-shot object detection via LLM-guided data augmentation. Knowledge-Based Systems 308 (2025) 111116. DOI:10.1016\/j.knosys.2025.111116","DOI":"10.1016\/j.knosys.2025.111116"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","unstructured":"Hanwen Zhang Houze Guo Haojie Bai Chuanfang Zhang and Linlin Li. 2025. MAML-based temporal supervised information maximizing GAN for few-shot learning. Expert Systems with Applications. 129342. DOI:10.1016\/j.eswa.2025.029574","DOI":"10.1016\/j.eswa.2025.029574"},{"key":"e_1_3_1_18_2","unstructured":"Kaining Ying Hengrui Hu and Henghui Ding. 2025. MOVE: Motion-guided few-shot video object segmentation. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV 2025). 11632\u201311642."},{"key":"e_1_3_1_19_2","unstructured":"J. Chen J. Mei L. Chen F. Zhao Y. Xing and Y. Hu. 2024. Proto-OOD: Enhancing OOD object detection with prototype feature similarity. arXiv:2409.05466. Retrieved from https:\/\/arxiv.org\/abs\/2409.05466"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","unstructured":"X. Chen W. Jiang H. Qi M. Liu H. Ma P. L. H. Yu Y. Wen Z. Han S. Zhang and G. Cao. 2024. Adaptive meta-knowledge transfer network for few-shot object detection in very high resolution remote sensing images. International Journal of Applied Earth Observation and Geoinformation 127 (2024) 103675. DOI:10.1016\/j.jag.2024.000293","DOI":"10.1016\/j.jag.2024.000293"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","unstructured":"Y. Chen X. Xu and C. Liu. 2024. Few-shot meta transfer learning-based damage detection of composite structures. Smart Materials and Structures 33 2 (2024) 025027. DOI:10.1088\/1361-665X\/ad1ded","DOI":"10.1088\/1361-665X\/ad1ded"},{"key":"e_1_3_1_22_2","unstructured":"Vishal Chudasama Hiran Sarkar Pratik Wasnik M. Tanveer and Aruna Tiwari. 2024. Beyond few-shot object detection: A detailed survey. arXiv:2408.14249. Retrieved from https:\/\/arxiv.org\/abs\/2408.14249"},{"key":"e_1_3_1_23_2","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Collins L.","year":"2020","unstructured":"L. Collins, A. Mokhtari, and S. Shakkottai. 2020. Task-robust model-agnostic meta-learning. In Proceedings of the Advances in Neural Information Processing Systems. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2020\/hash\/da8ce53cf0240070ce6c69c48cd588ee-Abstract.html"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","unstructured":"Daniel Cores Lorenzo Seidenari Alberto Del Bimbo V\u00edctor M. Brea and Manuel Mucientes. 2025. A fine-tuning approach based on spatio-temporal features for few-shot video object detection. Engineering Applications of Artificial Intelligence 146 (2025) 110198. DOI:10.1016\/j.engappai.2025.110198","DOI":"10.1016\/j.engappai.2025.110198"},{"key":"e_1_3_1_25_2","volume-title":"Proceedings of the 2025 IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV 2025)","year":"2025","unstructured":"Arghya De, Vrisha Sengar, Dulam Thapar, Maneesh Chandran, and Maneesh Kaul. 2025. Elemental composite prototypical network: Few-shot object detection on outdoor 3D point cloud scenes. In Proceedings of the 2025 IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV 2025). Retrieved from https:\/\/openaccess.thecvf.com\/content\/WACV2025\/papers\/De_Elemental_Composite_Prototypical_Network_Few-Shot_Object_Detection_on_Outdoor_3D_WACV_2025_paper.pdf"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","unstructured":"Enerst Edozie Aliyu Nuhu Shuaibu Ukagwu Kelechi John and Bashir Olaniyi Sadiq. 2025. Comprehensive review of recent developments in visual object detection based on deep learning. Artificial Intelligence Review 58 277(2025). DOI:10.1007\/s10462-025-11284-w","DOI":"10.1007\/s10462-025-11284-w"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-20044-1_5"},{"key":"e_1_3_1_28_2","doi-asserted-by":"crossref","unstructured":"S. R. Fatema and S. Maradithaya. 2024. Meta learning approach based on episodic learning for few-shot image classification. Journal of Image and Graphics 12 2 (2024) 139\u2013148. Retrieved from https:\/\/www.joig.net\/2024\/JOIG-V12N2-205.pdf","DOI":"10.18178\/joig.12.2.205-214"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW67362.2025.00103"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","unstructured":"J. E. Gallagher and E. J. Oughton. 2025. Surveying YOLO multispectral object detection: Applications advancements and challenges. IEEE Access 13 (2025) 12456\u201312478. DOI:10.1109\/ACCESS.2025.3526458","DOI":"10.1109\/ACCESS.2025.3526458"},{"key":"e_1_3_1_31_2","unstructured":"Wenbin Guan Zijiu Yang Xiaohong Wu Liqiong Chen Feng Huang Xiaohai He and Honggang Chen. 2024. Efficient meta-learning enabled lightweight multiscale few-shot object detection in remote sensing images. arXiv preprint arXiv:2404.18426 (2024)."},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","unstructured":"Guangxing Han and Ser-Nam Lim. 2024. Few-Shot Object detection with foundation models. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024). 28608\u201328618.","DOI":"10.1109\/CVPR52733.2024.02703"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/SMC53992.2023.10394197"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","unstructured":"S. Hao T. Li W. Li T. Qi and X. Ma. 2025. Few-shot object detection in unmanned aerial vehicles based transmission line inspection: A method based on transfer learning and attention mechanism. IEEE Sensors Journal (2025). DOI:10.1109\/JSEN.2025.3558229","DOI":"10.1109\/JSEN.2025.3558229"},{"key":"e_1_3_1_35_2","doi-asserted-by":"crossref","unstructured":"Cheng-Ju Ho Chen-Hsuan Tai Yen-Yu Lin Ming-Hsuan Yang and Yi-Hsuan Tsai. 2023. Diffusion-SS3D: Diffusion model for semi-supervised 3D object detection. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) 36 (2023) 49100\u201349112.","DOI":"10.52202\/075280-2134"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","unstructured":"Timothy Hospedales Antreas Antoniou Paul Micaelli and Amos Storkey. 2022. Meta-learning in neural networks: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 44 9 (2022) 5149\u20135169. DOI:10.1109\/TPAMI.2021.3079209","DOI":"10.1109\/TPAMI.2021.3079209"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","unstructured":"Gabriel Huang Issam Laradji David V\u00e1zquez Simon Lacoste-Julien and Pau Rodr\u00edguez. 2023. A survey of self-supervised and few-shot object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 45 4 (2023) 4071\u20134089. DOI:10.1109\/TPAMI.2022.3199617","DOI":"10.1109\/TPAMI.2022.3199617"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","unstructured":"Licheng Jiao Ruohan Zhang Fang Liu Shuyuan Yang Biao Hou Lingling Li and Xu Tang. 2022. New generation deep learning for video object detection: A survey. IEEE Transactions on Neural Networks and Learning Systems 33 8 (2022) 3195\u20133215. DOI:10.1109\/TNNLS.2021.3053249","DOI":"10.1109\/TNNLS.2021.3053249"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","unstructured":"K. J. Joseph Jathushan Rajasegaran Salman Khan Fahad Shahbaz Khan and Vineeth N. Balasubramanian. 2022. Incremental object detection via meta-learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 44 12 (2022) 9209\u20139216. DOI:10.1109\/TPAMI.2021.3124133","DOI":"10.1109\/TPAMI.2021.3124133"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","unstructured":"Mona K\u00f6hler Markus Eisenbach and Horst-Michael Gross. 2024. Few-shot object detection: A comprehensive survey. IEEE Transactions on Neural Networks and Learning Systems 35 9 (2024) 11958\u201311978. DOI:10.1109\/TNNLS.2023.3265051","DOI":"10.1109\/TNNLS.2023.3265051"},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","unstructured":"Akash Kumar and Yogesh Singh Rawat. 2022. End-to-end semi-supervised learning for video action detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022). 14680\u201314690. DOI:10.1109\/CVPR52688.2022.01429","DOI":"10.1109\/CVPR52688.2022.01429"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-19842-7_25"},{"key":"e_1_3_1_43_2","unstructured":"Shuangzhi Li Junlong Shen Lei Ma and Xingyu Li. 2025. From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning. arXiv:2503.06282. Retrieved from https:\/\/arxiv.org\/abs\/2503.06282"},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","unstructured":"Chuandong Liu Chenqiang Gao Fangcen Liu Pengcheng Li Deyu Meng and Xinbo Gao. 2023. Hierarchical supervision and shuffle data augmentation for 3D semi-supervised object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023). 23819\u201323828. DOI:10.1109\/CVPR52729.2023.02281","DOI":"10.1109\/CVPR52729.2023.02281"},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","unstructured":"Chuandong Liu Chenqiang Gao Fangcen Liu Jiang Liu Deyu Meng and Xinbo Gao. 2022. SS3D: Sparsely-supervised 3D object detection from point cloud. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022). 8418\u20138427. DOI:10.1109\/CVPR52688.2022.00824","DOI":"10.1109\/CVPR52688.2022.00824"},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","unstructured":"Tianyi Lyu Dian Gu Peiyuan Chen Yaoting Jiang Zhenhong Zhang Huadong Pang Li Zhou and Yiping Dong. 2024. Optimized CNNs for rapid 3D point cloud object recognition. 10.48550\/arXiv.2412.02855. Retrieved from https:\/\/arxiv.org\/abs\/2412.02855","DOI":"10.48550\/arXiv.2412.02855"},{"key":"e_1_3_1_47_2","doi-asserted-by":"crossref","unstructured":"Anish Madan Neehar Peri Shu Kong and Deva Ramanan. 2024. Revisiting few-shot object detection with vision-language models. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) 37 (2024) 19547\u201319560.","DOI":"10.52202\/079017-0617"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","unstructured":"Tanvir Mahmud Chun-Hao Liu Burhaneddin Yaman and Diana Marculescu. 2024. SSVOD: Semi-supervised video object detection with sparse annotations. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV 2024). 6759\u20136768. DOI:10.1109\/WACV57701.2024.00663","DOI":"10.1109\/WACV57701.2024.00663"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","unstructured":"Jiageng Mao Shaoshuai Shi Xiaogang Wang and Hongsheng Li. 2023. 3D object detection for autonomous driving: A comprehensive survey. International Journal of Computer Vision 131 8 (2023) 1909\u20131963. DOI:10.1007\/s11263-023-01790-1","DOI":"10.1007\/s11263-023-01790-1"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","unstructured":"Qinghao Meng Wenguan Wang Tianfei Zhou Jianbing Shen Yunde Jia and Luc Van Gool. 2022. Towards a weakly supervised framework for 3D point cloud object detection and annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence 44 8 (2022) 4454\u20134468. DOI:10.1109\/TPAMI.2021.3063611","DOI":"10.1109\/TPAMI.2021.3063611"},{"key":"e_1_3_1_51_2","doi-asserted-by":"crossref","unstructured":"Qinghao Meng Wenguan Wang Tianfei Zhou Jianbing Shen Luc Van Gool and Dengxin Dai. 2020. Weakly supervised 3D object detection from lidar point cloud. In Proceedings of the European Conference on Computer Vision (ECCV 2020). Springer 515\u2013531.","DOI":"10.1007\/978-3-030-58601-0_31"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","unstructured":"Samiyaa Yaseen Mohammed. 2025. Architecture review: Two-stage and one-stage object detection. Franklin Open 12 (2025) 100322. DOI:10.1016\/j.fraope.2025.100322","DOI":"10.1016\/j.fraope.2025.100322"},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","unstructured":"Narendra Mohan and Manoj Kumar. 2025. Tuned YOLOv4 model for indoor 3D object detection from point cloud data. In Proceedings of the 2025 International Conference on Intelligent Computing and Control Systems (ICICCS). IEEE 815\u2013820. DOI:10.1109\/ICCCS2025.10985198","DOI":"10.1109\/ICCCS2025.10985198"},{"key":"e_1_3_1_54_2","doi-asserted-by":"publisher","unstructured":"Divya Nimma Omaia Al-Omari Rahul Pradhan Zoirov Ulmas R. V. V. Krishna Ts Yousef A. Baker El-Ebiary and Vuda Sreenivasa Rao. 2025. Object detection in real-time video surveillance using attention based transformer-YOLOv8 model. Alexandria Engineering Journal 118 (2025) 482\u2013495. DOI:10.1016\/j.aej.2024.12.468","DOI":"10.1016\/j.aej.2024.12.468"},{"key":"e_1_3_1_55_2","unstructured":"OpenMMLab Contributors. 2021. MMFewShot: OpenMMLab Few Shot Learning Toolbox and Benchmark. Retrieved from https:\/\/github.com\/open-mmlab\/mmfewshot. Accessed: 2025."},{"key":"e_1_3_1_56_2","doi-asserted-by":"publisher","unstructured":"Z. Ouardirhi S. A. Mahmoudi and M. Zbakh. 2024. Enhancing object detection in smart video surveillance: A survey of occlusion-handling approaches. Electronics 13 3 (2024) 541. DOI:10.3390\/electronics13030541","DOI":"10.3390\/electronics13030541"},{"key":"e_1_3_1_57_2","doi-asserted-by":"crossref","unstructured":"Jinhyung Park Chenfeng Xu Yiyang Zhou Masayoshi Tomizuka and Wei Zhan. 2022. Detmatch: Two teachers are better than one for joint 2D and 3D semi-supervised object detection. In Proceedings of the European Conference on Computer Vision (ECCV 2022). Springer 370\u2013389.","DOI":"10.1007\/978-3-031-20080-9_22"},{"key":"e_1_3_1_58_2","doi-asserted-by":"crossref","unstructured":"Yongri Piao Chenyang Lu Miao Zhang and Huchuan Lu. 2022. Semi-supervised video salient object detection based on uncertainty-guided pseudo labels. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) 35 (2022) 5614\u20135627.","DOI":"10.52202\/068431-0406"},{"key":"e_1_3_1_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00937"},{"key":"e_1_3_1_60_2","doi-asserted-by":"crossref","unstructured":"Zengyi Qin Jinglu Wang and Yan Lu. 2020. Weakly supervised 3D object detection from point clouds. In Proceedings of the 28th ACM International Conference on Multimedia. 4144\u20134152.","DOI":"10.1145\/3394171.3413805"},{"key":"e_1_3_1_61_2","doi-asserted-by":"crossref","unstructured":"Yuhan Gao Peng Wang Xiaoyan Li Bo Sun Mengyu Sun Liangliang Li and Ruohai Di. 2025. PillarFocusNet for 3D object detection with perceptual diffusion and key feature understanding. Scientific Reports 15 1 (2025) 8776.","DOI":"10.1038\/s41598-025-92338-5"},{"key":"e_1_3_1_62_2","unstructured":"Roboflow. 2025. Foundational Few-Shot Object Detection Challenge [CVPR 2025]. Retrieved May 13 2025 from https:\/\/blog.roboflow.com\/foundational-few-shot-object-detection-challenge-cvpr-2025\/"},{"key":"e_1_3_1_63_2","unstructured":"Ranjan Sapkota Konstantinos I. Roumeliotis Rahul Harsha Cheppally Marco Flores Calero and Manoj Karkee. 2025. A Review of 3D Object Detection with Vision-Language Models. arXiv:2504.18738. Retrieved from https:\/\/arxiv.org\/abs\/2504.18738"},{"key":"e_1_3_1_64_2","doi-asserted-by":"publisher","unstructured":"Mohammadreza Saraei Mehrshad Lalinia and Eung-Joo Lee. 2025. Deep learning-based medical object detection: A survey. IEEE Access 13 (2025) 53019\u201353038. DOI:10.1109\/ACCESS.2025.3553087","DOI":"10.1109\/ACCESS.2025.3553087"},{"key":"e_1_3_1_65_2","doi-asserted-by":"crossref","unstructured":"Deepak Kumar Singh Dibakar Raj Pant Ganesh Gautam and Bhanu Shrestha. 2025. Meta-learning approach for adaptive anomaly detection from multi-scenario video surveillance. Applied Sciences 15 12 (2025) 6687.","DOI":"10.3390\/app15126687"},{"key":"e_1_3_1_66_2","doi-asserted-by":"publisher","unstructured":"Mohana Singh B. S. Vivek Jayavardhana Gubbi and R. Venkatesh Babu. 2025. ProtoPatchNet: An interpretable patch-based prototypical network. In Proceedings of the 2025 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2025). 712\u2013719. DOI:10.1109\/CVPRW67362.2025.00076","DOI":"10.1109\/CVPRW67362.2025.00076"},{"key":"e_1_3_1_67_2","unstructured":"Jake Snell Kevin Swersky and Richard Zemel. 2017. Prototypical networks for few-shot learning. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS) 30 (2017)."},{"key":"e_1_3_1_68_2","doi-asserted-by":"publisher","unstructured":"Jiadai Sun Yuxin Mao Yuchao Dai Yiran Zhong and Jianyuan Wang. 2023. MUNet: Motion uncertainty-aware semi-supervised video object segmentation. Pattern Recognition 138 (2023) 109399. DOI:10.1016\/j.patcog.2023.109399","DOI":"10.1016\/j.patcog.2023.109399"},{"key":"e_1_3_1_69_2","unstructured":"Jiawei Liu Xingping Dong Sanyuan Zhao and Jianbing Shen. 2023. Generalized few-shot 3D object detection of LiDAR point cloud for autonomous driving. arXiv:2302.03914. Retrieved from https:\/\/arxiv.org\/abs\/2302.03914"},{"key":"e_1_3_1_70_2","doi-asserted-by":"crossref","unstructured":"Tanuj Sur Samrat Mukherjee Kaizer Rahaman Subhasis Chaudhuri Muhammad Haris Khan and Biplab Banerjee. 2025. Hyperbolic uncertainty-aware few-shot incremental point cloud segmentation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025).","DOI":"10.1109\/CVPR52734.2025.01103"},{"key":"e_1_3_1_71_2","doi-asserted-by":"publisher","unstructured":"Ankit Kumar Tiwari and Gyanendra Kumar Sharma. 2024. FS-3DSSN: An efficient few-shot learning for single-stage 3D object detection on point clouds. The Visual Computer 40 (2024) 8125\u20138139. DOI:10.1007\/s00371-023-03228-8","DOI":"10.1007\/s00371-023-03228-8"},{"key":"e_1_3_1_72_2","unstructured":"Michelle Guo Edward Chou De-An Huang Shuran Song Serena Yeung and Li Fei-Fei. 2018. Neural graph matching networks for fewshot 3D action recognition. In Proceedings of the European Conference on Computer Vision (ECCV 2018)."},{"key":"e_1_3_1_73_2","unstructured":"Shuaihang Yuan Xiang Li Hao Huang and Yi Fang. 2022. Meta-Det3D: Learn to learn few-shot 3D object detection. In Proceedings of the Asian Conference on Computer Vision (ACCV 2022). 1761\u20131776."},{"key":"e_1_3_1_74_2","doi-asserted-by":"crossref","unstructured":"Yu-Xiong Wang Deva Ramanan and Martial Hebert. 2019. Meta-learning to detect rare objects. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV 2019).","DOI":"10.1109\/ICCV.2019.01002"},{"key":"e_1_3_1_75_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i3.25370"},{"key":"e_1_3_1_76_2","unstructured":"Yun Wang Long Zhang Jingren Liu Jiaqi Yan Zhanjie Zhang Jiahao Zheng Xun Yang Dapeng Wu Xiangyu Chen and Xuelong Li. 2025. Episodic memory representation for long-form video understanding. arXiv:2508.09486."},{"key":"e_1_3_1_77_2","doi-asserted-by":"publisher","unstructured":"Xiongwei Wu Doyen Sahoo and Steven C. H. Hoi. 2020. Meta-RCNN: Meta learning for few-shot object detection. In Proceedings of the 28th ACM International Conference on Multimedia. DOI:10.1145\/3394171.3413832","DOI":"10.1145\/3394171.3413832"},{"key":"e_1_3_1_78_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00575"},{"key":"e_1_3_1_79_2","doi-asserted-by":"publisher","unstructured":"Zhimeng Xin Shiming Chen Tianxu Wu Yuanjie Shao Weiping Ding and Xinge You. 2024. Few-shot object detection: Research advances and challenges. Information Fusion 107 (2024) 102307. DOI:10.1016\/j.inffus.2024.102307","DOI":"10.1016\/j.inffus.2024.102307"},{"key":"e_1_3_1_80_2","doi-asserted-by":"publisher","unstructured":"Liang Yu Lin Tang and Lisha Mu. 2025. A review of detection transformer: From basic architecture to advanced developments and visual perception applications. Sensors 25 13 (2025) 3952. DOI:10.3390\/s25133952","DOI":"10.3390\/s25133952"},{"key":"e_1_3_1_81_2","doi-asserted-by":"publisher","unstructured":"Zhongjie Yu Gaoang Wang Lin Chen Sebastian Raschka and Jiebo Luo. 2022. When few-shot learning meets video object detection. In 2022 26th International Conference on Pattern Recognition (ICPR). IEEE 2986\u20132992. DOI:10.1109\/ICPR56361.2022.9956303","DOI":"10.1109\/ICPR56361.2022.9956303"},{"key":"e_1_3_1_82_2","doi-asserted-by":"publisher","unstructured":"Yuan-Zhi Feng Shing-Ho J. Lin Xuan Tang Mu-Yu Wang Jian-Zhang Zheng Zi-Yao He Zi-Yi Pang Jian Yang Ming-Song Chen and Xian Wei. 2025. Hyperbolic prototype rectification for few-shot 3D point cloud classification. Pattern Recognition 158 (2025) 111042. DOI:10.1016\/j.patcog.2024.111042","DOI":"10.1016\/j.patcog.2024.111042"},{"key":"e_1_3_1_83_2","doi-asserted-by":"publisher","unstructured":"Jinghua Zhang Li Liu Olli Silven Matti Pietikainen and Dewen Hu. 2025. Few-shot class-incremental learning for classification and object detection: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 47 4 (2025) 2924\u20132945. DOI:10.1109\/TPAMI.2025.3529038","DOI":"10.1109\/TPAMI.2025.3529038"},{"key":"e_1_3_1_84_2","doi-asserted-by":"publisher","unstructured":"Yibo Zhang and Suofei Zhang. 2025. YW-FSVOD: Open-vocabulary few-shot object detection method based on YOLO-world. Software Engineering and Applications 14 7 (2025) 456\u2013472. DOI:10.12677\/SEA.2025.147045","DOI":"10.12677\/SEA.2025.147045"},{"key":"e_1_3_1_85_2","doi-asserted-by":"crossref","unstructured":"Shijia Zhao Qiming Xia Xusheng Guo Pufan Zou Maoji Zheng Hai Wu Chenglu Wen and Cheng Wang. 2025. SP3D: Boosting sparsely-supervised 3D object detection via accurate cross-modal semantic prompts. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025). 29374\u201329384.","DOI":"10.1109\/CVPR52734.2025.02735"},{"key":"e_1_3_1_86_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01655"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3790093","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3790093","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,15]],"date-time":"2026-03-15T13:41:08Z","timestamp":1773582068000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3790093"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,10]]},"references-count":85,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2026,7,31]]}},"alternative-id":["10.1145\/3790093"],"URL":"https:\/\/doi.org\/10.1145\/3790093","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,10]]},"assertion":[{"value":"2023-10-31","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-01-04","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-03-10","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}