{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,28]],"date-time":"2026-06-28T06:03:47Z","timestamp":1782626627433,"version":"3.54.5"},"reference-count":67,"publisher":"Springer Science and Business Media LLC","issue":"8","license":[{"start":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:00:00Z","timestamp":1750204800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:00:00Z","timestamp":1750204800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["42271409"],"award-info":[{"award-number":["42271409"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2025,8]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>As an important branch of remote sensing technology, aerial image target detection plays an indispensable role in supporting urban planning, disaster assessment, and other fields. However, this task faces many challenges such as small object size and complex background, which increase the difficulty of detection. Existing methods usually use multi-scale feature fusion or attention mechanism to improve performance, but they often ignore the role of object feature perception in the image and have problems such as insufficient use of context information. To address these problems, we propose the VMC-Net framework to optimize the aerial image object detection task. The VHeat C2f module enhances the feature extraction capability and generates a clearer target feature map; the multi-scale feature aggregation and distribution module adds feature distribution technology on the basis of the multi-scale feature fusion strategy to achieve more effective scale interaction; the contextual attention guided fusion module uses attention mechanism and weighted fusion method to effectively utilize context information and significantly improve the performance of small object detection. We evaluate the VMC-Net framework on the AI-TOD, VisDrone-2019 and TinyPerson datasets. Experimental results show that our framework outperforms the mainstream target detection methods in the past three years in aerial object detection, with mAP50 scores of 45.6%, 45.9%, and 25.4% respectively.<\/jats:p>","DOI":"10.1007\/s40747-025-01888-8","type":"journal-article","created":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:05:38Z","timestamp":1750205138000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["VMC-Net: multi-scale feature aggregation and distribution with contextual attention guided fusion for aerial object detection"],"prefix":"10.1007","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-0178-9056","authenticated-orcid":false,"given":"Haodong","family":"Li","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2351-3154","authenticated-orcid":false,"given":"Haicheng","family":"Qu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,6,18]]},"reference":[{"issue":"1","key":"1888_CR1","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1007\/s40747-024-01652-4","volume":"11","author":"Dandan Liao","year":"2025","unstructured":"Liao Dandan, Zhang Jianxun, Tao Ye, Jin Xie (2025) Atbhc-yolo: aggregate transformer and bidirectional hybrid convolution for small object detection. Complex & Intelligent Systems 11(1):38","journal-title":"Complex & Intelligent Systems"},{"key":"1888_CR2","doi-asserted-by":"publisher","DOI":"10.1016\/j.displa.2022.102299","volume":"75","author":"Cong Hua","year":"2022","unstructured":"Hua Cong, Zhong Baojiang, Song Weigang, Yang Jianyu (2022) Circular coding: A technique for visual localization in urban areas. Displays 75:102299","journal-title":"Displays"},{"key":"1888_CR3","doi-asserted-by":"publisher","DOI":"10.1016\/j.displa.2023.102574","volume":"81","author":"YingHong Tian","year":"2024","unstructured":"Tian YingHong, Zhang Kun, Xingbo Hu, Yue Lu (2024) Crop type recognition of vgi road-side images via hierarchy structure based on semantic segmentation model deeplabv3+. Displays 81:102574","journal-title":"Displays"},{"issue":"6","key":"1888_CR4","doi-asserted-by":"publisher","first-page":"2797","DOI":"10.1007\/s40747-021-00457-z","volume":"7","author":"Khushbu Maurya","year":"2021","unstructured":"Maurya Khushbu, Mahajan Seema, Chaube Nilima (2021) Remote sensing techniques: Mapping and monitoring of mangrove ecosystem-a review. Complex & Intelligent Systems 7(6):2797\u20132818","journal-title":"Complex & Intelligent Systems"},{"key":"1888_CR5","first-page":"1","volume":"19","author":"Weiyang Chen","year":"2021","unstructured":"Chen Weiyang, Wang Haifeng, Li Hao, Li Quanjing, Yang Yang, Yang Kun (2021) Real-time garbage object detection with data augmentation and feature fusion using suav low-altitude remote sensing images. IEEE Geoscience and Remote Sensing Letters 19:1\u20135","journal-title":"IEEE Geoscience and Remote Sensing Letters"},{"issue":"12","key":"1888_CR6","doi-asserted-by":"publisher","first-page":"2046","DOI":"10.3390\/rs16122046","volume":"16","author":"Jasper Baur","year":"2024","unstructured":"Baur Jasper, Dewey Kyle, Steinberg Gabriel, Nitsche Frank O (2024) Modeling the effect of vegetation coverage on unmanned aerial vehicles-based object detection: A study in the minefield environment. Remote Sensing 16(12):2046","journal-title":"Remote Sensing"},{"key":"1888_CR7","doi-asserted-by":"crossref","unstructured":"Yue Ma, Lu Zhang, Ding Chen, Haolei Zhang, Yitong Zheng, Shiyong Yan (2024) Inversion of reservoir parameters for oil extraction based on deformation monitoring with insar. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","DOI":"10.1109\/JSTARS.2024.3406022"},{"key":"1888_CR8","doi-asserted-by":"crossref","unstructured":"Han Qinzhe, Yin Qian, Zheng Xin, Chen Ziyi (2021) Remote sensing image building detection method based on mask r-cnn. Complex & Intelligent Systems, pages 1\u20139","DOI":"10.1007\/s40747-021-00322-z"},{"issue":"12","key":"1888_CR9","first-page":"2688","volume":"27","author":"Yu Wenqi","year":"2024","unstructured":"Wenqi Yu, Gong Cheng, Meijun Wang, Yanqing Yao, Xingxing Xie, Xiwen Yao, Junwei Han (2024) Mar20: A benchmark for military aircraft recognition in remote sensing images. National Remote Sensing Bulletin 27(12):2688\u20132696","journal-title":"National Remote Sensing Bulletin"},{"key":"1888_CR10","doi-asserted-by":"crossref","unstructured":"Sree Soumya D, Aishwarya Ch, Vasavi S (2021) Fpga-based military vehicles detection and classification from drone videos using yolov5. In International Conference on Energy Systems, Drives and Automations, pages 265\u2013276. Springer","DOI":"10.1007\/978-981-99-3691-5_22"},{"key":"1888_CR11","unstructured":"Li Chuyi, Li Lulu, Geng Yifei, Jiang Hongliang, Cheng Meng, Zhang Bo, Ke Zaidan, Xu Xiaoming, Chu Xiangxiang (2023) Yolov6 v3. 0: A full-scale reloading. arXiv preprint arXiv:2301.05586"},{"key":"1888_CR12","doi-asserted-by":"crossref","unstructured":"Wang Chien-Yao, Bochkovskiy Alexey, Liao Hong-Yuan Mark (2023) Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pages 7464\u20137475","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"1888_CR13","doi-asserted-by":"crossref","unstructured":"Zhang Shifeng, Chi Cheng, Yao Yongqiang, Lei Zhen, Li Stan Z (2020) Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pages 9759\u20139768","DOI":"10.1109\/CVPR42600.2020.00978"},{"key":"1888_CR14","unstructured":"Ge Z (2021) Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430"},{"key":"1888_CR15","unstructured":"Zhang Hongyi (2017) mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412"},{"key":"1888_CR16","doi-asserted-by":"crossref","unstructured":"Yun Sangdoo, Han Dongyoon, Oh Seong Joon, Chun Sanghyuk, Choe Junsuk, Yoo Youngjoon (2019) Cutmix: Regularization strategy to train strong classifiers with localizable features. In Proceedings of the IEEE\/CVF international conference on computer vision, pages 6023\u20136032","DOI":"10.1109\/ICCV.2019.00612"},{"key":"1888_CR17","doi-asserted-by":"publisher","first-page":"12993","DOI":"10.1609\/aaai.v34i07.6999","volume":"34","author":"Zheng Zhaohui","year":"2020","unstructured":"Zhaohui Zheng, Ping Wang, Wei Liu, Jinze Li, Rongguang Ye, Dongwei Ren (2020) Distance-iou loss: Faster and better learning for bounding box regression. In Proceedings of the AAAI conference on artificial intelligence 34:12993\u201313000","journal-title":"In Proceedings of the AAAI conference on artificial intelligence"},{"key":"1888_CR18","doi-asserted-by":"crossref","unstructured":"Bodla Navaneeth, Singh Bharat, Chellappa Rama, Davis Larry S (2017) Soft-nms\u2013improving object detection with one line of code. In Proceedings of the IEEE international conference on computer vision, pages 5561\u20135569","DOI":"10.1109\/ICCV.2017.593"},{"key":"1888_CR19","unstructured":"Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N, Kaiser Lukasz, Polosukhin Illia (2017) Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems, pages 6000\u20136010"},{"key":"1888_CR20","doi-asserted-by":"crossref","unstructured":"Carion Nicolas, Massa Francisco, Synnaeve Gabriel, Usunier Nicolas, Kirillov Alexander, Zagoruyko Sergey (2020) End-to-end object detection with transformers. In European conference on computer vision, pages 213\u2013229. Springer","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"1888_CR21","doi-asserted-by":"crossref","unstructured":"Sun Peize, Zhang Rufeng, Jiang Yi, Kong Tao, Xu Chenfeng, Zhan Wei, Tomizuka Masayoshi, Li Lei, Yuan Zehuan, Wang Changhu et al (2021) Sparse r-cnn: End-to-end object detection with learnable proposals. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pages 14454\u201314463","DOI":"10.1109\/CVPR46437.2021.01422"},{"key":"1888_CR22","doi-asserted-by":"crossref","unstructured":"Meng Depu, Chen Xiaokang, Fan Zejia, Zeng Gang, Li Hao, Yuan Yuwen, Sun Lei, Wang Jian, Zhu Xinggang (2021) Conditional detr for fast training convergence. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), pages 3651\u20133660","DOI":"10.1109\/ICCV48922.2021.00363"},{"key":"1888_CR23","unstructured":"Zhang Hao, Li Feng, Liu Shilong, Zhang Lei, Su Hang, Zhu Jun, Ni Lionel, Shum Heung-Yeung Dino: Detr with improved denoising anchor boxes for end-to-end object detection. In The Eleventh International Conference on Learning Representations"},{"key":"1888_CR24","doi-asserted-by":"crossref","unstructured":"Jia Ding, Yuan Yuhui, He Haodi, Wu Xiaopei, Yu Haojun, Lin Weihong, Sun Lei, Zhang Chao, Hu Han (2023) Detrs with hybrid matching. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pages 19702\u201319712","DOI":"10.1109\/CVPR52729.2023.01887"},{"key":"1888_CR25","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.124848","volume":"256","author":"Chen Xue","year":"2024","unstructured":"Xue Chen, Xia Yuelong, Mingjie Wu, Chen Zaiqing, Cheng Feiyan, Yun Lijun (2024) El-yolo: An efficient and lightweight low-altitude aerial objects detector for onboard applications. Expert Systems with Applications 256:124848","journal-title":"Expert Systems with Applications"},{"key":"1888_CR26","unstructured":"Jocher Glenn YOLOv5 by Ultralytics, May 2020"},{"issue":"23","key":"1888_CR27","doi-asserted-by":"publisher","first-page":"5499","DOI":"10.3390\/rs15235499","volume":"15","author":"Zhonghua Li","year":"2023","unstructured":"Li Zhonghua, Hou Biao, Zitong Wu, Ren Bo, Yang Chen (2023) Fcosr: A simple anchor-free rotated detector for aerial object detection. Remote Sensing 15(23):5499","journal-title":"Remote Sensing"},{"key":"1888_CR28","doi-asserted-by":"publisher","first-page":"41999","DOI":"10.1109\/ACCESS.2024.3378248","volume":"12","author":"Ou Kaitong","year":"2024","unstructured":"Kaitong Ou, Dong Chaojun, Liu Xiankun, Zhai Yikui, Li Ye, Huang Wanxia, Qiu Wenkang, Wang Yizhi, Wang Chengxuan (2024) Drone-tood: A lightweight task-aligned object detection algorithm for vehicle detection in uav images. IEEE Access 12:41999\u201342016","journal-title":"IEEE Access"},{"key":"1888_CR29","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2024.104898","volume":"142","author":"Jinping Liu","year":"2024","unstructured":"Liu Jinping, Zheng Kunyi, Liu Xianyi, Pengfei Xu, Zhou Ying (2024) Sdsdet: A real-time object detector for small, dense, multi-scale remote sensing objects. Image and Vision Computing 142:104898","journal-title":"Image and Vision Computing"},{"key":"1888_CR30","doi-asserted-by":"crossref","unstructured":"Dai Longgang, Chen Hongming, Li Yufeng, Kong Caihua, Fan Zhentao, Lu Jiyang, Chen Xiang (2022) Tardet: two-stage anchor-free rotating object detector in aerial images. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pages 4267\u20134275","DOI":"10.1109\/CVPRW56347.2022.00472"},{"issue":"1","key":"1888_CR31","doi-asserted-by":"publisher","first-page":"20230105","DOI":"10.1515\/comp-2023-0105","volume":"14","author":"Fu Liangrui","year":"2024","unstructured":"Liangrui Fu, Deng Jinqiu, Zhu Baoliang, Li Zengyan, Liao Xudong (2024) Afod: Two-stage object detection based on anchor-free remote sensing photos. Open Computer Science 14(1):20230105","journal-title":"Open Computer Science"},{"key":"1888_CR32","doi-asserted-by":"crossref","unstructured":"Wang Hongmei, Zhang Jiahe (2024) Enhancing object detection for remote sensing with dynamic heads in oriented r-cnn. In International Conference on Remote Sensing, Mapping, and Image Processing (RSMIP 2024), volume 13167, pages 150\u2013156. SPIE","DOI":"10.1117\/12.3029658"},{"key":"1888_CR33","doi-asserted-by":"crossref","unstructured":"ASM\u00a0Sharifuzzaman Sagar, Yu\u00a0Chen, YaKun Xie, and Hyung\u00a0Seok Kim. Msa r-cnn: A comprehensive approach to remote sensing object detection and scene understanding. Expert Systems with Applications, 241:122788, 2024","DOI":"10.1016\/j.eswa.2023.122788"},{"key":"1888_CR34","unstructured":"Wei Guoting, Yuan Xia, Liu Yu, Shang Zhenhao, Yao Kelu, Li Chao, Yan Qingsen, Zhao Chunxia, Zhang Haokui, Xiao Rong (2024) Ova-detr: Open vocabulary aerial object detection using image-text alignment and fusion. CoRR"},{"key":"1888_CR35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/LGRS.2024.3490732","volume":"21","author":"Xinyu Ma","year":"2024","unstructured":"Ma Xinyu, Lv Pengyuan, Zhong Yanfei (2024) Qetr: A query-enhanced transformer for remote sensing image object detection. IEEE Geoscience and Remote Sensing Letters 21:1\u20135","journal-title":"IEEE Geoscience and Remote Sensing Letters"},{"issue":"19","key":"1888_CR36","doi-asserted-by":"publisher","first-page":"4740","DOI":"10.3390\/rs15194740","volume":"15","author":"Lu Wanjie","year":"2023","unstructured":"Wanjie Lu, Niu Chaoyang, Lan Chaozhen, Liu Wei, Wang Shiju, Junming Yu, Tao Hu (2023) High-quality object detection method for uav images based on improved dino and masked image modeling. Remote Sensing 15(19):4740","journal-title":"Remote Sensing"},{"key":"1888_CR37","doi-asserted-by":"crossref","unstructured":"Wang Xiaoming, Chen Hao, Chu Xiangxiang, Wang Peng (2024) Aodet: Aerial object detection using transformers for foreground regions. IEEE Transactions on Geoscience and Remote Sensing","DOI":"10.1109\/TGRS.2024.3407815"},{"key":"1888_CR38","doi-asserted-by":"crossref","unstructured":"He Kaiming, Zhang Xiangyu, Ren Shaoqing, Sun Jian (2016) Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pages 770\u2013778","DOI":"10.1109\/CVPR.2016.90"},{"key":"1888_CR39","unstructured":"Jocher Glenn, Chaurasia Ayush, Qiu Jing Ultralytics YOLO, January 2023"},{"key":"1888_CR40","doi-asserted-by":"crossref","unstructured":"Wang Zhaozhi, Liu Yue, Liu Yunfan, Yu Hongtian, Wang Yaowei, Ye Qixiang, Tian Yunjie (2024) vheat: Building vision models upon heat conduction. arXiv preprint arXiv:2405.16555","DOI":"10.1109\/CVPR52734.2025.00907"},{"key":"1888_CR41","unstructured":"Howard Andrew G, Zhu Menglong, Chen Bo, Kalenichenko Dmitry, Wang Weijun, Weyand Tobias, Andreetto Marco, Adam Hartwig (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861"},{"key":"1888_CR42","unstructured":"Dosovitskiy Alexey (2020) An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929"},{"key":"1888_CR43","doi-asserted-by":"crossref","unstructured":"Wang Chien-Yao, Yeh I-Hau, Mark Liao Hong-Yuan (2024) Yolov9: Learning what you want to learn using programmable gradient information. In European conference on computer vision, pages 1\u201321. Springer","DOI":"10.1007\/978-3-031-72751-1_1"},{"key":"1888_CR44","doi-asserted-by":"crossref","unstructured":"Cai Xinhao, Lai Qiuxia, Wang Yuwei, Wang Wenguan, Sun Zeren, Yao Yazhou (2024) Poly kernel inception network for remote sensing detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pages 27706\u201327716","DOI":"10.1109\/CVPR52733.2024.02617"},{"key":"1888_CR45","doi-asserted-by":"crossref","unstructured":"Lee Youngwan, Park Jongyoul (2020) Centermask: Real-time anchor-free instance segmentation. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pages 13906\u201313915","DOI":"10.1109\/CVPR42600.2020.01392"},{"key":"1888_CR46","doi-asserted-by":"crossref","unstructured":"Hu Jie, Shen Li, Sun Gang (2018) Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7132\u20137141","DOI":"10.1109\/CVPR.2018.00745"},{"key":"1888_CR47","doi-asserted-by":"crossref","unstructured":"Zhao Yian, Lv Wenyu, Xu Shangliang, Wei Jinman, Wang Guanzhong, Dang Qingqing, Liu Yi, Chen Jie (2024) Detrs beat yolos on real-time object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pages 16965\u201316974","DOI":"10.1109\/CVPR52733.2024.01605"},{"key":"1888_CR48","doi-asserted-by":"crossref","unstructured":"Lin Tsung-Yi, Maire Michael, Belongie Serge, Hays James, Perona Pietro, Ramanan Deva, Doll\u00e1r Piotr, Lawrence Zitnick C (2014) Microsoft coco: Common objects in context. In Computer Vision\u2013ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740\u2013755. Springer","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"1888_CR49","doi-asserted-by":"crossref","unstructured":"Wang Jinwang, Yang Wen, Guo Haowen, Zhang Ruixiang, Xia Gui-Song (2021) Tiny object detection in aerial images. In 2020 25th international conference on pattern recognition (ICPR), pages 3791\u20133798. IEEE","DOI":"10.1109\/ICPR48806.2021.9413340"},{"key":"1888_CR50","unstructured":"Du Dawei, Zhu Pengfei, Wen Longyin, Bian Xiao, Lin Haibin, Hu Qinghua, Peng Tao, Zheng Jiayu, Wang Xinyao, Zhang Yue, et al (2019) Visdrone-det2019: The vision meets drone object detection in image challenge results. In Proceedings of the IEEE\/CVF international conference on computer vision workshops, pages 0\u20130"},{"key":"1888_CR51","doi-asserted-by":"crossref","unstructured":"Yu Xuehui, Gong Yuqi, Jiang Nan, Ye Qixiang, Han Zhenjun (2020) Scale match for tiny person detection. In Proceedings of the IEEE\/CVF winter conference on applications of computer vision, pages 1257\u20131265","DOI":"10.1109\/WACV45572.2020.9093394"},{"key":"1888_CR52","doi-asserted-by":"crossref","unstructured":"Chen Zehui, Yang Chenhongyi, Li Qiaofei, Zhao Feng, Zha Zheng-Jun, Wu Feng (2021) Disentangle your dense object detector. In Proceedings of the 29th ACM international conference on multimedia, pages 4939\u20134948","DOI":"10.1145\/3474085.3475351"},{"key":"1888_CR53","doi-asserted-by":"crossref","unstructured":"Feng Chengjian, Zhong Yujie, Gao Yu, Scott Matthew R, Huang Weilin (2021) Tood: Task-aligned one-stage object detection. In 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), pages 3490\u20133499. IEEE Computer Society","DOI":"10.1109\/ICCV48922.2021.00349"},{"key":"1888_CR54","unstructured":"Liu Shilong, Li Feng, Zhang Hao, Yang Xiao, Qi Xianbiao, Su Hang, Zhu Jun, Zhang Lei Dab-detr: Dynamic anchor boxes are better queries for detr. In International Conference on Learning Representations"},{"key":"1888_CR55","unstructured":"Lyu Chengqi, Zhang Wenwei, Huang Haian, Zhou Yue, Wang Yudong, Liu Yanyi, Zhang Shilong, Chen Kai (2022) Rtmdet: An empirical study of designing real-time object detectors. arXiv preprint arXiv:2212.07784"},{"key":"1888_CR56","doi-asserted-by":"crossref","unstructured":"Zheng Zhaohui, Ye Rongguang, Wang Ping, Ren Dongwei, Zuo Wangmeng, Hou Qibin, Cheng Ming-Ming (2022) Localization distillation for dense object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pages 9407\u20139416","DOI":"10.1109\/CVPR52688.2022.00919"},{"key":"1888_CR57","doi-asserted-by":"crossref","unstructured":"Liu Zhuang, Mao Hanzi, Wu Chao-Yuan, Feichtenhofer Christoph, Darrell Trevor, Xie Saining (2022) A convnet for the 2020s. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"1888_CR58","unstructured":"Wang Chengcheng, He Wei, Nie Ying, Guo Jianyuan, Liu Chuanjian, Wang Yunhe, Han Kai (2024) Gold-yolo: Efficient object detector via gather-and-distribute mechanism. Advances in Neural Information Processing Systems, 36"},{"key":"1888_CR59","unstructured":"Wang Ao, Chen Hui, Liu Lihao, Chen Kai, Lin Zijia, Han Jungong, Ding Guiguang (2024) Yolov10: Real-time end-to-end object detection. arXiv preprint arXiv:2405.14458"},{"key":"1888_CR60","doi-asserted-by":"crossref","unstructured":"Hou Qibin, Zhou Daquan, Feng Jiashi (2021) Coordinate attention for efficient mobile network design. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pages 13713\u201313722","DOI":"10.1109\/CVPR46437.2021.01350"},{"key":"1888_CR61","unstructured":"Yang Lingxiao, Zhang Ru-Yuan, Li Lida, Xie Xiaohua (2021) Simam: A simple, parameter-free attention module for convolutional neural networks. In International conference on machine learning, pages 11863\u201311874. PMLR"},{"key":"1888_CR62","doi-asserted-by":"crossref","unstructured":"Woo Sanghyun, Park Jongchan, Lee Joon-Young, Kweon In So (2018) Cbam: Convolutional block attention module. In Proceedings of the European conference on computer vision (ECCV), pages 3\u201319","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"1888_CR63","unstructured":"Xu Wei, Wan Yi (2024) Ela: Efficient local attention for deep convolutional neural networks. arXiv preprint arXiv:2403.01123"},{"issue":"5","key":"1888_CR64","doi-asserted-by":"publisher","first-page":"5185","DOI":"10.1007\/s40747-023-00999-4","volume":"9","author":"Yu Sun","year":"2023","unstructured":"Sun Yu, Feng Jian (2023) Fire and smoke precise detection method based on the attention mechanism and anchor-free mechanism. Complex & Intelligent Systems 9(5):5185\u20135198","journal-title":"Complex & Intelligent Systems"},{"issue":"6","key":"1888_CR65","doi-asserted-by":"publisher","first-page":"8095","DOI":"10.1007\/s40747-024-01580-3","volume":"10","author":"Lang Zhang","year":"2024","unstructured":"Zhang Lang, Huang Zhan Ao, Shi Canghong, Ma Hongjiang, Li Xiaojie, Xi Wu (2024) Mfpidet: improved yolov7 architecture based on multi-scale feature fusion for prohibited item detection in complex environment. Complex & Intelligent Systems 10(6):8095\u20138108","journal-title":"Complex & Intelligent Systems"},{"key":"1888_CR66","unstructured":"Shao Rui, Zhou Mingle, Li Min, Han Delong, Li Gang (2024) Td-net: tiny defect detection network for industrial products. Complex & Intelligent Systems, pages 1\u201312"},{"key":"1888_CR67","doi-asserted-by":"crossref","unstructured":"Zhao Langyue, Wu Yiquan, Yuan Yubin (2024) Pd-detr: towards efficient parallel hybrid matching with transformer for photovoltaic cell defects detection. Complex & Intelligent Systems, pages 1\u201314","DOI":"10.1007\/s40747-024-01559-0"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-025-01888-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-025-01888-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-025-01888-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,6]],"date-time":"2025-09-06T20:53:36Z","timestamp":1757192016000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-025-01888-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,18]]},"references-count":67,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2025,8]]}},"alternative-id":["1888"],"URL":"https:\/\/doi.org\/10.1007\/s40747-025-01888-8","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,6,18]]},"assertion":[{"value":"1 December 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 March 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 June 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"350"}}