{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,9]],"date-time":"2026-04-09T18:48:51Z","timestamp":1775760531728,"version":"3.50.1"},"reference-count":41,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,12,20]],"date-time":"2022-12-20T00:00:00Z","timestamp":1671494400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment","award":["APKLSATE2021x004"],"award-info":[{"award-number":["APKLSATE2021x004"]}]},{"name":"the Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment","award":["2008085MF203"],"award-info":[{"award-number":["2008085MF203"]}]},{"name":"the Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment","award":["201904a06020056"],"award-info":[{"award-number":["201904a06020056"]}]},{"name":"the Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment","award":["202104a06020012"],"award-info":[{"award-number":["202104a06020012"]}]},{"name":"the Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment","award":["2022AH040125"],"award-info":[{"award-number":["2022AH040125"]}]},{"name":"the Natural Science Foundation of Anhui Province","award":["APKLSATE2021x004"],"award-info":[{"award-number":["APKLSATE2021x004"]}]},{"name":"the Natural Science Foundation of Anhui Province","award":["2008085MF203"],"award-info":[{"award-number":["2008085MF203"]}]},{"name":"the Natural Science Foundation of Anhui Province","award":["201904a06020056"],"award-info":[{"award-number":["201904a06020056"]}]},{"name":"the Natural Science Foundation of Anhui Province","award":["202104a06020012"],"award-info":[{"award-number":["202104a06020012"]}]},{"name":"the Natural Science Foundation of Anhui Province","award":["2022AH040125"],"award-info":[{"award-number":["2022AH040125"]}]},{"name":"the Key Research and Development Plan of Anhui Province","award":["APKLSATE2021x004"],"award-info":[{"award-number":["APKLSATE2021x004"]}]},{"name":"the Key Research and Development Plan of Anhui Province","award":["2008085MF203"],"award-info":[{"award-number":["2008085MF203"]}]},{"name":"the Key Research and Development Plan of Anhui Province","award":["201904a06020056"],"award-info":[{"award-number":["201904a06020056"]}]},{"name":"the Key Research and Development Plan of Anhui Province","award":["202104a06020012"],"award-info":[{"award-number":["202104a06020012"]}]},{"name":"the Key Research and Development Plan of Anhui Province","award":["2022AH040125"],"award-info":[{"award-number":["2022AH040125"]}]},{"name":"the Natural Science Major Project for Anhui Provincial University","award":["APKLSATE2021x004"],"award-info":[{"award-number":["APKLSATE2021x004"]}]},{"name":"the Natural Science Major Project for Anhui Provincial University","award":["2008085MF203"],"award-info":[{"award-number":["2008085MF203"]}]},{"name":"the Natural Science Major Project for Anhui Provincial University","award":["201904a06020056"],"award-info":[{"award-number":["201904a06020056"]}]},{"name":"the Natural Science Major Project for Anhui Provincial University","award":["202104a06020012"],"award-info":[{"award-number":["202104a06020012"]}]},{"name":"the Natural Science Major Project for Anhui Provincial University","award":["2022AH040125"],"award-info":[{"award-number":["2022AH040125"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Precise pear detection and recognition is an essential step toward modernizing orchard management. However, due to the ubiquitous occlusion in orchards and various locations of image acquisition, the pears in the acquired images may be quite small and occluded, causing high false detection and object loss rate. In this paper, a multi-scale collaborative perception network YOLOv5s-FP (Fusion and Perception) was proposed for pear detection, which coupled local and global features. Specifically, a pear dataset with a high proportion of small and occluded pears was proposed, comprising 3680 images acquired with cameras mounted on a ground tripod and a UAV platform. The cross-stage partial (CSP) module was optimized to extract global features through a transformer encoder, which was then fused with local features by an attentional feature fusion mechanism. Subsequently, a modified path aggregation network oriented to collaboration perception of multi-scale features was proposed by incorporating a transformer encoder, the optimized CSP, and new skip connections. The quantitative results of utilizing the YOLOv5s-FP for pear detection were compared with other typical object detection networks of the YOLO series, recording the highest average precision of 96.12% with less detection time and computational cost. In qualitative experiments, the proposed network achieved superior visual performance with stronger robustness to the changes in occlusion and illumination conditions, particularly providing the ability to detect pears with different sizes in highly dense, overlapping environments and non-normal illumination areas. Therefore, the proposed YOLOv5s-FP network was practicable for detecting in-field pears in a real-time and accurate way, which could be an advantageous component of the technology for monitoring pear growth status and implementing automated harvesting in unmanned orchards.<\/jats:p>","DOI":"10.3390\/s23010030","type":"journal-article","created":{"date-parts":[[2022,12,21]],"date-time":"2022-12-21T02:31:33Z","timestamp":1671589893000},"page":"30","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":21,"title":["YOLOv5s-FP: A Novel Method for In-Field Pear Detection Using a Transformer Encoder and Multi-Scale Collaboration Perception"],"prefix":"10.3390","volume":"23","author":[{"given":"Yipu","family":"Li","sequence":"first","affiliation":[{"name":"College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China"},{"name":"Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4386-2490","authenticated-orcid":false,"given":"Yuan","family":"Rao","sequence":"additional","affiliation":[{"name":"College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China"},{"name":"Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0827-2962","authenticated-orcid":false,"given":"Xiu","family":"Jin","sequence":"additional","affiliation":[{"name":"College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China"},{"name":"Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhaohui","family":"Jiang","sequence":"additional","affiliation":[{"name":"College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China"},{"name":"Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4282-9821","authenticated-orcid":false,"given":"Yuwei","family":"Wang","sequence":"additional","affiliation":[{"name":"Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tan","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China"},{"name":"Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fengyi","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qing","family":"Luo","sequence":"additional","affiliation":[{"name":"College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8137-671X","authenticated-orcid":false,"given":"Lu","family":"Liu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China"},{"name":"Anhui Provincial Key Laboratory of Smart Agricultural Technology and Equipment, Hefei 230036, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,20]]},"reference":[{"key":"ref_1","first-page":"1067","article-title":"Current Status, Trends, Main Problems and the Suggestions on Development of Pear Industry in China","volume":"36","author":"Zhang","year":"2019","journal-title":"J. Fruit Sci."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"106092","DOI":"10.1016\/j.compag.2021.106092","article-title":"Estimation of Corn Yield Based on Hyperspectral Imagery and Convolutional Neural Network","volume":"184","author":"Yang","year":"2021","journal-title":"Comput. Electron. Agric."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Li, G., Chen, L., Zhao, S., and Liu, Y. (2022, January 8\u201310). Efficient Aircraft Object Detection Based on Faster R-CNN in Remote Sensing Images. Proceedings of the Second International Conference on Digital Signal and Computer Communications (DSCC 2022), SPIE, Changchun, China.","DOI":"10.1117\/12.2641804"},{"key":"ref_4","first-page":"1","article-title":"A Moveable Harvest: An Investigation into Using Mobile Platforms for Picking Apples","volume":"17","author":"Sinnetta","year":"2020","journal-title":"Aust. Farm Bus. Manag. J."},{"key":"ref_5","first-page":"7","article-title":"Labour Trends and Mechanisation: The Productivity Sum","volume":"5","author":"Lombard","year":"2019","journal-title":"FarmBiz"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Yan, Q., Yang, B., Wang, W., Wang, B., Chen, P., and Zhang, J. (2020). Apple Leaf Diseases Recognition Based on an Improved Convolutional Neural Network. Sensors, 20.","DOI":"10.3390\/s20123535"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely Connected Convolutional Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"81943","DOI":"10.1109\/ACCESS.2020.2991552","article-title":"Agricultural Pest Super-Resolution and Identification with Attention Enhanced Residual and Dense Fusion Generative and Adversarial Network","volume":"8","author":"Dai","year":"2020","journal-title":"IEEE Access"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhang, W., Ma, H., Li, X., Liu, X., Jiao, J., Zhang, P., Gu, L., Wang, Q., Bao, W., and Cao, S. (2021). Imperfect Wheat Grain Recognition Combined with an Attention Mechanism and Residual Network. Appl. Sci., 11.","DOI":"10.3390\/app11115139"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"105742","DOI":"10.1016\/j.compag.2020.105742","article-title":"Using Channel Pruning-Based YOLO v4 Deep Learning Algorithm for the Real-Time and Accurate Detection of Apple Flowers in Natural Environments","volume":"178","author":"Wu","year":"2020","journal-title":"Comput. Electron. Agric."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"106054","DOI":"10.1016\/j.compag.2021.106054","article-title":"A High-Precision Detection Method of Hydroponic Lettuce Seedlings Status Based on Improved Faster RCNN","volume":"182","author":"Li","year":"2021","journal-title":"Comput. Electron. Agric."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/j.biosystemseng.2022.02.011","article-title":"A Segmentation Algorithm Incorporating Superpixel Block and Holistically Nested Edge for Sugarcane Aphids Images under Natural Light Conditions","volume":"216","author":"Xu","year":"2022","journal-title":"Biosyst. Eng."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1016\/j.biosystemseng.2021.11.011","article-title":"A Real-Time Table Grape Detection Method Based on Improved YOLOv4-Tiny Network in Complex Background","volume":"212","author":"Li","year":"2021","journal-title":"Biosyst. Eng."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Xia, H., Yang, B., Li, Y., and Wang, B. (2022). An Improved CenterNet Model for Insulator Defect Detection Using Aerial Imagery. Sensors, 22.","DOI":"10.3390\/s22082850"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016). Ssd: Single Shot Multibox Detector. Proceedings of the European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_16","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). Fcos: Fully Convolutional One-Stage Object Detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_17","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An Incremental Improvement. arXiv."},{"key":"ref_18","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). Yolov4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Bochkovskiy, A., and Liao, H.-Y.M. (2022). YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. arXiv.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"106503","DOI":"10.1016\/j.compag.2021.106503","article-title":"Fast and Accurate Green Pepper Detection in Complex Backgrounds via an Improved Yolov4-Tiny Model","volume":"191","author":"Li","year":"2021","journal-title":"Comput. Electron. Agric."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"106696","DOI":"10.1016\/j.compag.2022.106696","article-title":"Canopy-Attention-YOLOv4-Based Immature\/Mature Apple Fruit Detection on Dense-Foliage Tree Architectures for Early Crop Load Estimation","volume":"193","author":"Lu","year":"2022","journal-title":"Comput. Electron. Agric."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"754","DOI":"10.1007\/s11119-020-09754-y","article-title":"Fast and Accurate Detection of Kiwifruit in Orchard Using Improved YOLOv3-Tiny Model","volume":"22","author":"Fu","year":"2021","journal-title":"Precis. Agric."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/j.biosystemseng.2019.08.017","article-title":"Fruit Detection in an Apple Orchard Using a Mobile Terrestrial Laser Scanner","volume":"187","author":"Gregorio","year":"2019","journal-title":"Biosyst. Eng."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.compag.2016.09.014","article-title":"Mapping Almond Orchard Canopy Volume, Flowers, Fruit and Yield Using Lidar and Vision Sensors","volume":"130","author":"Underwood","year":"2016","journal-title":"Comput. Electron. Agric."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3173","DOI":"10.1109\/TGRS.2018.2794326","article-title":"Hyperspectral Image Classification with Deep Feature Fusion Network","volume":"56","author":"Song","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention Is All You Need. Proceedings of the Advances in neural information processing systems, Long Beach, CA, USA."},{"key":"ref_27","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Wang, W., Xie, E., Li, X., Fan, D.-P., Song, K., Liang, D., Lu, T., Luo, P., and Shao, L. (2021, January 11\u201317). Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Virtual Event.","DOI":"10.1109\/ICCV48922.2021.00061"},{"key":"ref_29","unstructured":"Yang, Y., Jiao, L., Liu, X., Liu, F., Yang, S., Feng, Z., and Tang, X. (2022). Transformers Meet Visual Learning Understanding: A Comprehensive Review. arXiv."},{"key":"ref_30","unstructured":"D\u2019Ascoli, S., Touvron, H., Leavitt, M.L., Morcos, A.S., Biroli, G., and Sagun, L. (2021, January 18\u221224). Convit: Improving Vision Transformers with Soft Convolutional Inductive Biases. Proceedings of the International Conference on Machine Learning, PMLR, Virtual Event."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Srinivas, A., Lin, T.-Y., Parmar, N., Shlens, J., Abbeel, P., and Vaswani, A. (2021, January 11\u201317). Bottleneck Transformers for Visual Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Virtual Event.","DOI":"10.1109\/CVPR46437.2021.01625"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_33","unstructured":"Tan, M., and Le, Q. (2019, January 9\u201315). Efficientnet: Rethinking Model Scaling for Convolutional Neural Networks. Proceedings of the International Conference on Machine Learning, PMLR, Long Bench, CA, USA."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Zhang, Z., Lu, X., Cao, G., Yang, Y., Jiao, L., and Liu, F. (2021, January 11\u201317). ViT-YOLO: Transformer-Based YOLO for Object Detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Virtual Event.","DOI":"10.1109\/ICCVW54120.2021.00314"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G. (2011, January 6\u201313). ORB: An Efficient Alternative to SIFT or SURF. Proceedings of the 2011 International Conference on Computer Vision, IEEE, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_36","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., and Ren, D. (2020, January 7\u201312). Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. Proceedings of the AAAI Conference on Artificial Intelligence, New York, New York, USA.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhang, Y.-F., Ren, W., Zhang, Z., Jia, Z., Wang, L., and Tan, T. (2021). Focal and Efficient IOU Loss for Accurate Bounding Box Regression. arXiv.","DOI":"10.1016\/j.neucom.2022.07.042"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Chen, Z., Xie, L., Niu, J., Liu, X., Wei, L., and Tian, Q. (2021). Visformer: The Vision-Friendly Transformer. arXiv.","DOI":"10.1109\/ICCV48922.2021.00063"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Dai, Y., Gieseke, F., Oehmcke, S., Wu, Y., and Barnard, K. (2021, January 3\u20138). Attentional Feature Fusion. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV48630.2021.00360"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Silva, L.A., Sanchez San Blas, H., Peral Garc\u00eda, D., Sales Mendes, A., and Villarubia Gonz\u00e1lez, G. (2020). An Architectural Multi-Agent System for a Pavement Monitoring System with Pothole Recognition in UAV Images. Sensors, 20.","DOI":"10.3390\/s20216205"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/1\/30\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:44:52Z","timestamp":1760147092000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/1\/30"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,20]]},"references-count":41,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["s23010030"],"URL":"https:\/\/doi.org\/10.3390\/s23010030","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,20]]}}}