{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T05:23:36Z","timestamp":1780637016684,"version":"3.54.1"},"reference-count":42,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2023,3,22]],"date-time":"2023-03-22T00:00:00Z","timestamp":1679443200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Key Science and Technology Program of Henan Province","award":["212102110234"],"award-info":[{"award-number":["212102110234"]}]},{"name":"Key Science and Technology Program of Henan Province","award":["222102320080"],"award-info":[{"award-number":["222102320080"]}]},{"name":"Key Science and Technology Program of Henan Province","award":["232102111124"],"award-info":[{"award-number":["232102111124"]}]},{"name":"Key Science and Technology Program of Henan Province","award":["22A210013"],"award-info":[{"award-number":["22A210013"]}]},{"name":"Key Science and Technology Program of Henan Province","award":["21ZD003"],"award-info":[{"award-number":["21ZD003"]}]},{"name":"Colleges and Universities Key Research Project of Henan Province","award":["212102110234"],"award-info":[{"award-number":["212102110234"]}]},{"name":"Colleges and Universities Key Research Project of Henan Province","award":["222102320080"],"award-info":[{"award-number":["222102320080"]}]},{"name":"Colleges and Universities Key Research Project of Henan Province","award":["232102111124"],"award-info":[{"award-number":["232102111124"]}]},{"name":"Colleges and Universities Key Research Project of Henan Province","award":["22A210013"],"award-info":[{"award-number":["22A210013"]}]},{"name":"Colleges and Universities Key Research Project of Henan Province","award":["21ZD003"],"award-info":[{"award-number":["21ZD003"]}]},{"name":"Major Science and Technology Projects in Xinxiang City, Henan Province","award":["212102110234"],"award-info":[{"award-number":["212102110234"]}]},{"name":"Major Science and Technology Projects in Xinxiang City, Henan Province","award":["222102320080"],"award-info":[{"award-number":["222102320080"]}]},{"name":"Major Science and Technology Projects in Xinxiang City, Henan Province","award":["232102111124"],"award-info":[{"award-number":["232102111124"]}]},{"name":"Major Science and Technology Projects in Xinxiang City, Henan Province","award":["22A210013"],"award-info":[{"award-number":["22A210013"]}]},{"name":"Major Science and Technology Projects in Xinxiang City, Henan Province","award":["21ZD003"],"award-info":[{"award-number":["21ZD003"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Due to their rapid development and wide application in modern agriculture, robots, mobile terminals, and intelligent devices have become vital technologies and fundamental research topics for the development of intelligent and precision agriculture. Accurate and efficient target detection technology is required for mobile inspection terminals, picking robots, and intelligent sorting equipment in tomato production and management in plant factories. However, due to the limitations of computer power, storage capacity, and the complexity of the plant factory (PF) environment, the precision of small-target detection for tomatoes in real-world applications is inadequate. Therefore, we propose an improved Small MobileNet YOLOv5 (SM-YOLOv5) detection algorithm and model based on YOLOv5 for target detection by tomato-picking robots in plant factories. Firstly, MobileNetV3-Large was used as the backbone network to make the model structure lightweight and improve its running performance. Secondly, a small-target detection layer was added to improve the accuracy of small-target detection for tomatoes. The constructed PF tomato dataset was used for training. Compared with the YOLOv5 baseline model, the mAP of the improved SM-YOLOv5 model was increased by 1.4%, reaching 98.8%. The model size was only 6.33 MB, which was 42.48% that of YOLOv5, and it required only 7.6 GFLOPs, which was half that required by YOLOv5. The experiment showed that the improved SM-YOLOv5 model had a precision of 97.8% and a recall rate of 96.7%. The model is lightweight and has excellent detection performance, and so it can meet the real-time detection requirements of tomato-picking robots in plant factories.<\/jats:p>","DOI":"10.3390\/s23063336","type":"journal-article","created":{"date-parts":[[2023,3,22]],"date-time":"2023-03-22T06:35:28Z","timestamp":1679466928000},"page":"3336","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":55,"title":["Lightweight SM-YOLOv5 Tomato Fruit Detection Algorithm for Plant Factory"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6293-5624","authenticated-orcid":false,"given":"Xinfa","family":"Wang","sequence":"first","affiliation":[{"name":"School of Information Engineering, Henan Institute of Science and Technology, Xinxiang 453003, China"},{"name":"Faculty of Engineering and Technology, Sumy National Agrarian University, 40000 Sumy, Ukraine"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3635-7576","authenticated-orcid":false,"given":"Zhenwei","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Henan Institute of Science and Technology, Xinxiang 453003, China"},{"name":"College of Mechanical and Electrical Engineering, Xinxiang University, Xinxiang 453003, China"},{"name":"Institute of Farmland Irrigation, Chinese Academy of Agricultural Sciences, Xinxiang 453002, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Meng","family":"Jia","sequence":"additional","affiliation":[{"name":"College of Mechanical and Electrical Engineering, Xinxiang University, Xinxiang 453003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8821-4550","authenticated-orcid":false,"given":"Tao","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Henan Institute of Science and Technology, Xinxiang 453003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Canlin","family":"Pan","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Henan Institute of Science and Technology, Xinxiang 453003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xuebin","family":"Qi","sequence":"additional","affiliation":[{"name":"Institute of Farmland Irrigation, Chinese Academy of Agricultural Sciences, Xinxiang 453002, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3163-2110","authenticated-orcid":false,"given":"Mingfu","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Henan Institute of Science and Technology, Xinxiang 453003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,3,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"2105009","DOI":"10.1002\/adma.202105009","article-title":"Novel Materials for Urban Farming","volume":"34","author":"Xi","year":"2022","journal-title":"Adv. Mater."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"110811","DOI":"10.1016\/j.foodres.2021.110811","article-title":"Consumer Attitudes to Vertical Farming (Indoor Plant Factory with Artificial Lighting) in China, Singapore, UK, and USA: A Multi-Method Study","volume":"150","author":"Ares","year":"2021","journal-title":"Food Res. Int."},{"key":"ref_3","unstructured":"Food and Agriculture Organisation (2023, January 04). Food and Agriculture Organisation of the United Nations (FAOSTAT). Available online: https:\/\/www.fao.org\/faostat\/en\/#data\/QCL\/."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"10491","DOI":"10.4249\/scholarpedia.10491","article-title":"Scale Invariant Feature Transform","volume":"7","author":"Lindeberg","year":"2012","journal-title":"Scholarpedia"},{"key":"ref_5","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of Oriented Gradients for Human Detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905), San Diego, CA, USA."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1109\/5254.708428","article-title":"Support Vector Machines","volume":"13","author":"Hearst","year":"1998","journal-title":"IEEE Intell. Syst. Their Appl."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective Search for Object Recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_8","unstructured":"Iwasaki, F., and Imamura, H. (2014, January 17\u201319). A Robust Recognition Method for Occlusion of Mini Tomatoes Based on Hue Information and Shape of Edge. Proceedings of the International Conference on Computer Graphics, Multimedia and Image Processing, Kuala Lumpur, Malaysia."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1016\/j.compag.2011.11.007","article-title":"Determination of the Number of Green Apples in RGB Images Recorded in Orchards","volume":"81","author":"Linker","year":"2012","journal-title":"Comput. Electron. Agric."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"5684","DOI":"10.1016\/j.ijleo.2014.07.001","article-title":"Automatic Method of Fruit Object Extraction under Complex Agricultural Background for Vision System of Fruit Picking Robot","volume":"125","author":"Wei","year":"2014","journal-title":"Optik"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"7","DOI":"10.4028\/www.scientific.net\/AMR.485.7","article-title":"An Effective Flame Segmentation Method Based on Ohta Color Space","volume":"485","author":"Wu","year":"2012","journal-title":"Adv. Mater. Res."},{"key":"ref_12","first-page":"328","article-title":"Green Ripe Tomato Detection Method Based on Machine Vision in Greenhouse","volume":"33","author":"Li","year":"2017","journal-title":"Trans. Chin. Soc. Agric. Eng."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1016\/j.biosystemseng.2019.04.024","article-title":"A Novel Image Processing Algorithm to Separate Linearly Clustered Kiwifruits","volume":"183","author":"Fu","year":"2019","journal-title":"Biosyst. Eng."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Girshick, R.B., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-Cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_16","first-page":"1","article-title":"Faster R-Cnn: Towards Real-Time Object Detection with Region Proposal Networks","volume":"28","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Leibe, B., Matas, J., Sebe, N., and Welling, M. (2016). Proceedings of the Computer Vision\u2014ECCV 2016, Springer International Publishing. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-46487-9"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_20","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). Yolov3: An Incremental Improvement. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA."},{"key":"ref_21","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). Yolov4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_22","unstructured":"Ge, Z., Liu, S., Wang, F., Li, Z., and Sun, J. (2021). Yolox: Exceeding Yolo Series in 2021. arXiv."},{"key":"ref_23","unstructured":"Wang, C.Y., Bochkovskiy, A., and Liao, H.Y.M. (2022). YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_25","unstructured":"Mnih, V., Heess, N., Graves, A., and Kavukcuoglu, K. (2014, January 8\u201313). Recurrent Models of Visual Attention. Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-Excitation Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional Block Attention Module. Proceedings of the 15th European Conference Computer Vision (ECCV 2018), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_28","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Zhang, C., Kang, F., and Wang, Y. (2022). An Improved Apple Object Detection Method Based on Lightweight YOLOv4 in Complex Backgrounds. Remote Sens., 14.","DOI":"10.3390\/rs14174150"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Xu, Z., Huang, X., Huang, Y., Sun, H., and Wan, F. (2022). A Real-Time Zanthoxylum Target Detection Method for an Intelligent Picking Robot under a Complex Background, Based on an Improved YOLOv5s Architecture. Sensors, 22.","DOI":"10.3390\/s22020682"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1016\/j.compag.2019.01.012","article-title":"Apple Detection during Different Growth Stages in Orchards Using the Improved YOLO-V3 Model","volume":"157","author":"Tian","year":"2019","journal-title":"Comput. Electron. Agric."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Su, F., Zhao, Y., Wang, G., Liu, P., Yan, Y., and Zu, L. (2022). Tomato Maturity Classification Based on SE-YOLOv3-MobileNetV1 Network under Nature Greenhouse Environment. Agronomy, 12.","DOI":"10.3390\/agronomy12071638"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"8686","DOI":"10.1038\/s41598-022-12732-1","article-title":"Online Recognition and Yield Estimation of Tomato in Plant Factory Based on YOLOv3","volume":"12","author":"Wang","year":"2022","journal-title":"Sci. Rep."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The Pascal Visual Object Classes (Voc) Challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Taha, A.A., and Hanbury, A. (2015). Metrics for Evaluating 3D Medical Image Segmentation: Analysis, Selection, and Tool. BMC Med. Imaging, 15.","DOI":"10.1186\/s12880-015-0068-x"},{"key":"ref_36","first-page":"012003","article-title":"Summary of Target Detection Algorithms","volume":"1757","author":"Li","year":"2021","journal-title":"Proceedings of the Journal of Physics: Conference Series"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Neubeck, A., and Van Gool, L. (2006, January 20\u201324). Efficient Non-Maximum Suppression. Proceedings of the 18th International Conference on Pattern Recognition (ICPR\u201906), Hong Kong, China.","DOI":"10.1109\/ICPR.2006.479"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft Coco: Common Objects in Context. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Howard, A., Sandler, M., Chu, G., Chen, L.C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., and Vasudevan, V. (2019, January 27\u201328). Searching for Mobilenetv3. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00140"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Ju, M., Luo, H., Wang, Z., Hui, B., and Chang, Z. (2019). The Application of Improved YOLO V3 in Multi-Scale Target Detection. Appl. Sci., 9.","DOI":"10.3390\/app9183775"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Zhong, Y., Wang, J., Peng, J., and Zhang, L. (2020, January 1\u20135). Anchor Box Optimization for Object Detection. Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (WACV), Snowmass Village, CO, USA.","DOI":"10.1109\/WACV45572.2020.9093498"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1872","DOI":"10.1007\/s11431-020-1647-3","article-title":"Pre-Trained Models for Natural Language Processing: A Survey","volume":"63","author":"Qiu","year":"2020","journal-title":"Sci. China Technol. Sci."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/6\/3336\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:00:25Z","timestamp":1760122825000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/6\/3336"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,22]]},"references-count":42,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2023,3]]}},"alternative-id":["s23063336"],"URL":"https:\/\/doi.org\/10.3390\/s23063336","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,22]]}}}