{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T03:11:35Z","timestamp":1774840295698,"version":"3.50.1"},"reference-count":29,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2020,12,30]],"date-time":"2020-12-30T00:00:00Z","timestamp":1609286400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100018927","name":"Southwest Minzu University","doi-asserted-by":"publisher","award":["No. CX2019SZ16"],"award-info":[{"award-number":["No. CX2019SZ16"]}],"id":[{"id":"10.13039\/501100018927","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Wheat head detection can estimate various wheat traits, such as density, health, and the presence of wheat head. However, traditional detection methods have a huge array of problems, including low efficiency, strong subjectivity, and poor accuracy. In this paper, a method of wheat-head detection based on a deep neural network is proposed to enhance the speed and accuracy of detection. The YOLOv4 is taken as the basic network. The backbone part in the basic network is enhanced by adding dual spatial pyramid pooling (SPP) networks to improve the ability of feature learning and increase the receptive field of the convolutional network. Multilevel features are obtained by a multipath neck part using a top-down to bottom-up strategy. Finally, YOLOv3\u2032s head structures are used to predict the boxes of wheat heads. For training images, some data augmentation technologies are used. The experimental results demonstrate that the proposed method has a significant advantage in accuracy and speed. The mean average precision of our method is 94.5%, and the detection speed is 71 FPS that can achieve the effect of real-time detection.<\/jats:p>","DOI":"10.3390\/s21010191","type":"journal-article","created":{"date-parts":[[2020,12,30]],"date-time":"2020-12-30T20:13:41Z","timestamp":1609359221000},"page":"191","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":70,"title":["Real-Time Detection for Wheat Head Applying Deep Neural Network"],"prefix":"10.3390","volume":"21","author":[{"given":"Bo","family":"Gong","sequence":"first","affiliation":[{"name":"Key Laboratory of Electronic and Information Engineering (Southwest Minzu University), State Ethnic Affairs Commission, Chengdu 610041, China"}]},{"given":"Daji","family":"Ergu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Electronic and Information Engineering (Southwest Minzu University), State Ethnic Affairs Commission, Chengdu 610041, China"}]},{"given":"Ying","family":"Cai","sequence":"additional","affiliation":[{"name":"Key Laboratory of Electronic and Information Engineering (Southwest Minzu University), State Ethnic Affairs Commission, Chengdu 610041, China"}]},{"given":"Bo","family":"Ma","sequence":"additional","affiliation":[{"name":"Key Laboratory of Electronic and Information Engineering (Southwest Minzu University), State Ethnic Affairs Commission, Chengdu 610041, China"}]}],"member":"1968","published-online":{"date-parts":[[2020,12,30]]},"reference":[{"key":"ref_1","unstructured":"(2020, December 29). FAOSTAT. Available online: http:\/\/www.fao.org\/faostat\/zh\/#data\/QC."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1016\/j.agrformet.2018.10.013","article-title":"Ear density estimation from high resolution RGB imagery using deep learning technique","volume":"264","author":"Madec","year":"2019","journal-title":"Agric. For. Meteorol."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.biosystemseng.2015.12.015","article-title":"In-field automatic observation of wheat heading stage using computer vision","volume":"143","author":"Zhu","year":"2016","journal-title":"Biosyst. Eng."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1016\/j.cropro.2010.12.015","article-title":"Detection of head blight (Fusarium ssp.) in winter wheat by color and multispectral image analyses","volume":"30","author":"Dammer","year":"2011","journal-title":"Crop Prot."},{"key":"ref_5","first-page":"212","article-title":"Non-destructive measurement of wheat spike characteristics based on morphological image processing","volume":"26","author":"Bi","year":"2010","journal-title":"Trans. Chin. Soc. Agric. Eng."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"Lecun","year":"2015","journal-title":"Nature"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Zhang, Z., Flores, P., Igathinathane, C., Naik, D.L., Kiran, R., and Ransom, J.K. (2020). Wheat Lodging Detection from UAS Imagery Using Machine Learning Algorithms. Remote Sens., 12.","DOI":"10.3390\/rs12111838"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Pound, M.P., Atkinson, J.A., Wells, D.M., Pridmore, T.P., and French, A.P. (2017, January 22\u201329). Deep learning for multi-task plant phenotyping. Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCVW), Venice, Italy.","DOI":"10.1109\/ICCVW.2017.241"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Buzzy, M., Thesma, V., Davoodi, M., and Velni, J.M. (2020). Real-time plant leaf counting using deep object detection networks. Sensors, 20.","DOI":"10.3390\/s20236896"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Luo, Z., Yu, H., and Zhang, Y. (2020). Pine cone detection using boundary equilibrium generative adversarial networks and improved YOLOv3 model. Sensors, 20.","DOI":"10.3390\/s20164430"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13007-018-0366-8","article-title":"Detection and analysis of wheat spikes using Convolutional Neural Networks","volume":"14","author":"Hasan","year":"2018","journal-title":"Plant Methods"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13007-020-00582-9","article-title":"SpikeSegNet-a deep learning approach utilizing encoder-decoder network with hourglass for spike segmentation and counting in wheat plant from visual imaging","volume":"16","author":"Misra","year":"2020","journal-title":"Plant Methods"},{"key":"ref_13","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"David, E., Madec, S., Sadeghi-Tehran, P., Aasen, H., Zheng, B., Liu, S., Kirchgessner, N., Ishikawa, G., Nagasawa, K., and Badhon, M.A. (2020). Global Wheat Head Detection (GWHD) dataset: A large and diverse dataset of high resolution RGB labelled images to develop and benchmark wheat head detection methods. arXiv.","DOI":"10.34133\/2020\/3521852"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 24\u201327). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (2019, January 27\u201328). FCOS: Fully Convolutional One-Stage Object Detection. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00972"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective search for object recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_22","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv, Available online: http:\/\/arxiv.org\/abs\/1804.02767."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Lin, T., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Wang, C., Liao, H.M., Wu, Y., Chen, P., Hsieh, J., and Yeh, I. (2020, January 14\u201319). CSPNet: A New Backbone that can Enhance Learning Capability of CNN. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201322). Path Aggregation Network for Instance Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., and Savarese, S. (2019, January 16\u201320). Generalized intersection over union: A metric and a loss for bounding box regression. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2019, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"ref_29","unstructured":"Brooks, J. (2020, December 29). COCO Annotator 2019. Available online: https:\/\/github.com\/jsbroks\/coco-annotator."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/1\/191\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:47:52Z","timestamp":1760179672000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/1\/191"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,30]]},"references-count":29,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,1]]}},"alternative-id":["s21010191"],"URL":"https:\/\/doi.org\/10.3390\/s21010191","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12,30]]}}}