{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T02:48:36Z","timestamp":1780627716567,"version":"3.54.1"},"reference-count":31,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,12,21]],"date-time":"2022-12-21T00:00:00Z","timestamp":1671580800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"(NRF-2021R1A2C2006895) Project"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Weed control is among the most challenging issues for crop cultivation and turf grass management. In addition to hosting various insects and plant pathogens, weeds compete with crop for nutrients, water and sunlight. This results in problems such as the loss of crop yield, the contamination of food crops and disruption in the field aesthetics and practicality. Therefore, effective and efficient weed detection and mapping methods are indispensable. Deep learning (DL) techniques for the rapid recognition and localization of objects from images or videos have shown promising results in various areas of interest, including the agricultural sector. Attention-based Transformer models are a promising alternative to traditional constitutional neural networks (CNNs) and offer state-of-the-art results for multiple tasks in the natural language processing (NLP) domain. To this end, we exploited these models to address the aforementioned weed detection problem with potential applications in automated robots. Our weed dataset comprised of 1006 images for 10 weed classes, which allowed us to develop deep learning-based semantic segmentation models for the localization of these weed classes. The dataset was further augmented to cater for the need of a large sample set of the Transformer models. A study was conducted to evaluate the results of three types of Transformer architectures, which included Swin Transformer, SegFormer and Segmenter, on the dataset, with SegFormer achieving final Mean Accuracy (mAcc) and Mean Intersection of Union (mIoU) of 75.18% and 65.74%, while also being the least computationally expensive, with just 3.7 M parameters.<\/jats:p>","DOI":"10.3390\/s23010065","type":"journal-article","created":{"date-parts":[[2022,12,22]],"date-time":"2022-12-22T02:31:11Z","timestamp":1671676271000},"page":"65","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":42,"title":["Transformer-Based Weed Segmentation for Grass Management"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1610-1442","authenticated-orcid":false,"given":"Kan","family":"Jiang","sequence":"first","affiliation":[{"name":"Artificial Intelligence Lab, Department of Computer Science and Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0238-0505","authenticated-orcid":false,"given":"Usman","family":"Afzaal","sequence":"additional","affiliation":[{"name":"Division of Computer Science and Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Joonwhoan","family":"Lee","sequence":"additional","affiliation":[{"name":"Artificial Intelligence Lab, Department of Computer Science and Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"107148","DOI":"10.1016\/j.comnet.2020.107148","article-title":"A compilation of UAV applications for precision agriculture","volume":"172","author":"Sarigiannidis","year":"2020","journal-title":"Comput. Netw."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1016\/j.compag.2016.04.024","article-title":"A survey of image processing techniques for plant extraction and segmentation in the field","volume":"125","author":"Hamuda","year":"2016","journal-title":"Comput. Electron. Agric."},{"key":"ref_3","unstructured":"Sakyi, L.L.S. (2019, January 01). 2019. Available online: https:\/\/greenrootltd.com\/2019\/02\/19\/five-general-categories-of-weed-control-methods\/."},{"key":"ref_4","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017). Attention is all you need. Adv. Neural Inf. Process. Syst., 30."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"105750","DOI":"10.1016\/j.compag.2020.105750","article-title":"A DNN-based semantic segmentation for detecting weed and crop","volume":"178","author":"You","year":"2020","journal-title":"Comput. Electron. Agric."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_7","first-page":"12077","article-title":"SegFormer: Simple and efficient design for semantic segmentation with transformers","volume":"34","author":"Xie","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Strudel, R., Garcia, R., Laptev, I., and Schmid, C. (2021, January 11\u201317). Segmenter: Transformer for semantic segmentation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00717"},{"key":"ref_9","unstructured":"Bello, I., Zoph, B., Vaswani, A., Shlens, J., and Le, Q.V. (November, January 27). Attention augmented convolutional networks. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_10","unstructured":"Ramachandran, P., Parmar, N., Vaswani, A., Bello, I., Levskaya, A., and Shlens, J. (2019). Stand-alone self-attention in vision models. Adv. Neural Inf. Process. Syst., 32."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhao, H., Jia, J., and Koltun, V. (2020, January 13\u201319). Exploring self-attention for image recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01009"},{"key":"ref_12","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Vaswani, A., Ramachandran, P., Srinivas, A., Parmar, N., Hechtman, B., and Shlens, J. (2021, January 20\u201325). Scaling local self-attention for parameter efficient visual backbones. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01270"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wang, H., Zhu, Y., Green, B., Adam, H., Yuille, A., and Chen, L.-C. (2020, January 23\u201328). Axial-deeplab: Stand-alone axial-attention for panoptic segmentation. Proceedings of the European Conference on Computer Vision, Glasgow, UK.","DOI":"10.1007\/978-3-030-58548-8_7"},{"key":"ref_15","unstructured":"Ho, J., Kalchbrenner, N., Weissenborn, D., and Salimans, T. (2019). Axial attention in multidimensional transformers. arXiv."},{"key":"ref_16","unstructured":"Child, R., Gray, S., Radford, A., and Sutskever, I. (2019). Generating long sequences with sparse transformers. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"106067","DOI":"10.1016\/j.compag.2021.106067","article-title":"A survey of deep learning techniques for weed detection from images","volume":"184","author":"Hasan","year":"2021","journal-title":"Comput. Electron. Agric."},{"key":"ref_18","first-page":"1","article-title":"Image processing for plant determination using the Hough transform and clustering methods","volume":"67","author":"Hemming","year":"2002","journal-title":"Gartenbauwissenschaft"},{"key":"ref_19","first-page":"1761","article-title":"Machine vision identification of tomato seedlings for automated weed control","volume":"40","author":"Tian","year":"2000","journal-title":"Trans. ASAE"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"105306","DOI":"10.1016\/j.compag.2020.105306","article-title":"Towards weeds identification assistance through transfer learning","volume":"171","author":"Mylonas","year":"2020","journal-title":"Comput. Electron. Agric."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"10940","DOI":"10.1109\/ACCESS.2021.3050296","article-title":"Weed identification using deep learning and image processing in vegetable plantation","volume":"9","author":"Jin","year":"2021","journal-title":"IEEE Access"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"81724","DOI":"10.1109\/ACCESS.2020.2991354","article-title":"Semantic segmentation of crop and weed using an encoder-decoder network and image enhancement method under uncontrolled outdoor illumination","volume":"8","author":"Wang","year":"2020","journal-title":"IEEE Access"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Reedha, R., Dericquebourg, E., Canals, R., and Hafiane, A. (2022). Transformer Neural Network for Weed and Crop Classification of High Resolution UAV Images. Remote Sens., 14.","DOI":"10.3390\/rs14030592"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"012068","DOI":"10.1088\/1742-6596\/2002\/1\/012068","article-title":"Image Classification for Soybean and Weeds Based on ViT","volume":"2002","author":"Liang","year":"2021","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Thakur, P.S., Khanna, P., Sheorey, T., and Ojha, A. (2022). Explainable vision transformer enabled convolutional neural network for plant disease identification: PlantXViT. arXiv.","DOI":"10.1016\/j.ecoinf.2023.102245"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Zhu, W., Sun, J., Wang, S., Shen, J., Yang, K., and Zhou, X. (2022). Identifying Field Crop Diseases Using Transformer-Embedded Convolutional Neural Network. Agriculture, 12.","DOI":"10.3390\/agriculture12081083"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Shen, Y., Wang, L., and Jin, Y. (2022, January 19\u201320). AAFormer: A Multi-Modal Transformer Network for Aerial Agricultural Images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPRW56347.2022.00177"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Li, X., and Li, S. (2022). Transformer Help CNN See Better: A Lightweight Hybrid Apple Disease Identification Model Based on Transformers. Agriculture, 12.","DOI":"10.3390\/agriculture12060884"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"107258","DOI":"10.1016\/j.compag.2022.107258","article-title":"Transformer helps identify kiwifruit diseases in complex natural environments","volume":"200","author":"Li","year":"2022","journal-title":"Comput. Electron. Agric."},{"key":"ref_30","unstructured":"Loshchilov, I., and Hutter, F. (2019, January 6\u20139). Decoupled Weight Decay Regularization. Proceedings of the 7th International Conference on Learning Representations, ICLR, New Orleans, LA, USA."},{"key":"ref_31","unstructured":"Contributors, M. (2020, January 01). MMSegmentation. Available online: https:\/\/github.com\/open-mmlab\/mmsegmentation."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/1\/65\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:45:35Z","timestamp":1760147135000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/1\/65"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,21]]},"references-count":31,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["s23010065"],"URL":"https:\/\/doi.org\/10.3390\/s23010065","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,21]]}}}