{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T17:31:25Z","timestamp":1773768685363,"version":"3.50.1"},"reference-count":39,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2023,7,17]],"date-time":"2023-07-17T00:00:00Z","timestamp":1689552000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Information Management Center of the Council of Agriculture, Executive Yuan, R.O.C.","award":["110AS-9.1.1-I-i2."],"award-info":[{"award-number":["110AS-9.1.1-I-i2."]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Detecting and mapping paddy fields in Taiwan\u2019s agriculture is crucial for managing agricultural production, predicting yields, and assessing damages. Although researchers at the Taiwan Agricultural Research Institute currently use site surveys to identify rice planting areas, this method is time-consuming. This study aimed to determine the optimal band combinations and vegetation index for accurately detecting paddy fields during various phenological stages. Additionally, the Mask RCNN instance segmentation model in the ArcGIS Pro software was employed to enhance the effectiveness of detecting and segmenting paddy fields in aerial images. This study utilized aerial images collected from 2018 to 2019 covering Changhua, Yunlin, Chiayi, and Tainan in central and southern Taiwan, with a label file comprising four categories of rice growing, ripening, harvested stage, and other crops. To create different image datasets, the image pre-processing stage involved modifying band information using different vegetation indices, including NDVI, CMFI, DVI, RVI, and GRVI. The resolution of the training image chips was cropped to 550 by 550 pixels. After the model training process, the study found that the ResNet-50 backbone performed better than the ResNet-101, and the RGB + DVI image dataset achieved the highest mean average precision of 74.01%. In addition, the model trained on the RGB + CMFI image dataset was recommended for detecting paddy fields in the rice growing stage, RGB + NIR for the rice ripening stage, and RGB + GRVI for the rice harvested stage. These models exhibit Dice coefficients of 79.59%, 89.71%, and 87.94%, respectively. The detection and segmentation results can improve the efficiency of rice production management by using different band combinations according to different rice phenological stages. Furthermore, this method can be applied to large-scale detection of other crops, improving land use survey efficiency and reducing the burden on researchers.<\/jats:p>","DOI":"10.3390\/rs15143575","type":"journal-article","created":{"date-parts":[[2023,7,18]],"date-time":"2023-07-18T01:35:16Z","timestamp":1689644116000},"page":"3575","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Deep Learning Approach for Paddy Field Detection Using Labeled Aerial Images: The Case of Detecting and Staging Paddy Fields in Central and Southern Taiwan"],"prefix":"10.3390","volume":"15","author":[{"given":"Yi-Shin","family":"Chou","sequence":"first","affiliation":[{"name":"Department of Biomechatronics Engineering, National Taiwan University, Taipei 106, Taiwan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5737-6960","authenticated-orcid":false,"given":"Cheng-Ying","family":"Chou","sequence":"additional","affiliation":[{"name":"Department of Biomechatronics Engineering, National Taiwan University, Taipei 106, Taiwan"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,17]]},"reference":[{"key":"ref_1","unstructured":"Council of Agriculture, Executive Yuan, T (2022, November 05). Agricultural Statistics, Available online: https:\/\/agrstat.coa.gov.tw\/sdweb\/public\/indicator\/Indicator.aspx."},{"key":"ref_2","first-page":"7","article-title":"Rice growth and development","volume":"192","author":"Moldenhauer","year":"2001","journal-title":"Rice Prod. Handb."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"7589","DOI":"10.1109\/TGRS.2020.2981671","article-title":"Deep learning applications on multitemporal SAR (Sentinel-1) image classification using confined labeled data: The case of detecting rice paddy in South Korea","volume":"58","author":"Jo","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.isprsjprs.2019.04.015","article-title":"Deep learning in remote sensing applications: A meta-analysis and review","volume":"152","author":"Ma","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 13\u201323). DOTA: A large-scale dataset for object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_6","unstructured":"Waqas Zamir, S., Arora, A., Gupta, A., Khan, S., Sun, G., Shahbaz Khan, F., Zhu, F., Shao, L., Xia, G.S., and Bai, X. (2019, January 16\u201317). isaid: A large-scale dataset for instance segmentation in aerial images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, CA, USA."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"2217","DOI":"10.1109\/JSTARS.2019.2918242","article-title":"Eurosat: A novel dataset and deep learning benchmark for land use and land cover classification","volume":"12","author":"Helber","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"3053","DOI":"10.3390\/rs12183053","article-title":"Object detection and image segmentation with deep learning on Earth observation data: Areview\u2014Part 2: Applications","volume":"12","author":"Hoeser","year":"2020","journal-title":"Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"368","DOI":"10.3390\/rs9040368","article-title":"Segment-before-detect: Vehicle detection and classification through semantic segmentation of aerial images","volume":"9","author":"Audebert","year":"2017","journal-title":"Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"Imagenet classification with deep convolutional neural networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_12","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1007\/s11220-020-00314-2","article-title":"Airport detection based on improved faster RCNN in large scale remote sensing images","volume":"21","author":"Yin","year":"2020","journal-title":"Sens. Imaging"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Khasawneh, N., Fraiwan, M., and Fraiwan, L. (2022). Detection of K-complexes in EEG waveform images using faster R-CNN and deep transfer learning. BMC Med. Inform. Decis. Mak., 22.","DOI":"10.1186\/s12911-022-02042-x"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1070","DOI":"10.3390\/rs13061070","article-title":"A novel framework based on mask R-CNN and Histogram thresholding for scalable segmentation of new and old rural buildings","volume":"13","author":"Li","year":"2021","journal-title":"Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Pai, M.M., Mehrotra, V., Aiyar, S., Verma, U., and Pai, R.M. (2019, January 3\u20135). Automatic segmentation of river and land in sar images: A deep learning approach. Proceedings of the 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), Sardinia, Italy.","DOI":"10.1109\/AIKE.2019.00011"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"397","DOI":"10.3390\/s20020397","article-title":"Intelligent object recognition of urban water bodies based on deep learning for multi-source and multi-temporal high spatial resolution remote sensing imagery","volume":"20","author":"Song","year":"2020","journal-title":"Sensors"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"6687799","DOI":"10.1155\/2021\/6687799","article-title":"Depth Semantic Segmentation of Tobacco Planting Areas from Unmanned Aerial Vehicle Remote Sensing Images in Plateau Mountains","volume":"2021","author":"Huang","year":"2021","journal-title":"J. Spectrosc."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhu, X., Liang, J., and Hauptmann, A. (2021, January 5\u20139). Msnet: A multilevel instance segmentation network for natural disaster damage assessment in aerial videos. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Virtual.","DOI":"10.1109\/WACV48630.2021.00207"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1016\/j.rse.2015.08.004","article-title":"Mapping rice paddy extent and intensification in the Vietnamese Mekong River Delta with dense time stacks of Landsat data","volume":"169","author":"Kontgis","year":"2015","journal-title":"Remote Sens. Environ."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1042","DOI":"10.1080\/01431161.2017.1395969","article-title":"Mapping paddy rice fields by applying machine learning algorithms to multi-temporal Sentinel-1A and Landsat data","volume":"39","author":"Onojeghuo","year":"2018","journal-title":"Int. J. Remote Sens."},{"key":"ref_26","first-page":"13","article-title":"The Evaluation of Image Classification Methods for Rice Paddy Interpretation","volume":"9","author":"Hsiao","year":"2004","journal-title":"J. Photogramm. Remote Sens."},{"key":"ref_27","first-page":"212","article-title":"The Eco-friendly Evaluation Model: The Paddy Rice Image Classification through SOM and Logistic Regression by Remote Sensing Data","volume":"7","author":"Chang","year":"2012","journal-title":"J. Soil Water Conserv. Technol."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/15481603.2014.1001666","article-title":"Sensitivity of vegetation indices to spatial degradation of RapidEye imagery for paddy rice detection: A case study of South Korea","volume":"52","author":"Kim","year":"2015","journal-title":"GIScience Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1080\/13658810802587709","article-title":"An enhanced supervised spatial decision support system of image classification: Consideration on the ancillary information of paddy rice area","volume":"24","author":"Wan","year":"2010","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_30","unstructured":"Lei, T.C., Huang, S.W.C.Y., and Li, J.Y. (2023, April 07). The Comparison Study of Paddy Rice Thematic Maps Based on Parameter Classifier (mlc) and Regional Object of Knowledge Classifier (RG + ROSE). Available online: https:\/\/a-a-r-s.org\/proceeding\/ACRS2011\/Session\/Paper\/P_89_9-6-16.pdf."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"633","DOI":"10.3390\/rs12040633","article-title":"Semantic segmentation using deep learning with vegetation indices for rice lodging identification in multi-date UAV visible images","volume":"12","author":"Yang","year":"2020","journal-title":"Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"2342","DOI":"10.3390\/agronomy12102342","article-title":"Agricultural Field Boundary Delineation with Satellite Image Segmentation for High-Resolution Crop Mapping: A Case Study of Rice Paddy","volume":"12","author":"Wang","year":"2022","journal-title":"Agronomy"},{"key":"ref_33","unstructured":"Yenchia, H. (2019). Prediction of Paddy Field Area in Satellite Images Using Deep Learning Neural Networks. [Master\u2019s Thesis, Chung Yuan Christian University]."},{"key":"ref_34","unstructured":"(2022, June 19). Environmental Systems Research Institute, ArcGIS Pro Desktop: Version 3.0.2. Available online: https:\/\/www.esri.com\/en-us\/arcgis\/products\/arcgis-pro\/overview."},{"key":"ref_35","unstructured":"Council of Agriculture, Executive Yuan, T (2022, August 23). Aerial Photography Information, Available online: https:\/\/www.afasi.gov.tw\/aerial_search."},{"key":"ref_36","unstructured":"GmbH, V.I. (2022, October 28). UltraCam-XP Technical Specifications. Available online: http:\/\/coello.ujaen.es\/Asignaturas\/fotodigital\/descargas\/UCXP-specs.pdf."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"1026","DOI":"10.3390\/agriculture11101026","article-title":"Estimation and forecasting of rice yield using phenology-based algorithm and linear regression model on sentinel-2 satellite data","volume":"11","author":"Nazir","year":"2021","journal-title":"Agriculture"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1007\/s11069-009-9421-0","article-title":"Vegetation recovery and landscape change assessment at Chiufenershan landslide area caused by Chichi earthquake in central Taiwan","volume":"53","author":"Lin","year":"2010","journal-title":"Nat. Hazards"},{"key":"ref_39","first-page":"91","article-title":"Faster R-CNN: Towards real-time object detection with region proposal networks","volume":"28","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/14\/3575\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:13:33Z","timestamp":1760127213000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/14\/3575"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,17]]},"references-count":39,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["rs15143575"],"URL":"https:\/\/doi.org\/10.3390\/rs15143575","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,17]]}}}