{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:51:05Z","timestamp":1760161865928,"version":"build-2065373602"},"reference-count":38,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2021,1,1]],"date-time":"2021-01-01T00:00:00Z","timestamp":1609459200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Due to the large data volume, the UAV image stitching and matching suffers from high computational cost. The traditional feature extraction algorithms\u2014such as Scale-Invariant Feature Transform (SIFT), Speeded Up Robust Features (SURF), and Oriented FAST Rotated BRIEF (ORB)\u2014require heavy computation to extract and describe features in high-resolution UAV images. To overcome this issue, You Only Look Once version 3 (YOLOv3) combined with the traditional feature point matching algorithms is utilized to extract descriptive features from the drone dataset of residential areas for roof detection. Unlike the traditional feature extraction algorithms, YOLOv3 performs the feature extraction solely on the proposed candidate regions instead of the entire image, thus the complexity of the image matching is reduced significantly. Then, all the extracted features are fed into Structural Similarity Index Measure (SSIM) to identify the corresponding roof region pair between consecutive image sequences. In addition, the candidate corresponding roof pair by our architecture serves as the coarse matching region pair and limits the search range of features matching to only the detected roof region. This further improves the feature matching consistency and reduces the chances of wrong feature matching. Analytical results show that the proposed method is 13\u00d7 faster than the traditional image matching methods with comparable performance.<\/jats:p>","DOI":"10.3390\/rs13010127","type":"journal-article","created":{"date-parts":[[2021,1,1]],"date-time":"2021-01-01T22:35:48Z","timestamp":1609540548000},"page":"127","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["YOLOv3-Based Matching Approach for Roof Region Detection from Drone Images"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1711-3074","authenticated-orcid":false,"given":"Chia-Cheng","family":"Yeh","sequence":"first","affiliation":[{"name":"National Science and Technology Center for Disaster Reduction, New Taipei 23143, Taiwan"},{"name":"Department of Electrical Engineering, National Taipei University of Technology, Taipei 10608, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yang-Lang","family":"Chang","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, National Taipei University of Technology, Taipei 10608, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8933-7483","authenticated-orcid":false,"given":"Mohammad","family":"Alkhaleefah","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, National Taipei University of Technology, Taipei 10608, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4052-8323","authenticated-orcid":false,"given":"Pai-Hui","family":"Hsu","sequence":"additional","affiliation":[{"name":"Department of Civil Engineering, National Taiwan University, Taipei 10617, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Weiyong","family":"Eng","sequence":"additional","affiliation":[{"name":"Faculty of Engineering and Technology, Multimedia University, Melaka 76450, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Voon-Chet","family":"Koo","sequence":"additional","affiliation":[{"name":"Faculty of Engineering and Technology, Multimedia University, Melaka 76450, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bormin","family":"Huang","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, National Taipei University of Technology, Taipei 10608, Taiwan"},{"name":"The School of Information Science and Technology Southwest Jiaotong University, Chengdu 611756, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lena","family":"Chang","sequence":"additional","affiliation":[{"name":"Department of Communications, Navigation and Control Engineering, National Taiwan Ocean University, Keelung 20248, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,1,1]]},"reference":[{"key":"ref_1","first-page":"326","article-title":"A Survey of Image Registration Techniques","volume":"24","author":"Brown","year":"1992","journal-title":"ACM"},{"key":"ref_2","first-page":"1150","article-title":"Object Recognition from Local Scale-Invariant Features","volume":"99","author":"Lowe","year":"1999","journal-title":"ICCV"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive Image Features from Scale-Invariant Keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Bay, H., Tuytelaars, T., and Van Gool, L. (2006, January 7\u201313). Surf: Speeded up robust features. European Conference on Computer Vision, Graz, Austria.","DOI":"10.1007\/11744023_32"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cviu.2007.09.014","article-title":"Speeded-Up Robust Features (SURF)","volume":"110","author":"Bay","year":"2008","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G.R. (2011, January 6\u201313). ORB: An efficient alternative to SIFT or SURF. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_7","unstructured":"Harris, C.G., and Stephens, M.J. (September, January 31). A combined corner and edge detector. Proceedings of the Fourth Alvey Vision Conference, Manchester, UK."},{"key":"ref_8","unstructured":"Shi, J., and Tomasi, C. (1994, January 21\u201323). Good features to track. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Seattle, DC, USA."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1145\/358669.358692","article-title":"Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography","volume":"24","author":"Fischler","year":"1981","journal-title":"Commun. ACM"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.compag.2018.02.016","article-title":"Deep Learning in Agriculture: A Survey","volume":"147","author":"Kamilaris","year":"2018","journal-title":"Comput. Electron. Agric."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1007\/s11263-019-01247-4","article-title":"Deep Learning for Generic Object Detection: A Survey","volume":"128","author":"Liu","year":"2020","journal-title":"Int. J. Comput. Vis."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-Based Learning Applied to Document Recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_13","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). Imagenet classification with deep convolutional neural networks. Proceedings of the 25th Conference on Advances in Neural Information Processing Systems (NIPS), Lake Tahoe, NV, USA."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1146\/annurev-vision-091718-014951","article-title":"Deep Learning: The Good, the bad, and the Ugly","volume":"5","author":"Serre","year":"2019","journal-title":"Annu. Rev. Vis. Sci."},{"key":"ref_15","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the 2016 IEEE Conference on Computer, Vision, Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_17","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An Incremental Improvement. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Raina, R., Madhavan, A., and Ng, A.Y. (2009, January 14\u201318). Large-scale deep unsupervised learning using graphics processors. Proceedings of the 26th Annual International Conference on Machine Learning, Montreal, QC, Canada.","DOI":"10.1145\/1553374.1553486"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"3207","DOI":"10.1162\/NECO_a_00052","article-title":"Deep, Big, Simple Neural Nets for Handwritten Digit Recognition","volume":"22","author":"Meier","year":"2010","journal-title":"Neural Comput."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1016\/S1007-0214(08)70176-7","article-title":"Automatic Generation of 3D Building Models with Multiple Roofs","volume":"13","author":"Sugihara","year":"2008","journal-title":"Tsinghua Sci. Technol."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1109\/TASL.2011.2134090","article-title":"Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition","volume":"20","author":"Dahl","year":"2011","journal-title":"IEEE Trans. Audio Speech Lang. Process."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"75","DOI":"10.46604\/ijeti.2020.4354","article-title":"Application of Recent Developments in Deep Learning to ANN-Based Automatic Berthing Systems","volume":"10","author":"Lee","year":"2020","journal-title":"Int. J. Eng. Technol. Innov."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 13\u201316). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Mikolajczyk, K., and Schmid, C. (2002, January 28\u201331). An affine invariant interest point detector. Proceedings of the European Conference on Computer Vision, Copenhagen, Denmark.","DOI":"10.1007\/3-540-47969-4_9"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"49:1","DOI":"10.1145\/3178115","article-title":"Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments","volume":"9","author":"Zhang","year":"2018","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 8\u201316). Ssd: Single shot multibox detector. Proceedings of the European Conference on Computer Vision\u2014ECCV, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_29","unstructured":"Tzutalin (2019, May 30). Available online: https:\/\/github.com\/tzutalin\/labelImg."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"600","DOI":"10.1109\/TIP.2003.819861","article-title":"Image Quality Assessment: From Error Visibility to Structural Similarity","volume":"13","author":"Wang","year":"2004","journal-title":"IEEE Trans. Image Process."},{"key":"ref_31","unstructured":"Alhwarin, F. (2011). Fast and Robust Image Feature Matching Methods for Computer Vision Applications, Shaker Verlag."},{"key":"ref_32","unstructured":"Karami, E., Prasad, S., and Shehata, M. (2017). Image Matching Using SIFT, SURF, BRIEF and ORB: Performance Comparison for Distorted Images. arXiv."},{"key":"ref_33","first-page":"462","article-title":"An Advanced Technique of Image Matching Using SIFT and SURF","volume":"5","author":"Preeti","year":"2016","journal-title":"Int. J. Adv. Res. Comput. Commun. Eng."},{"key":"ref_34","first-page":"277","article-title":"Automatic Fast Feature-Level Image Registration for High-Resolution Remote Sensing Images","volume":"2","author":"He","year":"2018","journal-title":"J. Remote Sens."},{"key":"ref_35","first-page":"4016025","article-title":"Accuracy of Digital Surface Models and Orthophotos Derived from Unmanned Aerial Vehicle Photogrammetry","volume":"143","year":"2016","journal-title":"J. Surv. Eng."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Manfreda, S., Dvorak, P., Mullerova, J., Herban, S., Vuono, P., Arranz Justel, J., and Perks, M. (2019). Assessing the Accuracy of Digital Surface Models Derived from Optical Imagery Acquired with Unmanned Aerial Systems. Drones, 3.","DOI":"10.3390\/drones3010015"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"419","DOI":"10.14358\/PERS.82.6.419","article-title":"A Statistical Examination of Image Stitching Software Packages or Use with Unmanned Aerial Systems","volume":"82","author":"Gross","year":"2016","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Oniga, V.-E., Breaban, A.-I., and Statescu, F. (2018). Determining the Optimum Number of Ground Control Points for Obtaining High Precision Results Based on UAS Images. Proceedings, 2.","DOI":"10.3390\/ecrs-2-05165"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/1\/127\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:06:13Z","timestamp":1760159173000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/1\/127"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,1]]},"references-count":38,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,1]]}},"alternative-id":["rs13010127"],"URL":"https:\/\/doi.org\/10.3390\/rs13010127","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2021,1,1]]}}}