{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T01:04:57Z","timestamp":1775783097605,"version":"3.50.1"},"reference-count":35,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2018,2,24]],"date-time":"2018-02-24T00:00:00Z","timestamp":1519430400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2016YFB0502603"],"award-info":[{"award-number":["2016YFB0502603"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41401526 and 41501492"],"award-info":[{"award-number":["41401526 and 41501492"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Jiangxi Natural Science Foundation of China","award":["20171BAB213025"],"award-info":[{"award-number":["20171BAB213025"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Feature-based matching methods have been widely used in remote sensing image matching given their capability to achieve excellent performance despite image geometric and radiometric distortions. However, most of the feature-based methods are unreliable for complex background variations, because the gradient or other image grayscale information used to construct the feature descriptor is sensitive to image background variations. Recently, deep learning-based methods have been proven suitable for high-level feature representation and comparison in image matching. Inspired by the progresses made in deep learning, a new technical framework for remote sensing image matching based on the Siamese convolutional neural network is presented in this paper. First, a Siamese-type network architecture is designed to simultaneously learn the features and the corresponding similarity metric from labeled training examples of matching and non-matching true-color patch pairs. In the proposed network, two streams of convolutional and pooling layers sharing identical weights are arranged without the manually designed features. The number of convolutional layers is determined based on the factors that affect image matching. The sigmoid function is employed to compute the matching and non-matching probabilities in the output layer. Second, a gridding sub-pixel Harris algorithm is used to obtain the accurate localization of candidate matches. Third, a Gaussian pyramid coupling quadtree is adopted to gradually narrow down the searching space of the candidate matches, and multiscale patches are compared synchronously. Subsequently, a similarity measure based on the output of the sigmoid is adopted to find the initial matches. Finally, the random sample consensus algorithm and the whole-to-local quadratic polynomial constraints are used to remove false matches. In the experiments, different types of satellite datasets, such as ZY3, GF1, IKONOS, and Google Earth images, with complex background variations are used to evaluate the performance of the proposed method. The experimental results demonstrate that the proposed method, which can significantly improve the matching performance of multi-temporal remote sensing images with complex background variations, is better than the state-of-the-art matching methods. In our experiments, the proposed method obtained a large number of evenly distributed matches (at least 10 times more than other methods) and achieved a high accuracy (less than 1 pixel in terms of root mean square error).<\/jats:p>","DOI":"10.3390\/rs10020355","type":"journal-article","created":{"date-parts":[[2018,2,27]],"date-time":"2018-02-27T03:36:12Z","timestamp":1519702572000},"page":"355","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":72,"title":["Matching of Remote Sensing Images with Complex Background Variations via Siamese Convolutional Neural Network"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9361-0219","authenticated-orcid":false,"given":"Haiqing","family":"He","sequence":"first","affiliation":[{"name":"School of Geomatics, East China University of Technology, Nanchang 330013, China"}]},{"given":"Min","family":"Chen","sequence":"additional","affiliation":[{"name":"Faculty of Geosciences and Environmental Engineering, Southwest Jiaotong University, Chengdu 611756, China"}]},{"given":"Ting","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Water Resources &amp; Environmental Engineering, East China University of Technology, Nanchang 330013, China"}]},{"given":"Dajun","family":"Li","sequence":"additional","affiliation":[{"name":"School of Geomatics, East China University of Technology, Nanchang 330013, China"}]}],"member":"1968","published-online":{"date-parts":[[2018,2,24]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1145\/146370.146374","article-title":"A survey of image registration techniques","volume":"24","author":"Brown","year":"1992","journal-title":"ACM Comput. Surv."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S1361-8415(01)80026-8","article-title":"A survey of medical image registration","volume":"2","author":"Maintz","year":"1998","journal-title":"Med. Image Anal."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"977","DOI":"10.1016\/S0262-8856(03)00137-9","article-title":"Image registration methods: A survey","volume":"21","author":"Flusser","year":"2003","journal-title":"Image Vis. Comput."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1007\/978-3-642-13681-8_13","article-title":"Remote sensing image registration techniques: A survey","volume":"Volume 6134","author":"Elmoataz","year":"2010","journal-title":"International Conference on Image and Signal Processing"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1117\/1.JRS.9.095092","article-title":"Rotation and scale invariant shape context registration for remote sensing images with background variations","volume":"9","author":"Jiang","year":"2015","journal-title":"J. Appl. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Yang, K., Pan, A., Yang, Y., Zhang, S., Ong, S.H., and Tang, H. (2017). Remote sensing image registration using multiple image features. Remote Sens., 9.","DOI":"10.20944\/preprints201705.0027.v2"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Chen, M., Habib, A., He, H., Zhu, Q., and Zhang, W. (2017). Robust feature matching method for SAR and optical images by using Gaussian-Gamma-shaped bi-windows-based descriptor and geometric constraint. Remote Sens., 9.","DOI":"10.3390\/rs9090882"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Lowe, D. (1999, January 20\u201327). Object recognition from local scale-invariant features. Proceedings of the 7th IEEE International Conference on Computer Vision, Kerkyra, Greece.","DOI":"10.1109\/ICCV.1999.790410"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cviu.2007.09.014","article-title":"Speeded-up robust features (SURF)","volume":"110","author":"Bay","year":"2008","journal-title":"Comput. Vis. Image Und."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2076","DOI":"10.3390\/rs3092076","article-title":"Improved feature detection in fused intensity-range image with complex SIFT","volume":"3","author":"Bradley","year":"2011","journal-title":"Remote Sens."},{"key":"ref_11","unstructured":"Ke, Y., and Sukthankar, R. (July, January 27). PCA-SIFT: A more distinctive representation for local image descriptors. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Washington, DC, USA."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1615","DOI":"10.1109\/TPAMI.2005.188","article-title":"A performance evaluation of local descriptors","volume":"27","author":"Mikolajczyk","year":"2005","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1137\/080732730","article-title":"ASIFT: A new framework for fully affine invariant image comparison","volume":"2","author":"Morel","year":"2009","journal-title":"SIAM J. Imaging Sci."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1109\/LGRS.2008.2011751","article-title":"Robust scale-invariant feature matching for remote sensing image registration","volume":"6","author":"Li","year":"2009","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"65","DOI":"10.3390\/rs3010065","article-title":"Automatic registration of airborne and spaceborne images by topology map matching with SURF processor algorithm","volume":"3","author":"Brook","year":"2011","journal-title":"Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"157","DOI":"10.3390\/rs6010157","article-title":"Automatic registration method for fusion of ZY-1-02C satellite images","volume":"6","author":"Chen","year":"2013","journal-title":"Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3088","DOI":"10.1016\/j.sigpro.2013.04.008","article-title":"Perspective-SIFT: An efficient tool for low-altitude remote sensing image registration","volume":"93","author":"Cai","year":"2013","journal-title":"Signal Process."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1989","DOI":"10.1109\/LGRS.2016.2620147","article-title":"Robust feature matching for remote sensing image registration based on lq-estimator","volume":"13","author":"Li","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Zagoruyko, S., and Komodakis, N. (2015, January 7\u201312). Learning to compare image patches via convolutional neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299064"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Shi, X., and Jiang, J. (2016). Automatic registration method for optical remote sensing images with large background variations using line segments. Remote Sens., 8.","DOI":"10.3390\/rs8050426"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Altwaijry, H., Trulls, E., Hays, J., Fua, P., and Belongie, S. (2016, January 27\u201330). Learning to match aerial images with deep attentive architecture. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR.2016.385"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Chen, L., Rottensteiner, F., and Heipke, C. (2016, January 12\u201319). Invariant descriptor learning using a Siamese convolutional neural network. Proceedings of the ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Prague, Czech Republic.","DOI":"10.5194\/isprs-annals-III-3-11-2016"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Simo-Serra, E., Trulls, E., Ferraz, L., Kokkinos, I., Fua, P., and Moreno-Noguer, F. (2015, January 7\u201313). Discriminative learning of deep convolutional feature point descriptors. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.22"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Ahmed, E., Jones, M., and Marks, T.K. (2015, January 7\u201312). An improved deep learning architecture for person re-identificaiton. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299016"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Melekhov, I., Kannala, J., and Rahtu, E. (2016, January 4\u20138). Siamese Network Features for Image Matching. Proceedings of the 23rd International Conference on Pattern Recognition, Cancun, Mexico.","DOI":"10.1109\/ICPR.2016.7899663"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1145\/358669.358692","article-title":"Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography","volume":"24","author":"Fischler","year":"1981","journal-title":"ACM"},{"key":"ref_27","unstructured":"Brum, A.G.V., Pilchowski, H.U., and Faria, S.D. (2010, January 7\u201311). Attitude determination of spacecraft with use of surface imaging. Proceedings of the 9th Brazilian Conference on Dynamics Control and their Applications (DICON\u201910), Serra Negra, Brazil."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Kouyama, T., Kanemura, A., Kato, S., Imamoglu, N., Fukuhara, T., and Nakamura, R. (2017). Satellite attitude determination and map projection based on robust image matching. Remote Sens., 9.","DOI":"10.3390\/rs9010090"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"6972","DOI":"10.1109\/TGRS.2014.2306233","article-title":"Object-oriented shadow detection and removal from urban high-resolution remote sensing images","volume":"52","author":"Zhang","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1109\/TGRS.2012.2237521","article-title":"Inpainting for remotely sensed images with a multichannel nonlocal total variation model","volume":"52","author":"Cheng","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_31","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning (ICML\u201315), Lille, France."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Merkle, N., Luo, W., Auer, S., M\u00fcller, R., and Urtasun, R. (2017). Exploiting deep matching and SAR data for the geo-localization accuracy improvement of optical satellite images. Remote Sens., 9.","DOI":"10.3390\/rs9060586"},{"key":"ref_33","unstructured":"Harris, C. (September, January 31). A combined corner and edge detector. Proceedings of the 4th Alvey Vision Conference, Manchester, UK."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1109\/TPAMI.2010.54","article-title":"Discriminative learning of local image descriptors","volume":"33","author":"Brown","year":"2011","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Matas, J., Chum, O., Urban, M., and Pajdla, T. (2002, January 2\u20135). Robust wide baseline stereo from maximally stable extremal regions. Proceedings of the British Machine Vision Conference, Cardiff, UK.","DOI":"10.5244\/C.16.36"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/2\/355\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T14:56:16Z","timestamp":1760194576000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/2\/355"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,2,24]]},"references-count":35,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2018,2]]}},"alternative-id":["rs10020355"],"URL":"https:\/\/doi.org\/10.3390\/rs10020355","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,2,24]]}}}