{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T04:01:03Z","timestamp":1781064063883,"version":"3.54.1"},"reference-count":38,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2022,2,10]],"date-time":"2022-02-10T00:00:00Z","timestamp":1644451200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41771457"],"award-info":[{"award-number":["41771457"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41601443"],"award-info":[{"award-number":["41601443"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Matching aerial and satellite optical images with large dip angles is a core technology and is essential for target positioning and dynamic monitoring in sensitive areas. However, due to the long distances and large dip angle observations of the aerial platform, there are significant perspective, radiation, and scale differences between heterologous space-sky images, which seriously affect the accuracy and robustness of feature matching. In this paper, a multiview satellite and unmanned aerial vehicle (UAV) image matching method based on deep learning is proposed to solve this problem. The main innovation of this approach is to propose a joint descriptor consisting of soft descriptions and hard descriptions. Hard descriptions are used as the main description to ensure matching accuracy. Soft descriptions are used not only as auxiliary descriptions but also for the process of network training. Experiments on several problems show that the proposed method ensures matching efficiency and achieves better matching accuracy for multiview satellite and UAV images than other traditional methods. In addition, the matching accuracy of our method in optical satellite and UAV images is within 3 pixels, and can nearly reach 2 pixels, which meets the requirements of relevant UAV missions.<\/jats:p>","DOI":"10.3390\/rs14040838","type":"journal-article","created":{"date-parts":[[2022,2,11]],"date-time":"2022-02-11T02:40:17Z","timestamp":1644547217000},"page":"838","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Multiview Image Matching of Optical Satellite and UAV Based on a Joint Description Neural Network"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7099-7833","authenticated-orcid":false,"given":"Chuan","family":"Xu","sequence":"first","affiliation":[{"name":"School of Computer Science, Hubei University of Technology, Wuhan 430068, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0580-7017","authenticated-orcid":false,"given":"Chang","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Computer Science, Hubei University of Technology, Wuhan 430068, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hongli","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430070, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhiwei","family":"Ye","sequence":"additional","affiliation":[{"name":"School of Computer Science, Hubei University of Technology, Wuhan 430068, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Haigang","family":"Sui","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2014-8120","authenticated-orcid":false,"given":"Wei","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Wuchang Shouyi University, Wuhan 430064, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,2,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"112045","DOI":"10.1016\/j.rse.2020.112045","article-title":"Accurate cloud detection in high-resolution remote sensing imagery by weakly supervised deep learning","volume":"250","author":"Li","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"5388","DOI":"10.1080\/01431161.2017.1339926","article-title":"Dynamic monitoring of land-use\/land-cover change and urban expansion in Shenzhen using Landsat imagery from 1988 to 2015","volume":"38","author":"Dou","year":"2017","journal-title":"Int. J. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Shi, W., Zhang, M., Zhang, R., Chen, S., and Zhan, Z. (2020). Change detection based on artificial intelligence: State-of-the-art and challenges. Remote Sens., 12.","DOI":"10.3390\/rs12101688"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"716","DOI":"10.1109\/JSTARS.2020.3039235","article-title":"Robust SAR Automatic Target Recognition via Adversarial Learning","volume":"14","author":"Guo","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Guerra, E., Mungu\u00eda, R., and Grau, A. (2018). UAV visual and laser sensors fusion for detection and positioning in industrial applications. Sensors, 18.","DOI":"10.3390\/s18072071"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/s11263-020-01359-2","article-title":"Image matching from handcrafted to deep features: A survey","volume":"129","author":"Ma","year":"2021","journal-title":"Int. J. Comput. Vis."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1016\/j.isprsjprs.2018.06.010","article-title":"A local phase based invariant feature for remote sensing image matching","volume":"142","author":"Ye","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Manzo, M. (2020). Attributed relational sift-based regions graph: Concepts and applications. Mach. Learn. Knowl. Extr., 2.","DOI":"10.3390\/make2030013"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhao, X., Li, H., Wang, P., and Jing, L. (2021). An Image Registration Method Using Deep Residual Network Features for Multisource High-Resolution Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13173425"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1821","DOI":"10.1109\/JSTARS.2020.3047656","article-title":"A Novel Region-Based Image Registration Method for Multisource Remote Sensing Images via CNN","volume":"14","author":"Zeng","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1016\/j.isprsjprs.2017.12.012","article-title":"A deep learning framework for remote sensing image registration","volume":"145","author":"Wang","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1175\/1520-0450(1971)010<0118:AATFOC>2.0.CO;2","article-title":"An automated technique for obtaining cloud motion from geosynchronous satellite data using cross correlation","volume":"10","author":"Leese","year":"1971","journal-title":"J. Appl. Meteorol. Climatol."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1109\/TC.1972.5008923","article-title":"A class of algorithms for fast digital image registration","volume":"100","author":"Barnea","year":"1972","journal-title":"IEEE Trans. Comput."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"977","DOI":"10.1016\/S0262-8856(03)00137-9","article-title":"Image registration methods: A survey","volume":"21","author":"Zitova","year":"2003","journal-title":"Image Vis. Comput."},{"key":"ref_15","unstructured":"Harris, C.G., and Stephens, M. (September, January 31). A combined corner and edge detector. Proceedings of the Alvey Vision Conference, Manchester, UK."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1023\/A:1007963824710","article-title":"SUSAN\u2014A new approach to low level image processing","volume":"23","author":"Smith","year":"1997","journal-title":"Int. J. Comput. Vis."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1150","DOI":"10.1109\/ICCV.1999.790410","article-title":"Object recognition from local scale-invariant features","volume":"Volume 2","author":"Lowe","year":"1999","journal-title":"Proceedings of the Seventh IEEE International Conference on Computer Vision"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1109\/TPAMI.2007.70716","article-title":"Scene classification using a hybrid generative\/discriminative approach","volume":"30","author":"Bosch","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_19","first-page":"2","article-title":"PCA-SIFT: A more distinctive representation for local image descriptors","volume":"Volume 2","author":"Ke","year":"2004","journal-title":"Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1137\/080732730","article-title":"ASIFT: A new framework for fully affine invariant image comparison","volume":"2","author":"Morel","year":"2009","journal-title":"SIAM J. Imaging Sci."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"5254","DOI":"10.1109\/TGRS.2019.2959606","article-title":"A New Sample Consensus Based on Sparse Coding for Improved Matching of SIFT Features on Remote Sensing Images","volume":"58","author":"Etezadifar","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2019.04.006","article-title":"Reliable image matching via photometric and geometric constraints structured by Delaunay triangulation","volume":"153","author":"Jiang","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.isprsjprs.2019.05.006","article-title":"LAM: Locality affine-invariant feature matching","volume":"154","author":"Li","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2020.10.019","article-title":"Universal SAR and optical image registration via a novel SIFT framework based on nonlinear diffusion and a polar spatial-frequency descriptor","volume":"171","author":"Yu","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1016\/j.isprsjprs.2018.04.023","article-title":"Ancient Chinese Architecture 3D Preservation by Merging Ground and Aerial Point Clouds","volume":"143","author":"Gao","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"49","DOI":"10.14358\/PERS.81.1.49","article-title":"Reliable spatial relationship constrained feature point matching of oblique aerial images","volume":"81","author":"Hu","year":"2015","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Jiang, S., and Jiang, W. (2017). On-Board GNSS\/IMU Assisted Feature Extraction and Matching for Oblique UAV Images. Remote Sens., 9.","DOI":"10.3390\/rs9080813"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Yi, K.M., Trulls, E., Lepetit, V., and Fua, P. (2016). Lift: Learned invariant feature transform. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-46466-4_28"},{"key":"ref_29","unstructured":"Balntas, V., Johns, E., Tang, L., and Mikolajczyk, K. (2016). PN-Net: Conjoined triple deep network for learning local image descriptors. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"DeTone, D., Malisiewicz, T., and Rabinovich, A. (2018, January 18\u201322). Superpoint: Self-supervised interest point detection and description. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00060"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Bhowmik, A., Gumhold, S., Rother, C., and Brachmann, E. (2020, January 14\u201319). Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00500"},{"key":"ref_32","unstructured":"Ono, Y., Trulls, E., Fua, P., and Yi, K.M. (2018). LF-Net: Learning local features from images. arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Sun, J., Shen, Z., Wang, Y., Bao, H., and Zhou, X. (2021, January 19\u201325). LoFTR: Detector-free local feature matching with transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00881"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.isprsjprs.2020.09.012","article-title":"A deep learning framework for matching of SAR and optical imagery","volume":"169","author":"Lhh","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Dusmanu, M., Rocco, I., Pajdla, T., Pollefeys, M., Sivic, J., Torii, A., and Sattler, T. (2019, January 16\u201320). D2-net: A trainable cnn for joint description and detection of local features. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00828"},{"key":"ref_36","first-page":"2605","article-title":"Performance Evaluation of SIFT & FLANN and HAAR Cascade Image Processing Algorithms for Object Identification in Robotic Applications","volume":"118","author":"Megalingam","year":"2018","journal-title":"Int. J. Pure Appl. Math."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"66963","DOI":"10.1109\/ACCESS.2018.2878147","article-title":"An efficient image matching algorithm based on adaptive threshold and RANSAC","volume":"6","author":"Li","year":"2018","journal-title":"IEEE Access"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Yang, T.Y., Hsu, J.H., Lin, Y.Y., and Chuang, Y.Y. (2017, January 22\u201329). Deepcd: Learning deep complementary descriptors for patch representations. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.359"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/4\/838\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:17:54Z","timestamp":1760134674000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/4\/838"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,10]]},"references-count":38,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2022,2]]}},"alternative-id":["rs14040838"],"URL":"https:\/\/doi.org\/10.3390\/rs14040838","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,10]]}}}