{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T01:25:16Z","timestamp":1775784316022,"version":"3.50.1"},"reference-count":28,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2023,4,14]],"date-time":"2023-04-14T00:00:00Z","timestamp":1681430400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Science Foundation of China","doi-asserted-by":"publisher","award":["42201443"],"award-info":[{"award-number":["42201443"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Most cross-view image matching algorithms focus on designing network structures with excellent performance, ignoring the content information of the image. At the same time, there are non-fixed targets such as cars, ships, and pedestrians in ground perspective images and aerial perspective images. Differences in perspective, direction, and scale cause serious interference with the cross-view matching process. This paper proposes a cross-view image matching method with feature enhancement, which first transforms the empty image to generate a transformation image aligned with the ground\u2013aerial image domain to establish a preliminary geometric correspondence between the ground-space image. Then, the rich feature information of the deep network and the edge information of the cross-convolution layer are used to establish the feature correspondence between the ground-space images. The feature fusion module enhances the tolerance of the network model to scale differences, improving the interference problem of transient non-fixed targets on the matching performance in the images. Finally, the maximum pooling and feature aggregation strategies are adopted to aggregate local features with obvious distinguishability into global features to complete the accurate matching between ground images. The experimental results show that the proposed method has good advance and high accuracy on CVUSA, which is commonly used in public datasets, reaching 92.23%, 98.47%, and 99.74% on the top 1, top 5 and top 10 indicators, respectively, outperforming the original method in the dataset with a limited field of view and image center, better completing the cross-perspective image matching task.<\/jats:p>","DOI":"10.3390\/rs15082083","type":"journal-article","created":{"date-parts":[[2023,4,14]],"date-time":"2023-04-14T10:29:16Z","timestamp":1681468156000},"page":"2083","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["A Cross-View Image Matching Method with Feature Enhancement"],"prefix":"10.3390","volume":"15","author":[{"given":"Ziyu","family":"Rao","sequence":"first","affiliation":[{"name":"Institute of Geospatial Information, PLA Strategic Support Force Information Engineering University, Zhengzhou 450001, China"}]},{"given":"Jun","family":"Lu","sequence":"additional","affiliation":[{"name":"Institute of Geospatial Information, PLA Strategic Support Force Information Engineering University, Zhengzhou 450001, China"}]},{"given":"Chuan","family":"Li","sequence":"additional","affiliation":[{"name":"Naval Academy, Dalian 116000, China"}]},{"given":"Haitao","family":"Guo","sequence":"additional","affiliation":[{"name":"Institute of Geospatial Information, PLA Strategic Support Force Information Engineering University, Zhengzhou 450001, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,4,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Choi, J., and Friedland, G. (2014). Multimodal Location Estimation of Videos and Images, Springer.","DOI":"10.1007\/978-3-319-09861-6"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Hays, J., and Efros, A.A. (2008, January 23\u201328). IM2GPS: Estimating geographic information from a single image. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA.","DOI":"10.1109\/CVPR.2008.4587784"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Cui, Y., Belongie, S., and Hays, J. (2015, January 7\u201312). Learning deep representations for ground-to-aerial geolocalization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299135"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Leibe, B., Matas, J., Sebe, N., and Welling, M. (2016). European Conference on Computer Vision (ECCV), Springer International Publishing.","DOI":"10.1007\/978-3-319-46478-7"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Hu, S., Feng, M., Nguyen, R.M.H., and Lee, G.H. (2018, January 18\u201323). CVM-net: Cross-view matching network for image-based ground-to-aerial geo-localization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00758"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1437","DOI":"10.1109\/TPAMI.2017.2711011","article-title":"NetVLAD: CNN architecture for weakly supervised place recognition","volume":"40","author":"Arandjelovic","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","first-page":"11990","article-title":"Optimal feature transport for cross-view image geo-localization","volume":"34","author":"Shi","year":"2020","journal-title":"Proc. Conf. AAAI Artif. Intell."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1109\/TCSVT.2021.3061265","article-title":"Each part matters: Local patterns facilitate cross-view geo-localization","volume":"32","author":"Wang","year":"2021","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Liu, L., and Li, H. (2019, January 20). Lending Orientation to Neural Networks for Cross-View Geo-Localization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00577"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Zhu, S.J., Yang, T., and Chen, C. (2021, January 20\u201325). VIGOR: Cross-view image geo-localization beyond one-to-one retrieval. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00364"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhu, S.J., Yang, T., and Chen, C. (2021, January 3\u20138). Revisiting street-to-aerial view image geo-localization and orientation estimation. Proceedings of the IEEE Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV48630.2021.00080"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Shi, Y., Yu, X., Campbell, D., and Li, H. (2020, January 14\u201319). Where am I looking at? Joint location and orientation estimation by cross-view matching. Proceedings of the Institute of Electrical and Electronics Engineers (IEEE)\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00412"},{"key":"ref_13","first-page":"10090","article-title":"Spatial-aware feature aggregation for image based cross-view geo-localization","volume":"32","author":"Shi","year":"2019","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Toker, A., Zhou, Q., Maximov, M., and Leal-Taixe, L. (2021, January 20\u201325). Coming down to earth: Satellite-to-street view synthesis for geo-localization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00642"},{"key":"ref_15","unstructured":"Mousavian, A., and Kosecka, J. (2016). Semantic image based geolocation given a map. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Regmi, K., and Borji, A. (2018, January 18\u201322). Cross-view image synthesis using conditional gans. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00369"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_18","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Sarlin, P.E., DeTone, D., Malisiewicz, T., and Rabinovich, A. (2020, January 13\u201319). Superglue: Learning feature matching with graph neural networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00499"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"4927","DOI":"10.1109\/TCSVT.2021.3138431","article-title":"Cross-SRN: Structure-Preserving Super-Resolution Network with Cross Convolution","volume":"32","author":"Liu","year":"2021","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Workman, S., Souvenir, R., and Jacobs, N. (2015, January 7\u201313). Wide-area image geolocalization with aerial reference imagery. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.451"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhai, M., Bessinger, Z., Workman, S., and Jacobs, N. (2017, January 21\u201326). Predicting ground-level scene layout from aerial imagery. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.440"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Sun, B., Chen, C., Zhu, Y., and Jiang, J. (2019). Geocapsnet: Aerial to ground view image geo-localization using capsule network. arXiv.","DOI":"10.1109\/ICME.2019.00133"},{"key":"ref_25","unstructured":"Regmi, K., and Shah, M. (November, January 27). Bridging the domain gap for ground-to-aerial image matching. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_26","unstructured":"Cai, S., Guo, Y., Khan, S., Hu, J., and Wen, G. (November, January 27). Ground-to-aerial image geo-localization with a hard exemplar reweighting triplet loss. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Republic of Korea."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"3780","DOI":"10.1109\/TIP.2022.3175601","article-title":"Joint Representation Learning and Keypoint Detection for Cross-view Geo-localization","volume":"31","author":"Lin","year":"2022","journal-title":"IEEE Trans. Image Process."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhu, S., Shah, M., and Chen, C. (2022, January 18\u201324). Transgeo: Transformer is all you need for cross-view image geo-localization. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00123"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/8\/2083\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:16:11Z","timestamp":1760123771000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/8\/2083"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,14]]},"references-count":28,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2023,4]]}},"alternative-id":["rs15082083"],"URL":"https:\/\/doi.org\/10.3390\/rs15082083","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,14]]}}}