{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,21]],"date-time":"2026-06-21T08:10:15Z","timestamp":1782029415721,"version":"3.54.5"},"reference-count":28,"publisher":"MDPI AG","issue":"15","license":[{"start":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T00:00:00Z","timestamp":1691107200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["61973314"],"award-info":[{"award-number":["61973314"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Visual positioning is a basic component for UAV operation. The structure-based methods are, widely applied in most literature, based on local feature matching between a query image that needs to be localized and a reference image with a known pose and feature points. However, the existing methods still struggle with the different illumination and seasonal changes. In outdoor regions, the feature points and descriptors are similar, and the number of mismatches will increase rapidly, leading to the visual positioning becoming unreliable. Moreover, with the database growing, the image retrieval and feature matching are time-consuming. Therefore, in this paper, we propose a novel hierarchical visual positioning method, which includes map construction, landmark matching and pose calculation. First, we combine brain-inspired mechanisms and landmarks to construct a cognitive map, which can make image retrieval efficient. Second, the graph neural network is utilized to learn the inner relations of the feature points. To improve matching accuracy, the network uses the semantic confidence in matching score calculations. Besides, the system can eliminate the mismatches by analyzing all the matching results in the same landmark. Finally, we calculate the pose by using a PnP solver. Furthermore, we evaluate both the matching algorithm and the visual positioning method experimentally in the simulation datasets, where the matching algorithm performs better in some scenes. The results demonstrate that the retrieval time can be shortened by three-thirds with an average positioning error of 10.8 m.<\/jats:p>","DOI":"10.3390\/s23156941","type":"journal-article","created":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T09:28:29Z","timestamp":1691141309000},"page":"6941","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["A Visual Positioning Method of UAV in a Large-Scale Outdoor Environment"],"prefix":"10.3390","volume":"23","author":[{"given":"Chenhao","family":"Zhao","sequence":"first","affiliation":[{"name":"Graduate School, Air Force Engineering University, Xi\u2019an 710077, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dewei","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Information and Navigation, Air Force Engineering University, Xi\u2019an 710077, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jing","family":"He","sequence":"additional","affiliation":[{"name":"School of Information and Navigation, Air Force Engineering University, Xi\u2019an 710077, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chuanjin","family":"Dai","sequence":"additional","affiliation":[{"name":"School of Information and Navigation, Air Force Engineering University, Xi\u2019an 710077, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,8,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1007\/s12204-017-1829-1","article-title":"Research on Visual Autonomous Navigation Indoor for Unmanned Aerial Vehicle","volume":"22","author":"Zhang","year":"2017","journal-title":"J. Shanghai Jiaotong Univ."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Xu, M., Wang, L., Ren, J., and Poslad, S. (November, January 30). Use of LSTM Regression and Rotation Classification to Improve Camera Pose Localization Estimation. Proceedings of the 2020 IEEE 14th International Conference on Anti-Counterfeiting, Security, and Identification (ASID), Xiamen, China.","DOI":"10.1109\/ASID50160.2020.9271762"},{"key":"ref_3","unstructured":"Zhang, Z., Xu, M., Zhou, W., Peng, T., Li, L., and Poslad, S. (2022). BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images. arXiv."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Sarlin, P.E., Cadena, C., Siegwart, R., and Dymczyk, M. (2019, January 16\u201320). From coarse to fine: Robust hierarchical localization at large scale. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01300"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Sattler, T., Weyand, T., Leibe, B., and Kobbelt, L. (2012, January 3\u20137). Image Retrieval for Image-Based Localization Revisited. Proceedings of the BMVC, Surrey, UK.","DOI":"10.5244\/C.26.76"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Sarlin, P.E., Unagar, A., Larsson, M., Germain, H., Toft, C., Larsson, V., Pollefeys, M., Lepetit, V., Hammarstrand, L., and Kahl, F. (2021, January 19\u201325). Back to the feature: Learning robust camera localization from pixels to pose. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Virtual.","DOI":"10.1109\/CVPR46437.2021.00326"},{"key":"ref_7","unstructured":"Humenberger, M., Cabon, Y., Guerin, N., Morat, J., Revaud, J., Rerole, P., Pion, N., de Souza, C., Leroy, V., and Csurka, G. (2020). Robust image retrieval-based visual localization using kapture. arXiv."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"930","DOI":"10.1109\/TPAMI.2003.1217599","article-title":"Complete solution classification for the perspective-three-point problem","volume":"25","author":"Gao","year":"2003","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Raguram, R., Frahm, J.-M., and Pollefeys, M. (2008, January 12\u201318). A comparative analysis of RANSAC techniques leading to adaptive real-time random sample consensus. Proceedings of the European Conference on Computer Vision 2008, Marseille, France.","DOI":"10.1007\/978-3-540-88688-4_37"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Kendall, A., Grimes, M., and Cipolla, R. (2015, January 7\u201313). Posenet: A convolutional network for real-time 6-dof camera relocalization. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile. Available online: https:\/\/openaccess.thecvf.com\/content_iccv_2015\/papers\/Kendall_PoseNet_A_Convolutional_ICCV_2015_paper.pdf.","DOI":"10.1109\/ICCV.2015.336"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Brahmbhatt, S., Gu, J., Kim, K., Hays, J., and Kautz, J. (2018, January 18\u201322). Geometry-aware learning of maps for camera localization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00277"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Kendall, A., and Cipolla, R. (2017, January 21\u201326). Geometric loss functions for camera pose regression with deep learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.694"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Shavit, Y., Ferens, R., and Keller, Y. (2021, January 10\u201317). Learning multi-scene absolute pose regression with transformers. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00273"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Balntas, V., Li, S., and Prisacariu, V. (2018, January 8\u201314). Relocnet: Continuous metric learning relocalization using neural nets. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_46"},{"key":"ref_15","unstructured":"Ding, M., Wang, Z., Sun, J., Shi, J., and Luo, P. (November, January 27). CamNet: Coarse-to-fine retrieval for camera re-localization. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"44","DOI":"10.2197\/ipsjtcva.3.44","article-title":"Scene Reconstruction and Visualization from Internet Photo Collections: A Survey","volume":"3","author":"Snavely","year":"2011","journal-title":"IPSJ Trans. Comput. Vis. Appl."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Sattler, T., Leibe, B., and Kobbelt, L. (2012, January 7\u201313). Improving image-based localization by active correspondence search. Proceedings of the European Conference on Computer Vision, Florence, Italy.","DOI":"10.1007\/978-3-642-33718-5_54"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Li, Y., Snavely, N., Huttenlocher, D., and Fua, P. (2012, January 7\u201313). Worldwide pose estimation using 3d point clouds. Proceedings of the European Conference on Computer Vision, Florence, Italy.","DOI":"10.1007\/978-3-642-33718-5_2"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Boin, J.B., Bobkov, D., Steinbach, E., and Girod, B. (2019, January 4\u20136). Efficient panorama database indexing for indoor localization. Proceedings of the 2019 IEEE International Conference on Content-Based Multimedia Indexing, Dublin, Ireland.","DOI":"10.1109\/CBMI.2019.8877395"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1023\/A:1007963824710","article-title":"SUSAN\u2014A new approach to low level image processing","volume":"23","author":"Smith","year":"1997","journal-title":"Int. J. Comput. Vis."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"DeTone, D., Malisiewicz, T., and Rabinovich, A. (2018, January 18\u201322). Superpoint: Self-supervised interest point detection and description. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00060"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"2953","DOI":"10.1109\/TMI.2022.3175478","article-title":"D2-Net: Dual Disentanglement Network for Brain Tumor Segmentation with Missing Modalities","volume":"41","author":"Yang","year":"2022","journal-title":"IEEE Trans. Med. Imaging"},{"key":"ref_24","unstructured":"Du, J., Wang, R., and Cremers, D. (2020). European Conference on Computer Vision, Springer."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Sun, J., Shen, Z., Wang, Y., Bao, H., and Zhou, X. (2021). LoFTR: Detector-Free Local Feature Matching with Transformers. arXiv.","DOI":"10.1109\/CVPR46437.2021.00881"},{"key":"ref_26","unstructured":"Ono, Y., Trulls, E., Fua, P., and Yi, K.M. (2018). LF-Net: Learning local features from images. arXiv."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Sarlin, P.-E., DeTone, D., Malisiewicz, T., and Rabinovich, A. (2020, January 13\u201319). SuperGlue: Learning Feature Matching with Graph Neural Networks. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00499"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1142\/S0218001411008774","article-title":"A Stable Direct Solution of Perspective-Three-Point Problem","volume":"25","author":"Li","year":"2011","journal-title":"Int. J. Pattern Recognit. Artif. Intell."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/15\/6941\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:25:44Z","timestamp":1760127944000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/15\/6941"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,4]]},"references-count":28,"journal-issue":{"issue":"15","published-online":{"date-parts":[[2023,8]]}},"alternative-id":["s23156941"],"URL":"https:\/\/doi.org\/10.3390\/s23156941","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,4]]}}}