{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T17:28:52Z","timestamp":1770917332018,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,29]],"date-time":"2023-10-29T00:00:00Z","timestamp":1698537600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,11,2]]},"DOI":"10.1145\/3607834.3616567","type":"proceedings-article","created":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T00:09:37Z","timestamp":1698192577000},"page":"7-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Orientation-Guided Contrastive Learning for UAV-View Geo-Localisation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4511-4223","authenticated-orcid":false,"given":"Fabian","family":"Deuser","sequence":"first","affiliation":[{"name":"University of the Bundeswehr Munich, Munich, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0638-276X","authenticated-orcid":false,"given":"Konrad","family":"Habel","sequence":"additional","affiliation":[{"name":"University of the Bundeswehr Munich, Munich, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6951-8022","authenticated-orcid":false,"given":"Martin","family":"Werner","sequence":"additional","affiliation":[{"name":"Technische Universit\u00e4t M\u00fcnchen, Munich, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-8361-2719","authenticated-orcid":false,"given":"Norbert","family":"Oswald","sequence":"additional","affiliation":[{"name":"University of the Bundeswehr Munich, Munich, Germany"}]}],"member":"320","published-online":{"date-parts":[[2023,10,29]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2019.00170"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8794030"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2021.3135013"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2018.00060"},{"key":"e_1_3_2_1_5_1","volume-title":"Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation. arXiv preprint arXiv:2303.11851","author":"Deuser Fabian","year":"2023","unstructured":"Fabian Deuser , Konrad Habel , and Norbert Oswald . 2023. Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation. arXiv preprint arXiv:2303.11851 ( 2023 ). Fabian Deuser, Konrad Habel, and Norbert Oswald. 2023. Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation. arXiv preprint arXiv:2303.11851 (2023)."},{"key":"e_1_3_2_1_6_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASE.2022.3232025"},{"key":"e_1_3_2_1_8_1","volume-title":"International Conference on Machine Learning. PMLR, 3887--3896","author":"Guo Ruiqi","year":"2020","unstructured":"Ruiqi Guo , Philip Sun , Erik Lindgren , Quan Geng , David Simcha , Felix Chern , and Sanjiv Kumar . 2020 . Accelerating large-scale inference with anisotropic vector quantization . In International Conference on Machine Learning. PMLR, 3887--3896 . Ruiqi Guo, Philip Sun, Erik Lindgren, Quan Geng, David Simcha, Felix Chern, and Sanjiv Kumar. 2020. Accelerating large-scale inference with anisotropic vector quantization. In International Conference on Machine Learning. PMLR, 3887--3896."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"e_1_3_2_1_10_1","volume-title":"Multi-view drone-based geo-localization via style and spatial alignment. arXiv preprint arXiv:2006.13681","author":"Hu Siyi","year":"2020","unstructured":"Siyi Hu and Xiaojun Chang . 2020. Multi-view drone-based geo-localization via style and spatial alignment. arXiv preprint arXiv:2006.13681 ( 2020 ). Siyi Hu and Xiaojun Chang. 2020. Multi-view drone-based geo-localization via style and spatial alignment. arXiv preprint arXiv:2006.13681 (2020)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.5143773"},{"key":"e_1_3_2_1_12_1","volume-title":"Optimization of indexing based on k-nearest neighbor graph for proximity search in high-dimensional data. arXiv preprint arXiv:1810.07355","author":"Iwasaki Masajiro","year":"2018","unstructured":"Masajiro Iwasaki and Daisuke Miyazaki . 2018. Optimization of indexing based on k-nearest neighbor graph for proximity search in high-dimensional data. arXiv preprint arXiv:1810.07355 ( 2018 ). Masajiro Iwasaki and Daisuke Miyazaki. 2018. Optimization of indexing based on k-nearest neighbor graph for proximity search in high-dimensional data. arXiv preprint arXiv:1810.07355 (2018)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995464"},{"key":"e_1_3_2_1_14_1","unstructured":"Gregory Koch Richard Zemel Ruslan Salakhutdinov etal 2015. Siamese neural networks for one-shot image recognition. In ICML deep learning workshop Vol. 2. Lille.  Gregory Koch Richard Zemel Ruslan Salakhutdinov et al. 2015. Siamese neural networks for one-shot image recognition. In ICML deep learning workshop Vol. 2. Lille."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.120"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00577"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00577"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"e_1_3_2_1_19_1","volume-title":"Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101","author":"Loshchilov Ilya","year":"2017","unstructured":"Ilya Loshchilov and Frank Hutter . 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 ( 2017 ). Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)."},{"key":"e_1_3_2_1_20_1","volume-title":"Proceedings of the Asian Conference on Computer Vision (ACCV). 4211--4224","author":"Lu Zeng","year":"2022","unstructured":"Zeng Lu , Tao Pu , Tianshui Chen , and Liang Lin . 2022 . Content-Aware Hierarchical Representation Selection for Cross-View Geo-Localization . In Proceedings of the Asian Conference on Computer Vision (ACCV). 4211--4224 . Zeng Lu, Tao Pu, Tianshui Chen, and Liang Lin. 2022. Content-Aware Hierarchical Representation Selection for Cross-View Geo-Localization. In Proceedings of the Asian Conference on Computer Vision (ACCV). 4211--4224."},{"key":"e_1_3_2_1_21_1","volume-title":"Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748","author":"van den Oord Aaron","year":"2018","unstructured":"Aaron van den Oord , Yazhe Li , and Oriol Vinyals . 2018. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 ( 2018 ). Aaron van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 (2018)."},{"key":"e_1_3_2_1_22_1","unstructured":"Maxime Oquab Timoth\u00e9e Darcet Th\u00e9o Moutakanni Huy Vo Marc Szafraniec Vasil Khalidov Pierre Fernandez Daniel Haziza Francisco Massa Alaaeldin El-Nouby etal 2023. Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193 (2023).  Maxime Oquab Timoth\u00e9e Darcet Th\u00e9o Moutakanni Huy Vo Marc Szafraniec Vasil Khalidov Pierre Fernandez Daniel Haziza Francisco Massa Alaaeldin El-Nouby et al. 2023. Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193 (2023)."},{"key":"e_1_3_2_1_23_1","volume-title":"International conference on machine learning. PMLR, 8748--8763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong Wook Kim , Chris Hallacy , Aditya Ramesh , Gabriel Goh , Sandhini Agarwal , Girish Sastry , Amanda Askell , Pamela Mishkin , Jack Clark , 2021 . Learning transferable visual models from natural language supervision . In International conference on machine learning. PMLR, 8748--8763 . Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning. PMLR, 8748--8763."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01300"},{"key":"e_1_3_2_1_25_1","volume-title":"Structure-From-Motion Revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Johannes","unstructured":"Johannes L. Schonberger and Jan-Michael Frahm. 2016 . Structure-From-Motion Revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Johannes L. Schonberger and Jan-Michael Frahm. 2016. Structure-From-Motion Revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_26_1","volume-title":"Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track. https:\/\/openreview.net\/forum?id=M3Y74vmsMcY","author":"Schuhmann Christoph","year":"2022","unstructured":"Christoph Schuhmann , Romain Beaumont , Richard Vencu , Cade W Gordon , Ross Wightman , Mehdi Cherti , Theo Coombes , Aarush Katta , Clayton Mullis , Mitchell Wortsman , Patrick Schramowski , Srivatsa R Kundurthy , Katherine Crowson , Ludwig Schmidt , Robert Kaczmarczyk , and Jenia Jitsev . 2022 . LAION-5B: An open large-scale dataset for training next generation image-text models . In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track. https:\/\/openreview.net\/forum?id=M3Y74vmsMcY Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade W Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, Patrick Schramowski, Srivatsa R Kundurthy, Katherine Crowson, Ludwig Schmidt, Robert Kaczmarczyk, and Jenia Jitsev. 2022. LAION-5B: An open large-scale dataset for training next generation image-text models. In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track. https:\/\/openreview.net\/forum?id=M3Y74vmsMcY"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01650"},{"key":"e_1_3_2_1_28_1","volume-title":"Advances in Neural Information Processing Systems, , H. Wallach, H. Larochelle, A. Beygelzimer, F. dtextquotesingle Alch\u00e9-Buc","author":"Shi Yujiao","year":"2019","unstructured":"Yujiao Shi , Liu Liu , Xin Yu , and Hongdong Li. 2019. Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization . In Advances in Neural Information Processing Systems, , H. Wallach, H. Larochelle, A. Beygelzimer, F. dtextquotesingle Alch\u00e9-Buc , E. Fox, and R. Garnett (Eds.), Vol. 32 . Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/ 2019 \/file\/ba2f0015122a5955f8b3a50240fb91b2-Paper.pdf Yujiao Shi, Liu Liu, Xin Yu, and Hongdong Li. 2019. Spatial-Aware Feature Aggregation for Image based Cross-View Geo-Localization. In Advances in Neural Information Processing Systems, , H. Wallach, H. Larochelle, A. Beygelzimer, F. dtextquotesingle Alch\u00e9-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2019\/file\/ba2f0015122a5955f8b3a50240fb91b2-Paper.pdf"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00412"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TITS.2023.3234627"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00807"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01258"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2021.3061265"},{"key":"e_1_3_2_1_34_1","volume-title":"Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization. ArXiv","author":"Wang Ting","year":"2022","unstructured":"Ting Wang , Zhedong Zheng , Zunjie Zhu , Yu-Fei Gao , Yi Yang , and Chenggang Yan . 2022. Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization. ArXiv , Vol. abs\/ 2211 .05296 ( 2022 ). Ting Wang, Zhedong Zheng, Zunjie Zhu, Yu-Fei Gao, Yi Yang, and Chenggang Yan. 2022. Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization. ArXiv , Vol. abs\/2211.05296 (2022)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.451"},{"key":"e_1_3_2_1_36_1","volume-title":"Cross-view Geo-localization via Learning Disentangled Geometric Layout Correspondence. arXiv preprint arXiv:2212.04074","author":"Zhang Xiaohan","year":"2022","unstructured":"Xiaohan Zhang , Xingyu Li , Waqas Sultani , Yi Zhou , and Safwan Wshah . 2022. Cross-view Geo-localization via Learning Disentangled Geometric Layout Correspondence. arXiv preprint arXiv:2212.04074 ( 2022 ). Xiaohan Zhang, Xingyu Li, Waqas Sultani, Yi Zhou, and Safwan Wshah. 2022. Cross-view Geo-localization via Learning Disentangled Geometric Layout Correspondence. arXiv preprint arXiv:2212.04074 (2022)."},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the 31th ACM International Conference on Multimedia Workshop.","author":"Zheng Zhedong","year":"2023","unstructured":"Zhedong Zheng , Yujiao Shi , Tingyu Wang , Jun Liu , Jianwu Fang , Yunchao Wei , and Tat-seng Chua. 2023 . UAVs in Multimedia: Capturing the World from a New Perspective . In Proceedings of the 31th ACM International Conference on Multimedia Workshop. Zhedong Zheng, Yujiao Shi, Tingyu Wang, Jun Liu, Jianwu Fang, Yunchao Wei, and Tat-seng Chua. 2023. UAVs in Multimedia: Capturing the World from a New Perspective. In Proceedings of the 31th ACM International Conference on Multimedia Workshop."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413896"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.3390\/s23020720"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.3390\/s23020720"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00123"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV48630.2021.00080"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00364"}],"event":{"name":"MM '23: The 31st ACM International Conference on Multimedia","location":"Ottawa ON Canada","acronym":"MM '23","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3607834.3616567","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3607834.3616567","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:05Z","timestamp":1750178225000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3607834.3616567"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,29]]},"references-count":43,"alternative-id":["10.1145\/3607834.3616567","10.1145\/3607834"],"URL":"https:\/\/doi.org\/10.1145\/3607834.3616567","relation":{},"subject":[],"published":{"date-parts":[[2023,10,29]]},"assertion":[{"value":"2023-10-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}