{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T13:05:12Z","timestamp":1777640712166,"version":"3.51.4"},"reference-count":69,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T00:00:00Z","timestamp":1698192000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T00:00:00Z","timestamp":1698192000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2024,4]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The existing methods for addressing visual navigation employ deep reinforcement learning as the standard tool for the task. However, they tend to be vulnerable to statistical shifts between the training and test data, resulting in poor generalization over novel environments that are out-of-distribution from the training data. In this study, we attempt to improve the generalization ability by utilizing the inductive biases available for the task. Employing the active neural SLAM that learns policies with the advantage actor-critic method as the base framework, we first point out that the mappings represented by the actor and the critic should satisfy specific symmetries. We then propose a network design for the actor and the critic to inherently attain these symmetries. Specifically, we use <jats:italic>G<\/jats:italic>-convolution instead of the standard convolution and insert the semi-global polar pooling layer, which we newly design in this study, in the last section of the critic network. Our method can be integrated into existing methods that utilize intermediate goals and 2D occupancy maps. Experimental results show that our method improves generalization ability by a good margin over visual exploration and object goal navigation, which are two main embodied visual navigation tasks.<\/jats:p>","DOI":"10.1007\/s11263-023-01909-4","type":"journal-article","created":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T07:02:21Z","timestamp":1698217341000},"page":"1091-1107","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Symmetry-aware Neural Architecture for Embodied Visual Navigation"],"prefix":"10.1007","volume":"132","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3528-8131","authenticated-orcid":false,"given":"Shuang","family":"Liu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Masanori","family":"Suganuma","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Takayuki","family":"Okatani","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,10,25]]},"reference":[{"key":"1909_CR1","unstructured":"Anderson, P., Chang, A., Chaplot, D. S., et\u00a0al. (2018). On evaluation of embodied navigation agents. arXiv preprint arXiv:1807.06757"},{"key":"1909_CR2","doi-asserted-by":"crossref","unstructured":"Beeching, E., Dibangoye, J., Simonin, O., et\u00a0al. (2020). Egomap: Projective mapping and structured egocentric memory for deep rl. In: Joint European conference on machine learning and knowledge discovery in databases, Springer, pp 525\u2013540","DOI":"10.1007\/978-3-030-67661-2_31"},{"issue":"3","key":"1909_CR3","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1007\/s10846-008-9235-4","volume":"53","author":"F Bonin-Font","year":"2008","unstructured":"Bonin-Font, F., Ortiz, A., & Oliver, G. (2008). Visual navigation for mobile robots: A survey. Journal of intelligent and robotic systems, 53(3), 263\u2013296.","journal-title":"Journal of intelligent and robotic systems"},{"issue":"6","key":"1909_CR4","doi-asserted-by":"publisher","first-page":"1309","DOI":"10.1109\/TRO.2016.2624754","volume":"32","author":"C Cadena","year":"2016","unstructured":"Cadena, C., Carlone, L., Carrillo, H., et al. (2016). Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age. IEEE Transactions on robotics, 32(6), 1309\u20131332.","journal-title":"IEEE Transactions on robotics"},{"key":"1909_CR5","doi-asserted-by":"crossref","unstructured":"Calimeri, F., Marzullo, A., Stamile, C., et\u00a0al. (2017). Biomedical data augmentation using generative adversarial neural networks. In: International conference on artificial neural networks, Springer, pp 626\u2013634","DOI":"10.1007\/978-3-319-68612-7_71"},{"key":"1909_CR6","doi-asserted-by":"crossref","unstructured":"Chang, A., Dai, A., Funkhouser, T., et\u00a0al. (2017). Matterport3d: Learning from rgb-d data in indoor environments. In: International conference on 3D vision (3DV).","DOI":"10.1109\/3DV.2017.00081"},{"key":"1909_CR7","unstructured":"Chaplot, D. S., Gandhi, D., Gupta, S., et\u00a0al. (2020a). Learning to explore using active neural slam. In: International conference on learning representations, URl https:\/\/openreview.net\/forum?id=HklXn1BKDH"},{"key":"1909_CR8","first-page":"4247","volume":"33","author":"DS Chaplot","year":"2020","unstructured":"Chaplot, D. S., Gandhi, D. P., Gupta, A., et al. (2020). Object goal navigation using goal-oriented semantic exploration. Advances in Neural Information Processing Systems, 33, 4247.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"1909_CR9","doi-asserted-by":"crossref","unstructured":"Chaplot, D. S., Jiang, H., Gupta, S., et\u00a0al. (2020c). Semantic curiosity for active visual learning. In: European conference on computer vision, Springer, pp 309\u2013326.","DOI":"10.1007\/978-3-030-58539-6_19"},{"key":"1909_CR10","unstructured":"Chen, C., Majumder, S., Al-Halah, Z., et\u00a0al. (2021). Learning to set waypoints for audio-visual navigation. In: International conference on learning representations, URL https:\/\/openreview.net\/forum?id=cR91FAodFMe"},{"key":"1909_CR11","unstructured":"Chen, T., Gupta, S., & Gupta, A. (2019). Learning exploration policies for navigation. In: International conference on learning representations, URL https:\/\/openreview.net\/forum?id=SyMWn05F7"},{"issue":"12","key":"1909_CR12","doi-asserted-by":"publisher","first-page":"7405","DOI":"10.1109\/TGRS.2016.2601622","volume":"54","author":"G Cheng","year":"2016","unstructured":"Cheng, G., Zhou, P., & Han, J. (2016). Learning rotation-invariant convolutional neural networks for object detection in vhr optical remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 54(12), 7405\u20137415.","journal-title":"IEEE Transactions on Geoscience and Remote Sensing"},{"key":"1909_CR13","doi-asserted-by":"crossref","unstructured":"Choi, Y., & Oh, S. (2021). Image-goal navigation via keypoint-based reinforcement learning. In: 2021 18th international conference on ubiquitous robots (UR), IEEE, pp 18\u201321.","DOI":"10.1109\/UR52253.2021.9494664"},{"key":"1909_CR14","unstructured":"Cohen, T., & Welling, M. (2016). Group equivariant convolutional networks. In: Balcan, M. F., Weinberger, K. Q. (eds) Proceedings of The 33rd international conference on machine learning, proceedings of machine learning research, vol\u00a048. PMLR, pp 2990\u20132999, URL https:\/\/proceedings.mlr.press\/v48\/cohenc16.html."},{"key":"1909_CR15","doi-asserted-by":"publisher","unstructured":"Dai, A., Papatheodorou, S., Funk, N., et\u00a0al. (2020). Fast frontier-based information-driven autonomous exploration with an mav. In: 2020 IEEE International conference on robotics and automation (ICRA), pp 9570\u20139576, https:\/\/doi.org\/10.1109\/ICRA40945.2020.9196707.","DOI":"10.1109\/ICRA40945.2020.9196707"},{"key":"1909_CR16","unstructured":"Dey, N., Chen, A., & Ghafurian, S. (2020). Group equivariant generative adversarial networks. CoRR arXiv:2005.01683."},{"key":"1909_CR17","unstructured":"Dieleman, S., Fauw, J. D., & Kavukcuoglu, K. (2016). Exploiting Cyclic Symmetry in Convolutional Neural Networks. In: Proceedings of the 33rd international conference on machine learning. JMLR, pp 1889\u20131898."},{"key":"1909_CR18","doi-asserted-by":"crossref","unstructured":"Du, H., Yu, X., & Zheng, L. (2020). Learning object relation graph and tentative policy for visual navigation. In: European conference on computer vision, Springer, pp 19\u201334","DOI":"10.1007\/978-3-030-58571-6_2"},{"key":"1909_CR19","doi-asserted-by":"crossref","unstructured":"Gan, C., Zhang, Y., Wu, J., et\u00a0al. (2020). Look, listen, and act: Towards audio-visual embodied navigation. In: 2020 IEEE International conference on robotics and automation (ICRA), IEEE, pp 9701\u20139707.","DOI":"10.1109\/ICRA40945.2020.9197008"},{"key":"1909_CR20","volume-title":"Deep Learning","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press."},{"key":"1909_CR21","doi-asserted-by":"crossref","unstructured":"Gupta, S., Davidson, J., Levine, S., et\u00a0al. (2017). Cognitive mapping and planning for visual navigation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2616\u20132625.","DOI":"10.1109\/CVPR.2017.769"},{"key":"1909_CR22","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., et\u00a0al. (2017). Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961\u20132969.","DOI":"10.1109\/ICCV.2017.322"},{"key":"1909_CR23","doi-asserted-by":"crossref","unstructured":"Jayaraman, D., & Grauman, K. (2018). Learning to look around: Intelligently exploring unseen environments for unknown tasks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1238\u20131247.","DOI":"10.1109\/CVPR.2018.00135"},{"key":"1909_CR24","first-page":"19884","volume":"33","author":"M Laskin","year":"2020","unstructured":"Laskin, M., Lee, K., Stooke, A., et al. (2020). Reinforcement learning with augmented data. Advances in Neural Information Processing Systems, 33, 19884\u201319895.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"1909_CR25","doi-asserted-by":"crossref","unstructured":"Lin, T. Y., Maire, M., Belongie, S., et\u00a0al. (2014). Microsoft coco: Common objects in context. In: European conference on computer vision, Springer, pp 740\u2013755.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"1909_CR26","doi-asserted-by":"crossref","unstructured":"Lindeberg, T. (2021). Scale-covariant and scale-invariant gaussian derivative networks. In: International conference on scale space and variational methods in computer vision, Springer, pp 3\u201314.","DOI":"10.1007\/978-3-030-75549-2_1"},{"key":"1909_CR27","doi-asserted-by":"crossref","unstructured":"Liu, S., & Okatani, T. (2022). Symmetry-aware neural architecture for embodied visual exploration. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 17,242\u201317,251.","DOI":"10.1109\/CVPR52688.2022.01673"},{"key":"1909_CR28","doi-asserted-by":"publisher","unstructured":"Liu, S., Ozay, M., Xu, H., et\u00a0al. (2019). A generative model of underwater images for active landmark detection and docking. In: 2019 IEEE\/RSJ International conference on intelligent robots and systems (IROS), pp 8034\u20138039, https:\/\/doi.org\/10.1109\/IROS40897.2019.8968146.","DOI":"10.1109\/IROS40897.2019.8968146"},{"key":"1909_CR29","unstructured":"Lv, Y., Xie, N., Shi, Y., et\u00a0al. (2020). Improving target-driven visual navigation with attention on 3d spatial relationships. CoRR arXiv:2005.02153."},{"key":"1909_CR30","doi-asserted-by":"crossref","unstructured":"Madani, A., Moradi, M., Karargyris, A., et\u00a0al. (2018). Chest x-ray generation and data augmentation for cardiovascular abnormality classification. In: Medical imaging 2018: Image processing, international society for optics and photonics, p 105741M.","DOI":"10.1117\/12.2293971"},{"key":"1909_CR31","doi-asserted-by":"crossref","unstructured":"Mayo, B., Hazan, T., & Tal, A. (2021). Visual navigation with spatial attention. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 16,898\u201316,907.","DOI":"10.1109\/CVPR46437.2021.01662"},{"key":"1909_CR32","unstructured":"Mezghani, L., Sukhbaatar, S., Szlam, A., et\u00a0al. (2020). Learning to visually navigate in photorealistic environments without any supervision. CoRR arXiv:2004.04954."},{"key":"1909_CR33","doi-asserted-by":"crossref","unstructured":"Mezghani, L., Sukhbaatar, S., Lavril, T., et\u00a0al. (2021). Memory-augmented reinforcement learning for image-goal navigation. CoRR arXiv:2101.05181.","DOI":"10.1109\/IROS47612.2022.9981090"},{"key":"1909_CR34","unstructured":"Mirowski, P., Pascanu, R., Viola, F., et\u00a0al. (2016). Learning to navigate in complex environments. arXiv preprint arXiv:1611.03673"},{"key":"1909_CR35","unstructured":"Mishkin, D., Dosovitskiy, A., & Koltun, V. (2019). Benchmarking classic and learned navigation in complex 3d environments. arXiv preprint arXiv:1901.10915"},{"key":"1909_CR36","unstructured":"Mnih, V., Badia, A. P., Mirza, M., et\u00a0al. (2016). Asynchronous methods for deep reinforcement learning. In: International conference on machine learning, PMLR, pp 1928\u20131937."},{"key":"1909_CR37","unstructured":"M\u00fcller P, Golkov V, Tomassini V, et\u00a0al (2021) Rotation-equivariant deep learning for diffusion MRI. CoRR arXiv:2102.06942."},{"key":"1909_CR38","unstructured":"Nachum, O., Gu, S. S., Lee, H., et\u00a0al. (2018). Data-efficient hierarchical reinforcement learning. Advances in Neural Information Processing Systems 31. https:\/\/dl.acm.org\/doi\/abs\/10.5555\/3327144.3327250"},{"key":"1909_CR39","first-page":"2005","volume":"33","author":"T Nagarajan","year":"2020","unstructured":"Nagarajan, T., & Grauman, K. (2020). Learning affordance landscapes for interaction exploration in 3d environments. Advances in Neural Information Processing Systems, 33, 2005.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"1909_CR40","unstructured":"Pal, A., Qiu, Y., & Christensen, H. (2021). Learning hierarchical relationships for object-goal navigation. In: Conference on robot learning, PMLR, pp 517\u2013528."},{"key":"1909_CR41","doi-asserted-by":"crossref","unstructured":"Pathak, D., Agrawal, P., Efros, A. A., et\u00a0al. (2017). Curiosity-driven exploration by self-supervised prediction. In: International conference on machine learning, PMLR, pp 2778\u20132787.","DOI":"10.1109\/CVPRW.2017.70"},{"key":"1909_CR42","unstructured":"Qi, W., Mullapudi, R. T., Gupta, S., et\u00a0al. (2020). Learning to move with affordance maps. In: International conference on learning representations, URL https:\/\/openreview.net\/forum?id=BJgMFxrYPB"},{"key":"1909_CR43","unstructured":"Raileanu, R., Goldstein, M., Yarats, D., et\u00a0al. (2021). Automatic data augmentation for generalization in deep reinforcement learning. arXiv:2006.12862"},{"key":"1909_CR44","doi-asserted-by":"crossref","unstructured":"Ramakrishnan, S. K., Al-Halah, Z., & Grauman, K. (2020). Occupancy anticipation for efficient exploration and navigation. In: European conference on computer vision, Springer, pp 400\u2013418.","DOI":"10.1007\/978-3-030-58558-7_24"},{"issue":"5","key":"1909_CR45","doi-asserted-by":"publisher","first-page":"1616","DOI":"10.1007\/s11263-021-01437-z","volume":"129","author":"SK Ramakrishnan","year":"2021","unstructured":"Ramakrishnan, S. K., Jayaraman, D., & Grauman, K. (2021). An exploration of embodied visual exploration. International Journal of Computer Vision, 129(5), 1616\u20131649.","journal-title":"International Journal of Computer Vision"},{"key":"1909_CR46","unstructured":"Savinov, N., Dosovitskiy, A., & Koltun, V. (2018). Semi-parametric topological memory for navigation. In: International conference on learning representations."},{"key":"1909_CR47","doi-asserted-by":"crossref","unstructured":"Savva, M., Kadian, A., Maksymets, O., et\u00a0al. (2019). Habitat: A Platform for Embodied AI Research. In: Proceedings of the IEEE\/CVF international conference on computer vision (ICCV).","DOI":"10.1109\/ICCV.2019.00943"},{"key":"1909_CR48","unstructured":"Seifi, S., & Tuytelaars, T. (2019). Where to look next: Unsupervised active visual exploration on $$360^{\\circ }$$ input. CoRR arXiv:1909.10304."},{"key":"1909_CR49","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9781107298019","volume-title":"Understanding machine learning: From theory to algorithms","author":"S Shalev-Shwartz","year":"2014","unstructured":"Shalev-Shwartz, S., & Ben-David, S. (2014). Understanding machine learning: From theory to algorithms. Cambridge: Cambridge University Press."},{"key":"1909_CR50","doi-asserted-by":"crossref","unstructured":"Shen, W. B., Xu, D., Zhu, Y., et\u00a0al. (2019). Situational fusion of visual representation for visual navigation. In: Proceedings of the IEEE\/CVF international conference on computer vision, pp 2881\u20132890.","DOI":"10.1109\/ICCV.2019.00297"},{"key":"1909_CR51","doi-asserted-by":"publisher","unstructured":"Singh\u00a0Chaplot, D., Salakhutdinov, R., Gupta, A., et\u00a0al. (2020). Neural topological slam for visual navigation. In: 2020 IEEE\/CVF conference on computer vision and pattern recognition (CVPR), pp 12,872\u201312,881, https:\/\/doi.org\/10.1109\/CVPR42600.2020.01289","DOI":"10.1109\/CVPR42600.2020.01289"},{"key":"1909_CR52","unstructured":"Sosnovik, I., Szmaja, M., & Smeulders, A. (2020). Scale-equivariant steerable networks. In: International conference on learning representations, URL https:\/\/openreview.net\/forum?id=HJgpugrKPS."},{"key":"1909_CR53","doi-asserted-by":"crossref","unstructured":"Sosnovik, I., Moskalev, A., & Smeulders, A. W. (2021). Scale equivariance improves siamese tracking. In: Proceedings of the IEEE\/CVF winter conference on applications of computer vision, pp 2765\u20132774.","DOI":"10.1109\/WACV48630.2021.00281"},{"key":"1909_CR54","first-page":"251","volume":"34","author":"A Szot","year":"2021","unstructured":"Szot, A., Clegg, A., Undersander, E., et al. (2021). Habitat 2.0: Training home assistants to rearrange their habitat. Advances in Neural Information Processing Systems, 34, 251\u2013266.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"1909_CR55","unstructured":"Thiede, E. H., Hy, T., & Kondor, R. (2020). The general theory of permutation equivarant neural networks and higher order graph variational encoders. CoRR arXiv:2004.03990."},{"key":"1909_CR56","doi-asserted-by":"crossref","unstructured":"Visser, A., Xingrui-Ji, van Ittersum M, et al. (2008). Beyond frontier exploration. In U. Visser, F. Ribeiro, T. Ohashi, et al. (Eds.), RoboCup 2007: Robot Soccer World Cup XI (pp. 113\u2013123). Berlin: Springer.","DOI":"10.1007\/978-3-540-68847-1_10"},{"key":"1909_CR57","unstructured":"Walters, R., Li, J., & Yu, R. (2021). Trajectory prediction using equivariant continuous convolution. In: International conference on learning representations, URL https:\/\/openreview.net\/forum?id=J8_GttYLFgr"},{"key":"1909_CR58","first-page":"9700","volume":"33","author":"S Wani","year":"2020","unstructured":"Wani, S., Patel, S., Jain, U., et al. (2020). Multion: Benchmarking semantic map memory using multi-object navigation. Advances in Neural Information Processing Systems, 33, 9700\u20139712.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"1909_CR59","unstructured":"Worrall, D. E., & Welling, M. (2019). Deep scale-spaces: Equivariance over scale. In: Advances in Neural Information Processing Systems, pp 7364\u20137376."},{"key":"1909_CR60","doi-asserted-by":"publisher","unstructured":"Wu, Y., Wu, Y., Tamar, A., et\u00a0al. (2019). Bayesian relational memory for semantic visual navigation. In: 2019 IEEE\/CVF International conference on computer vision (ICCV), pp 2769\u20132779, https:\/\/doi.org\/10.1109\/ICCV.2019.00286","DOI":"10.1109\/ICCV.2019.00286"},{"key":"1909_CR61","doi-asserted-by":"crossref","unstructured":"Xia, F., R.\u00a0Zamir, A., He, Z. Y., et\u00a0al. (2018). Gibson env: real-world perception for embodied agents. In: Computer vision and pattern recognition (CVPR), 2018 IEEE conference on, IEEE.","DOI":"10.1109\/CVPR.2018.00945"},{"key":"1909_CR62","doi-asserted-by":"publisher","unstructured":"Yamauchi, B. (1997). A frontier-based approach for autonomous exploration. In: Proceedings 1997 IEEE international symposium on computational intelligence in robotics and automation CIRA\u201997. \u2019Towards New Computational Principles for Robotics and Automation\u2019, pp 146\u2013151, https:\/\/doi.org\/10.1109\/CIRA.1997.613851","DOI":"10.1109\/CIRA.1997.613851"},{"key":"1909_CR63","unstructured":"Yarats, D., Kostrikov, I., & Fergus, R. (2021). Image augmentation is all you need: Regularizing deep reinforcement learning from pixels. In: International conference on learning representations, URL https:\/\/openreview.net\/forum?id=GY6-6sTvGaf"},{"key":"1909_CR64","doi-asserted-by":"crossref","unstructured":"Ye, J., Batra, D., Das, A., et\u00a0al. (2021a). Auxiliary tasks and exploration enable objectgoal navigation. In: Proceedings of the IEEE\/CVF international conference on computer vision, pp 16,117\u201316,126.","DOI":"10.1109\/ICCV48922.2021.01581"},{"key":"1909_CR65","unstructured":"Ye, J., Batra, D., Wijmans, E., et\u00a0al. (2021b). Auxiliary tasks speed up learning point goal navigation. In: Kober, J., Ramos, F., & Tomlin, C. (eds) Proceedings of the 2020 conference on robot learning, proceedings of machine learning research, vol 155. PMLR, pp 498\u2013516."},{"key":"1909_CR66","unstructured":"Yu, C., Yang, X., Gao, J., et\u00a0al. (2021). Learning efficient multi-agent cooperative visual exploration. In: Deep RL Workshop NeurIPS 2021, URL https:\/\/openreview.net\/forum?id=-4Yz4vU4uN5"},{"key":"1909_CR67","unstructured":"Zhang, R. (2019). Making convolutional networks shift-invariant again. In: International conference on machine learning, PMLR, pp 7324\u20137334."},{"key":"1909_CR68","doi-asserted-by":"crossref","unstructured":"Zhang, S., Song, X., Bai, Y., et\u00a0al. (2021). Hierarchical object-to-zone graph for object navigation. In: Proceedings of the IEEE\/CVF international conference on computer vision, pp 15,130\u201315,140.","DOI":"10.1109\/ICCV48922.2021.01485"},{"key":"1909_CR69","doi-asserted-by":"publisher","unstructured":"Zhu, Y., Mottaghi, R., Kolve, E., et\u00a0al. (2017). Target-driven visual navigation in indoor scenes using deep reinforcement learning. In: 2017 IEEE International conference on robotics and automation (ICRA), pp 3357\u20133364, https:\/\/doi.org\/10.1109\/ICRA.2017.7989381","DOI":"10.1109\/ICRA.2017.7989381"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-023-01909-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11263-023-01909-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-023-01909-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,26]],"date-time":"2024-03-26T11:11:48Z","timestamp":1711451508000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11263-023-01909-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,25]]},"references-count":69,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,4]]}},"alternative-id":["1909"],"URL":"https:\/\/doi.org\/10.1007\/s11263-023-01909-4","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,25]]},"assertion":[{"value":"8 November 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 September 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 October 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}