{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:24:31Z","timestamp":1750220671877,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":42,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,12,4]],"date-time":"2020-12-04T00:00:00Z","timestamp":1607040000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,12,4]]},"DOI":"10.1145\/3442705.3442706","type":"proceedings-article","created":{"date-parts":[[2021,3,22]],"date-time":"2021-03-22T02:02:26Z","timestamp":1616378546000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Self-Supervised Visual Odometry with Ego-Motion Sampling"],"prefix":"10.1145","author":[{"given":"Igor","family":"Slinko","sequence":"first","affiliation":[{"name":"Samsung AI Center Moscow, Russia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anna","family":"Vorontsova","sequence":"additional","affiliation":[{"name":"Samsung Research, Russia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dmitry","family":"Zhukov","sequence":"additional","affiliation":[{"name":"Samsung AI Center Moscow, Russia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Olga","family":"Barinova","sequence":"additional","affiliation":[{"name":"Samsung AI Center Moscow, Russia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anton","family":"Konushin","sequence":"additional","affiliation":[{"name":"Samsung AI Center Moscow, Russia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,3,21]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Pedro PB de Gusmao, Andrew Markham, and Niki Trigoni.","author":"Almalioglu","year":"2018","unstructured":"Almalioglu , Muhamad Risqi U Saputra , Pedro PB de Gusmao, Andrew Markham, and Niki Trigoni. 2018 . GANVO : Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks. arXiv preprint arXiv:1809.05786(2018). Almalioglu, Muhamad Risqi U Saputra, Pedro PB de Gusmao, Andrew Markham, and Niki Trigoni. 2018. GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks. arXiv preprint arXiv:1809.05786(2018)."},{"unstructured":"J.-W. Bian Z. Li N. Wang H.Zhan C. Shen M.-M. Cheng and Reid I. 2019. Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video. arXiv preprint arXiv:1908.10553(2019).  J.-W. Bian Z. Li N. Wang H.Zhan C. Shen M.-M. Cheng and Reid I. 2019. Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video. arXiv preprint arXiv:1908.10553(2019).","key":"e_1_3_2_1_2_1"},{"key":"e_1_3_2_1_3_1","volume-title":"The OpenCV Library. Dr. Dobb's Journal of Software Tools","author":"Bradski G.","year":"2000","unstructured":"G. Bradski . 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools ( 2000 ). G. Bradski. 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools (2000)."},{"unstructured":"Ting Chen Simon Kornblith Mohammad Norouzi and Geoffrey Hinton. 2020. A Simple Framework for Contrastive Learning of Visual Representations. arXiv:2002.05709 [cs.LG]  Ting Chen Simon Kornblith Mohammad Norouzi and Geoffrey Hinton. 2020. A Simple Framework for Contrastive Learning of Visual Representations. arXiv:2002.05709 [cs.LG]","key":"e_1_3_2_1_4_1"},{"key":"e_1_3_2_1_6_1","volume-title":"Ls-vo: Learning dense optical subspace for robust visual odometry estimation","author":"Costante Gabriele","year":"2018","unstructured":"Gabriele Costante and Thomas Alessandro Ciarfuglia . 2018 . Ls-vo: Learning dense optical subspace for robust visual odometry estimation . IEEE Robotics and Automation Letters 3, 3 (2018),1735\u20131742. Gabriele Costante and Thomas Alessandro Ciarfuglia. 2018. Ls-vo: Learning dense optical subspace for robust visual odometry estimation. IEEE Robotics and Automation Letters3, 3 (2018),1735\u20131742."},{"key":"e_1_3_2_1_7_1","volume-title":"ENG: End-to-end Neural Geometry for Robust Depth and PoseEstimation using CNNs. arXiv preprint arXiv:1807.05705(2018).","author":"Dharmasiri Thanuja","year":"2018","unstructured":"Thanuja Dharmasiri , Andrew Spek ,and Tom Drummond . 2018 . ENG: End-to-end Neural Geometry for Robust Depth and PoseEstimation using CNNs. arXiv preprint arXiv:1807.05705(2018). Thanuja Dharmasiri, Andrew Spek,and Tom Drummond. 2018. ENG: End-to-end Neural Geometry for Robust Depth and PoseEstimation using CNNs. arXiv preprint arXiv:1807.05705(2018)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_8_1","DOI":"10.1109\/ICCV.2015.316"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_9_1","DOI":"10.1007\/978-3-319-10605-2_54"},{"key":"e_1_3_2_1_10_1","volume-title":"SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation with Stacked Generative Adversarial Networks. CoRR abs\/1906.08889(2019)","author":"Feng Tuo","year":"2019","unstructured":"Tuo Feng and Dongbing Gu . 2019 . SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation with Stacked Generative Adversarial Networks. CoRR abs\/1906.08889(2019) Tuo Feng and Dongbing Gu. 2019. SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation with Stacked Generative Adversarial Networks. CoRR abs\/1906.08889(2019)"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_11_1","DOI":"10.5555\/2354409.2354978"},{"key":"e_1_3_2_1_12_1","volume-title":"Bo Ding, Huaimin Wang, Pengfei Zhang, and Lei Zhang.","author":"Geng Mingyang","year":"2019","unstructured":"Mingyang Geng , Su Ning Shang , Bo Ding, Huaimin Wang, Pengfei Zhang, and Lei Zhang. 2019 . Unsupervised Learning-based Depth Estimation aided Visual SLAM Approach. CoRR abs\/1901.07288(2019) Mingyang Geng, Su Ning Shang, Bo Ding, Huaimin Wang, Pengfei Zhang, and Lei Zhang. 2019. Unsupervised Learning-based Depth Estimation aided Visual SLAM Approach. CoRR abs\/1901.07288(2019)"},{"unstructured":"Spyros Gidaris Praveer Singh and Nikos Komodakis. 2018. Unsupervised representation learning by predicting image rotations. arXiv preprint arXiv:1803.07728(2018).  Spyros Gidaris Praveer Singh and Nikos Komodakis. 2018. Unsupervised representation learning by predicting image rotations. arXiv preprint arXiv:1803.07728(2018).","key":"e_1_3_2_1_13_1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_14_1","DOI":"10.1109\/ICCV.2019.00907"},{"key":"e_1_3_2_1_15_1","volume-title":"Momentum contrast for unsupervised visual representation learning. arXiv preprint arXiv:1911.05722","author":"He Kaiming","year":"2019","unstructured":"Kaiming He , Haoqi Fan , Yuxin Wu , Saining Xie ,and Ross Girshick . 2019. Momentum contrast for unsupervised visual representation learning. arXiv preprint arXiv:1911.05722 ( 2019 ). Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie,and Ross Girshick. 2019. Momentum contrast for unsupervised visual representation learning. arXiv preprint arXiv:1911.05722 (2019)."},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)","author":"Kuemmerle R","year":"2011","unstructured":"R . Kuemmerle , G. Grisetti , H. Strasdat , K. Konolige , and W. Burgard . 2011. g2o: A General Framework for Graph Optimization . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) . Shanghai, China, 3607\u20133613. https:\/\/doi.org\/10.1109\/ICRA. 2011 .5979949 10.1109\/ICRA.2011.5979949 R .Kuemmerle, G. Grisetti, H. Strasdat, K. Konolige, and W. Burgard. 2011. g2o: A General Framework for Graph Optimization. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA). Shanghai, China, 3607\u20133613. https:\/\/doi.org\/10.1109\/ICRA.2011.5979949"},{"key":"e_1_3_2_1_17_1","volume-title":"UnDeepVO: Monocular Visual Odometry Through Unsupervised Deep Learning. 2018 IEEE International Conference on Robotics and Automation (ICRA)","author":"Li Ruihao","year":"2018","unstructured":"Ruihao Li , Sen Wang , Zhiqiang Long , and Dongbing Gu . 2018 . UnDeepVO: Monocular Visual Odometry Through Unsupervised Deep Learning. 2018 IEEE International Conference on Robotics and Automation (ICRA) (2018), 7286\u20137291. Ruihao Li, Sen Wang, Zhiqiang Long, and Dongbing Gu. 2018. UnDeepVO: Monocular Visual Odometry Through Unsupervised Deep Learning. 2018 IEEE International Conference on Robotics and Automation (ICRA) (2018), 7286\u20137291."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_18_1","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_19_1","DOI":"10.5555\/850924.851523"},{"unstructured":"Zhaoyang Lv Kihwan Kim Alejandro Troccoli Deqing Sun James Rehg and Jan Kautz. 2018. Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation. In ECCV.  Zhaoyang Lv Kihwan Kim Alejandro Troccoli Deqing Sun James Rehg and Jan Kautz. 2018. Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation. In ECCV.","key":"e_1_3_2_1_20_1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_21_1","DOI":"10.1109\/CVPR.2018.00594"},{"key":"e_1_3_2_1_22_1","volume-title":"Tardos","author":"Mur-Artal Raul","year":"2016","unstructured":"Raul Mur-Artal and Juan D . Tardos . 2016 . Visual-Inertial Monocular SLAM with Map Reuse. CoRRabs\/ 1610.05949 (2016). Raul Mur-Artal and Juan D. Tardos. 2016. Visual-Inertial Monocular SLAM with Map Reuse. CoRRabs\/1610.05949 (2016)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_23_1","DOI":"10.1109\/TRO.2017.2705103"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_24_1","DOI":"10.1109\/ICCV.2011.6126513"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_25_1","DOI":"10.1109\/CVPR.2017.638"},{"volume-title":"Trajectory Alignment and Evaluation in SLAM : Horn","author":"Salas Marta","unstructured":"Marta Salas , Es, and Yasir Latif . 2010. Trajectory Alignment and Evaluation in SLAM : Horn \u2019 s Method vs Alignment on the Manifold . Marta Salas, Es, and Yasir Latif. 2010. Trajectory Alignment and Evaluation in SLAM : Horn \u2019 s Method vs Alignment on the Manifold.","key":"e_1_3_2_1_26_1"},{"doi-asserted-by":"crossref","unstructured":"Torsten Sattler Qunjie Zhou Marc Pollefeys and Laura Leal-Taixe. 2019. Understanding the Limitations of CNN-based Absolute Camera Pose Regression. arXiv:1903.07504 [cs.CV]  Torsten Sattler Qunjie Zhou Marc Pollefeys and Laura Leal-Taixe. 2019. Understanding the Limitations of CNN-based Absolute Camera Pose Regression. arXiv:1903.07504 [cs.CV]","key":"e_1_3_2_1_27_1","DOI":"10.1109\/CVPR.2019.00342"},{"key":"e_1_3_2_1_28_1","volume-title":"BAD SLAM: Bundle Adjusted Direct RGB-D SLAM. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Schops Thomas","year":"2019","unstructured":"Thomas Schops , Torsten Sattler , and Marc Pollefeys . 2019 . BAD SLAM: Bundle Adjusted Direct RGB-D SLAM. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Thomas Schops, Torsten Sattler, and Marc Pollefeys. 2019. BAD SLAM: Bundle Adjusted Direct RGB-D SLAM. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"unstructured":"Igor Slinko Anna Vorontsova Filipp Konokhov Olga Barinova and Anton Konushin. 2019. Scene Motion Decomposition for Learnable Visual Odometry. CoRR abs\/1907.07227 (2019)  Igor Slinko Anna Vorontsova Filipp Konokhov Olga Barinova and Anton Konushin. 2019. Scene Motion Decomposition for Learnable Visual Odometry. CoRR abs\/1907.07227 (2019)","key":"e_1_3_2_1_29_1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_30_1","DOI":"10.1109\/CVPR.2018.00931"},{"key":"e_1_3_2_1_31_1","volume-title":"DeepV2D: Video to Depth with Differentiable Structure from Motion. CoRR abs\/1812.04605","author":"Teed Zachary","year":"2018","unstructured":"Zachary Teed and Jia Deng . 2018. DeepV2D: Video to Depth with Differentiable Structure from Motion. CoRR abs\/1812.04605 ( 2018 ). arXiv:1812.04605 Zachary Teed and Jia Deng. 2018. DeepV2D: Video to Depth with Differentiable Structure from Motion. CoRR abs\/1812.04605 (2018). arXiv:1812.04605"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_32_1","DOI":"10.1109\/CVPR.2017.596"},{"unstructured":"Sudheendra Vijayanarasimhan Susanna Ricco Cordelia Schmid Rahul Sukthankar and Katerina Fragkiadaki. 2017. SfM-Net: Learning of Structure and Motion from Video. CoRR abs\/1704.07804 (2017)  Sudheendra Vijayanarasimhan Susanna Ricco Cordelia Schmid Rahul Sukthankar and Katerina Fragkiadaki. 2017. SfM-Net: Learning of Structure and Motion from Video. CoRR abs\/1704.07804 (2017)","key":"e_1_3_2_1_33_1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_34_1","DOI":"10.1109\/ICRA.2017.7989236"},{"key":"e_1_3_2_1_35_1","volume-title":"sequence-to-sequence probabilistic visual odometry through deep neural networks. The International Journal of Robotics Research, 4-5","author":"Wang Sen","year":"2018","unstructured":"Sen Wang , Ronald Clark , Hongkai Wen , and Niki Trigoni . 2018. End-to-end , sequence-to-sequence probabilistic visual odometry through deep neural networks. The International Journal of Robotics Research, 4-5 ( 2018 ), 513\u2013542. https:\/\/doi.org\/10.1177\/0278364917734298 10.1177\/0278364917734298 Sen Wang, Ronald Clark, Hongkai Wen, and Niki Trigoni. 2018. End-to-end, sequence-to-sequence probabilistic visual odometry through deep neural networks. The International Journal of Robotics Research, 4-5 (2018), 513\u2013542. https:\/\/doi.org\/10.1177\/0278364917734298"},{"key":"e_1_3_2_1_36_1","volume-title":"Guided Feature Selection for Deep Visual Odometry. CoRR abs\/1811.09935","author":"Xue Fei","year":"2018","unstructured":"Fei Xue , Qiuyuan Wang , Xin Wang , Wei Dong , Junqiu Wang , and Hongbin Zha . 2018. Guided Feature Selection for Deep Visual Odometry. CoRR abs\/1811.09935 ( 2018 ). Fei Xue, Qiuyuan Wang, Xin Wang,Wei Dong, Junqiu Wang, and Hongbin Zha. 2018. Guided Feature Selection for Deep Visual Odometry. CoRR abs\/1811.09935 (2018)."},{"key":"e_1_3_2_1_37_1","volume-title":"Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Xue Fei","year":"2019","unstructured":"Fei Xue , Xin Wang , Shunkai Li , Qiuyuan Wang , Junqiu Wang , and Hongbin Zha . 2019 . Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Fei Xue, Xin Wang, Shunkai Li, Qiuyuan Wang, Junqiu Wang, and Hongbin Zha. 2019. Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_38_1","volume-title":"Optical Flow and Camera Pose. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","volume":"2","author":"Yin Zhichao","year":"2018","unstructured":"Zhichao Yin and Jianping Shi . 2018 . GeoNet: Unsupervised Learning of Dense Depth , Optical Flow and Camera Pose. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , Vol. 2 . Zhichao Yin and Jianping Shi. 2018. GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vol. 2."},{"key":"e_1_3_2_1_39_1","volume-title":"Kejie Li, Harsh Agarwal, and Ian D. Reid.","author":"Zhan Huangying","year":"2018","unstructured":"Huangying Zhan , Ravi Garg , Chamara Saroj Weerasekera , Kejie Li, Harsh Agarwal, and Ian D. Reid. 2018 . Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction. CoRR abs\/1803.03893 (2018) Huangying Zhan, Ravi Garg, Chamara Saroj Weerasekera, Kejie Li, Harsh Agarwal, and Ian D. Reid. 2018. Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction. CoRR abs\/1803.03893 (2018)"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_40_1","DOI":"10.1007\/978-3-319-46487-9_40"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_41_1","DOI":"10.1109\/IROS.2018.8594151"},{"key":"e_1_3_2_1_42_1","volume-title":"DeepTAM: Deep Tracking and Mapping. In European Conference on Computer Vision (ECCV).","author":"Zhou Huizhong","year":"2018","unstructured":"Huizhong Zhou , Benjamin Ummenhofer , and Thomas Brox . 2018 . DeepTAM: Deep Tracking and Mapping. In European Conference on Computer Vision (ECCV). Huizhong Zhou, Benjamin Ummenhofer, and Thomas Brox. 2018. DeepTAM: Deep Tracking and Mapping. In European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_1_43_1","first-page":"7","article-title":"Unsupervised learning of depth and ego-motion from video","volume":"2","author":"Zhou Tinghui","year":"2017","unstructured":"Tinghui Zhou , Matthew Brown , Noah Snavely , and David G Lowe . 2017 . Unsupervised learning of depth and ego-motion from video . In CVPR , Vol. 2. 7 . Tinghui Zhou, Matthew Brown, Noah Snavely, and David G Lowe. 2017. Unsupervised learning of depth and ego-motion from video. In CVPR, Vol. 2. 7.","journal-title":"CVPR"}],"event":{"acronym":"VSIP '20","name":"VSIP '20: 2020 2nd International Conference on Video, Signal and Image Processing","location":"Jakarta Indonesia"},"container-title":["2020 2nd International Conference on Video, Signal and Image Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442705.3442706","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3442705.3442706","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:03:03Z","timestamp":1750197783000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442705.3442706"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,4]]},"references-count":42,"alternative-id":["10.1145\/3442705.3442706","10.1145\/3442705"],"URL":"https:\/\/doi.org\/10.1145\/3442705.3442706","relation":{},"subject":[],"published":{"date-parts":[[2020,12,4]]},"assertion":[{"value":"2021-03-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}