{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,8]],"date-time":"2026-07-08T03:28:01Z","timestamp":1783481281244,"version":"3.55.0"},"publisher-location":"New York, NY, USA","reference-count":47,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,12,18]],"date-time":"2018-12-18T00:00:00Z","timestamp":1545091200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,12,18]]},"DOI":"10.1145\/3293353.3293427","type":"proceedings-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T22:07:32Z","timestamp":1588630052000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Epipolar Geometry based Learning of Multi-view Depth and Ego-Motion from Monocular Sequences"],"prefix":"10.1145","author":[{"given":"Vignesh","family":"Prasad","sequence":"first","affiliation":[{"name":"Embedded Systems &amp; Robotics, TCS Research &amp; Innovation, Kolkata"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dipanjan","family":"Das","sequence":"additional","affiliation":[{"name":"Embedded Systems &amp; Robotics, TCS Research &amp; Innovation, Kolkata"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Brojeshwar","family":"Bhowmick","sequence":"additional","affiliation":[{"name":"Embedded Systems &amp; Robotics, TCS Research &amp; Innovation, Kolkata"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2020,5,3]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Mart\u00edn Abadi Paul Barham Jianmin Chen Zhifeng Chen Andy Davis Jeffrey Dean Matthieu Devin Sanjay Ghemawat Geoffrey Irving Michael Isard etal 2016. TensorFlow: A System for Large-Scale Machine Learning.. In Operating Systems Design and Implementation (OSDI).  Mart\u00edn Abadi Paul Barham Jianmin Chen Zhifeng Chen Andy Davis Jeffrey Dean Matthieu Devin Sanjay Ghemawat Geoffrey Irving Michael Isard et al. 2016. TensorFlow: A System for Large-Scale Machine Learning.. In Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_3_2_1_2_1","volume-title":"Asian Conference on Computer Vision (ACCV).","author":"Alismail Hatem","year":"2016","unstructured":"Hatem Alismail , Brett Browning , and Simon Lucey . 2016 . Enhancing direct camera tracking with dense feature descriptors . In Asian Conference on Computer Vision (ACCV). Hatem Alismail, Brett Browning, and Simon Lucey. 2016. Enhancing direct camera tracking with dense feature descriptors. In Asian Conference on Computer Vision (ACCV)."},{"key":"e_1_3_2_1_3_1","volume-title":"Asian Conference on Computer Vision (ACCV).","author":"Bhowmick Brojeshwar","year":"2014","unstructured":"Brojeshwar Bhowmick , Suvam Patra , Avishek Chatterjee , Venu Madhav Govindu , and Subhashis Banerjee . 2014 . Divide and conquer: Efficient large-scale structure from motion using graph partitioning . In Asian Conference on Computer Vision (ACCV). Brojeshwar Bhowmick, Suvam Patra, Avishek Chatterjee, Venu Madhav Govindu, and Subhashis Banerjee. 2014. Divide and conquer: Efficient large-scale structure from motion using graph partitioning. In Asian Conference on Computer Vision (ACCV)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989023"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3009977.3009990"},{"key":"e_1_3_2_1_6_1","unstructured":"David Eigen Christian Puhrsch and Rob Fergus. 2014. Depth map prediction from a single image using a multi-scale deep network. In Advances in Neural Information Processing Systems (NIPS).  David Eigen Christian Puhrsch and Rob Fergus. 2014. Depth map prediction from a single image using a multi-scale deep network. In Advances in Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2658577"},{"key":"e_1_3_2_1_8_1","volume-title":"LSD-SLAM: Large-scale Direct Monocular SLAM. In European Conference on Computer Vision (ECCV).","author":"Engel Jakob","year":"2014","unstructured":"Jakob Engel , Thomas Sch\u00f6ps , and Daniel Cremers . 2014 . LSD-SLAM: Large-scale Direct Monocular SLAM. In European Conference on Computer Vision (ECCV). Jakob Engel, Thomas Sch\u00f6ps, and Daniel Cremers. 2014. LSD-SLAM: Large-scale Direct Monocular SLAM. In European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/358669.358692"},{"key":"e_1_3_2_1_10_1","volume-title":"SVO: Fast Semi-Direct Monocular Visual Odometry. In IEEE International Conference on Robotics and Automation (ICRA).","author":"Forster Christian","year":"2014","unstructured":"Christian Forster , Matia Pizzoli , and Davide Scaramuzza . 2014 . SVO: Fast Semi-Direct Monocular Visual Odometry. In IEEE International Conference on Robotics and Automation (ICRA). Christian Forster, Matia Pizzoli, and Davide Scaramuzza. 2014. SVO: Fast Semi-Direct Monocular Visual Odometry. In IEEE International Conference on Robotics and Automation (ICRA)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5539802"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46484-8_45"},{"key":"e_1_3_2_1_13_1","volume-title":"Vision meets Robotics: The KITTI Dataset. The International Journal of Robotics Research (IJRR)","author":"Geiger Andreas","year":"2013","unstructured":"Andreas Geiger , Philip Lenz , Christoph Stiller , and Raquel Urtasun . 2013. Vision meets Robotics: The KITTI Dataset. The International Journal of Robotics Research (IJRR) ( 2013 ). Andreas Geiger, Philip Lenz, Christoph Stiller, and Raquel Urtasun. 2013. Vision meets Robotics: The KITTI Dataset. The International Journal of Robotics Research (IJRR) (2013)."},{"key":"e_1_3_2_1_14_1","volume-title":"The KITTI Vision Benchmark Suite. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Geiger Andreas","year":"2012","unstructured":"Andreas Geiger , Philip Lenz , and Raquel Urtasun . 2012 . Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_15_1","volume-title":"Oisin Mac Aodha, and Gabriel Brostow","author":"Godard Cl\u00e9ment","year":"2018","unstructured":"Cl\u00e9ment Godard , Oisin Mac Aodha, and Gabriel Brostow . 2018 . Digging Into Self-Supervised Monocular Depth Estimation . arXiv preprint arXiv:1806.01260 (2018). Cl\u00e9ment Godard, Oisin Mac Aodha, and Gabriel Brostow. 2018. Digging Into Self-Supervised Monocular Depth Estimation. arXiv preprint arXiv:1806.01260 (2018)."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.699"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5220\/0006129200750085"},{"key":"e_1_3_2_1_18_1","volume-title":"Temporal Semantic Motion Segmentation Using Spatio Temporal Optimization. In International Workshop on Energy Minimization Methods in Computer Vision and Pattern Recognition (EMMCVPR).","author":"Haque Nazrul","year":"2017","unstructured":"Nazrul Haque , N Dinesh Reddy , and Madhava Krishna . 2017 . Temporal Semantic Motion Segmentation Using Spatio Temporal Optimization. In International Workshop on Energy Minimization Methods in Computer Vision and Pattern Recognition (EMMCVPR). Nazrul Haque, N Dinesh Reddy, and Madhava Krishna. 2017. Temporal Semantic Motion Segmentation Using Spatio Temporal Optimization. In International Workshop on Energy Minimization Methods in Computer Vision and Pattern Recognition (EMMCVPR)."},{"key":"e_1_3_2_1_19_1","volume-title":"Multiple view geometry in computer vision","author":"Hartley Richard","unstructured":"Richard Hartley and Andrew Zisserman . 2003. Multiple view geometry in computer vision . Cambridge university press . Richard Hartley and Andrew Zisserman. 2003. Multiple view geometry in computer vision. Cambridge university press."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"crossref","unstructured":"Derek Hoiem Alexei A Efros and Martial Hebert. 2005. Automatic photo pop-up. In ACM Transactions on Graphics (TOG).  Derek Hoiem Alexei A Efros and Martial Hebert. 2005. Automatic photo pop-up. In ACM Transactions on Graphics (TOG).","DOI":"10.1145\/1186822.1073232"},{"key":"e_1_3_2_1_21_1","volume-title":"International Conference on Machine Learning(ICML)","author":"Ioffe Sergey","year":"2015","unstructured":"Sergey Ioffe and Christian Szegedy . 2015 . Batch normalization: Accelerating deep network training by reducing internal covariate shift . International Conference on Machine Learning(ICML) (2015). Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. International Conference on Machine Learning(ICML) (2015)."},{"key":"e_1_3_2_1_22_1","unstructured":"Max Jaderberg Karen Simonyan Andrew Zisserman etal 2015. Spatial transformer networks. In Advances in Neural Information Processing Systems (NIPS).  Max Jaderberg Karen Simonyan Andrew Zisserman et al. 2015. Spatial transformer networks. In Advances in Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_1_23_1","volume-title":"European Conference on Computer Vision (ECCV).","author":"Jason J Yu","year":"2016","unstructured":"J Yu Jason , Adam W Harley , and Konstantinos G Derpanis . 2016 . Back to basics: Unsupervised learning of optical flow via brightness constancy and motion smoothness . In European Conference on Computer Vision (ECCV). J Yu Jason, Adam W Harley, and Konstantinos G Derpanis. 2016. Back to basics: Unsupervised learning of optical flow via brightness constancy and motion smoothness. In European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.494"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.87"},{"key":"e_1_3_2_1_26_1","volume-title":"International Conference on Learning Representations (ICLR).","author":"Kingma Diederik P","year":"2015","unstructured":"Diederik P Kingma and Jimmy Ba . 2015 . Adam: A method for stochastic optimization . In International Conference on Learning Representations (ICLR). Diederik P Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_27_1","volume-title":"Parallel Tracking And Mapping for Small AR Workspaces. In International Symposium on Mixed and Augmented Reality (ISMAR).","author":"Klein Georg","year":"2007","unstructured":"Georg Klein and David Murray . 2007 . Parallel Tracking And Mapping for Small AR Workspaces. In International Symposium on Mixed and Augmented Reality (ISMAR). Georg Klein and David Murray. 2007. Parallel Tracking And Mapping for Small AR Workspaces. In International Symposium on Mixed and Augmented Reality (ISMAR)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.238"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2505283"},{"key":"e_1_3_2_1_30_1","volume-title":"Edge SLAM: Edge Points Based Monocular Visual SLAM. In IEEE International Conference on Computer Vision (ICCV) Workshops (IEEE International Conference on Computer Vision (ICCV)W).","author":"Maity Soumyadip","year":"2017","unstructured":"Soumyadip Maity , Arindam Saha , Brojeshwar Bhowmick , Chanoh Park , Soohwan Kim , Peyman Moghadam , Clinton Fookes , Sridha Sridharan , Ivan Eichhardt , Levente Hajder , 2017 . Edge SLAM: Edge Points Based Monocular Visual SLAM. In IEEE International Conference on Computer Vision (ICCV) Workshops (IEEE International Conference on Computer Vision (ICCV)W). Soumyadip Maity, Arindam Saha, Brojeshwar Bhowmick, Chanoh Park, Soohwan Kim, Peyman Moghadam, Clinton Fookes, Sridha Sridharan, Ivan Eichhardt, Levente Hajder, et al. 2017. Edge SLAM: Edge Points Based Monocular Visual SLAM. In IEEE International Conference on Computer Vision (ICCV) Workshops (IEEE International Conference on Computer Vision (ICCV)W)."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.438"},{"key":"e_1_3_2_1_32_1","volume-title":"Jose Maria Martinez Montiel, and Juan D Tardos","author":"Mur-Artal Raul","year":"2015","unstructured":"Raul Mur-Artal , Jose Maria Martinez Montiel, and Juan D Tardos . 2015 . ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE Transactions on Robotics (TRO) ( 2015). Raul Mur-Artal, Jose Maria Martinez Montiel, and Juan D Tardos. 2015. ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE Transactions on Robotics (TRO) (2015)."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126513"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2004.17"},{"key":"e_1_3_2_1_35_1","volume-title":"Spatio-temporal video autoencoder with differentiable memory. arXiv preprint arXiv:1511.06309","author":"Patraucean Viorica","year":"2015","unstructured":"Viorica Patraucean , Ankur Handa , and Roberto Cipolla . 2015. Spatio-temporal video autoencoder with differentiable memory. arXiv preprint arXiv:1511.06309 ( 2015 ). Viorica Patraucean, Ankur Handa, and Roberto Cipolla. 2015. Spatio-temporal video autoencoder with differentiable memory. arXiv preprint arXiv:1511.06309 (2015)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989442"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2007.4408828"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-61123-1_183"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.596"},{"key":"e_1_3_2_1_40_1","volume-title":"Sfm-net: Learning of structure and motion from video. arXiv:1704.07804","author":"Vijayanarasimhan Sudheendra","year":"2017","unstructured":"Sudheendra Vijayanarasimhan , Susanna Ricco , Cordelia Schmid , Rahul Sukthankar , and Katerina Fragkiadaki . 2017 . Sfm-net: Learning of structure and motion from video. arXiv:1704.07804 (2017). Sudheendra Vijayanarasimhan, Susanna Ricco, Cordelia Schmid, Rahul Sukthankar, and Katerina Fragkiadaki. 2017. Sfm-net: Learning of structure and motion from video. arXiv:1704.07804 (2017)."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00216"},{"key":"e_1_3_2_1_42_1","volume-title":"Image quality assessment: from error visibility to structural similarity","author":"Wang Zhou","year":"2004","unstructured":"Zhou Wang , Alan C Bovik , Hamid R Sheikh , and Eero P Simoncelli . 2004. Image quality assessment: from error visibility to structural similarity . IEEE Transactions on Image Processing ( 2004 ). Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing (2004)."},{"key":"e_1_3_2_1_43_1","unstructured":"Changchang Wu. 2007. SiftGPU: A GPU implementation of sift. (2007). http:\/\/cs.unc.edu\/~ccwu\/siftgpu  Changchang Wu. 2007. SiftGPU: A GPU implementation of sift. (2007). http:\/\/cs.unc.edu\/~ccwu\/siftgpu"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2013.25"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"crossref","unstructured":"Zhenheng Yang Peng Wang Wei Xu Liang Zhao and Ramakant Nevatia. 2018. Unsupervised Learning of Geometry From Videos With Edge-Aware Depth-Normal Consistency. In AAAI.  Zhenheng Yang Peng Wang Wei Xu Liang Zhao and Ramakant Nevatia. 2018. Unsupervised Learning of Geometry From Videos With Edge-Aware Depth-Normal Consistency. In AAAI.","DOI":"10.1609\/aaai.v32i1.12257"},{"key":"e_1_3_2_1_46_1","volume-title":"Optical Flow and Camera Pose. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Yin Zhichao","year":"2018","unstructured":"Zhichao Yin and Jianping Shi . 2018 . GeoNet: Unsupervised Learning of Dense Depth , Optical Flow and Camera Pose. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Zhichao Yin and Jianping Shi. 2018. GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.700"}],"event":{"name":"ICVGIP 2018: 11th Indian Conference on Computer Vision, Graphics and Image Processing","location":"Hyderabad India","acronym":"ICVGIP 2018"},"container-title":["Proceedings of the 11th Indian Conference on Computer Vision, Graphics and Image Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3293353.3293427","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3293353.3293427","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:58:09Z","timestamp":1750208289000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3293353.3293427"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,12,18]]},"references-count":47,"alternative-id":["10.1145\/3293353.3293427","10.1145\/3293353"],"URL":"https:\/\/doi.org\/10.1145\/3293353.3293427","relation":{},"subject":[],"published":{"date-parts":[[2018,12,18]]},"assertion":[{"value":"2020-05-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}