{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T15:39:40Z","timestamp":1780501180249,"version":"3.54.1"},"reference-count":46,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,9,27]],"date-time":"2023-09-27T00:00:00Z","timestamp":1695772800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NITROAA"},{"name":"NVIDIA Corporation"},{"name":"Science & Engineering Research Board (SERB), Department of Science and Technology (DST), Government of India","award":["SRG\/2021\/002399"],"award-info":[{"award-number":["SRG\/2021\/002399"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,2,29]]},"abstract":"<jats:p>Monocular vision-based 3D scene understanding has been an integral part of many machine vision applications. Always, the objective is to measure the depth using a single RGB camera, which is at par with the depth cameras. In this regard, monocular vision-guided autonomous navigation of robots is rapidly gaining popularity among the research community. We propose an effective monocular vision-assisted method to measure the depth of an Unmanned Aerial Vehicle (UAV) from an impending frontal obstacle. This is followed by collision-free navigation in unknown GPS-denied environments. Our approach deals upon the fundamental principle of perspective vision that the size of an object relative to its field of view (FoV) increases as the center of projection moves closer towards the object. Our contribution involves modeling the depth followed by its realization through scale-invariant SURF features. Noisy depth measurements arising due to external wind, or the turbulence in the UAV, are rectified by employing a constant velocity-based Kalman filter model. Necessary control commands are then designed based on the rectified depth value to avoid the obstacle before collision. Rigorous experiments with SURF scale-invariant features reveal an overall accuracy of 88.6% with varying obstacles, in both indoor and outdoor environments.<\/jats:p>","DOI":"10.1145\/3550485","type":"journal-article","created":{"date-parts":[[2022,7,29]],"date-time":"2022-07-29T12:04:35Z","timestamp":1659096275000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":21,"title":["Monocular Vision-aided Depth Measurement from RGB Images for Autonomous UAV Navigation"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2930-8060","authenticated-orcid":false,"given":"Ram Prasad","family":"Padhy","sequence":"first","affiliation":[{"name":"Indian Institute of Information Technology, Design and Manufacturing (IIITDM), Kancheepuram, India"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8362-3873","authenticated-orcid":false,"given":"Pankaj Kumar","family":"Sa","sequence":"additional","affiliation":[{"name":"National Institute of Technology Rourkela, India"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4879-7138","authenticated-orcid":false,"given":"Fabio","family":"Narducci","sequence":"additional","affiliation":[{"name":"University of Salerno, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1358-006X","authenticated-orcid":false,"given":"Carmen","family":"Bisogni","sequence":"additional","affiliation":[{"name":"University of Salerno, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6107-114X","authenticated-orcid":false,"given":"Sambit","family":"Bakshi","sequence":"additional","affiliation":[{"name":"National Institute of Technology Rourkela, India"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2023,9,27]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"3056","volume-title":"IEEE International Conference on Robotics and Automation (ICRA)","author":"Achtelik Markus","year":"2011","unstructured":"Markus Achtelik, Michael Achtelik, Stephan Weiss, and Roland Siegwart. 2011. Onboard IMU and monocular vision based control for MAVs in unknown in-and outdoor environments. In IEEE International Conference on Robotics and Automation (ICRA). IEEE, 3056\u20133063."},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1002\/rob.20400"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2020.3024011"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2007.09.014"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2608901"},{"key":"e_1_3_1_7_2","first-page":"213","volume-title":"Systems and Information Engineering Design Symposium","author":"Chen Michael Y.","year":"2013","unstructured":"Michael Y. Chen, Derrick H. Edwards, Erin L. Boehmer, Nathan M. Eller, James T. Slack, Christian R. Speck, Sean M. Brown, Hunter G. Williams, Samuel H. Wilson, Christopher S. Gillum, et\u00a0al. 2013. Designing a spatially aware and autonomous quadcopter. In Systems and Information Engineering Design Symposium. IEEE, 213\u2013218."},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10846-011-9587-z"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/TRA.2002.807557"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2019.2928615"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10846-017-0510-0"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2014.03.012"},{"key":"e_1_3_1_13_2","first-page":"443","volume-title":"International Conference on Image Analysis and Processing","author":"Falco Antonio De","year":"2019","unstructured":"Antonio De Falco, Fabio Narducci, and Alfredo Petrosino. 2019. An UAV autonomous warehouse inventorying by deep learning. In International Conference on Image Analysis and Processing. Springer, 443\u2013453."},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/358669.358692"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2015.2392531"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2013.2289091"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2015.2432455"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/3377876"},{"issue":"10","key":"e_1_3_1_19_2","article-title":"Robot path planning using analog circuit phase delay","volume":"50","author":"Garvey Sean","year":"2018","unstructured":"Sean Garvey and Scott Koziol. 2018. Robot path planning using analog circuit phase delay. IEEE Trans. Syst., Man, Cyber. Syst. 50, 10 (2018).","journal-title":"IEEE Trans. Syst., Man, Cyber. Syst."},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/3360050"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2014.2358131"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2020.3024405"},{"key":"e_1_3_1_23_2","first-page":"2166","volume-title":"American Control Conference","author":"He Zhihai","year":"2006","unstructured":"Zhihai He, Ram Venkataraman Iyer, and Phillip R. Chandler. 2006. Vision-based UAV flight control and obstacle avoidance. In American Control Conference. IEEE, 2166\u20132170."},{"key":"e_1_3_1_24_2","first-page":"448","volume-title":"AIAA Information Systems-AIAA Infotech @ Aerospace","author":"Hening Sebastian","year":"2017","unstructured":"Sebastian Hening, Corey A. Ippolito, Kalmanje S. Krishnakumar, Vahram Stepanyan, and Mircea Teodorescu. 2017. 3D LiDAR SLAM integration with GPS\/INS for UAVs in urban GPS-degraded environments. In AIAA Information Systems-AIAA Infotech @ Aerospace. AIAA, 448\u2013457."},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.3390\/drones5020052"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2021.3073687"},{"key":"e_1_3_1_27_2","first-page":"33","volume-title":"International Conference on Pervasive Technologies Related to Assistive Environments","author":"Lioulemes Alexandros","year":"2014","unstructured":"Alexandros Lioulemes, Georgios Galatas, Vangelis Metsis, Gian Luca Mariottini, and Fillia Makedon. 2014. Safety challenges in using AR.drone to collaborate with humans in indoor environments. In International Conference on Pervasive Technologies Related to Assistive Environments. ACM, 33\u201336."},{"issue":"3","key":"e_1_3_1_28_2","doi-asserted-by":"crossref","first-page":"710","DOI":"10.1109\/TPAMI.2017.2689007","article-title":"Single-view 3D scene reconstruction and parsing by attribute grammar","volume":"40","author":"Liu Xiaobai","year":"2018","unstructured":"Xiaobai Liu, Yibiao Zhao, and Song-Chun Zhu. 2018. Single-view 3D scene reconstruction and parsing by attribute grammar. IEEE Trans. Pattern Anal. Mach. Intell. 40, 3 (2018), 710\u2013725.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"560","DOI":"10.1007\/978-3-319-27101-9_43","volume-title":"Advances in Artificial Intelligence and its Applications","author":"Mart\u00ednez-Carranza Jos\u00e9","year":"2015","unstructured":"Jos\u00e9 Mart\u00ednez-Carranza, Esteban Omar Garcia, Hugo Jair Escalante, and Walterio Mayol-Cuevas. 2015. Towards autonomous flight of low-cost MAVs by using a probabilistic visual odometry approach. In Advances in Artificial Intelligence and its Applications. Springer, 560\u2013573."},{"issue":"8","key":"e_1_3_1_31_2","doi-asserted-by":"crossref","first-page":"1321","DOI":"10.1109\/TSMC.2017.2668603","article-title":"Path planning for active SLAM based on the D* algorithm with negative edge weights","volume":"48","author":"Maurovi\u0107 Ivan","year":"2018","unstructured":"Ivan Maurovi\u0107, Marija Seder, Kruno Lenac, and Ivan Petrovi\u0107. 2018. Path planning for active SLAM based on the D* algorithm with negative edge weights. IEEE Trans. Syst., Man Cyber. Syst. 48, 8 (2018), 1321\u20131331.","journal-title":"IEEE Trans. Syst., Man Cyber. Syst."},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2008.926048"},{"issue":"4","key":"e_1_3_1_33_2","first-page":"7","article-title":"Comparison of feature detection and matching approaches: SIFT and SURF","volume":"2","author":"Mistry Darshana","year":"2017","unstructured":"Darshana Mistry and Asim Banerjee. 2017. Comparison of feature detection and matching approaches: SIFT and SURF. GRD J.-Glob. Res. Devel. J. Eng. 2, 4 (2017), 7\u201313.","journal-title":"GRD J.-Glob. Res. Devel. J. Eng."},{"key":"e_1_3_1_34_2","first-page":"1750","volume-title":"IEEE International Conference on Robotics and Automation","author":"Mori Takayoshi","year":"2013","unstructured":"Takayoshi Mori and Stefan Scherer. 2013. First results in detecting and avoiding frontal obstacles from a monocular camera for micro unmanned aerial vehicles. In IEEE International Conference on Robotics and Automation. IEEE, 1750\u20131757."},{"key":"e_1_3_1_35_2","doi-asserted-by":"crossref","first-page":"9423","DOI":"10.1109\/ICPR48806.2021.9412096","volume-title":"25th International Conference on Pattern Recognition (ICPR)","author":"Padhy Ram Prasad","year":"2021","unstructured":"Ram Prasad Padhy, Shahzad Ahmad, Sachin Verma, Sambit Bakshi, and Pankaj Kumar Sa. 2021. Localization of unmanned aerial vehicles in corridor environments using deep learning. In 25th International Conference on Pattern Recognition (ICPR). IEEE, 9423\u20139428."},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2018.07.012"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2018.07.099"},{"key":"e_1_3_1_38_2","article-title":"Monocular vision aided autonomous UAV navigation in indoor corridor environments","author":"Padhy Ram Prasad","year":"2018","unstructured":"Ram Prasad Padhy, Feng Xia, Suman Kumar Choudhury, Pankaj Kumar Sa, and Sambit Bakshi. 2018. Monocular vision aided autonomous UAV navigation in indoor corridor environments. IEEE Trans. Sustain. Comput. 4, 1 (2018).","journal-title":"IEEE Trans. Sustain. Comput."},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"e_1_3_1_40_2","first-page":"1355","volume-title":"IEEE\/ASME International Conference on Advanced Intelligent Mechatronics","author":"Sa Inkyu","year":"2013","unstructured":"Inkyu Sa, Hu He, Van Huynh, and Peter Corke. 2013. Monocular vision based autonomous navigation for a cost-effective MAV in GPS-denied environments. In IEEE\/ASME International Conference on Advanced Intelligent Mechatronics. IEEE, 1355\u20131360."},{"key":"e_1_3_1_41_2","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1109\/ICARES.2014.7024382","volume-title":"IEEE International Conference on Aerospace Electronics and Remote Sensing Technology","author":"Saha Simanto","year":"2014","unstructured":"Simanto Saha, Ashutosh Natraj, and Sonia Waharte. 2014. A real-time monocular vision-based frontal obstacle detection and avoidance for low cost UAVs in GPS denied environment. In IEEE International Conference on Aerospace Electronics and Remote Sensing Technology. IEEE, 189\u2013195."},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10846-017-0543-4"},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/MIM.2014.6825388"},{"key":"e_1_3_1_44_2","first-page":"2026","volume-title":"AIAA Infotech @ Aerospace","author":"Stubblebine Andrew","year":"2015","unstructured":"Andrew Stubblebine, Brennan Redmond, Brian Feie, and Elad Kivelevitch. 2015. Laser-guided quadrotor obstacle avoidance. In AIAA Infotech @ Aerospace. AIAA, 2026\u20132037."},{"key":"e_1_3_1_45_2","first-page":"1674","volume-title":"IEEE International Conference on Information and Automation","author":"Wang Chaoqun","year":"2015","unstructured":"Chaoqun Wang, Wei Liu, and Max Q.-H. Meng. 2015. Obstacle avoidance for quadrotor using improved method based on optical flow. In IEEE International Conference on Information and Automation. IEEE, 1674\u20131679."},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2020.3001816"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/3486678"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3550485","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3550485","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:51:17Z","timestamp":1750182677000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3550485"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,27]]},"references-count":46,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,2,29]]}},"alternative-id":["10.1145\/3550485"],"URL":"https:\/\/doi.org\/10.1145\/3550485","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,27]]},"assertion":[{"value":"2022-02-15","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-07-08","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-09-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}