{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T00:18:38Z","timestamp":1771978718786,"version":"3.50.1"},"reference-count":28,"publisher":"Emerald","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,9,19]]},"abstract":"<jats:sec>\n                  <jats:title>Purpose<\/jats:title>\n                  <jats:p>To address the issue of large visual measurement errors caused by insufficient information collected by monocular vision when performing six-degree-of-freedom (6DOF) position measurements on metal castings, which hinders the robot\u2019s ability to visually guide grasping, this paper aims to propose a 6DOF position measurement method that integrates monocular vision with deep neural networks.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Design\/methodology\/approach<\/jats:title>\n                  <jats:p>This method enhances the robot\u2019s ability to visually grasp small-sample industrial objects with high accuracy. By establishing a mapping relationship between the two-dimensional (2D) position of the object\u2019s image and its three-dimensional (3D) position in space, the proposed approach achieves 6DOF position measurement of the target workpiece using monocular vision. An image enhancement algorithm based on a generative adversarial network (GAN) is introduced to improve robustness in industrial environments by addressing the challenge of acquiring image data for small-sample objects. Additionally, the method combines single-phase object detection using deep neural networks with 2D-3D affine transformation to achieve accurate 3D position measurements.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Findings<\/jats:title>\n                  <jats:p>The introduction of the GAN-based image enhancement algorithm significantly mitigates the robustness issues posed by the difficulties in obtaining image data for small-sample objects in industrial settings. The integration of single-phase object detection and 2D\u20133D affine transformation allows for precise 3D position measurement of the workpiece. Experimental results demonstrate that the proposed method provides high accuracy in 6DOF position measurements for industrial objects.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Originality\/value<\/jats:title>\n                  <jats:p>This approach overcomes the limitations of traditional vision algorithms for 3D position measurement of industrial objects, such as high cost and poor robustness. The experimental validation confirms that the proposed method achieves excellent 6DOF position measurement accuracy for industrial objects.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1108\/ir-04-2024-0167","type":"journal-article","created":{"date-parts":[[2025,2,28]],"date-time":"2025-02-28T01:05:52Z","timestamp":1740704752000},"page":"591-599","source":"Crossref","is-referenced-by-count":1,"title":["Research on robotic monocular 6DOF vision localization grasping strategy for industrial objects"],"prefix":"10.1108","volume":"52","author":[{"given":"Guoyang","family":"Wan","sequence":"first","affiliation":[{"name":"Anhui Polytechnic University , Wuhu,","place":["China"]}]},{"given":"Hanqi","family":"Li","sequence":"additional","affiliation":[{"name":"Anhui Polytechnic University , Wuhu,","place":["China"]}]},{"given":"Qianqian","family":"Wang","sequence":"additional","affiliation":[{"name":"Anhui Polytechnic University , Wuhu,","place":["China"]}]},{"given":"Chengwen","family":"Wang","sequence":"additional","affiliation":[{"name":"Anhui Polytechnic University , Wuhu,","place":["China"]}]},{"given":"Qin","family":"He","sequence":"additional","affiliation":[{"name":"Anhui Polytechnic University , Wuhu,","place":["China"]}]},{"given":"Xuna","family":"Li","sequence":"additional","affiliation":[{"name":"Anhui Polytechnic University , Wuhu,","place":["China"]}]}],"member":"140","published-online":{"date-parts":[[2025,3,3]]},"reference":[{"key":"2025091707171051800_ref001","first-page":"1","article-title":"Low-cost vision system for pick and place application using camera and ABB industrial robot","author":"Chakole","year":"2020"},{"issue":"4","key":"2025091707171051800_ref002","doi-asserted-by":"crossref","first-page":"1310","DOI":"10.1109\/TCE.2003.1261234","article-title":"Minimum mean brightness error bi-histogram equalization in contrast enhancement","volume":"49","author":"Chen","year":"2003","journal-title":"IEEE Transactions on Consumer Electronics"},{"key":"2025091707171051800_ref003","first-page":"834","volume-title":"European Conference on Computer Vision","author":"Engel","year":"2014"},{"issue":"3","key":"2025091707171051800_ref004","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1109\/TPAMI.2017.2658577","article-title":"Direct sparse odometry","volume":"40","author":"Engel","year":"2017","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2025091707171051800_ref005","first-page":"15","article-title":"SVO: fast semi-direct monocular visual odometry","author":"Forster","year":"2014"},{"issue":"4","key":"2025091707171051800_ref006","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1109\/LES.2022.3181892","article-title":"Lightweight robotic grasping model based on template matching and depth image","volume":"14","author":"Le","year":"2022","journal-title":"IEEE Embedded Systems Letters"},{"key":"2025091707171051800_ref007","first-page":"7286","article-title":"UnDeepVO: monocular visual odometry through unsupervised deep learning","author":"Li","year":"2018"},{"key":"2025091707171051800_ref008","doi-asserted-by":"crossref","DOI":"10.1109\/ICCV48922.2021.00986","article-title":"Swin transformer: hierarchical vision transformer using shifted windows","author":"Liu","year":"2021"},{"key":"2025091707171051800_ref009","first-page":"3121","article-title":"Enhancement of aerial images using threshold decomposition adaptive morphological filter","author":"Mahmoud","year":"2009"},{"key":"2025091707171051800_ref010","first-page":"2637","article-title":"Yolo-pose: enhancing yolo for multi person pose estation using object keypoint similarimity loss","author":"Maji","year":"2022"},{"issue":"5","key":"2025091707171051800_ref011","doi-asserted-by":"crossref","first-page":"1147","DOI":"10.1109\/TRO.2015.2463671","article-title":"ORB-SLAM: a versatile and accurate monocular SLAM system","volume":"31","author":"Mur-Artal","year":"2015","journal-title":"IEEE Transactions on Robotics"},{"key":"2025091707171051800_ref012","first-page":"1","article-title":"Adversarial context aggregation network for low-light image enhancement","author":"Shin","year":"2018"},{"issue":"2","key":"2025091707171051800_ref013","first-page":"238","article-title":"Monocular 6DOF SLAM: a survey","volume":"36","author":"Smith","year":"2020","journal-title":"IEEE Transactions on Robotics"},{"key":"2025091707171051800_ref014","first-page":"1","article-title":"LE-VINS: a robust solid-state-LiDAR-enhanced visual-inertial navigation system for Low-Speed robots","volume":"72","author":"Tang","year":"2023","journal-title":"IEEE Transactions on Instrumentation and Measurement"},{"key":"2025091707171051800_ref015","first-page":"6243","article-title":"'CNN-SLAM: real-time dense monocular SLAM with learned depth prediction","author":"Tateno","year":"2017"},{"key":"2025091707171051800_ref016","first-page":"1326","article-title":"Developing an approximation to the heidler function \u2013 with an analytical transformation into the frequency domain","author":"Terespolsky","year":"2014"},{"issue":"4","key":"2025091707171051800_ref017","doi-asserted-by":"crossref","first-page":"458","DOI":"10.1108\/AA-01-2022-0014","article-title":"A novel robotic 6DOF pose measurement strategy for large-size casts based on stereo vision","volume":"42","author":"Wan","year":"2022","journal-title":"Assembly Automation"},{"key":"2025091707171051800_ref018","first-page":"1250","article-title":"Intelligent operation of robot for handling and assembly based on vision guidance","author":"Xu","year":"2019"},{"key":"2025091707171051800_ref019","first-page":"6772","article-title":"Image restoration with deep generative models","author":"Yeh","year":"2018"},{"key":"2025091707171051800_ref020","first-page":"54","article-title":"Low-cost solution for vision-based robotic grasping","author":"Zhang","year":"2021"},{"key":"2025091707171051800_ref021","doi-asserted-by":"crossref","first-page":"13830","DOI":"10.1109\/ACCESS.2023.3243551","article-title":"CVML-Pose: convolutional VAE based multi-level network for object 3D pose estimation","volume":"11","author":"Zhao","year":"2023","journal-title":"IEEE Access"},{"key":"2025091707171051800_ref022","first-page":"2242","article-title":"Unpaired image-to-image translation using Cycle-Consistent adversarial networks","author":"Zhu","year":"2017"},{"key":"2025091707171051800_ref023","first-page":"587","article-title":"Dual attention based convolutional neural network with scattering transform features for hyperspectral unmixing","author":"Jin","year":"2022"},{"key":"2025091707171051800_ref024","article-title":"Attention is all you need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2025091707171051800_ref025","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/j.jfoodeng.2014.08.014","article-title":"Relationship between fermentation behavior, measured with a 3D vision structured light technique, and the internal structure of bread","volume":"146","author":"Verdu","year":"2015","journal-title":"Journal of Food Engineering"},{"key":"2025091707171051800_ref026","doi-asserted-by":"crossref","first-page":"109623","DOI":"10.1016\/j.measurement.2021.109623","article-title":"Simultaneous robot\u2013world and hand\u2013eye calibration based on a pair of dual equations","volume":"181","author":"Wang","year":"2021","journal-title":"Measurement"},{"key":"2025091707171051800_ref027","first-page":"1","article-title":"Generative image inpainting based on wavelet transform attention model","author":"Wang","year":"2020"},{"key":"2025091707171051800_ref028","doi-asserted-by":"crossref","first-page":"112076","DOI":"10.1016\/j.measurement.2022.112076","article-title":"Robot-world and hand\u2013eye calibration based on motion tensor with applications in uncalibrated robot","volume":"204","author":"Wang","year":"2022","journal-title":"Measurement"}],"container-title":["Industrial Robot: the international journal of robotics research and application"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IR-04-2024-0167\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/ir\/article-pdf\/52\/4\/591\/10294019\/ir-04-2024-0167en.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/www.emerald.com\/ir\/article-pdf\/52\/4\/591\/10294019\/ir-04-2024-0167en.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,17]],"date-time":"2025-09-17T11:17:18Z","timestamp":1758107838000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/ir\/article\/52\/4\/591\/1251335\/Research-on-robotic-monocular-6DOF-vision"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,3]]},"references-count":28,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,9,19]]}},"URL":"https:\/\/doi.org\/10.1108\/ir-04-2024-0167","relation":{},"ISSN":["0143-991X","1758-5791"],"issn-type":[{"value":"0143-991X","type":"print"},{"value":"1758-5791","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,3]]}}}