{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,4]],"date-time":"2025-11-04T16:17:15Z","timestamp":1762273035481,"version":"3.41.2"},"reference-count":36,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2023,2,20]],"date-time":"2023-02-20T00:00:00Z","timestamp":1676851200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Comput. Neurosci."],"abstract":"<jats:p>Object detection and grasp detection are essential for unmanned systems working in cluttered real-world environments. Detecting grasp configurations for each object in the scene would enable reasoning manipulations. However, finding the relationships between objects and grasp configurations is still a challenging problem. To achieve this, we propose a novel neural learning approach, namely SOGD, to predict a best grasp configuration for each detected objects from an RGB-D image. The cluttered background is first filtered out via a 3D-plane-based approach. Then two separate branches are designed to detect objects and grasp candidates, respectively. The relationship between object proposals and grasp candidates are learned by an additional alignment module. A series of experiments are conducted on two public datasets (Cornell Grasp Dataset and Jacquard Dataset) and the results demonstrate the superior performance of our SOGD against SOTA methods in predicting reasonable grasp configurations \u201cfrom a cluttered scene.\u201d<\/jats:p>","DOI":"10.3389\/fncom.2023.1110889","type":"journal-article","created":{"date-parts":[[2023,2,20]],"date-time":"2023-02-20T05:28:11Z","timestamp":1676870891000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["A neural learning approach for simultaneous object detection and grasp detection in cluttered scenes"],"prefix":"10.3389","volume":"17","author":[{"given":"Yang","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Lihua","family":"Xie","sequence":"additional","affiliation":[]},{"given":"Yuheng","family":"Li","sequence":"additional","affiliation":[]},{"given":"Yuan","family":"Li","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2023,2,20]]},"reference":[{"key":"B1","doi-asserted-by":"crossref","first-page":"13452","DOI":"10.1109\/ICRA48506.2021.9561398","article-title":"\u201cEnd-to-end trainable deep neural network for robotic grasp detection and semantic segmentation from RGB,\u201d","volume-title":"Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA)","author":"Ainetter","year":"2021"},{"key":"B2","first-page":"8085","article-title":"\u201cDensely Supervised Grasp Detector (DSGD),\u201d","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Asif","year":"2019"},{"key":"B3","doi-asserted-by":"publisher","first-page":"1030707","DOI":"10.3389\/fncom.2022.1030707","article-title":"Invariance of object detection in untrained deep neural networks","volume":"16","author":"Cheon","year":"2022","journal-title":"Front. Comput. Neurosci."},{"key":"B4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s00521-022-07894-y","article-title":"Improving automated latent fingerprint detection and segmentation using deep convolutional neural network","volume":"2022","author":"Chhabra","year":"2022","journal-title":"Neural Comput. Appl"},{"key":"B5","doi-asserted-by":"publisher","first-page":"3355","DOI":"10.1109\/LRA.2018.2852777","article-title":"Real-world multiobject, multigrasp detection","volume":"3","author":"Chu","year":"2018","journal-title":"IEEE Robot. Autom. Lett"},{"key":"B6","first-page":"3511","article-title":"\u201cJacquard: A large scale dataset for robotic grasp detection,\u201d","author":"Depierre","year":"2018"},{"key":"B7","doi-asserted-by":"publisher","first-page":"124","DOI":"10.1016\/j.comcom.2021.07.012","article-title":"Mask-GD Segmentation Based Robotic Grasp Detection","volume":"178","author":"Dong","year":"2021","journal-title":"Comp. Commun"},{"key":"B8","article-title":"Yolox: exceeding yolo series in 2021","author":"Ge","year":"2021","journal-title":"arXiv preprint arXiv:2107.08430"},{"key":"B9","doi-asserted-by":"crossref","first-page":"8966","DOI":"10.1109\/ICCV.2019.00906","article-title":"\u201cLearning Local RGB-to-CAD Correspondences for Object Pose Estimation,\u201d","volume-title":"2019 IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"Georgakis","year":"2019"},{"key":"B10","doi-asserted-by":"crossref","first-page":"1063","DOI":"10.1109\/CVPR.2016.90","article-title":"\u201cDeep residual learning for image recognition,\u201d","volume-title":"Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"He","year":"2016"},{"key":"B11","doi-asserted-by":"publisher","first-page":"930827","DOI":"10.3389\/fncom.2022.930827","article-title":"An infrared sequence image generating method for target detection and tracking","volume":"16","author":"Huang","year":"2022","journal-title":"Front. Comput. Neurosci."},{"key":"B12","doi-asserted-by":"crossref","first-page":"3304","DOI":"10.1109\/ICRA.2011.5980145","article-title":"\u201cEfficient grasping from rgbd images: Learning using a new rectangle representation,\u201d","volume-title":"Proceedings of the 2011 IEEE International Conference on Robotics and Automation (ICRA)","author":"Jiang","year":"2011"},{"key":"B13","doi-asserted-by":"crossref","DOI":"10.15607\/RSS.2021.XVII.024","article-title":"\u201cSynergies between affordance and geometry: 6-DoF grasp detection via implicit representations,\u201d","volume-title":"Robotics: Science and Systems XVII","author":"Jiang","year":"2021"},{"key":"B14","doi-asserted-by":"publisher","first-page":"1001803","DOI":"10.3389\/fncom.2022.1001803","article-title":"A computational classification method of breast cancer images using the VGGNet model","volume":"16","author":"Khan","year":"2022","journal-title":"Front. Comput. Neurosci."},{"key":"B15","first-page":"9626","article-title":"\u201cAntipodal Robotic Grasping using Generative Residual Convolutional Neural Network,\u201d","volume-title":"Proceedings of the 2020 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)","author":"Kumra","year":"2020"},{"key":"B16","doi-asserted-by":"crossref","DOI":"10.15607\/RSS.2013.IX.012","volume-title":"Deep Learning for Detecting Robotic Grasps","author":"Lenz","year":"2013"},{"key":"B17","doi-asserted-by":"crossref","first-page":"3629","DOI":"10.1109\/ICRA.2019.8794435","article-title":"\u201cPointNetGPD: detecting grasp configurations from point sets,\u201d","volume-title":"Proceedings of the 2019 International Conference on Robotics and Automation (ICRA)","author":"Liang","year":"2019"},{"key":"B18","doi-asserted-by":"publisher","first-page":"318","DOI":"10.1109\/TPAMI.2018.2858826","article-title":"Focal loss for dense object detection","volume":"42","author":"Lin","year":"2017","journal-title":"IEEE Trans. Patt. Anal. Mach. Intell"},{"key":"B19","doi-asserted-by":"publisher","DOI":"10.1016\/j.compeleceng.2022.108479","article-title":"Enhanced framework for COVID-19 prediction with computed tomography scan images using dense convolutional neural network and novel loss function","author":"Motwani","year":"2022","journal-title":"Comput. Electr. Eng"},{"key":"B20","doi-asserted-by":"crossref","first-page":"7300","DOI":"10.1109\/ICRA40945.2020.9197179","article-title":"\u201cA single multi-task deep neural network with post-processing for object detection with reasoning and robotic grasp detection,\u201d","volume-title":"Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA)","author":"Park","year":"2020"},{"key":"B21","doi-asserted-by":"publisher","first-page":"1455","DOI":"10.1177\/0278364917735594","article-title":"Grasp pose detection in point clouds","volume":"36","author":"Pas","year":"2017","journal-title":"Int. J. Robot. Res."},{"key":"B22","article-title":"Yolov3: an incremental improvement","author":"Redmon","year":"2018","journal-title":"arXiv preprint arXiv:1804.02767"},{"key":"B23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11063-022-10818-5","article-title":"An IoT and machine learning based intelligent system for the classification of therapeutic plants","volume":"2022","author":"Shailendra","year":"2022","journal-title":"Neural Process. Lett"},{"key":"B24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11042-022-14088-0","article-title":"Detection and classification of brain tumor using hybrid feature extraction technique","volume":"2022","author":"Singh","year":"2022","journal-title":"Multimedia Tools Appl"},{"key":"B25","doi-asserted-by":"publisher","first-page":"101963","DOI":"10.1016\/j.rcim.2020.101963","article-title":"A novel robotic grasp detection method based on region proposal networks","volume":"65","author":"Song","year":"2020","journal-title":"Robot. Cim-Int. Manuf."},{"key":"B26","doi-asserted-by":"crossref","first-page":"13438","DOI":"10.1109\/ICRA48506.2021.9561877","article-title":"\u201cContact-GraspNet: Efficient 6-DoF Grasp Generation in Cluttered Scenes,\u201d","volume-title":"2021 IEEE International Conference on Robotics and Automation (ICRA)","author":"Sundermeyer","year":"2021"},{"key":"B27","doi-asserted-by":"publisher","first-page":"11611","DOI":"10.1109\/TIE.2021.3120474","article-title":"High-performance pixel-level grasp detection based on adaptive grasping and grasp-aware network","volume":"69","author":"Wang","year":"2022","journal-title":"IEEE T. Ind. Electron."},{"key":"B28","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1109\/ROBIO49542.2019.8961711","article-title":"\u201cEfficient fully convolution neural network for generating pixel wise robotic grasps with high resolution images,\u201d","volume-title":"Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO)","author":"Wang","year":"2019"},{"key":"B29","first-page":"4654","article-title":"\u201cDouble-Dot Network for Antipodal Grasp Detection,\u201d","volume-title":"Proceedings of the 2021 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)","author":"Wang","year":"2021"},{"key":"B30","unstructured":"The Darknet: A digital copyright revolution1\n            WoodJ. A.\n          Rich. JL Tech.162009"},{"key":"B31","doi-asserted-by":"crossref","first-page":"6350","DOI":"10.1109\/ICRA48506.2021.9562046","article-title":"\u201cRobotic Grasping through Combined Image-Based Grasp Proposal and 3D Reconstruction,\u201d","volume-title":"2021 IEEE International Conference on Robotics and Automation (ICRA)","author":"Yang","year":"2021"},{"key":"B32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TMECH.2022.3209488","article-title":"EGNet: Efficient Robotic Grasp Detection Network","volume":"2022","author":"Yu","year":"","journal-title":"IEEE T. Ind. Electron"},{"key":"B33","doi-asserted-by":"publisher","first-page":"5238","DOI":"10.1109\/LRA.2022.3145064","article-title":"SE-ResUNet: A novel robotic grasp detection method","volume":"7","author":"Yu","year":"","journal-title":"IEEE Robot. Autom. Lett"},{"key":"B34","first-page":"4768","article-title":"\u201cROI-based Robotic Grasp Detection for Object Overlapping Scenes,\u201d","volume-title":"Proceedings of the 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)","author":"Zhang","year":"2019"},{"key":"B35","doi-asserted-by":"publisher","first-page":"1341","DOI":"10.1360\/N092018-00169","article-title":"Robotic grasping in multi-object stacking scenes based on visual reasoning","volume":"48","author":"Zhang","year":"2018","journal-title":"Scientia Sinica Technologica."},{"key":"B36","first-page":"7223","article-title":"\u201cFully convolutional grasp detection network with oriented anchor box,\u201d","volume-title":"Proceedings of the 2018 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)","author":"Zhou","year":"2018"}],"container-title":["Frontiers in Computational Neuroscience"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fncom.2023.1110889\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,20]],"date-time":"2023-02-20T05:28:23Z","timestamp":1676870903000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fncom.2023.1110889\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,20]]},"references-count":36,"alternative-id":["10.3389\/fncom.2023.1110889"],"URL":"https:\/\/doi.org\/10.3389\/fncom.2023.1110889","relation":{},"ISSN":["1662-5188"],"issn-type":[{"type":"electronic","value":"1662-5188"}],"subject":[],"published":{"date-parts":[[2023,2,20]]},"article-number":"1110889"}}