{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T16:31:34Z","timestamp":1772037094803,"version":"3.50.1"},"reference-count":49,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2024,6,16]],"date-time":"2024-06-16T00:00:00Z","timestamp":1718496000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,6,16]],"date-time":"2024-06-16T00:00:00Z","timestamp":1718496000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100007129","name":"Natural Science Foundation of Shandong Province","doi-asserted-by":"publisher","award":["ZR2021QF031"],"award-info":[{"award-number":["ZR2021QF031"]}],"id":[{"id":"10.13039\/501100007129","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100010029","name":"Taishan Scholar Foundation of Shandong Province","doi-asserted-by":"publisher","award":["tshw201502042"],"award-info":[{"award-number":["tshw201502042"]}],"id":[{"id":"10.13039\/501100010029","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002858","name":"China Postdoctoral Science Foundation","doi-asserted-by":"publisher","award":["2023M743757"],"award-info":[{"award-number":["2023M743757"]}],"id":[{"id":"10.13039\/501100002858","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2024,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This study proposes an innovative deep learning algorithm for pose estimation based on point clouds, aimed at addressing the challenges of pose estimation for objects affected by the environment. Previous research on using deep learning for pose estimation has primarily been conducted using RGB-D data. This paper introduces an algorithm that utilizes point cloud data for deep learning-based pose computation. The algorithm builds upon previous work by integrating PointNet\u2009+\u2009\u2009+\u2009technology and the classical Point Pair Features algorithm, achieving accurate pose estimation for objects across different scene scales. Additionally, an adaptive parameter-density clustering method suitable for point clouds is introduced, effectively segmenting clusters in varying point cloud density environments. This resolves the complex issue of parameter determination for density clustering in different point cloud environments and enhances the robustness of clustering. Furthermore, the LineMod dataset is transformed into a point cloud dataset, and experiments are conducted on the transformed dataset to achieve promising results with our algorithm. Finally, experiments under both strong and weak lighting conditions demonstrate the algorithm's robustness.<\/jats:p>","DOI":"10.1007\/s40747-024-01508-x","type":"journal-article","created":{"date-parts":[[2024,6,16]],"date-time":"2024-06-16T16:01:23Z","timestamp":1718553683000},"page":"6581-6595","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Pose estimation algorithm based on point pair features using PointNet\u2009+\u2009\u2009+"],"prefix":"10.1007","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-5012-3658","authenticated-orcid":false,"given":"Yifan","family":"Chen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenjian","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qingdang","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mingyue","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,6,16]]},"reference":[{"key":"1508_CR1","unstructured":"Rad M, Lepetit V. Bb8: a scalable, accurate, robust to partial occlusion method for predicting the 3d poses of challenging objects without using depth. Proceedings of the IEEE international conference on computer vision. 3828\u20133836"},{"key":"1508_CR2","unstructured":"Kehl W, Manhardt F, Tombari F et al. Ssd-6d: making rgb-based 3d detection and 6d pose estimation great again. Proceedings of the IEEE international conference on computer vision. 1521\u20131529"},{"key":"1508_CR3","doi-asserted-by":"crossref","unstructured":"Xiang Y, Schmidt T, Narayanan V et al (2017) Posecnn: a convolutional neural network for 6d object pose estimation in cluttered scenes. arXiv preprint arXiv:171100199","DOI":"10.15607\/RSS.2018.XIV.019"},{"key":"1508_CR4","unstructured":"Do T-T, Cai M, Pham T et al (2018) Deep-6dpose: recovering 6d object pose from a single rgb image. arXiv preprint arXiv:180210367"},{"key":"1508_CR5","unstructured":"He Y, Sun W, Huang H et al. Pvn3d: a deep point-wise 3d keypoints voting network for 6dof pose estimation. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 11632\u201311641"},{"key":"1508_CR6","unstructured":"Zakharov S, Shugurov I, Ilic S. Dpod: 6d pose object detector and refiner. Proceedings of the IEEE\/CVF international conference on computer vision. 1941\u20131950"},{"key":"1508_CR7","doi-asserted-by":"publisher","first-page":"1066","DOI":"10.1016\/j.procs.2022.01.135","volume":"199","author":"P Jiang","year":"2022","unstructured":"Jiang P, Ergu D, Liu F et al (2022) A review of Yolo algorithm developments. Procedia Comput Sci 199:1066\u20131073","journal-title":"Procedia Comput Sci"},{"key":"1508_CR8","doi-asserted-by":"crossref","unstructured":"Su H, Maji S, Kalogerakis E et al. Multi-view convolutional neural networks for 3d shape recognition. Proceedings of the IEEE international conference on computer vision. 945\u2013953","DOI":"10.1109\/ICCV.2015.114"},{"key":"1508_CR9","unstructured":"He K, Gkioxari G, Doll\u00e1r P et al. Mask r-cnn. Proceedings of the IEEE international conference on computer vision. 2961\u20132969"},{"key":"1508_CR10","doi-asserted-by":"crossref","unstructured":"Lowe DG. Object recognition from local scale-invariant features. Proceedings of the seventh IEEE international conference on computer vision. IEEE, 2: 1150\u20131157","DOI":"10.1109\/ICCV.1999.790410"},{"key":"1508_CR11","doi-asserted-by":"publisher","first-page":"404","DOI":"10.1007\/11744023_32","volume":"3951","author":"H Bay","year":"2006","unstructured":"Bay H, Tuytelaars T, van Gool L (2006) Surf: speeded up robust features. Lect Notes Comput Sci 3951:404\u2013417","journal-title":"Lect Notes Comput Sci"},{"issue":"5","key":"1508_CR12","doi-asserted-by":"publisher","first-page":"1147","DOI":"10.1109\/TRO.2015.2463671","volume":"31","author":"R Mur-Artal","year":"2015","unstructured":"Mur-Artal R, Montiel JMM, Tardos JD (2015) ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE Trans Rob 31(5):1147\u20131163","journal-title":"IEEE Trans Rob"},{"key":"1508_CR13","unstructured":"Johnson AE (1997) Spin-images: a representation for 3-D surface matching. https:\/\/citeseerx.ist.psu.edu\/document?repid=rep1&type=pdf&doi=4c09532c6ef9afd5f0dd1f3d2b0af313199a8520"},{"key":"1508_CR14","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1016\/j.cviu.2014.04.011","volume":"125","author":"S Salti","year":"2014","unstructured":"Salti S, Tombari F, di Stefano L (2014) SHOT: unique signatures of histograms for surface and texture description. Comput Vis Image Underst 125:251\u2013264","journal-title":"Comput Vis Image Underst"},{"issue":"10","key":"1508_CR15","doi-asserted-by":"publisher","first-page":"1385","DOI":"10.1109\/TPAMI.2004.92","volume":"26","author":"L Vacchetti","year":"2004","unstructured":"Vacchetti L, Lepetit V, Fua P (2004) Stable real-time 3d tracking using online and offline information. IEEE Trans Pattern Anal Mach Intell 26(10):1385\u20131391","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1508_CR16","doi-asserted-by":"crossref","unstructured":"Hoda\u0148 T, Zabulis X, Lourakis M et al (2015) Detection and fine 3D pose estimation of texture-less objects in RGB-D images. 2015 IEEE\/RSJ international conference on intelligent robots and systems (IROS). IEEE: 4421\u20134428","DOI":"10.1109\/IROS.2015.7354005"},{"key":"1508_CR17","doi-asserted-by":"crossref","unstructured":"Tong G, Liu R, Li H (2012) The monocular model-based 3D pose tracking. 2012 24th Chinese control and decision conference (CCDC). IEEE: 980\u2013985","DOI":"10.1109\/CCDC.2012.6244153"},{"key":"1508_CR18","doi-asserted-by":"crossref","unstructured":"Drost B, Ilic S (2012) 3d object detection and localization using multimodal point pair features. 2012 Second international conference on 3D imaging, modeling, processing, visualization & transmission. IEEE: 9\u201316","DOI":"10.1109\/3DIMPVT.2012.53"},{"key":"1508_CR19","unstructured":"Wang C, Xu D, Zhu Y et al. Densefusion: 6d object pose estimation by iterative dense fusion. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 3343\u20133352"},{"key":"1508_CR20","doi-asserted-by":"publisher","DOI":"10.1016\/j.displa.2021.102077","volume":"70","author":"Y Wang","year":"2021","unstructured":"Wang Y, Wang C, Long P et al (2021) Recent advances in 3D object detection based on RGB-D: a survey. Displays 70:102077","journal-title":"Displays"},{"issue":"3","key":"1508_CR21","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1016\/j.vrih.2020.05.002","volume":"2","author":"Z Zhang","year":"2020","unstructured":"Zhang Z, Dai Y, Sun J (2020) Deep learning based point cloud registration: an overview. Virtual Real Intell Hardw 2(3):222\u2013246","journal-title":"Virtual Real Intell Hardw"},{"key":"1508_CR22","unstructured":"Qi CR, Yi L, Su H, Guibas LJ (2017) Pointnet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in neural information processing systems 30"},{"key":"1508_CR23","unstructured":"Qi CR, Su H, Mo K et al. Pointnet: deep learning on point sets for 3d classification and segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition. 652\u2013660"},{"key":"1508_CR24","doi-asserted-by":"crossref","unstructured":"Pham Q-H, Uy MA, Hua B-S et al. Lcd: learned cross-domain descriptors for 2d-3d matching. Proceedings of the AAAI conference on artificial intelligence. 34: 11856\u201311864","DOI":"10.1609\/aaai.v34i07.6859"},{"key":"1508_CR25","unstructured":"Chen H, Wang P, Wang F et al. Epro-pnp: generalized end-to-end probabilistic perspective-n-points for monocular object pose estimation. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 2781\u20132790"},{"key":"1508_CR26","doi-asserted-by":"crossref","unstructured":"Rusu RB, Blodow N, Beetz M (2009) Fast point feature histograms (FPFH) for 3D registration. 2009 IEEE international conference on robotics and automation. IEEE: 3212\u20133217","DOI":"10.1109\/ROBOT.2009.5152473"},{"key":"1508_CR27","doi-asserted-by":"crossref","unstructured":"Tejani A, Tang D, Kouskouridas R et al (2014) Latent-class hough forests for 3d object detection and pose estimation. Computer vision\u2013ECCV 2014: 13th European conference, Zurich, Switzerland, September 6\u201312, 2014, Proceedings, Part VI 13. Springer: 462\u2013477","DOI":"10.1007\/978-3-319-10599-4_30"},{"key":"1508_CR28","doi-asserted-by":"crossref","unstructured":"Drost B, Ulrich M, Navab N et al (2010) Model globally, match locally: efficient and robust 3D object recognition. 2010 IEEE computer society conference on computer vision and pattern recognition. IEEE: 998\u20131005","DOI":"10.1109\/CVPR.2010.5540108"},{"key":"1508_CR29","doi-asserted-by":"crossref","unstructured":"Birdal T, Ilic S (2015) Point pair features based object detection and pose estimation revisited. 2015 international conference on 3D vision. IEEE: 527\u2013535","DOI":"10.1109\/3DV.2015.65"},{"key":"1508_CR30","doi-asserted-by":"crossref","unstructured":"Hinterstoisser S, Lepetit V, Rajkumar N et al (2016) Going further with point pair features. Going further with point pair features. Computer vision\u2013ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11\u201314, 2016, PROCEEDINGS, Part III 14. Springer: 834\u2013848","DOI":"10.1007\/978-3-319-46487-9_51"},{"key":"1508_CR31","unstructured":"Karunakaran V (2021) Deep learning based object detection using mask RCNN. 2021 6th international conference on communication and electronics systems (ICCES). IEEE: 1684\u20131690"},{"key":"1508_CR32","doi-asserted-by":"crossref","unstructured":"Tekin B, Sinha SN, Fua P. Real-time seamless single shot 6d object pose prediction. Proceedings of the IEEE conference on computer vision and pattern recognition. 292\u2013301","DOI":"10.1109\/CVPR.2018.00038"},{"key":"1508_CR33","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1016\/j.neucom.2018.12.061","volume":"337","author":"F Liu","year":"2019","unstructured":"Liu F, Fang P, Yao Z et al (2019) Recovering 6D object pose from RGB indoor image based on two-stage detection network with multi-task loss. Neurocomputing 337:15\u201323","journal-title":"Neurocomputing"},{"key":"1508_CR34","unstructured":"Lin H, Liu Z, Cheang C et al. Sar-net: shape alignment and recovery network for category-level 6d object pose and size estimation. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 6707\u20136717"},{"key":"1508_CR35","unstructured":"Zeng A, Song S, Nie\u00dfner M et al. 3dmatch: learning local geometric descriptors from rgb-d reconstructions. Proceedings of the IEEE conference on computer vision and pattern recognition. 1802\u20131811"},{"key":"1508_CR36","unstructured":"Yew Z J, Lee GH. 3dfeat-net: weakly supervised local 3d features for point cloud registration. Proceedings of the European conference on computer vision (ECCV). 607\u2013623"},{"issue":"2","key":"1508_CR37","doi-asserted-by":"publisher","first-page":"486","DOI":"10.3390\/s21020486","volume":"21","author":"Y Yuan","year":"2021","unstructured":"Yuan Y, Borrmann D, Hou J et al (2021) Self-supervised point set local descriptors for point cloud registration. Sensors 21(2):486","journal-title":"Sensors"},{"key":"1508_CR38","doi-asserted-by":"crossref","unstructured":"Liu W, Anguelov D, Erhan D et al (2016) Ssd: single shot multibox detector. Computer vision\u2013ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11\u201314, 2016, proceedings, Part I 14. Springer: 21\u201337","DOI":"10.1007\/978-3-319-46448-0_2"},{"issue":"6","key":"1508_CR39","doi-asserted-by":"publisher","first-page":"1465","DOI":"10.1109\/TPAMI.2017.2708711","volume":"40","author":"A Crivellaro","year":"2017","unstructured":"Crivellaro A, Rad M, Verdie Y et al (2017) Robust 3D object tracking from monocular images using stable parts. IEEE Trans Pattern Anal Mach Intell 40(6):1465\u20131479","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1508_CR40","doi-asserted-by":"crossref","unstructured":"Hu Y, Hugonot J, Fua P et al. (2019) Segmentation-driven 6d object pose estimation. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 3385\u20133394","DOI":"10.1109\/CVPR.2019.00350"},{"key":"1508_CR41","doi-asserted-by":"crossref","unstructured":"Liang H, Ma X, Li S, et al (2019) Pointnetgpd: detecting grasp configurations from point sets. 2019 international conference on robotics and automation (ICRA). IEEE: 3629\u20133635","DOI":"10.1109\/ICRA.2019.8794435"},{"key":"1508_CR42","doi-asserted-by":"crossref","unstructured":"Aoki Y, Goforth H, Srivatsan RA et al (2019) Pointnetlk: robust & efficient point cloud registration using pointnet. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. 7163\u20137172","DOI":"10.1109\/CVPR.2019.00733"},{"key":"1508_CR43","unstructured":"Sarode V, Li X, Goforth H et al (2019) Pcrnet: point cloud registration network using pointnet encoding. arXiv preprint arXiv:190807906"},{"key":"1508_CR44","doi-asserted-by":"crossref","unstructured":"GRO\u00df J, O\u0161ep A, Leibe B. Alignnet-3d: fast point cloud registration of partially observed objects. 2019 international conference on 3d vision (3DV). IEEE: 623\u2013632","DOI":"10.1109\/3DV.2019.00074"},{"key":"1508_CR45","unstructured":"Besl P J, Mckay ND (1992) Method for registration of 3-D shapes. Sensor fusion IV: control paradigms and data structures. Spie, 1611: 586\u2013606"},{"key":"1508_CR46","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v091.i01","volume":"91","author":"M Hahsler","year":"2019","unstructured":"Hahsler M, Piekenbrock M, Doran D (2019) dbscan: fast density-based clustering with R. J Stat Softw 91:1\u201330","journal-title":"J Stat Softw"},{"key":"1508_CR47","doi-asserted-by":"crossref","unstructured":"Hinterstoisser S, Lepetit V, Ilic S et al (2012) Model based training, detection and pose estimation of texture-less 3d objects in heavily cluttered scenes. Computer vision\u2013ACCV 2012: 11th Asian conference on computer vision, Daejeon, Korea, November 5\u20139, 2012, revised selected papers, Part I 11. Springer: 548\u2013562","DOI":"10.1007\/978-3-642-37331-2_42"},{"key":"1508_CR48","doi-asserted-by":"crossref","unstructured":"Hoda\u0148 T, Matas J, Obdr\u017e\u00e1lek \u0160 (2016) On evaluation of 6D object pose estimation. Computer vision\u2013ECCV 2016 workshops: Amsterdam, The Netherlands, October 8\u201310 and 15\u201316, 2016, Proceedings, Part III 14. Springer: 606\u2013619","DOI":"10.1007\/978-3-319-49409-8_52"},{"key":"1508_CR49","doi-asserted-by":"crossref","unstructured":"Wu Y, Javaheri A, Zand M et al (2022) Keypoint cascade voting for point cloud based 6DoF pose estimation. 2022 international conference on 3D vision (3DV). IEEE: 176\u20131786","DOI":"10.1109\/3DV57658.2022.00030"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01508-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-024-01508-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01508-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,21]],"date-time":"2024-11-21T23:28:34Z","timestamp":1732231714000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-024-01508-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,16]]},"references-count":49,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,10]]}},"alternative-id":["1508"],"URL":"https:\/\/doi.org\/10.1007\/s40747-024-01508-x","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,6,16]]},"assertion":[{"value":"15 November 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 May 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 June 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}