{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,15]],"date-time":"2026-02-15T08:56:34Z","timestamp":1771145794697,"version":"3.50.1"},"reference-count":36,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,4,5]],"date-time":"2024-04-05T00:00:00Z","timestamp":1712275200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Neurorobot."],"abstract":"<jats:sec><jats:title>Introduction<\/jats:title><jats:p>Service robot technology is increasingly gaining prominence in the field of artificial intelligence. However, persistent limitations continue to impede its widespread implementation. In this regard, human motion pose estimation emerges as a crucial challenge necessary for enhancing the perceptual and decision-making capacities of service robots.<\/jats:p><\/jats:sec><jats:sec><jats:title>Method<\/jats:title><jats:p>This paper introduces a groundbreaking model, YOLOv8-ApexNet, which integrates advanced technologies, including Bidirectional Routing Attention (BRA) and Generalized Feature Pyramid Network (GFPN). BRA facilitates the capture of inter-keypoint correlations within dynamic environments by introducing a bidirectional information propagation mechanism. Furthermore, GFPN adeptly extracts and integrates feature information across different scales, enabling the model to make more precise predictions for targets of various sizes and shapes.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Empirical research findings reveal significant performance enhancements of the YOLOv8-ApexNet model across the COCO and MPII datasets. Compared to existing methodologies, the model demonstrates pronounced advantages in keypoint localization accuracy and robustness.<\/jats:p><\/jats:sec><jats:sec><jats:title>Discussion<\/jats:title><jats:p>The significance of this research lies in providing an efficient and accurate solution tailored for the realm of service robotics, effectively mitigating the deficiencies inherent in current approaches. By bolstering the accuracy of perception and decision-making, our endeavors unequivocally endorse the widespread integration of service robots within practical applications.<\/jats:p><\/jats:sec>","DOI":"10.3389\/fnbot.2024.1374385","type":"journal-article","created":{"date-parts":[[2024,4,5]],"date-time":"2024-04-05T04:58:13Z","timestamp":1712293093000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["The application prospects of robot pose estimation technology: exploring new directions based on YOLOv8-ApexNet"],"prefix":"10.3389","volume":"18","author":[{"given":"XianFeng","family":"Tang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shuwei","family":"Zhao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1965","published-online":{"date-parts":[[2024,4,5]]},"reference":[{"key":"B1","first-page":"10843","article-title":"3D hand shape and pose from images in the wild","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Boukhayma","year":"2019"},{"key":"B2","doi-asserted-by":"publisher","first-page":"744","DOI":"10.3390\/sym12050744","article-title":"Fall detection based on key points of human-skeleton using openpose","volume":"12","author":"Chen","year":"2020","journal-title":"Symmetry"},{"key":"B3","first-page":"5386","article-title":"Higherhrnet: scale-aware representation learning for bottom-up human pose estimation","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Cheng","year":"2020"},{"key":"B4","doi-asserted-by":"publisher","first-page":"7157","DOI":"10.1109\/TPAMI.2022.3222784","article-title":"Alphapose: whole-body regional multi-person pose estimation and tracking in real-time","volume":"45","author":"Fang","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"B5","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2003.03522","article-title":"Mobilepose: real-time pose estimation for unseen objects with weak shape supervision","author":"Hou","year":"2020","journal-title":"arXiv"},{"key":"B6","first-page":"7718","article-title":"Learnable triangulation of human pose","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Iskakov","year":"2019"},{"key":"B7","first-page":"845","article-title":"Few-shot relation extraction model based on attention mechanism induction network","volume":"61","author":"Ji","year":"2023","journal-title":"J. Jilin Univ. Inf. Sci. Ed"},{"key":"B8","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1109\/ICIS51600.2021.9516598","article-title":"Face depth prediction by the scene depth","volume-title":"2021 IEEE\/ACIS 19th International Conference on Computer and Information Science (ICIS)","author":"Jin","year":"2021"},{"key":"B9","doi-asserted-by":"publisher","first-page":"21780","DOI":"10.1109\/JSEN.2022.3197235","article-title":"Pseudo RGB-D face recognition","volume":"22","author":"Jin","year":"2022","journal-title":"IEEE Sens. J"},{"key":"B10","first-page":"733","article-title":"Characterizations of weighted right core inverse and weighted right pseudo core inverse","volume":"61","author":"Ke","year":"2023","journal-title":"J. Jilin Univ. Sci. Ed"},{"key":"B11","first-page":"3122","article-title":"Multi-instance pose networks: rethinking top-down pose estimation","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Khirodkar","year":""},{"key":"B12","doi-asserted-by":"publisher","first-page":"11354","DOI":"10.1609\/aaai.v34i07.6797","article-title":"Simple pose: rethinking and improving a bottom-up approach for multi-person pose estimation","volume":"34","author":"Li","year":"","journal-title":"Proc. AAAI Conf. Artif. Intell"},{"key":"B13","doi-asserted-by":"publisher","first-page":"304","DOI":"10.3390\/drones7050304","article-title":"A modified yolov8 detection network for uav aerial image recognition","volume":"7","author":"Li","year":"2023","journal-title":"Drones"},{"key":"B14","first-page":"75","article-title":"A-hrnet: attention based high resolution network for human pose estimation","volume-title":"2020 Second International Conference on Transdisciplinary AI (TransAI)","author":"Li","year":""},{"key":"B15","doi-asserted-by":"publisher","first-page":"4970","DOI":"10.3390\/electronics12244970","article-title":"Revolutionizing target detection in intelligent traffic systems: Yolov8-snakevision","volume":"12","author":"Liu","year":"2023","journal-title":"Electronics"},{"key":"B16","doi-asserted-by":"publisher","first-page":"13264","DOI":"10.1109\/CVPR46437.2021.01306","article-title":"Rethinking the heatmap regression for bottom-up human pose estimation","author":"Luo","year":"2021","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"B17","doi-asserted-by":"crossref","first-page":"548","DOI":"10.1007\/978-3-030-58565-5_33","article-title":"Interhand2. 6m. a dataset and baseline for 3D interacting hand pose estimation from a single RGB image","volume-title":"Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XX 16","author":"Moon","year":"2020"},{"key":"B18","doi-asserted-by":"publisher","first-page":"122419","DOI":"10.1016\/j.eswa.2023.122419","article-title":"Occluded person re-identification with deep learning: a survey and perspectives","volume":"239","author":"Ning","year":"2023","journal-title":"Exp. Syst. Appl"},{"key":"B19","doi-asserted-by":"publisher","first-page":"102033","DOI":"10.1016\/j.inffus.2023.102033","article-title":"Dilf: differentiable rendering-based multi-view image-language fusion for zero-shot 3D shape understanding","volume":"102","author":"Ning","year":"2024","journal-title":"Inf. Fusion"},{"key":"B20","doi-asserted-by":"crossref","first-page":"9250","DOI":"10.1109\/ICRA.2019.8793621","article-title":"Superdepth: self-supervised, super-resolved monocular depth estimation","volume-title":"2019 International Conference on Robotics and Automation (ICRA)","author":"Pillai","year":"2019"},{"key":"B21","first-page":"3302","article-title":"Understanding the limitations of cnn-based absolute camera pose regression","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Sattler","year":"2019"},{"key":"B22","doi-asserted-by":"publisher","first-page":"3087","DOI":"10.3390\/rs13163087","article-title":"Semantic segmentation of urban buildings using a high-resolution network (hrnet) with channel and spatial attention gates","volume":"13","author":"Seong","year":"2021","journal-title":"Remote Sens"},{"key":"B23","doi-asserted-by":"publisher","first-page":"1439","DOI":"10.1109\/TMM.2022.3233251","article-title":"Depth-aware multi-person 3D pose estimation with multi-scale waterfall representations","volume":"25","author":"Shen","year":"2022","journal-title":"IEEE Trans. Multimedia"},{"key":"B24","first-page":"5693","article-title":"Deep high-resolution representation learning for human pose estimation","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Sun","year":"2019"},{"key":"B25","doi-asserted-by":"publisher","first-page":"20939","DOI":"10.1007\/s00521-023-08809-1","article-title":"An improved fire detection approach based on yolo-v8 for smart cities","volume":"35","author":"Talaat","year":"2023","journal-title":"Neural Comput. Appl"},{"key":"B26","doi-asserted-by":"publisher","first-page":"117784","DOI":"10.1109\/ACCESS.2021.3106350","article-title":"Integrated feature pyramid network with feature aggregation for traffic sign detection","volume":"9","author":"Tang","year":"2021","journal-title":"IEEE Access"},{"key":"B27","first-page":"2642","article-title":"Normalized object coordinate space for category-level 6D object pose and size estimation","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wang","year":"2019"},{"key":"B28","doi-asserted-by":"publisher","first-page":"4644","DOI":"10.3390\/electronics12224644","article-title":"Single-stage pose estimation and joint angle extraction method for moving human body","volume":"12","author":"Wang","year":"2023","journal-title":"Electronics"},{"key":"B29","doi-asserted-by":"publisher","first-page":"16105","DOI":"10.1109\/CVPR46437.2021.01584","article-title":"Graph stacked hourglass networks for 3D human pose estimation","author":"Xu","year":"2021","journal-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"B30","doi-asserted-by":"publisher","first-page":"1824","DOI":"10.3390\/agronomy13071824","article-title":"A lightweight yolov8 tomato detection algorithm combining feature enhancement and attention","volume":"13","author":"Yang","year":"2023","journal-title":"Agronomy"},{"key":"B31","first-page":"853","article-title":"Graph embedding clustering based on heterogeneous fusion and discriminant loss","volume":"61","author":"Yao","year":"2023","journal-title":"J. Jilin Univ. Sci. Ed"},{"key":"B32","first-page":"607","article-title":"Deciwatch: a simple baseline for 10\u00d7 efficient 2D and 3D pose estimation","volume-title":"European Conference on Computer Vision","author":"Zeng","year":"2022"},{"key":"B33","first-page":"3517","article-title":"Fast human pose estimation","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhang","year":"2019"},{"key":"B34","doi-asserted-by":"publisher","first-page":"2639","DOI":"10.1007\/s11263-021-01482-8","article-title":"Towards high performance human keypoint detection","volume":"129","author":"Zhang","year":"2021","journal-title":"Int. J. Comput. Vis"},{"key":"B35","doi-asserted-by":"publisher","first-page":"046006","DOI":"10.1117\/1.JBO.28.4.046006","article-title":"Stable tissue-mimicking phantoms for longitudinal multimodality imaging studies that incorporate optical, CT, and MRI contrast","volume":"28","author":"Zhao","year":"2023","journal-title":"J. Biomed. Opt"},{"key":"B36","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1109\/ICFTIC57696.2022.10075089","article-title":"Lightweight sit-ups recognition and counting method based on openpose","volume-title":"2022 4th International Conference on Frontiers Technology of Information and Computer (ICFTIC)","author":"Zhao","year":"2022"}],"container-title":["Frontiers in Neurorobotics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fnbot.2024.1374385\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,5]],"date-time":"2024-04-05T04:58:25Z","timestamp":1712293105000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fnbot.2024.1374385\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,5]]},"references-count":36,"alternative-id":["10.3389\/fnbot.2024.1374385"],"URL":"https:\/\/doi.org\/10.3389\/fnbot.2024.1374385","relation":{},"ISSN":["1662-5218"],"issn-type":[{"value":"1662-5218","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,5]]},"article-number":"1374385"}}