{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T15:07:59Z","timestamp":1764688079709,"version":"3.41.2"},"reference-count":27,"publisher":"World Scientific Pub Co Pte Ltd","issue":"02","funder":[{"DOI":"10.13039\/501100001809","name":"the National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61961037"],"award-info":[{"award-number":["61961037"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Industrial Support Plan Project of Gansu Provincial Department of Education","award":["2021CYZC-30"],"award-info":[{"award-number":["2021CYZC-30"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2024,2]]},"abstract":"<jats:p> Head Pose Estimation (HPE) has a wide range of applications in computer vision, but still faces challenges: (1) Existing studies commonly use Euler angles or quaternions as pose labels, which may lead to discontinuity problems. (2) HPE does not effectively address regression via rotated matrices. (3) There is a low recognition rate in complex scenes, high computational requirements, etc. This paper presents an improved unconstrained HPE model to address these challenges. First, a rotation matrix form is introduced to solve the problem of unclear rotation labels. Second, a continuous 6D rotation matrix representation is used for efficient and robust direct regression. The RepVGG-A2 lightweight framework is used for feature extraction, and by adding a multi-level feature fusion module and a coordinate attention mechanism with residual connection, to improve the network\u2019s ability to perceive contextual information and pay attention to features. The model\u2019s accuracy was further improved by replacing the network activation function and improving the loss function. Experiments on the BIWI dataset 7:3 dividing the training and test sets show that the average absolute error of HPE for the proposed network model is 2.41. Trained on the dataset 300W_LP and tested on the AFLW2000 and BIWI datasets, the average absolute errors of HPE of the proposed network model are 4.34 and 3.93. The experimental results demonstrate that the improved network has better HPE performance. <\/jats:p>","DOI":"10.1142\/s0218001424560020","type":"journal-article","created":{"date-parts":[[2024,1,27]],"date-time":"2024-01-27T04:00:15Z","timestamp":1706328015000},"source":"Crossref","is-referenced-by-count":4,"title":["Head Pose Estimation Based on Multi-Level Feature Fusion"],"prefix":"10.1142","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-9493-4384","authenticated-orcid":false,"given":"Chunman","family":"Yan","sequence":"first","affiliation":[{"name":"School of Physics and Electronic, Northwest Normal University, Lanzhou 730070, P. R. China"},{"name":"Engineering Research Center of Gansu Province for Intelligent Information Technology and Application, Lanzhou 730070, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-9982-1886","authenticated-orcid":false,"given":"Xiao","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Physics and Electronic, Northwest Normal University, Lanzhou 730070, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2024,2,28]]},"reference":[{"first-page":"1021","volume-title":"IEEE Int. Conf. Computer Vision (ICCV)","author":"Bulat A.","key":"S0218001424560020BIB001"},{"first-page":"1187","volume-title":"IEEE Winter Conf. Applications of Computer Vision","author":"Cao Z.","key":"S0218001424560020BIB002"},{"first-page":"123","volume-title":"Proc. 2019 3rd High Performance Computing and Cluster Technologies Conf.","author":"Chuan T.","key":"S0218001424560020BIB003"},{"first-page":"192","volume-title":"IEEE Int. Conf. Automatic Face and Gesture Recognition","author":"Dapogny A.","key":"S0218001424560020BIB004"},{"first-page":"1977","volume-title":"IEEE Int. Conf. Acoustics, Speech and Signal Processing","author":"Gupta A.","key":"S0218001424560020BIB005"},{"first-page":"2496","volume-title":"IEEE Int. Conf. Image Processing (ICIP)","author":"Hempel T.","key":"S0218001424560020BIB006"},{"first-page":"13708","volume-title":"IEEE\/CVF Conf. Computer Vision and Pattern Recognition (CVPR)","author":"Hou Q.","key":"S0218001424560020BIB008"},{"key":"S0218001424560020BIB009","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2018.2866770"},{"key":"S0218001424560020BIB010","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2019.11.005"},{"key":"S0218001424560020BIB011","doi-asserted-by":"publisher","DOI":"10.1561\/0600000001"},{"key":"S0218001424560020BIB012","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2021.3081873"},{"key":"S0218001424560020BIB013","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001419400068"},{"first-page":"641","volume-title":"Int. Conf. 3D Vision","author":"Martin M.","key":"S0218001424560020BIB014"},{"first-page":"402","volume-title":"Int. Seminar on Intelligent Technology and Its Applications","author":"Perdana M. I.","key":"S0218001424560020BIB015"},{"first-page":"2155","volume-title":"IEEE\/CVF Conf. Computer Vision and Pattern Recognition Workshops","author":"Ruiz N.","key":"S0218001424560020BIB016"},{"first-page":"519","volume-title":"IEEE Workshop on Applications of Computer Vision","author":"Sankaranarayanan K.","key":"S0218001424560020BIB017"},{"key":"S0218001424560020BIB018","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2011.2162740"},{"first-page":"789","volume-title":"2018 13th IEEE Int. Conf. Automatic Face & Gesture Recognition","author":"Wang K.","key":"S0218001424560020BIB019"},{"key":"S0218001424560020BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2909327"},{"first-page":"642","volume-title":"IEEE Int. Conf. Automatic Face & Gesture Recognition","author":"Xu X.","key":"S0218001424560020BIB021"},{"first-page":"1087","volume-title":"IEEE\/CVF Conf. Computer Vision and Pattern Recognition (CVPR)","author":"Yang T.-Y.","key":"S0218001424560020BIB022"},{"first-page":"7","volume-title":"Int. Joint Conf. Artificial Intelligence","author":"Yang T.-Y.","key":"S0218001424560020BIB023"},{"first-page":"826","volume-title":"Int. Conf. Pattern Recognition","author":"Zeng Z.","key":"S0218001424560020BIB024"},{"issue":"7","key":"S0218001424560020BIB025","first-page":"12789","volume":"34","author":"Zhang H.","year":"2020","journal-title":"Proc. AAAI Conf. Artif. Intell."},{"first-page":"5745","volume-title":"IEEE\/CVF Conf. Computer Vision and Pattern Recognition (CVPR)","author":"Zhou Y.","key":"S0218001424560020BIB026"},{"first-page":"146","volume-title":"IEEE Conf. Computer Vision and Pattern Recognition (CVPR)","author":"Zhu X.","key":"S0218001424560020BIB028"},{"first-page":"787","volume-title":"IEEE Conf. Computer Vision and Pattern Recognition (CVPR)","author":"Zhu X.","key":"S0218001424560020BIB029"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001424560020","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,5]],"date-time":"2024-04-05T06:25:16Z","timestamp":1712298316000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001424560020"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2]]},"references-count":27,"journal-issue":{"issue":"02","published-print":{"date-parts":[[2024,2]]}},"alternative-id":["10.1142\/S0218001424560020"],"URL":"https:\/\/doi.org\/10.1142\/s0218001424560020","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2024,2]]},"article-number":"2456002"}}