{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,11]],"date-time":"2026-06-11T21:27:41Z","timestamp":1781213261769,"version":"3.54.1"},"reference-count":41,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2022,11,3]],"date-time":"2022-11-03T00:00:00Z","timestamp":1667433600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>A real-time head pose and gaze estimation (HPGE) algorithm has excellent potential for technological advancements either in human\u2013machine or human\u2013robot interactions. For example, in high-accuracy advent applications such as Driver\u2019s Assistance System (DAS), HPGE plays a crucial role in omitting accidents and road hazards. In this paper, the authors propose a new hybrid framework for improved estimation by combining both the appearance and geometric-based conventional methods to extract local and global features. Therefore, the Zernike moments algorithm has been prominent in extracting rotation, scale, and illumination invariant features. Later, conventional discriminant algorithms were used to classify the head poses and gaze direction. Furthermore, the experiments were performed on standard datasets and real-time images to analyze the accuracy of the proposed algorithm. As a result, the proposed framework has immediately estimated the range of direction changes under different illumination conditions. We obtained an accuracy of ~85%; the average response time was 21.52 and 7.483 ms for estimating head poses and gaze, respectively, independent of illumination, background, and occlusion. The proposed method is promising for future developments of a robust system that is invariant even to blurring conditions and thus reaching much more significant performance enhancement.<\/jats:p>","DOI":"10.3390\/s22218449","type":"journal-article","created":{"date-parts":[[2022,11,3]],"date-time":"2022-11-03T04:49:20Z","timestamp":1667450960000},"page":"8449","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["A Novel Zernike Moment-Based Real-Time Head Pose and Gaze Estimation Framework for Accuracy-Sensitive Applications"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0825-4211","authenticated-orcid":false,"given":"Hima","family":"Vankayalapati","sequence":"first","affiliation":[{"name":"Department of Electronics and Communication Engineering, Kalasalingam Academy of Research and Education, Krishnankovil 626126, India"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Swarna","family":"Kuchibhotla","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram 522302, India"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2855-9071","authenticated-orcid":false,"given":"Mohan","family":"Chadalavada","sequence":"additional","affiliation":[{"name":"Department of Electronics and Communication Engineering, VelTech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai 600062, India"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8443-736X","authenticated-orcid":false,"given":"Shashi","family":"Dargar","sequence":"additional","affiliation":[{"name":"Department of Electronics and Communication Engineering, Kalasalingam Academy of Research and Education, Krishnankovil 626126, India"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Koteswara","family":"Anne","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Kalasalingam Academy of Research and Education, Krishnankovil 626126, India"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0773-9476","authenticated-orcid":false,"given":"Kyandoghere","family":"Kyamakya","sequence":"additional","affiliation":[{"name":"Institute for Smart Systems Technologies, University Klagenfurt, 9020 Klagenfurt am W\u00f6rthersee, Austria"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,11,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1109\/TCSVT.2008.2009261","article-title":"3-D Head Pose Estimation in Monocular Video Sequences Using Deformable Surfaces And Radial Basis Functions","volume":"19","author":"Krinidis","year":"2009","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_2","unstructured":"National Center for Statistics and Analysis (2022, August 12). (2021, April). Distracted driving 2019 (Research Note. Report No. DOT HS 813 111). National Highway Traffic Safety Administration, Available online: https:\/\/crashstats.nhtsa.dot.gov\/Api\/Public\/ViewPublication\/813111."},{"key":"ref_3","unstructured":"(2022, August 12). University of North Carolina Highway Safety Research Center. Available online: https:\/\/www.hsrc.unc.edu\/news\/announcements\/hsrc-to-lead-ncdot-center-of-excellence\/."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2033","DOI":"10.1109\/TPAMI.2014.2313123","article-title":"Adaptive Linear Regression for Appearance Based Gaze Estimation","volume":"36","author":"Lu","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_5","unstructured":"Zavan, F.H., Nascimento, A.C., Bellon, O.R., and Silva, L. (2016, January 4\u20137). Nose pose: A competitive, landmark-free methodology for head pose estimation in the wild. Proceedings of the Conference on Graphics, Patterns and Images-W. Face Processing, Sao Paulo, Brazil."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"3191","DOI":"10.1007\/s13369-019-04322-7","article-title":"An Enhanced Eye-Tracking Approach Using Pipeline Computation","volume":"45","author":"Hossain","year":"2020","journal-title":"Arab. J. Sci. Eng."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Svanera, M., Muhammad, U., Leonardi, R., and Benini, S. (2016, January 25\u201328). Figaro, hair detection and segmentation in the wild. Proceedings of the IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2016.7532494"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1109\/TSMCB.2002.999809","article-title":"Study on Eye Gaze Estimation","volume":"32","author":"Wang","year":"2002","journal-title":"IEEE Trans. Syst. Man Cybern."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1109\/TPAMI.2017.2781233","article-title":"Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition","volume":"41","author":"Ranjan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"3219","DOI":"10.1109\/JSEN.2013.2268247","article-title":"A Calibration Method for Eye Gaze Estimation System Based on 3D Geometrical Optics","volume":"13","author":"Lee","year":"2013","journal-title":"IEEE Sens. J."},{"key":"ref_11","first-page":"123","article-title":"Automatic Head Pose Estimation with Synchronized Sub Manifold Embedding and Random Regression Forests","volume":"7","author":"Zhu","year":"2014","journal-title":"Int. J. Signal Process. Image Process. Pattern Recognit."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1109\/TPAMI.2008.106","article-title":"Head Pose Estimation in Computer Vision: A Survey","volume":"31","author":"Trivedi","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_13","unstructured":"Fu, Y., and Huang, T.S. (2006, January 10\u201312). Graph Embedded Analysis for Head Pose Estimation. Proceedings of the IEEE 7th International Conference on Automatic Face and Gesture Recognition (FGR\u201906), Southampton, UK."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"116479","DOI":"10.1016\/j.image.2021.116479","article-title":"Head pose estimation: A survey of the last ten years","volume":"99","author":"Khana","year":"2021","journal-title":"Signal Process. Image Commun."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"823","DOI":"10.1080\/01431160600746456","article-title":"A survey of image classification methods and techniques for improving classification performance","volume":"28","author":"Lu","year":"2007","journal-title":"Int. J. Remote Sens."},{"key":"ref_16","first-page":"197","article-title":"Estimating Driver Attentiveness Through Head Pose Using Hybrid Geometric-Based Method","volume":"1","author":"Vankayalapati","year":"2022","journal-title":"Smart Intell. Comput. Appl."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1016\/S0022-4375(03)00027-6","article-title":"Development of an algorithm for an EEG-based driver fatigue countermeasure","volume":"34","author":"Lal","year":"2003","journal-title":"J. Saf. Res."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"7777","DOI":"10.1007\/s13369-018-3189-z","article-title":"Modelling Human Body Pose for Action Recognition Using Deep Neural Networks","volume":"43","author":"Li","year":"2018","journal-title":"Arab. J. Sci. Eng."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1002\/eat.22998","article-title":"Eye-tracking research in eating disorders: A systematic review","volume":"52","author":"Harrison","year":"2019","journal-title":"Int. J. Eat. Disord."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"456","DOI":"10.1109\/TRA.2004.825269","article-title":"Road boundary detection and tracking using ladar sensing","volume":"20","author":"Wijesoma","year":"2004","journal-title":"IEEE Trans. Robot. Autom."},{"key":"ref_21","unstructured":"Zhang, X., Sugano, Y., and Bulling, A. (2019). CHI Conference on Human Factors in Computing Systems, ser. CHI \u201919, Association for Computing Machinery."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zhu, X., Lei, Z., Liu, X., Shi, H., and Li, S.Z. (2016, January 27\u201330). Face alignment across large poses: A 3D solution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.23"},{"key":"ref_23","unstructured":"Dinges, D.F., and Grace, R. (1999). PERCLOS: A Valid Psychophysiological Measure of Alertness as Assessed by Psychomotor Vigilance, FHWA-MCRT-98-006."},{"key":"ref_24","unstructured":"Hiraiwa, J., Vargas, E., and Toral, S. (2010, January 17\u201319). An FPGA based Embedded Vision System for Real-Time Motion Segmentation. Proceedings of the 17th International Conference on Systems, Signals and Image Processing, Rio de Janeiro, Brazil."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1007\/978-981-13-1921-1_13","article-title":"Deformable facial fitting using active appearance model for emotion recognition","volume":"Volume 104","author":"Videla","year":"2019","journal-title":"Smart Intelligent Computing and Applications; Smart Innovation, Systems and Technologies"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"108210","DOI":"10.1016\/j.patcog.2021.108210","article-title":"Head pose estimation using deep neural networks and 3D point clouds","volume":"121","author":"Xu","year":"2022","journal-title":"Pattern Recognit."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"107316","DOI":"10.1016\/j.patcog.2020.107316","article-title":"Single image-based head pose estimation with spherical parametrization and 3D morphing","volume":"103","author":"Yuan","year":"2020","journal-title":"Pattern Recognit."},{"key":"ref_28","unstructured":"Jiang, N., Yu, W., Tang, S., and Goto, S. (2011, January 4\u20136). Cascade Detector for Rapid Face Detection. Proceedings of the IEEE 7th International Colloquium on Signal Processing and its Applications, Penang, Malaysia."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"489","DOI":"10.1109\/34.55109","article-title":"Invariant Image Recognition by Zernike Moments","volume":"12","author":"Khotanzad","year":"1990","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"101731","DOI":"10.1016\/j.asej.2022.101731","article-title":"Novel eye-based features for head pose-free gaze estimation with web camera: New model and low-cost device","volume":"13","author":"Aunsri","year":"2022","journal-title":"Ain Shams Eng. J."},{"key":"ref_31","unstructured":"Svensson, U. (2004). Blink Behaviour-Based Drowsiness Detection, Linkoping University."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"5457","DOI":"10.1109\/TIP.2020.2984373","article-title":"Web-shaped model for head pose estimation: An approach for best exemplar selection","volume":"29","author":"Barra","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1052","DOI":"10.1109\/TVT.2004.830974","article-title":"Real Time Non-Intrusive Monitoring and Prediction of Driver Fatigue","volume":"53","author":"Ji","year":"2004","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_34","unstructured":"Vijayalakshmi, G.V.M., and Raj, A.N.J. (2016, January 19\u201321). Zernike Moments and Machine Learning Based Gender Classification Using Facial Images. Proceedings of the Eighth International Conference on Soft Computing and Pattern Recognition, Vellore, India."},{"key":"ref_35","unstructured":"Paul, V., and Jones, M. (2003, January 8\u201314). Rapid object detection using a boosted cascade of simple features. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Kauai, HI, USA."},{"key":"ref_36","first-page":"583","article-title":"Study of Zernike Moments Using Analytical Zernike Polynomials","volume":"3","author":"Hasan","year":"2012","journal-title":"Adv. Appl. Sci. Res."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Amayeh, G., Kasaei, S., Bebis, G., Tavakkoli, A., and Veropoul, K. (2007, January 12\u201315). Improvement of Zernike Moment Descriptors on Affine Transformed Shapes. Proceedings of the 9th International Symposium on Signal Processing and Its Applications ISSPA, Sharjah, United Arab Emirates.","DOI":"10.1109\/ISSPA.2007.4555333"},{"key":"ref_38","unstructured":"Fagertun, J., and Stegmann, M.B. (2020, February 13). Free Datasets for Statistical Models of Shape; Information and Mathematical Modeling, Technical University of Denmark, DTU. Available online: http:\/\/www.imm.dtu.dk\/~aam\/datasets\/datasets.html."},{"key":"ref_39","unstructured":"Weyrauch, B., Huang, J., Heisele, B., and Blanz, V. (July, January 27). Component-based Face Recognition with 3D Morphable Models. Proceedings of the IEEE Workshop on Face Processing in Video, Washington, DC, USA."},{"key":"ref_40","unstructured":"Gross, R., Li, S.Z., and Jain, A.K. (2020, February 13). Face Databases. Available online: https:\/\/www.face-rec.org\/databases\/."},{"key":"ref_41","unstructured":"Nordstrom, M.M., Larsen, M., Sierakowski, J., and Stegmann, M.B. (2004). The IMM Face Database\u2014An Annotated Dataset of 240 Face Images, Informatics and Mathematical Modelling, Technical University of Denmark."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/21\/8449\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:09:48Z","timestamp":1760144988000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/21\/8449"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,3]]},"references-count":41,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2022,11]]}},"alternative-id":["s22218449"],"URL":"https:\/\/doi.org\/10.3390\/s22218449","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,11,3]]}}}