{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,11]],"date-time":"2025-04-11T08:50:31Z","timestamp":1744361431781,"version":"3.37.3"},"reference-count":42,"publisher":"World Scientific Pub Co Pte Ltd","issue":"07","funder":[{"DOI":"10.13039\/501100001809","name":"the National Natural Science Foundation of China","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"the Beijing Natural Science Foundation of China"},{"name":"the Science and Technology Development Program of Beijing Municipal Education Commission"},{"name":"the Great Wall Scholar Reserved Talent Program of North China University of Technology"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2018,7]]},"abstract":"<jats:p> As a significant component of the Human Computer Interface (HCI), automatic lip reading is designed for the purpose of understanding the content of speech by interpreting the movements of the lips. Although performance of automatic lip reading system is easily affected by challenging conditions such as noise, illumination and low resolution, enormous advancements in the relevant fields accompanied with enhancement in computer capability have improved the robustness of the system, making it more adaptable to the real environment. In this paper, we study the field and gives a detailed discussion on the actuality and the developing level of automatic lip reading. We emphatically introduce the feature extraction and recognition model algorithms. We also compare and analyze the various visual speech databases for their characteristics and functions in speech recognition systems. In addition, we describe the challenges and offer our insights into future research direction of automatic lip reading. <\/jats:p>","DOI":"10.1142\/s0218001418560074","type":"journal-article","created":{"date-parts":[[2017,12,18]],"date-time":"2017-12-18T01:33:58Z","timestamp":1513560838000},"page":"1856007","source":"Crossref","is-referenced-by-count":13,"title":["Review on Automatic Lip Reading Techniques"],"prefix":"10.1142","volume":"32","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3494-3640","authenticated-orcid":false,"given":"Yuanyao","family":"Lu","sequence":"first","affiliation":[{"name":"School of Electronic and Information Engineering, North China University of Technology, Beijing, P. R. China"}]},{"given":"Jie","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Electronic and Information Engineering, North China University of Technology, Beijing, P. R. China"}]},{"given":"Ke","family":"Gu","sequence":"additional","affiliation":[{"name":"School of Electronic and Information Engineering, North China University of Technology, Beijing, P. R. China"}]}],"member":"219","published-online":{"date-parts":[[2018,3,14]]},"reference":[{"issue":"1","key":"S0218001418560074BIB001","first-page":"1","volume":"10","author":"Alex J. S. R.","year":"2015","journal-title":"J. Eng. Appl. Sci."},{"key":"S0218001418560074BIB002","doi-asserted-by":"publisher","DOI":"10.1007\/s11760-014-0615-x"},{"key":"S0218001418560074BIB003","doi-asserted-by":"publisher","DOI":"10.1016\/j.jml.2015.06.008"},{"key":"S0218001418560074BIB005","first-page":"4945","author":"Bahdanau D.","year":"2016","journal-title":"Comput. Sci."},{"key":"S0218001418560074BIB006","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177699147"},{"issue":"10","key":"S0218001418560074BIB008","first-page":"1","volume":"3","author":"Bose A.","year":"2012","journal-title":"Int. J. Sci. Eng. Res."},{"key":"S0218001418560074BIB011","doi-asserted-by":"publisher","DOI":"10.1016\/j.rinp.2016.12.026"},{"key":"S0218001418560074BIB013","doi-asserted-by":"publisher","DOI":"10.1109\/83.605417"},{"key":"S0218001418560074BIB015","doi-asserted-by":"publisher","DOI":"10.1121\/1.2229005"},{"key":"S0218001418560074BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/34.927467"},{"key":"S0218001418560074BIB017","doi-asserted-by":"publisher","DOI":"10.12733\/jics20105482"},{"key":"S0218001418560074BIB018","first-page":"1","author":"Czyzewski A.","year":"2017","journal-title":"J. Inte. Inf. Syst."},{"issue":"1","key":"S0218001418560074BIB020","first-page":"648","volume":"23","author":"Fan X.","year":"2012","journal-title":"Control Decision Conf."},{"key":"S0218001418560074BIB022","doi-asserted-by":"crossref","unstructured":"A. J. Goldschen,  O. N. Garcia and  E. D. Petajan ,  Continuous Automatic Speech Recognition by Lipreading  (George Washington University,  1993), pp.  321\u2013343.","DOI":"10.1007\/978-94-015-8935-2_14"},{"issue":"2","key":"S0218001418560074BIB023","first-page":"275","volume":"26","author":"Haque S.","year":"2013","journal-title":"Appl. Comput. Vision"},{"issue":"3","key":"S0218001418560074BIB025","first-page":"174","volume":"41","author":"Hong X.","year":"2005","journal-title":"Comput. Eng. Appl."},{"key":"S0218001418560074BIB026","doi-asserted-by":"publisher","DOI":"10.1162\/NECO_a_00843"},{"key":"S0218001418560074BIB027","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2016.03.003"},{"key":"S0218001418560074BIB030","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvcir.2015.04.013"},{"key":"S0218001418560074BIB034","doi-asserted-by":"publisher","DOI":"10.1007\/BF00133570"},{"key":"S0218001418560074BIB035","doi-asserted-by":"crossref","unstructured":"T. H. N. Le and  M. Savvides ,  A Novel Shape Constrained Feature-based Active Contour Model for Lips\/Mouth Segmentation in the Wild  (Elsevier Science Inc.,  2016), pp.  23\u201333.","DOI":"10.1016\/j.patcog.2015.11.009"},{"key":"S0218001418560074BIB036","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijleo.2017.02.017"},{"key":"S0218001418560074BIB037","doi-asserted-by":"publisher","DOI":"10.1587\/transinf.2015EDL8002"},{"key":"S0218001418560074BIB041","first-page":"1382","volume":"9","author":"Dong L.","year":"2005","journal-title":"EURASIP J. Appl. Signal Process."},{"key":"S0218001418560074BIB045","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2010.07.078"},{"issue":"6","key":"S0218001418560074BIB046","first-page":"564","volume":"22","author":"Mase K.","year":"1991","journal-title":"Comput. Japan"},{"key":"S0218001418560074BIB047","doi-asserted-by":"publisher","DOI":"10.1002\/scj.4690220607"},{"key":"S0218001418560074BIB049","doi-asserted-by":"publisher","DOI":"10.1109\/34.982900"},{"key":"S0218001418560074BIB055","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-74048-3"},{"key":"S0218001418560074BIB057","doi-asserted-by":"publisher","DOI":"10.1145\/2000824.2000827"},{"issue":"1","key":"S0218001418560074BIB064","first-page":"193","volume":"150","author":"Ribes R. J.","year":"2007","journal-title":"Speechreading by Humans and Machines"},{"key":"S0218001418560074BIB066","doi-asserted-by":"publisher","DOI":"10.1007\/BFb0016021"},{"key":"S0218001418560074BIB070","doi-asserted-by":"publisher","DOI":"10.1109\/CISP.2010.5646264"},{"key":"S0218001418560074BIB071","doi-asserted-by":"publisher","DOI":"10.1049\/iet-ipr.2014.1014"},{"key":"S0218001418560074BIB072","doi-asserted-by":"publisher","DOI":"10.1016\/j.cmpb.2014.02.017"},{"key":"S0218001418560074BIB074","doi-asserted-by":"publisher","DOI":"10.1121\/1.1907309"},{"key":"S0218001418560074BIB079","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2004.826773"},{"issue":"1","key":"S0218001418560074BIB081","first-page":"42","volume":"25","author":"Yanjun X. U.","year":"2000","journal-title":"Acta Acust."},{"issue":"1","key":"S0218001418560074BIB082","first-page":"117","volume":"2007","author":"Yoshinaga T.","year":"2004","journal-title":"Proe. RobuSt"},{"key":"S0218001418560074BIB083","doi-asserted-by":"publisher","DOI":"10.1016\/j.jfranklin.2013.12.021"},{"key":"S0218001418560074BIB086","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-014-1491-0"},{"key":"S0218001418560074BIB087","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2009.2030637"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001418560074","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,7]],"date-time":"2019-08-07T13:45:33Z","timestamp":1565185533000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218001418560074"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,3,14]]},"references-count":42,"journal-issue":{"issue":"07","published-online":{"date-parts":[[2018,3,14]]},"published-print":{"date-parts":[[2018,7]]}},"alternative-id":["10.1142\/S0218001418560074"],"URL":"https:\/\/doi.org\/10.1142\/s0218001418560074","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2018,3,14]]}}}