{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,27]],"date-time":"2025-03-27T16:06:21Z","timestamp":1743091581994,"version":"3.40.3"},"publisher-location":"Berlin, Heidelberg","reference-count":42,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"type":"print","value":"9783540646136"},{"type":"electronic","value":"9783540692355"}],"license":[{"start":{"date-parts":[[1998,1,1]],"date-time":"1998-01-01T00:00:00Z","timestamp":883612800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[1998]]},"DOI":"10.1007\/bfb0054762","type":"book-chapter","created":{"date-parts":[[2006,8,1]],"date-time":"2006-08-01T12:09:53Z","timestamp":1154434193000},"page":"514-528","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["A comparison of active shape model and scale decomposition based features for visual speech recognition"],"prefix":"10.1007","author":[{"given":"Iain","family":"Matthews","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"J. Andrew","family":"Bangham","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Richard","family":"Harvey","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stephen","family":"Cox","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2006,5,26]]},"reference":[{"key":"33_CR1","series-title":"NATO ASI Series F: Computer and Systems Sciences","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1007\/978-3-662-13015-5_35","volume-title":"Speechreading by Humans and Machines: Models, Systems and Applications","author":"A. Adjoudani","year":"1996","unstructured":"A. Adjoudani and C. Beno\u00cet. On the Integration of Auditory and Visual Pararneters in an HMM-based ASR, pages 461\u2013471. In Stork and Hennecke [38], 1996."},{"key":"33_CR2","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1016\/0165-1684(94)90156-2","volume":"38","author":"J. A. Bangham","year":"1994","unstructured":"J. A. Bangham, T. G. Campbell, and R. V. Aldridge. Multiscale median and morphological filters used for 2d pattern recognition. Signal Processing, 38:387\u2013415, 1994.","journal-title":"Signal Processing"},{"issue":"5","key":"33_CR3","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1109\/34.494642","volume":"18","author":"J. A. Bangham","year":"1996","unstructured":"J. A. Bangham, P. Chardaire, C. J. Pye, and P. Ling. Mulitscale nonlinear decomposition: The sieve decomposition theorem. IEEE Trans. Pattern Analysis and Machine Intelligence, 18(5):529\u2013539, 1996.","journal-title":"IEEE Trans. Pattern Analysis and Machine Intelligence"},{"issue":"3","key":"33_CR4","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1117\/12.243349","volume":"5","author":"J. A. Bangham","year":"1996","unstructured":"J. A. Bangham, R. Harvey, P. Ling, and R. V. Aldridge. Morphological scale-space preserving transforms in many dimensions. Journal of Electronic Imaging, 5(3):283\u2013299, July 1996.","journal-title":"Journal of Electronic Imaging"},{"key":"33_CR5","first-page":"189","volume":"1","author":"J. A. Bangham","year":"1996","unstructured":"J. A. Bangham, R. Harvey, P. Ling, and R. V. Aldridge. Nonlinear scale-space from n-dimensional sieves. Proc. European Conference on Computer Vision, 1:189\u2013198, 1996.","journal-title":"Proc. European Conference on Computer Vision"},{"issue":"6","key":"33_CR6","doi-asserted-by":"publisher","first-page":"1043","DOI":"10.1109\/83.503918","volume":"5","author":"J. A. Bangham","year":"1996","unstructured":"J. A. Bangham, P. Ling, and R. Young. Mulitscale recursive medians, scale-space and transforms with applications to image processing. IEEE Trans. Image Processing, 5(6):1043\u20131048, 1996.","journal-title":"IEEE Trans. Image Processing"},{"unstructured":"C. Beno\u00cet and R. Campbell, editors. Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, Rhodes, Sept. 1997.","key":"33_CR7"},{"key":"33_CR8","first-page":"11","volume":"1","author":"A. Bosson","year":"1997","unstructured":"A. Bosson, R. Harvey, and J. A. Bangham. Robustness of scale space filters. In BMVC, volume 1, pages 11\u201321, 1997.","journal-title":"BMVC"},{"doi-asserted-by":"crossref","unstructured":"C. Bregler and S. M. Omohundro. Learning visual models for lipreading. In M. Shah and R. Jain, editors, Motion-Based Recognition, volume 9 of Computational Imaging and Vision, chapter 13, pages 301\u2013320. Kluwer Academic, 1997.","key":"33_CR9","DOI":"10.1007\/978-94-015-8935-2_13"},{"key":"33_CR10","series-title":"NATO ASI Series F: Computer and Systems Sciences","doi-asserted-by":"crossref","first-page":"409","DOI":"10.1007\/978-3-662-13015-5_31","volume-title":"Speechreading by Humans and Machines: Models, Systems and Applications","author":"C. Bregler","year":"1996","unstructured":"C. Bregler, S. M. Omohundro, and J. Shi. Towards a Robust Speechreading Dialog System, pages 409\u2013423. In Stork and Hennecke [38], 1996."},{"issue":"5","key":"33_CR11","first-page":"15","volume":"16","author":"N. M. Brooke","year":"1994","unstructured":"N. M. Brooke, M. J. Tomlinson, and R. K. Moore. Automatic speech recognition that includes visual speech cues. Proc. Institute of Acoustics, 16(5):15\u201322, 1994.","journal-title":"Proc. Institute of Acoustics"},{"key":"33_CR12","first-page":"7\/1","volume":"number 1996\/213","author":"C. C. Chibelushi","year":"1996","unstructured":"C. C. Chibelushi, S. Gandon, J. S. D. Mason, F. Deravi, and R. D. Johnston. Desing issues for a digital audio-visual integrated database. In IEE Colloquium on Integrated Audio-Visual Processing, number 1996\/213, pages 7\/1\u20137\/7, Savoy Place, London, Nov. 1996.","journal-title":"IEE Colloquium on Integrated Audio-Visual Processing"},{"key":"33_CR13","series-title":"NATO ASI Series F: Computer and Systems Sciences","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1007\/978-3-662-13015-5_29","volume-title":"Speechreading by Humans and Machines: Models, Systems and Applications","author":"T. Coianiz","year":"1996","unstructured":"T. Coianiz, L. Torresani, and B. Caprile. 2D Deformable Models for Visual Speech Analysis, pages 391\u2013398. In Stork and Hennecke [38], 1996."},{"issue":"6","key":"33_CR14","doi-asserted-by":"publisher","first-page":"355","DOI":"10.1016\/0262-8856(94)90060-4","volume":"12","author":"T. F. Cootes","year":"1994","unstructured":"T. F. Cootes, A. Hill, C. J. Taylor, and J. Haslam. The use of active shape models for locating structures in medical images. Image and Vision Computing, 12(6):355\u2013366, 1994.","journal-title":"Image and Vision Computing"},{"key":"33_CR15","series-title":"NATO ASI Series F: Computer and Systems Sciences","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1007\/978-3-662-13015-5_23","volume-title":"Speechreading by Humans and Machines: Models, Systems and Applications","author":"P. Cosi","year":"1996","unstructured":"P. Cosi and E. M. Caldognetto. Lips and Jaw Movements for Vowels and Consonants: Spatio-Temporal Characteristics and Bimodal Recognition Applications, pages 291\u2013313. In Stork and Hennecke [38], 1996."},{"unstructured":"S. Cox, I. Matthews, and A. Bangham. Combining noise compensation with visual information in speech recognition. In Beno\u00cet and Campbell [7], pages 53\u201356.","key":"33_CR16"},{"key":"33_CR17","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1044\/jshr.1202.423","volume":"12","author":"N. P. Erber","year":"1969","unstructured":"N. P. Erber. Interaction of audition and vision in the recognition of oral speech stimuli. Journal of Speech and Hearing Research, 12:423\u2013425, 1969.","journal-title":"Journal of Speech and Hearing Research"},{"unstructured":"A. J. Goldschen. Continuous Automatic Speech Recognition by Lipreading. PhD thesis, George Washington University, 1993.","key":"33_CR18"},{"key":"33_CR19","first-page":"582","volume-title":"Lip reading from scale-space measurements","author":"R. Harvey","year":"1997","unstructured":"R. Harvey, I. Matthews, J. A. Bangham, and S. Cox. Lip reading from scale-space measurements. In Proc. Computer Vision and Pattern Recognition, pages 582\u2013587, Puerto Rico, June 1997. IEEE."},{"issue":"1","key":"33_CR20","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1016\/1047-3203(92)90028-R","volume":"3","author":"H. J. A. M. Heijmans","year":"1992","unstructured":"H. J. A. M. Heijmans, P. Nacken, A. Toet, and L. Vincent. Graph morphology. Journal of Visual Computing and Image Representation, 3(1):24\u201338, March 1992.","journal-title":"Journal of Visual Computing and Image Representation"},{"key":"33_CR21","series-title":"NATO ASI Series F: Computer and Systems Sciences","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1007\/978-3-662-13015-5_25","volume-title":"Speechreading by Humans and Machines: Models, Systems and Applications","author":"M. E. Hennecke","year":"1996","unstructured":"M. E. Hennecke, D. G. Stork, and K. V. Prasad. Visionary Speech: Looking Ahead to Practical Speechreading Systems, pages 331\u2013349. In Stork and Hennecke [38], 1996."},{"doi-asserted-by":"crossref","unstructured":"A. Hill and C. J. Taylor. Automatic landmark generation for point distribution models. In Proc. British Machine Vision Conference, 1994.","key":"33_CR22","DOI":"10.5244\/C.8.42"},{"key":"33_CR23","first-page":"376","volume":"II","author":"R. Kaucic","year":"1996","unstructured":"R. Kaucic, B. Dalton, and A. Blake. Real-time lip tracking for audio-visual speech recognition applications. In Proc. European Conference on Computer Vision, volume II, pages 376\u2013387, 1996.","journal-title":"Proc. European Conference on Computer Vision"},{"key":"33_CR24","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1126\/science.7146899","volume":"218","author":"P. K. Kuhl","year":"1982","unstructured":"P. K. Kuhl and A. N. Meltzoff. The bimodal perception of speech in infancy. Science, 218:1138\u20131141, Dec. 1982.","journal-title":"Science"},{"issue":"4","key":"33_CR25","doi-asserted-by":"crossref","first-page":"1035","DOI":"10.1002\/j.1538-7305.1983.tb03114.x","volume":"62","author":"S. E. Levinson","year":"1983","unstructured":"S. E. Levinson, L. R. Rabiner, and M. M. Sondhi. An introduction to the application of the theory of probabilistic functions of a markov process to automatic speech recognition. The Bell System Technical Journal, 62(4):1035\u20131074, Apr. 1983.","journal-title":"The Bell System Technical Journal"},{"doi-asserted-by":"crossref","unstructured":"J. Luettin. Towards speaker independent continuous speechreading. In Proc. of the European Conference on Speech Communication and Technology, 1997.","key":"33_CR26","DOI":"10.21437\/Eurospeech.1997-528"},{"unstructured":"J. Luettin. Visual Speech and Speaker Recognition. PhD thesis, University of Sheffield, May 1997.","key":"33_CR27"},{"issue":"6","key":"33_CR28","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1002\/scj.4690220607","volume":"22","author":"K. Mase","year":"1991","unstructured":"K. Mase and A. Pentland. Automatic lipreading by optical-flow analysis. Systems and Computers in Japan, 22(6):67\u201375, 1991.","journal-title":"Systems and Computers in Japan"},{"key":"33_CR29","first-page":"8\/1","volume":"number 1996\/213","author":"I. Matthews","year":"1996","unstructured":"I. Matthews, J. A. Bangham, and S. Cox. Scale based features for audiovisual speech recognition. In IEE Colloquium on Integrated Audio-Visual Processing, number 1996\/213, pages 8\/1\u20138\/7, Savoy Place, London, Nov. 1996.","journal-title":"IEE Colloquium on Integrated Audio-Visual Processing"},{"key":"33_CR30","doi-asserted-by":"publisher","first-page":"746","DOI":"10.1038\/264746a0","volume":"264","author":"H. McGurk","year":"1976","unstructured":"H. McGurk and J. McDonald. Hearing lips and seeing voices. Nature, 264:746\u2013748, Dec. 1976.","journal-title":"Nature"},{"unstructured":"U. Meier, R. Stiefelhagen, and J. Yang. Preprocessing of visual speech under real world conditions. In Beno\u00cet and Campbell [7], pages 113\u2013116.","key":"33_CR31"},{"unstructured":"J. R. Movellan. Visual speech recognition with stochastic networks. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, 1995.","key":"33_CR32"},{"issue":"6","key":"33_CR33","doi-asserted-by":"publisher","first-page":"1275","DOI":"10.1121\/1.1908620","volume":"28","author":"K. K. Neely","year":"1956","unstructured":"K. K. Neely. Effect of visual factors on the intelligibility of speech. Journal of the Acoustical Society of America, 28(6):1275\u20131277, Nov. 1956.","journal-title":"Journal of the Acoustical Society of America"},{"issue":"4","key":"33_CR34","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1093\/comjnl\/7.4.308","volume":"7","author":"J. A. Nelder","year":"1965","unstructured":"J. A. Nelder and R. Mead. A simplex method for function minimisation. Computing Journal, 7(4):308\u2013313, 1965.","journal-title":"Computing Journal"},{"key":"33_CR35","volume-title":"PhD thesis","author":"E. D. Petajan","year":"1984","unstructured":"E. D. Petajan. Automatic Lipreading to Enhance Speech Recognition. PhD thesis, University of Illinois, Urbana-Champaign, 1984."},{"unstructured":"G. Potamianos, Cosatto, H. P. Graf, and D. B. Roe. Speaker independent audiovisual database for bimodal ASR. In Beno\u00cet and Campbell [7], pages 65\u201368.","key":"33_CR36"},{"key":"33_CR37","volume-title":"PhD thesis","author":"P. L. Silsbee","year":"1993","unstructured":"P. L. Silsbee. Computer Lipreading for Improved Accuracy in Automatic Speech Recognition. PhD thesis, The University of Texas, Austin, Dec. 1993."},{"key":"33_CR38","series-title":"NATO ASI Series F: Computer and Systems Sciences","volume-title":"Speechreading by Humans and Machines: Models, Systems and Applications","year":"1996","unstructured":"D. G. Stork and M. E. Hennecke, editors. Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series F: Computer and Systems Sciences. Springer-Verlag, Berlin, 1996."},{"issue":"2","key":"33_CR39","doi-asserted-by":"publisher","first-page":"212","DOI":"10.1121\/1.1907309","volume":"26","author":"W. H. Sumby","year":"1954","unstructured":"W. H. Sumby and I. Pollack. Visual contribution to speech intelligibility in noise. Journal of the Acoustical Society of America, 26(2):212\u2013215, Mar. 1954.","journal-title":"Journal of the Acoustical Society of America"},{"key":"33_CR40","first-page":"3","volume-title":"Hearing by Eye: The Psychology of Lip-reading","author":"Q. Summerfield","year":"1987","unstructured":"Q. Summerfield. Some preliminaries to a comprehensive account of audio-visual speech perception. In B. Dodd and R. Campbell, editors, Hearing by Eye: The Psychology of Lip-reading, pages 3\u201351. Lawrence Erlbaum Associates, London, 1987."},{"unstructured":"S. Young, J. Jansen, J. Odell, D. Ollason, and P. Woodland. The HTK Book. Cambridge University, 1996.","key":"33_CR41"},{"key":"33_CR42","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1109\/35.41402","volume":"27","author":"B. P. Yuhas","year":"1989","unstructured":"B. P. Yuhas, M. H. Goldstein, Jr., and T. J. Sejnowski. Integration of acoustic and visual speech signals using neural networks. IEEE Communications Magazine, 27:65\u201371, 1989.","journal-title":"IEEE Communications Magazine"}],"container-title":["Lecture Notes in Computer Science","Computer Vision \u2014 ECCV\u201998"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/BFb0054762","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,8]],"date-time":"2023-05-08T10:25:02Z","timestamp":1683541502000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/BFb0054762"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1998]]},"ISBN":["9783540646136","9783540692355"],"references-count":42,"URL":"https:\/\/doi.org\/10.1007\/bfb0054762","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[1998]]},"assertion":[{"value":"26 May 2006","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}}]}}