{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:26:09Z","timestamp":1754155569824,"version":"3.41.2"},"reference-count":30,"publisher":"Emerald","issue":"8","license":[{"start":{"date-parts":[[2014,8,26]],"date-time":"2014-08-26T00:00:00Z","timestamp":1409011200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,8,26]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-heading\">Purpose<\/jats:title><jats:p>\u2013 The purpose of this paper is to present a novel coarticulation and speech synchronization framework compliant with MPEG-4 facial animation (FA).<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Design\/methodology\/approach<\/jats:title><jats:p>\u2013 The system the authors have developed uses MPEG-4 FA standard and other development to enable the creation, editing and playback of high-resolution 3D models; MPEG-4 animation streams; and is compatible with well-known related systems such as Greta and Xface. It supports text-to-speech for dynamic speech synchronization. The framework enables real-time model simplification using quadric-based surfaces.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Findings<\/jats:title><jats:p>\u2013 The preliminary experiments show that the coarticulation technique the authors have developed gives overall good and promising results when compared to related techniques.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Originality\/value<\/jats:title><jats:p>\u2013 The coarticulation approach provides realistic and high performance lip-sync animation, based on Cohen-Massaro's model of coarticulation adapted to MPEG-4 FA specification.<\/jats:p><\/jats:sec>","DOI":"10.1108\/k-07-2014-0139","type":"journal-article","created":{"date-parts":[[2014,10,3]],"date-time":"2014-10-03T11:46:53Z","timestamp":1412336813000},"page":"1165-1182","source":"Crossref","is-referenced-by-count":3,"title":["Coarticulation and speech synchronization in MPEG-4 based facial animation"],"prefix":"10.1108","volume":"43","author":[{"given":"Ricardo","family":"Leandro Parreira Duarte","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Abdennour","family":"El Rhalibi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Madjid","family":"Merabti","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","reference":[{"key":"key2020123000191063500_b1","unstructured":"Albrecht, I. , Haber, J. and Seidel, H.-P. (2002), \u201cSpeech synchronization for physics-based facial animation\u201d, Proc. Int'l Conf. in Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG)."},{"key":"key2020123000191063500_b2","doi-asserted-by":"crossref","unstructured":"Albrecht, I. , Schr\u00f6der, M. , Haber, J. and Seidel, H.-P. (2005), \u201cMixed feelings: expression of non-basic emotions in a muscle-based talking head\u201d, Special Issue of Journal of Virtual Reality on Language, Speech & Gesture, Vol. 8 No. 4, pp. 201-212.","DOI":"10.1007\/s10055-005-0153-5"},{"key":"key2020123000191063500_b3","unstructured":"Annosoft . \u2018Annosoft viseme to phoneme set\u201d, available at: www.annosoft.com\/phoneset.htm (accessed 14 January 2013)."},{"key":"key2020123000191063500_b5","doi-asserted-by":"crossref","unstructured":"Bell-Berti, F. and Harris, K. (1979), \u201cAnticipatory coarticulation: some implications from a study of lip rounding\u201d, Journal of the Acoustical Society of America, Vol. 65 No. 5, pp. 1268-1270.","DOI":"10.1121\/1.382794"},{"key":"key2020123000191063500_b6","doi-asserted-by":"crossref","unstructured":"Benguerel, A.-P. and Pichora-Fuller, M.K. (1982), \u201cCoarticulation effects in lipreading\u201d, Journal of Speech and Hearing Research, Vol. 25, pp. 600-607.","DOI":"10.1044\/jshr.2504.600"},{"key":"key2020123000191063500_b7","doi-asserted-by":"crossref","unstructured":"Bregler, C. , Covell, M. and Slaney, M. (1997), \u201cVideo rewrite: driving visual speech with audio\u201d, Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, ACM Press\/Addison-Wesley Publishing Co, 3-8 August, Los Angeles, California.","DOI":"10.1145\/258734.258880"},{"key":"key2020123000191063500_b26","doi-asserted-by":"crossref","unstructured":"Campbell, R. and Dodd, B. (1980), \u201cHearing by eye\u201d, Quarterly Journal of Experimental Psychology, Vol. 32, pp. 85-99.","DOI":"10.1080\/00335558008248235"},{"key":"key2020123000191063500_b8","unstructured":"Cohen, M.M. , Massaro, D.W. and Clark, R. (2002), \u201cTraining a talking head\u201d, Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, IEEE Computer Society, 14-16 October, Los Alamitos, California."},{"key":"key2020123000191063500_b9","doi-asserted-by":"crossref","unstructured":"Cosi, P. , Fusaro, A. and Tisato, G. (2003), \u201cLUCIA a new Italian talking-head based on a modified Cohen-Massaro's Labial coarticulation model\u201d, Eurospeech 2003, Geneva, Switzerland, Vol. III, pp. 2269-2272.","DOI":"10.21437\/Eurospeech.2003-634"},{"key":"key2020123000191063500_b10","doi-asserted-by":"crossref","unstructured":"Deng, Z. and Noh, J. (2007), \u201cComputer facial animation: a survey\u201d, in Deng, Z. and Neumann, U. (Eds), Data-Driven 3D Facial Animation, Springer, London, pp. 1-28.","DOI":"10.1007\/978-1-84628-907-1_1"},{"key":"key2020123000191063500_b11","doi-asserted-by":"crossref","unstructured":"El Rhalibi, A. , Carter, C. , Cooper, S. , Merabti, M. and Price, M. (2010), \u201cCharisma: high-performance web-based MPEG-compliant animation framework\u201d, ACM Comput. Entertain., Vol. 8 No. 2, pp. 1-15.","DOI":"10.1145\/1899687.1899690"},{"key":"key2020123000191063500_b13","doi-asserted-by":"crossref","unstructured":"Goff, B.L. (1997), \u201cAutomatic modeling of coarticulation in text-to-visual speech synthesis\u201d, in Kokkinakis G. , Fakotakis N . and Dermatas E. (Eds), Fifth European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, 22-25 September, ISCA.","DOI":"10.21437\/Eurospeech.1997-475"},{"key":"key2020123000191063500_b14","doi-asserted-by":"crossref","unstructured":"Kent, R.D. and Minifie, F.D. (1977), \u201cCoarticulation in recent speech production models\u201d, Journal of Phonetics, Vol. 5, pp. 115-135.","DOI":"10.1016\/S0095-4470(19)31123-4"},{"key":"key2020123000191063500_b15","doi-asserted-by":"crossref","unstructured":"King, S.A. and Parent, R.E. (2005), \u201cCreating speech-synchronized animation\u201d, IEEE Transactions on Visualization and Computer Graphics, Vol. 11 No. 3, pp. 341-352.","DOI":"10.1109\/TVCG.2005.43"},{"key":"key2020123000191063500_b16","unstructured":"Koray, B. (2004), \u201cXface: MPEG-4 based open source toolkit for 3D Facial Animation\u201d, in Maria, F.C. (Ed.), Proceedings of the Working Conference on Advanced Visual Interfaces, 25-28 May, ACM , Gallipoli."},{"key":"key2020123000191063500_b17","doi-asserted-by":"crossref","unstructured":"L\u00f6fqvist, A. (1990), \u201cSpeech as audible gestures\u201d, in Hardcastle, W.J. and Marchal, A. (Eds), Speech Production and Speech Modelling, Kluwer Academic Publishers, Dordrecht, pp. 289-322.","DOI":"10.1007\/978-94-009-2037-8_12"},{"key":"key2020123000191063500_b18","doi-asserted-by":"crossref","unstructured":"McGurk, H. and Macdonald, J. (1976), \u201cHearing lips and seeing voices\u201d, Nature, Vol. 264 No. -, pp. 746-774.","DOI":"10.1038\/264746a0"},{"key":"key2020123000191063500_b19","unstructured":"Massaro, D.W. (1998), Perceiving Talking Faces: From Speech Perception to a Behavioral Principle, MIT Press\/Bradford Books."},{"key":"key2020123000191063500_b20","unstructured":"Massaro, D.W. and Cohen, M.M. (1993), \u201cModeling coarticulation in synthetic visual speech\u201d, in Thalman, N.M. and Thalmann, D. (Eds), Models and techniques in computer animation, Springer-Verlag, Tokyo, pp. 139-156."},{"key":"key2020123000191063500_b21","doi-asserted-by":"crossref","unstructured":"\u00d6hman, S.E.G. (1967), \u201cNumerical model of coarticulation\u201d, The Journal of the Acoustical Society of America, Vol. 41, pp. 310-320.","DOI":"10.1121\/1.1910340"},{"key":"key2020123000191063500_b22","doi-asserted-by":"crossref","unstructured":"Pandzic, I.S. and Forchheimer, R. (2002), MPEG-4 Facial Animation The Standard, Implementation and Applications, John Wiley & Sons Ltd, Chichester.","DOI":"10.1002\/0470854626"},{"key":"key2020123000191063500_b23","unstructured":"Pasquariello, S. and Pelachaud, C. (2001), \u201cGreta: a simple facial animation engine\u201d, 6th Online World Conference on Soft Computing in Industrial Applications, Session on Soft Computing for Intelligent 3D Agents, 10-24 September."},{"key":"key2020123000191063500_b24","unstructured":"Pelachaud, C. (1991), Comunication and coarticulation in facial animation, PhD, University of Pennylvania, Philadelphia, PA."},{"key":"key2020123000191063500_b25","doi-asserted-by":"crossref","unstructured":"Pelachaud, C. (2002), \u201cVisual text-to-speech\u201d, MPEG4 Facial Animation \u2013 The Standard, Implementations and Applications, in Igor, S. Pandzic, Robert Forchheimer (Eds), John Wiley & Sons.","DOI":"10.1002\/0470854626.ch8"},{"key":"key2020123000191063500_b27","doi-asserted-by":"crossref","unstructured":"Schr\u00f6der, M. (2003), \u201cThe German text-to-speech synthesis system MARY: a tool for research development and teaching\u201d, International Journal of Speech Technology, Vol. 6 No. 4, pp. 365-377.","DOI":"10.1023\/A:1025708916924"},{"key":"key2020123000191063500_b28","unstructured":"Somasundaram, A. (2006), A facial animation model for expressive audio-visual speech, PhD, The Ohio State University, Columbus, OH."},{"key":"key2020123000191063500_b29","doi-asserted-by":"crossref","unstructured":"Sumedha, K. and Magnenat-Thalmann, N. (2003), \u201cVisyllable based speech animation\u201d, Computer Graphics Forum, Vol. 22 No. 3, pp. 631-639.","DOI":"10.1111\/1467-8659.t01-2-00711"},{"key":"key2020123000191063500_b30","doi-asserted-by":"crossref","unstructured":"Terry, L. and Katsaggelos, A.K. (2008), \u201cA phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition\u201d, Pattern Recognition, 2008. ICPR 2008. 19th International Conference, 8-11 December 2008, pp. 1-4.","DOI":"10.1109\/ICPR.2008.4761927"},{"key":"key2020123000191063500_b4","unstructured":"Ardor3D , available at: http:\/\/github.com\/Renanse\/Ardor3D (accessed 28 August 2014)."},{"key":"key2020123000191063500_b12","unstructured":"FaceGen , available at: www.facegen.com\/2014 (accessed 28 August 2014)."}],"container-title":["Kybernetes"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/K-07-2014-0139","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/K-07-2014-0139\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/K-07-2014-0139\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T21:49:08Z","timestamp":1753393748000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/k\/article\/43\/8\/1165-1182\/265263"}},"subtitle":[],"editor":[{"given":"Dr","family":"Mourad Oussalah and Professor Ali Hessami","sequence":"first","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2014,8,26]]},"references-count":30,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2014,8,26]]}},"alternative-id":["10.1108\/K-07-2014-0139"],"URL":"https:\/\/doi.org\/10.1108\/k-07-2014-0139","relation":{},"ISSN":["0368-492X"],"issn-type":[{"type":"print","value":"0368-492X"}],"subject":[],"published":{"date-parts":[[2014,8,26]]}}}