{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:43:43Z","timestamp":1750308223370,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","license":[{"start":{"date-parts":[[2004,10,15]],"date-time":"2004-10-15T00:00:00Z","timestamp":1097798400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2004,10,15]]},"DOI":"10.1145\/1026799.1026810","type":"proceedings-article","created":{"date-parts":[[2005,1,30]],"date-time":"2005-01-30T17:55:16Z","timestamp":1107107716000},"page":"54-62","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Multimodal group action clustering in meetings"],"prefix":"10.1145","author":[{"given":"Dong","family":"Zhang","sequence":"first","affiliation":[{"name":"IDIAP Research Institute, Martigny, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Gatica-Perez","sequence":"additional","affiliation":[{"name":"IDIAP Research Institute, Martigny, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Samy","family":"Bengio","sequence":"additional","affiliation":[{"name":"IDIAP Research Institute, Martigny, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Iain","family":"McCowan","sequence":"additional","affiliation":[{"name":"IDIAP Research Institute, Martigny, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guillaume","family":"Lathoud","sequence":"additional","affiliation":[{"name":"IDIAP Research Institute, Martigny, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2004,10,15]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"ICSLP","author":"Ajmera J.","year":"2002","unstructured":"J. Ajmera , H. Bourlard , I. Lapidot , and I. McCowan . Unknown-multiple speaker clustering using HMM . In ICSLP , Colorado , 2002 . J. Ajmera, H. Bourlard, I. Lapidot, and I. McCowan. Unknown-multiple speaker clustering using HMM. In ICSLP, Colorado, 2002."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2003.1318476"},{"key":"e_1_3_2_1_3_1","volume-title":"Proc. CVPR Workshop on Cues in Communication","author":"Basu S.","year":"2001","unstructured":"S. Basu , T. Choudhury , B. Clarkson , and A. Pentland . Towards measuring human interactions in conversational settings . In Proc. CVPR Workshop on Cues in Communication , Kawai , Dec. 2001 . S. Basu, T. Choudhury, B. Clarkson, and A. Pentland. Towards measuring human interactions in conversational settings. In Proc. CVPR Workshop on Cues in Communication, Kawai, Dec. 2001."},{"key":"e_1_3_2_1_4_1","volume-title":"Proc. NIPS","author":"Bengio S.","year":"2003","unstructured":"S. Bengio . An asynchronous hidden Markov model for audio-visual speech recognition . In Proc. NIPS , 2003 . S. Bengio. An asynchronous hidden Markov model for audio-visual speech recognition. In Proc. NIPS, 2003."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/641007.641112"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-04619-7_8"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2004.1327189"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/6046.865479"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073483.1073495"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2001.937608"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1037\/0022-3514.35.7.523"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAU.1972.1162410"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2003.1202751"},{"key":"e_1_3_2_1_14_1","volume-title":"IDIAP","author":"McCowan I.","year":"2003","unstructured":"I. McCowan , D. Gatica-Perez , S. Bengio , and G. Lathoud . Automatic analysis of multimodal group actions in meetings. IDIAP-RR 27 , IDIAP , Martigny, Switzerland , May 2003 . I. McCowan, D. Gatica-Perez, S. Bengio, and G. Lathoud. Automatic analysis of multimodal group actions in meetings. IDIAP-RR 27, IDIAP, Martigny, Switzerland, May 2003."},{"key":"e_1_3_2_1_15_1","volume-title":"Groups: Interaction and Performance","author":"McGrath J. E.","year":"1984","unstructured":"J. E. McGrath . Groups: Interaction and Performance . Prentice-Hall , 1984 . J. E. McGrath. Groups: Interaction and Performance. Prentice-Hall, 1984."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072133.1072203"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1998.675368"},{"key":"e_1_3_2_1_19_1","volume-title":"Proc. ICMI","author":"Oliver N.","year":"2002","unstructured":"N. Oliver , E. Horvitz , and A. Garg . Layered representations for learning and inferring office activity from multiple sensory channels . In Proc. ICMI , Pittsburgh , Oct. 2002 . N. Oliver, E. Horvitz, and A. Garg. Layered representations for learning and inferring office activity from multiple sensory channels. In Proc. ICMI, Pittsburgh, Oct. 2002."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.868684"},{"key":"e_1_3_2_1_21_1","volume-title":"Fundamentals of Speech Recognition","author":"Rabiner L. R.","year":"1993","unstructured":"L. R. Rabiner and B.-H. Juang . Fundamentals of Speech Recognition . Prentice-Hall , 1993 . L. R. Rabiner and B.-H. Juang. Fundamentals of Speech Recognition. Prentice-Hall, 1993."},{"key":"e_1_3_2_1_22_1","volume-title":"Proc. Int. Work. on AFGR","author":"Starner T.","year":"1995","unstructured":"T. Starner and A. Pentland . Visual recognition of american sign language using HMMs . In Proc. Int. Work. on AFGR , Zurich , 1995 . T. Starner and A. Pentland. Visual recognition of american sign language using HMMs. In Proc. Int. Work. on AFGR, Zurich, 1995."},{"key":"e_1_3_2_1_23_1","volume-title":"Proc. IEEE ICASSP","author":"Waibel A.","year":"1999","unstructured":"A. Waibel , M. Bett , F. Metze , K. Ries , T. Schaaf , T. Schultz , H. Soltau , H. Yu , and K. Zechner . Advances in automatic meeting record creation and access . In Proc. IEEE ICASSP , May 1999 . A. Waibel, M. Bett, F. Metze, K. Ries, T. Schaaf, T. Schultz, H. Soltau, H. Yu, and K. Zechner. Advances in automatic meeting record creation and access. In Proc. IEEE ICASSP, May 1999."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2003.1318425"},{"key":"e_1_3_2_1_25_1","volume-title":"Proc. ICME","author":"Xie L.","year":"2003","unstructured":"L. Xie , S.-F. Chang , A. Divakaran , and H. Sun . Unsupervised discovery of multilevel statistical video structures using hierarchical hidden markov models . In Proc. ICME , July 2003 . L. Xie, S.-F. Chang, A. Divakaran, and H. Sun. Unsupervised discovery of multilevel statistical video structures using hierarchical hidden markov models. In Proc. ICME, July 2003."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2001.990935"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/1032638.1033008"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/1896300.1896418"}],"event":{"name":"MM04: 2004 12th Annual ACM International Conference on Multimedia","sponsor":["SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques","ACM Association for Computing Machinery","SIGMM ACM Special Interest Group on Multimedia"],"location":"New York NY USA","acronym":"MM04"},"container-title":["Proceedings of the ACM 2nd international workshop on Video surveillance &amp; sensor networks"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1026799.1026810","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1026799.1026810","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T16:31:23Z","timestamp":1750264283000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1026799.1026810"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,10,15]]},"references-count":27,"alternative-id":["10.1145\/1026799.1026810","10.1145\/1026799"],"URL":"https:\/\/doi.org\/10.1145\/1026799.1026810","relation":{},"subject":[],"published":{"date-parts":[[2004,10,15]]},"assertion":[{"value":"2004-10-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}