{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,3]],"date-time":"2025-09-03T10:26:57Z","timestamp":1756895217350,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":51,"publisher":"ACM","license":[{"start":{"date-parts":[[2015,8,24]],"date-time":"2015-08-24T00:00:00Z","timestamp":1440374400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2015,8,24]]},"DOI":"10.1145\/2801040.2801058","type":"proceedings-article","created":{"date-parts":[[2015,11,30]],"date-time":"2015-11-30T19:03:51Z","timestamp":1448910231000},"page":"75-82","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Vision-Based Technique and Issues for Multimodal Interaction in Augmented Reality"],"prefix":"10.1145","author":[{"given":"Ajune Wanis","family":"Ismail","sequence":"first","affiliation":[{"name":"MaGIC-X (Media and Games Innovation Centre of Excellence), UTM-IRDA Digital Media Centre, Universiti Teknologi Malaysia, 81310 Skudai Johor, Malaysia"}]},{"given":"Mark","family":"Billinghurst","sequence":"additional","affiliation":[{"name":"Human Interface Technology, Laboratory New Zealand (HITLabNZ), University of Canterbury, 8041 Christchurch, New Zealand"}]},{"given":"Mohd Shahrizal","family":"Sunar","sequence":"additional","affiliation":[{"name":"MaGIC-X (Media and Games Innovation Centre of Excellence), UTM-IRDA Digital Media Centre, Universiti Teknologi Malaysia, 81310 Skudai Johor, Malaysia"}]}],"member":"320","published-online":{"date-parts":[[2015,8,24]]},"reference":[{"volume-title":"Computational Intelligence in Information Systems: Proceedings of the Fourth INNS Symposia Series on Computational Intelligence in Information Systems (INNS-CIIS 2014) (Vol. 331","author":"Ismail A. W.","key":"e_1_3_2_1_1_1","unstructured":"Ismail , A. W. , & Sunar , M. S. (2014, November ). Multimodal Fusion: Gesture and Speech Input in Augmented Reality Environment . In Computational Intelligence in Information Systems: Proceedings of the Fourth INNS Symposia Series on Computational Intelligence in Information Systems (INNS-CIIS 2014) (Vol. 331 , p. 245). Springer . Ismail, A. W., & Sunar, M. S. (2014, November). Multimodal Fusion: Gesture and Speech Input in Augmented Reality Environment. In Computational Intelligence in Information Systems: Proceedings of the Fourth INNS Symposia Series on Computational Intelligence in Information Systems (INNS-CIIS 2014) (Vol. 331, p. 245). Springer."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/38.963459"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISMAR.2008.4637362"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCC.2007.893280"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/958432.958438"},{"volume-title":"Proceedings of the International CLASS Workshop on Natural, Intelligent and Effective Interaction in Multimodal Dialogue Systems, 52--61","author":"Corradini A.","key":"e_1_3_2_1_6_1","unstructured":"Corradini , A. and Cohen , P .: 2002, On the Relationships among Speech, Gestures, and Object Manipulation in Virtual Environments: Initial Evidence , Proceedings of the International CLASS Workshop on Natural, Intelligent and Effective Interaction in Multimodal Dialogue Systems, 52--61 . Corradini, A. and Cohen, P.: 2002, On the Relationships among Speech, Gestures, and Object Manipulation in Virtual Environments: Initial Evidence, Proceedings of the International CLASS Workshop on Natural, Intelligent and Effective Interaction in Multimodal Dialogue Systems, 52--61."},{"issue":"1","key":"e_1_3_2_1_7_1","first-page":"67","article-title":"Human Factors and Design Issues in Multimodal (Speech\/Gesture) Interface","volume":"2","author":"Lim C. J.","year":"2008","unstructured":"Lim , C. J. , Younghwan Pan , and Jane Lee. \" Human Factors and Design Issues in Multimodal (Speech\/Gesture) Interface .\" JDCTA 2 . 1 ( 2008 ): 67 -- 77 . Lim, C. J., Younghwan Pan, and Jane Lee. \"Human Factors and Design Issues in Multimodal (Speech\/Gesture) Interface.\" JDCTA 2.1 (2008): 67--77.","journal-title":"JDCTA"},{"key":"e_1_3_2_1_8_1","volume-title":"Technology, literacy and learning: A multimodal approach","author":"Jewitt","year":"2006","unstructured":"Jewitt , Carey. Technology, literacy and learning: A multimodal approach . Psychology Press , 2006 . Jewitt, Carey. Technology, literacy and learning: A multimodal approach. Psychology Press, 2006."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","DOI":"10.4018\/978-1-59904-066-0","volume-title":"Emerging technologies of augmented reality: interfaces and design. IGI Global","author":"Haller M.","year":"2007","unstructured":"Haller , M. , Billinghurst , M. , & Thomas , B. H. (Eds.). ( 2007 ). Emerging technologies of augmented reality: interfaces and design. IGI Global . Haller, M., Billinghurst, M., & Thomas, B. H. (Eds.). (2007). Emerging technologies of augmented reality: interfaces and design. IGI Global."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/800250.807503"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1180639.1180831"},{"key":"e_1_3_2_1_12_1","first-page":"407","volume-title":"Huang","author":"Pan H.","year":"1999","unstructured":"Pan , H. , Liang , Z.P. , Anastasio , T.J. , Huang , T.S. : Exploiting the dependencies in information fusion. In : CVPR , vol. 2 , pp. 407 -- 412 ( 1999 ). Pan, H., Liang, Z.P., Anastasio, T.J., Huang, T.S.: Exploiting the dependencies in information fusion. In: CVPR, vol. 2, pp. 407--412 (1999)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/266180.266328"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/5.664275"},{"volume-title":"Proceedings of 1997 IEEE International Conference on Robotics and Automation, 2, 1329--1334","author":"Chu C.","key":"e_1_3_2_1_15_1","unstructured":"Chu , C. , Dani , T. , and Gadh , R .: 1997, Multimodal Interface for a virtual reality based computer aided design system . Proceedings of 1997 IEEE International Conference on Robotics and Automation, 2, 1329--1334 . Chu, C., Dani, T., and Gadh, R.: 1997, Multimodal Interface for a virtual reality based computer aided design system. Proceedings of 1997 IEEE International Conference on Robotics and Automation, 2, 1329--1334."},{"key":"e_1_3_2_1_16_1","volume-title":"An approach to reducing screen clutter in mobile computing.\" Mobile Human-Computer Interaction-MobileHCI","author":"Nicholson","year":"2004","unstructured":"Nicholson , Mark, and Paul Vickers . \"Pen-Based gestures : An approach to reducing screen clutter in mobile computing.\" Mobile Human-Computer Interaction-MobileHCI 2004 . Springer Berlin Heidelberg , 2004. 320--324. Nicholson, Mark, and Paul Vickers. \"Pen-Based gestures: An approach to reducing screen clutter in mobile computing.\" Mobile Human-Computer Interaction-MobileHCI 2004. Springer Berlin Heidelberg, 2004. 320--324."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/330534.330538"},{"key":"e_1_3_2_1_18_1","unstructured":"ARToolKit http:\/\/www.hitl.washington.edu\/artoolkit  ARToolKit http:\/\/www.hitl.washington.edu\/artoolkit"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.5555\/946248.946836"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1027933.1027944"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/1647314.1647368"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/11941354_28"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISAR.2000.880934"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISWC.2007.4373785"},{"volume-title":"Proceedings of Joint Eurographics-IEEE TCVG Symposium on visualization, 195--200","author":"Krum D. M.","key":"e_1_3_2_1_26_1","unstructured":"Krum , D. M. , Omotesto , O. , Ribarsky , W. , Starner , T. , and Hodges , L. F .: 2002, Speech and gesture control of a whole earth 3D visualization environment , Proceedings of Joint Eurographics-IEEE TCVG Symposium on visualization, 195--200 . Krum, D. M., Omotesto, O., Ribarsky, W., Starner, T., and Hodges, L. F.: 2002, Speech and gesture control of a whole earth 3D visualization environment, Proceedings of Joint Eurographics-IEEE TCVG Symposium on visualization, 195--200."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2522848.2532202"},{"key":"e_1_3_2_1_28_1","unstructured":"Lee Minkyung. \"Multimodal Speech-Gesture Interaction with 3D Objects in Augmented Reality Environments.\" (2010).  Lee Minkyung. \"Multimodal Speech-Gesture Interaction with 3D Objects in Augmented Reality Environments.\" (2010)."},{"key":"e_1_3_2_1_29_1","volume-title":"Design and implementation of multi-modal AR-based interaction for cooperative planning tasks","author":"Neumann A.","year":"2011","unstructured":"Neumann A. Design and implementation of multi-modal AR-based interaction for cooperative planning tasks . Bielefeld : Bielefeld University ; 2011 . Neumann A. Design and implementation of multi-modal AR-based interaction for cooperative planning tasks. Bielefeld: Bielefeld University; 2011."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2522848.2522892"},{"key":"e_1_3_2_1_31_1","volume-title":"Multimodal corpora: Beyond audio and video (IVA 2013 workshop).","author":"Pitsch K","year":"2013","unstructured":"Pitsch K , Neumann A , Schnier C , Hermann T. Augmented reality as a tool for linguistic research: Intercepting and manipulating multimodal interaction . In: Multimodal corpora: Beyond audio and video (IVA 2013 workshop). ; 2013 : 23--29. Pitsch K, Neumann A, Schnier C, Hermann T. Augmented reality as a tool for linguistic research: Intercepting and manipulating multimodal interaction. In: Multimodal corpora: Beyond audio and video (IVA 2013 workshop).; 2013: 23--29."},{"key":"e_1_3_2_1_32_1","volume-title":"Intuitiveness 3D objects Interaction in Augmented Reality Using S-PI Algorithm. TELKOMNIKA Indonesian Journal of Electrical Engineering, 11(7), 3561--3567","author":"Ismail A. W.","year":"2013","unstructured":"Ismail , A. W. , & Sunar , M. S. ( 2013 ). Intuitiveness 3D objects Interaction in Augmented Reality Using S-PI Algorithm. TELKOMNIKA Indonesian Journal of Electrical Engineering, 11(7), 3561--3567 . Ismail, A. W., & Sunar, M. S. (2013). Intuitiveness 3D objects Interaction in Augmented Reality Using S-PI Algorithm. TELKOMNIKA Indonesian Journal of Electrical Engineering, 11(7), 3561--3567."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2543651.2543667"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2468356.2468527"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10055-013-0230-0"},{"key":"e_1_3_2_1_36_1","first-page":"1","volume-title":"2013 IEEE International Symposium on","author":"Bai","year":"2013","unstructured":"Bai , Huidong, Lei Gao , Jihad El-Sana, and Mark Billinghurst . \"Markerless 3 D gesture-based interaction for handheld augmented reality interfaces.\" In Mixed and Augmented Reality (ISMAR) , 2013 IEEE International Symposium on , pp. 1 -- 6 . IEEE, 2013 Bai, Huidong, Lei Gao, Jihad El-Sana, and Mark Billinghurst. \"Markerless 3D gesture-based interaction for handheld augmented reality interfaces.\" In Mixed and Augmented Reality (ISMAR), 2013 IEEE International Symposium on, pp. 1--6. IEEE, 2013"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/67449.67496"},{"key":"e_1_3_2_1_38_1","volume-title":"Processing Iconic Gestures in a Multimodal Virtual Construction Environment","author":"Fr\u00f6hlich C.","year":"2009","unstructured":"Fr\u00f6hlich , C. , Biermann , P. , Latoschik , M. E. , & Wachsmuth , I. ( 2009 ). Processing Iconic Gestures in a Multimodal Virtual Construction Environment . In: M. Dias, S. Gibet, M. M. Wanderley, R. Bastos (Eds.), Gesture-Based Human-Computer Interaction and Simulation. Springer LNAI 5085, 187--192. Fr\u00f6hlich, C., Biermann, P., Latoschik, M. E., & Wachsmuth, I. (2009). Processing Iconic Gestures in a Multimodal Virtual Construction Environment. In: M. Dias, S. Gibet, M. M. Wanderley, R. Bastos (Eds.), Gesture-Based Human-Computer Interaction and Simulation. Springer LNAI 5085, 187--192."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-012-9356-9"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2013.2246148"},{"key":"e_1_3_2_1_41_1","volume-title":"Computational Intelligence in Information Systems (pp. 201--210)","author":"Rozan M. R.","year":"2015","unstructured":"Rozan , M. R. , Sidik , M. K. M. , Sunar , M. S. , & Omar , A. H. ( 2015 ). KIHECT&copy;\u00a9: Reliability of Hand-Eye Coordination among Rugby Players Using Consumer Depth Camera . In Computational Intelligence in Information Systems (pp. 201--210) . Springer International Publishing . Rozan, M. R., Sidik, M. K. M., Sunar, M. S., & Omar, A. H. (2015). KIHECT&copy;\u00a9: Reliability of Hand-Eye Coordination among Rugby Players Using Consumer Depth Camera. In Computational Intelligence in Information Systems (pp. 201--210). Springer International Publishing."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.895976"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964963"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.5555\/1051814"},{"key":"e_1_3_2_1_45_1","volume-title":"Issues in Visual and Audio-Visual Speech Processing","author":"Bailly E.","year":"2004","unstructured":"G. Bailly , E. Vatikiotis , and P. Perrier , Issues in Visual and Audio-Visual Speech Processing , MIT Press , 2004 . G. Bailly, E. Vatikiotis, and P. Perrier, Issues in Visual and Audio-Visual Speech Processing, MIT Press, 2004."},{"key":"e_1_3_2_1_46_1","first-page":"598","volume-title":"10th International Conference on Information Science, Signal Processing and their Applications","author":"Navarathna P.","year":"2010","unstructured":"R. Navarathna , P. Lucey , D. Dean , C. Fookes , and S. Sridharan , \" Lip Detection for Audio-Visual Speech Recognition In-Car Environment,\" in Proc . 10th International Conference on Information Science, Signal Processing and their Applications , 2010 , pp. 598 -- 601 . R. Navarathna, P. Lucey, D. Dean, C. Fookes, and S. Sridharan, \"Lip Detection for Audio-Visual Speech Recognition In-Car Environment,\" in Proc. 10th International Conference on Information Science, Signal Processing and their Applications, 2010, pp. 598--601."},{"key":"e_1_3_2_1_47_1","volume-title":"Japan","author":"Shen S.","year":"2010","unstructured":"P. Shen , S. Tamura , and S. Hayamizu , \" Evaluation of Real-time Audio-Visual Speech Recognition,\" presented at International Conference on Audio-Visual Speech Processing , Japan , 2010 . P. Shen, S. Tamura, and S. Hayamizu, \"Evaluation of Real-time Audio-Visual Speech Recognition,\" presented at International Conference on Audio-Visual Speech Processing, Japan, 2010."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2008.12.004"},{"key":"e_1_3_2_1_49_1","first-page":"84","volume-title":"USA","author":"Rashad H.M.","year":"2010","unstructured":"M.Z. Rashad , H.M. El-Bakry , I.R. Isma'il , and N. Mastorakis , \" An Overview of Text-to-Speech Synthesis Techniques,\" in Proc. of the 4th International Conference on Communications and Information Technology , USA , 2010 , pp. 84 -- 89 . M.Z. Rashad, H.M. El-Bakry, I.R. Isma'il, and N. Mastorakis, \"An Overview of Text-to-Speech Synthesis Techniques,\" in Proc. of the 4th International Conference on Communications and Information Technology, USA, 2010, pp. 84--89."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/MPRV.2003.1186727"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSPEC.2013.6395297"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12193-014-0172-1"}],"event":{"name":"VINCI '15: The 8th International Symposium on Visual Information Communication and Interaction","acronym":"VINCI '15","location":"Tokyo AA Japan"},"container-title":["Proceedings of the 8th International Symposium on Visual Information Communication and Interaction"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2801040.2801058","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2801040.2801058","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:07:13Z","timestamp":1750223233000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2801040.2801058"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,8,24]]},"references-count":51,"alternative-id":["10.1145\/2801040.2801058","10.1145\/2801040"],"URL":"https:\/\/doi.org\/10.1145\/2801040.2801058","relation":{},"subject":[],"published":{"date-parts":[[2015,8,24]]},"assertion":[{"value":"2015-08-24","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}