{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,5]],"date-time":"2025-12-05T12:12:16Z","timestamp":1764936736298,"version":"build-2065373602"},"reference-count":30,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2014,8,5]],"date-time":"2014-08-05T00:00:00Z","timestamp":1407196800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/3.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>There is an urgent need for intelligent home surveillance systems to provide home security, monitor health conditions, and detect emergencies of family members. One of the fundamental problems to realize the power of these intelligent services is how to detect, track, and identify people at home. Compared to RFID tags that need to be worn all the time, vision-based sensors provide a natural and nonintrusive solution. Observing that body appearance and body build, as well as face, provide valuable cues for human identification, we model and record multi-view faces, full-body colors and shapes of family members in an appearance database by using two Kinects located at a home\u2019s entrance. Then the Kinects and another set of color cameras installed in other parts of the house are used to detect, track, and identify people by matching the captured color images with the registered templates in the appearance database. People are detected and tracked by multisensor fusion (Kinects and color cameras) using a Kalman filter that can handle duplicate or partial measurements. People are identified by multimodal fusion (face, body appearance, and silhouette) using a track-based majority voting. Moreover, the appearance-based human detection, tracking, and identification modules can cooperate seamlessly and benefit from each other. Experimental results show the effectiveness of the human tracking across multiple sensors and human identification considering the information of multi-view faces, full-body clothes, and silhouettes. The proposed home surveillance system can be applied to domestic applications in digital home security and intelligent healthcare.<\/jats:p>","DOI":"10.3390\/s140814253","type":"journal-article","created":{"date-parts":[[2014,8,5]],"date-time":"2014-08-05T10:59:37Z","timestamp":1407236377000},"page":"14253-14277","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Appearance-Based Multimodal Human Tracking and Identification for Healthcare in the Digital Home"],"prefix":"10.3390","volume":"14","author":[{"given":"Mau-Tsuen","family":"Yang","sequence":"first","affiliation":[{"name":"Department of Computer Science & Information Engineering, National Dong-Hwa University, No. 1, Sec. 2, Da-Hsueh Rd., Shoufeng, Hualien 974, Taiwan"}]},{"given":"Shen-Yen","family":"Huang","sequence":"additional","affiliation":[{"name":"Department of Computer Science & Information Engineering, National Dong-Hwa University, No. 1, Sec. 2, Da-Hsueh Rd., Shoufeng, Hualien 974, Taiwan"}]}],"member":"1968","published-online":{"date-parts":[[2014,8,5]]},"reference":[{"key":"ref_1","unstructured":"Microsoft Corp. Kinect for Xbox 360 Available online: http:\/\/www.xbox.com\/en-GB\/kinect."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2235","DOI":"10.1177\/0956797613492986","article-title":"Unaware Person Recognition from the Body When Face Identification Fails","volume":"24","author":"Rice","year":"2013","journal-title":"Psychol. Sci."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1109\/MPRV.2004.1316817","article-title":"A smart sensor to detect the falls of the elderly","volume":"3","author":"Sixsmith","year":"2004","journal-title":"IEEE Pervasive Comput."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"16920","DOI":"10.3390\/s121216920","article-title":"Privacy-preserved behavior analysis and fall detection by an infrared ceiling sensor network","volume":"12","author":"Tao","year":"2012","journal-title":"Sensors"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ni, B., Dat, N., and Moulin, P. (2012, January 25\u201330). RGBD-Camera Based Get-up Event Detection for Hospital Fall Prevention. Kyoto, Japan.","DOI":"10.1109\/ICASSP.2012.6287947"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"6695","DOI":"10.3390\/s120506695","article-title":"Categorization of Indoor Places Using the Kinect Sensor","volume":"12","author":"Mozos","year":"2012","journal-title":"Sensors"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"16985","DOI":"10.3390\/s131216985","article-title":"Fall Risk Assessment and Early-Warning for Toddler Behaviors at Home","volume":"13","author":"Yang","year":"2013","journal-title":"Sensors"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1023\/B:VISI.0000013087.49260.fb","article-title":"Robust real-time object detection","volume":"57","author":"Viola","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_9","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201326). Histograms of Oriented Gradients for Human Detection. San Diego, CA, USA."},{"key":"ref_10","unstructured":"Zhu, Q., Avidan, S., Yeh, M., and Cheng, K. (2006, January 17\u201322). Fast Human Detection Using a Cascade of Histograms of Oriented Gradients. New York, NY, USA."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Dollar, P., Tu, Z., Perona, P., and Belongie, S. (2009, January 7\u201310). Integral Channel Features. London, UK.","DOI":"10.5244\/C.23.91"},{"key":"ref_12","unstructured":"Dollar, P., Belongie, S., and Perona, P. (September, January 31). The Fastest Pedestrian Detector in the West. London, UK."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Benenson, R., Mathias, M., Timofte, R., and Gool, L. (2012, January 16\u201321). Pedestrian detection at 100 frames per second. Providence, RI, USA.","DOI":"10.1109\/CVPR.2012.6248017"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"790","DOI":"10.1109\/34.400568","article-title":"Mean Shift, Mode Seeking Clustering","volume":"17","author":"Cheng","year":"1995","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_15","unstructured":"Bradski, G. (1998, January 19\u201321). Real Time Face and Object Tracking as a Component of a Perceptual User Interface. Princeton, NJ, USA."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1631","DOI":"10.1109\/TPAMI.2005.205","article-title":"Online Selection of Discriminative Tracking Features","volume":"27","author":"Collins","year":"2005","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Babenko, B., Yang, M., and Belongie, S. (2009, January 20\u201325). Visual Tracking with Online Multiple Instance Learning. Miami, FL, USA.","DOI":"10.1109\/CVPRW.2009.5206737"},{"key":"ref_18","first-page":"1","article-title":"Tracking-Learning-Detection","volume":"6","author":"Kalal","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1162\/jocn.1991.3.1.71","article-title":"Eigenfaces for recognition","volume":"3","author":"Turk","year":"1991","journal-title":"J. Cogn. Neurosci."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"711","DOI":"10.1109\/34.598228","article-title":"Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection","volume":"19","author":"Belhumeur","year":"1997","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1007\/s00138-006-0063-x","article-title":"Person tracking and reidentification: Introducing Panoramic Appearance Map (PAM) for feature representation","volume":"18","author":"Gandhi","year":"2007","journal-title":"Mach. Vis. Appl."},{"key":"ref_22","unstructured":"Prosser, B., Zheng, W., Gong, S., and Xiang, T. (September, January 31). Person re-identification by support vector ranking. London, UK."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., and Blake, A. (2011, January 21\u201323). Real-Time Human Pose Recognition in Parts from Single Depth Images. Colorado Springs, CO, USA.","DOI":"10.1109\/CVPR.2011.5995316"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Li, B., Mian, A., Liu, W., and Krishna, A. (2013, January 15\u201317). Using Kinect for Face Recognition Under Varying Poses, Expressions, Illumination and Disguise. Clearwater, FL, USA.","DOI":"10.1109\/WACV.2013.6475017"},{"key":"ref_25","unstructured":"Ahmed, N. (2012, January 9\u201311). A System for 360 degree Acquisition and 3D Animation Reconstruction using Multiple RGB-D Cameras. Singapore."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1049\/iet-ipr:20070113","article-title":"Moving cast shadow detection by exploiting multiple cues","volume":"2","author":"Yang","year":"2008","journal-title":"IET Image Process."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1115\/1.3662552","article-title":"A New Approach to Linear Filtering and Prediction Problems","volume":"82","author":"Kalman","year":"1960","journal-title":"J. Basic Eng."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1016\/j.inffus.2004.07.002","article-title":"GPS\/IMU data fusion using multisensor Kalman filtering: Introduction of contextual aspects","volume":"7","author":"Caron","year":"2006","journal-title":"Inf. Fusion"},{"key":"ref_29","unstructured":"Ahonen, T., Hadid, A., and Pietikainen, M. (2004). Computer Vision\u2014ECCV 2004, Springer-Verlag."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"850","DOI":"10.1109\/34.232073","article-title":"Comparing Images Using the Hausdorff Distance","volume":"15","author":"Huttenlocher","year":"1993","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/14\/8\/14253\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T21:14:25Z","timestamp":1760217265000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/14\/8\/14253"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,8,5]]},"references-count":30,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2014,8]]}},"alternative-id":["s140814253"],"URL":"https:\/\/doi.org\/10.3390\/s140814253","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2014,8,5]]}}}