{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,25]],"date-time":"2025-09-25T18:11:37Z","timestamp":1758823897247,"version":"3.43.0"},"reference-count":59,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2018,7,5]],"date-time":"2018-07-05T00:00:00Z","timestamp":1530748800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Interact. Mob. Wearable Ubiquitous Technol."],"published-print":{"date-parts":[[2018,7,5]]},"abstract":"<jats:p>Studies have linked excessive TV watching to obesity in adults and children. In addition, TV content represents an important source of visual exposure to cues which can effect a broad set of health-related behaviors. This paper presents a ubiquitous sensing system which can detect moments of screen-watching during daily life activities. We utilize machine learning techniques to analyze video captured by a head-mounted wearable camera. Although wearable cameras do not directly provide a measure of visual attention, we show that attention to screens can be reliably inferred by detecting and tracking the location of screens within the camera's field-of-view. We utilize a computational model of the head movements associated with TV watching to identify TV watching events. We have evaluated our method on 13 hours of TV watching videos recorded from 16 participants in a home environment. Our model achieves a precision of 0.917 and a recall of 0.945 in identifying attention to screens. We validated the third-person annotations used to determine accuracy and further evaluated our system in a multi-device environment using gold standard attention measurements obtained from a wearable eye-tracker. Finally, we tested our system in a natural environment. Our system achieves a precision of 0.87 and a recall of 0.82 on challenging videos capturing the daily life activities of participants.<\/jats:p>","DOI":"10.1145\/3214291","type":"journal-article","created":{"date-parts":[[2018,7,5]],"date-time":"2018-07-05T15:19:10Z","timestamp":1530803950000},"page":"1-27","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Watching the TV Watchers"],"prefix":"10.1145","volume":"2","author":[{"given":"Yun C.","family":"Zhang","sequence":"first","affiliation":[{"name":"Georgia Institute of Technology, Center for Behavioral Imaging and School of Electrical and Computer Engineering, Atlanta, GA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"James M.","family":"Rehg","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology, Center for Behavioral Imaging and College of Computing, Atlanta, GA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,7,5]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"Catrine Tudor-Locke, Jennifer L Greer, Jesse Vezina, Melicia C Whitt-Glover, and Arthur S Leon.","author":"Ainsworth Barbara E","year":"2011","unstructured":"Barbara E Ainsworth, William L Haskell, Stephen D Herrmann, Nathanael Meckes, David R Bassett Jr, Catrine Tudor-Locke, Jennifer L Greer, Jesse Vezina, Melicia C Whitt-Glover, and Arthur S Leon. 2011. 2011 Compendium of Physical Activities: a second update of codes and MET values. Medicine and science in sports and exercise 43, 8 (2011), 1575--1581."},{"key":"e_1_2_2_2_1","unstructured":"Aitor Apaolaza Andy Brown Caroline Jay and Simon Harper. 2014. Understanding the division of attention between TV and companion content: experiment 2 without eye-tracking. Technical report."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2851581.2892483"},{"key":"e_1_2_2_4_1","volume-title":"Simple Online and Realtime Tracking. arXiv:1602.00763","author":"Bewley Alex","year":"2016","unstructured":"Alex Bewley, Zonguan Ge, Lionel Ott, Fabio Ramos, and Ben Upcroft. 2016. Simple Online and Realtime Tracking. arXiv:1602.00763 (2016). http:\/\/arxiv.org\/abs\/1602.00763"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2047196.2047256"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2016.7532629"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3131902"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1080\/13506280902834696"},{"key":"e_1_2_2_9_1","first-page":"807","article-title":"Do We Fatten Our Children at the Television Set","volume":"75","author":"Dietz William H.","year":"1985","unstructured":"William H. Dietz and Steven L. Gortmaker. 1985. Do We Fatten Our Children at the Television Set? Obesity and Television Viewing in Children and Adolescents. Pediatrics 75, 5 (1985), 807--812. arXiv:http:\/\/pediatrics.aappublications.org\/content\/75\/5\/807.full.pdf http:\/\/pediatrics.aappublications.org\/content\/75\/5\/807","journal-title":"Obesity and Television Viewing in Children and Adolescents. Pediatrics"},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0275-4"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.333"},{"key":"e_1_2_2_12_1","unstructured":"RR Fletcher D Chamberlain D Richman N Oreskovic and E Taveras. 2016. Wearable sensor and algorithm for automated measurement of screen time. (2016) 1--8."},{"key":"e_1_2_2_13_1","unstructured":"Daniel P Hallahan James M Kauffman and Paige C Pullen. 2011. Exceptional learners: An introduction to special education. Pearson Higher Ed."},{"key":"e_1_2_2_14_1","volume-title":"Television viewing and unhealthy diet: implications for children and media interventions. Health communication 24, 7","author":"Harris Jennifer L","year":"2009","unstructured":"Jennifer L Harris and John A Bargh. 2009. Television viewing and unhealthy diet: implications for children and media interventions. Health communication 24, 7 (2009), 660--673."},{"key":"e_1_2_2_15_1","volume-title":"Eye movements in natural behavior. Trends in cognitive sciences 9, 4","author":"Hayhoe Mary","year":"2005","unstructured":"Mary Hayhoe and Dana Ballard. 2005. Eye movements in natural behavior. Trends in cognitive sciences 9, 4 (2005), 188--194."},{"volume-title":"Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on. 1--7.","author":"Hernandez J.","key":"e_1_2_2_16_1","unstructured":"J. Hernandez, Zicheng Liu, G. Hulten, D. DeBarr, K. Krum, and Z. Zhang. 2013. Measuring the engagement level of TV viewers. In Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on. 1--7."},{"key":"e_1_2_2_17_1","volume-title":"Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861","author":"Howard Andrew G","year":"2017","unstructured":"Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)."},{"key":"e_1_2_2_18_1","volume-title":"TabletGaze: unconstrained appearance-based gaze estimation in mobile tablets. arXiv preprint arXiv:1508.01244","author":"Huang Qiong","year":"2015","unstructured":"Qiong Huang, Ashok Veeraraghavan, and Ashutosh Sabharwal. 2015. TabletGaze: unconstrained appearance-based gaze estimation in mobile tablets. arXiv preprint arXiv:1508.01244 (2015)."},{"key":"e_1_2_2_19_1","volume-title":"SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and &lt","author":"Iandola Forrest N","year":"2016","unstructured":"Forrest N Iandola, Song Han, Matthew W Moskewicz, Khalid Ashraf, William J Dally, and Kurt Keutzer. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and &lt; 0.5 MB model size. arXiv preprint arXiv:1602.07360 (2016)."},{"key":"e_1_2_2_20_1","volume-title":"Extensive television viewing and the development of attention and learning difficulties during adolescence. Archives of pediatrics 8 adolescent medicine 161, 5","author":"Johnson Jeffrey G","year":"2007","unstructured":"Jeffrey G Johnson, Patricia Cohen, Stephanie Kasen, and Judith S Brook. 2007. Extensive television viewing and the development of attention and learning difficulties during adolescence. Archives of pediatrics 8 adolescent medicine 161, 5 (2007), 480--486."},{"key":"e_1_2_2_21_1","volume-title":"Television viewing and aggressive behavior during adolescence and adulthood. Science 295, 5564","author":"Johnson Jeffrey G","year":"2002","unstructured":"Jeffrey G Johnson, Patricia Cohen, Elizabeth M Smailes, Stephanie Kasen, and Judith S Brook. 2002. Television viewing and aggressive behavior during adolescence and adulthood. Science 295, 5564 (2002), 2468--2471."},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459462"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1279540.1279548"},{"key":"e_1_2_2_24_1","volume-title":"Using the SenseCam to improve classifications of sedentary behavior in free-living settings. American journal of preventive medicine 44, 3","author":"Kerr Jacqueline","year":"2013","unstructured":"Jacqueline Kerr, Simon J Marshall, Suneeta Godbole, Jacqueline Chen, Amanda Legge, Aiden R Doherty, Paul Kelly, Melody Oliver, Hannah M Badland, and Charlie Foster. 2013. Using the SenseCam to improve classifications of sedentary behavior in free-living settings. American journal of preventive medicine 44, 3 (2013), 290--296."},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","unstructured":"D. Koller and N. Friedman. 2009. Probabilistic Graphical Models: Principles and Techniques. MIT Press.","DOI":"10.5555\/1795555"},{"key":"e_1_2_2_26_1","volume-title":"Eye Tracking for Everyone. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Krafka Kyle","year":"2016","unstructured":"Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra Bhandarkar, Wojciech Matusik, and Antonio Torralba. 2016. Eye Tracking for Everyone. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1068\/p2935"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1542\/peds.2007-0978"},{"key":"e_1_2_2_29_1","volume-title":"Seventh International Conference on Machine Vision (ICMV","volume":"9445","author":"Lee Dongjin","year":"2015","unstructured":"Dongjin Lee, Woo han Yun, Chan kyu Park, H Yoon, Jaehong Kim, and CH Park. 2015. Measuring the engagement level of children for multiple intelligence test using Kinect. In Seventh International Conference on Machine Vision (ICMV 2014), Vol. 9445. International Society for Optics and Photonics, 944529."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.399"},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.43"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1088\/0967-3334\/37\/10\/1834"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1111\/acer.12986"},{"key":"e_1_2_2_36_1","volume-title":"The party effect: prediction of future alcohol use based on exposure to specific alcohol advertising content. Addiction","author":"Morgenstern Matthis","year":"2016","unstructured":"Matthis Morgenstern, Zhongze Li, Zhigang Li, and James D Sargent. 2016. The party effect: prediction of future alcohol use based on exposure to specific alcohol advertising content. Addiction (2016)."},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV.2014.6835987"},{"key":"e_1_2_2_38_1","first-page":"10","article-title":"Understanding the relationship between television use and unhealthy eating: The mediating role of fatalistic views of eating well and nutritional knowledge","volume":"1","author":"Northup Temple","year":"2014","unstructured":"Temple Northup. 2014. Understanding the relationship between television use and unhealthy eating: The mediating role of fatalistic views of eating well and nutritional knowledge. The International Journal of Communication and Health 1, 3 (2014), 10--15.","journal-title":"The International Journal of Communication and Health"},{"key":"e_1_2_2_39_1","unstructured":"U.S. Bureau of Labor Statistics. 2016. ATUS Table. Time spent in detailed primary activities and percent of the civilian population engaging in each activity averages per day by sex annual averages. https:\/\/www.bls.gov\/tus\/a1_2016.pdf"},{"key":"e_1_2_2_40_1","volume-title":"The attention system of the human brain: 20 years after. Annual review of neuroscience 35","author":"Petersen Steven E","year":"2012","unstructured":"Steven E Petersen and Michael I Posner. 2012. The attention system of the human brain: 20 years after. Annual review of neuroscience 35 (2012), 73--89."},{"key":"e_1_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_32"},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.5555\/3152576"},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In Advances in Neural Information Processing Systems (NIPS).","DOI":"10.5555\/2969239.2969250"},{"key":"e_1_2_2_44_1","unstructured":"AGB Nielsen Media Research. {n. d.}. Unitam. http:\/\/mail.agb-it.com\/"},{"volume-title":"Computer Vision and Pattern Recognition, 1994. Proceedings CVPR '94., 1994 IEEE Computer Society Conference on. 593--600","author":"Shi Jianbo","key":"e_1_2_2_45_1","unstructured":"Jianbo Shi and C. Tomasi. 1994. Good features to track. In Computer Vision and Pattern Recognition, 1994. Proceedings CVPR '94., 1994 IEEE Computer Society Conference on. 593--600."},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2501988.2501994"},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2009.5204354"},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1542\/peds.2009-1508"},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-014-2352-0"},{"key":"e_1_2_2_50_1","volume-title":"The impact of television viewing on brain structures: cross-sectional and longitudinal analyses. Cerebral Cortex","author":"Takeuchi Hikaru","year":"2013","unstructured":"Hikaru Takeuchi, Yasuyuki Taki, Hiroshi Hashizume, Kohei Asano, Michiko Asano, Yuko Sassa, Susumu Yokota, Yuka Kotozaki, Rui Nouchi, and Ryuta Kawashima. 2013. The impact of television viewing on brain structures: cross-sectional and longitudinal analyses. Cerebral Cortex (2013), bht315."},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1167\/11.5.5"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1167\/7.14.4"},{"key":"e_1_2_2_53_1","unstructured":"Ray van Brandenburg Hans van den Berg M Oskar van Deventer and Ir Mike Schenk. 2009. Towards multi-user personalized TV services introducing combined RFID Digest authentication. (2009)."},{"key":"e_1_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00779-015-0862-z"},{"key":"e_1_2_2_55_1","volume-title":"Automatic Face and Gesture Recognition (FG), 2015 11th IEEE International Conference and Workshops on","volume":"1","author":"Ye Zhefan","year":"2015","unstructured":"Zhefan Ye, Yin Li, Yun Liu, Chanel Bridges, Agata Rozga, and James M Rehg. 2015. Detecting bids for eye contact using a wearable camera. In Automatic Face and Gesture Recognition (FG), 2015 11th IEEE International Conference and Workshops on, Vol. 1. IEEE, 1--8."},{"key":"e_1_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299183"},{"key":"e_1_2_2_57_1","volume-title":"Pyramidal implementation of the Lucas Kanade feature tracker","author":"Bouguet Jean","year":"2000","unstructured":"Jean yves Bouguet. 2000. Pyramidal implementation of the Lucas Kanade feature tracker. Intel Corporation, Microprocessor Research Labs (2000)."},{"key":"e_1_2_2_58_1","volume-title":"Shufflenet: An extremely efficient convolutional neural network for mobile devices. arXiv preprint arXiv:1707.01083","author":"Zhang Xiangyu","year":"2017","unstructured":"Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, and Jian Sun. 2017. Shufflenet: An extremely efficient convolutional neural network for mobile devices. arXiv preprint arXiv:1707.01083 (2017)."},{"key":"e_1_2_2_59_1","volume-title":"Children's television viewing and cognitive outcomes: a longitudinal analysis of national data. Archives of Pediatrics 8 Adolescent Medicine 159, 7","author":"Zimmerman Frederick J","year":"2005","unstructured":"Frederick J Zimmerman and Dimitri A Christakis. 2005. Children's television viewing and cognitive outcomes: a longitudinal analysis of national data. Archives of Pediatrics 8 Adolescent Medicine 159, 7 (2005), 619--625."}],"container-title":["Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3214291","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3214291","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,12]],"date-time":"2025-08-12T15:07:14Z","timestamp":1755011234000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3214291"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,5]]},"references-count":59,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2018,7,5]]}},"alternative-id":["10.1145\/3214291"],"URL":"https:\/\/doi.org\/10.1145\/3214291","relation":{},"ISSN":["2474-9567"],"issn-type":[{"type":"electronic","value":"2474-9567"}],"subject":[],"published":{"date-parts":[[2018,7,5]]},"assertion":[{"value":"2017-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-04-01","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-07-05","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}