{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T12:53:41Z","timestamp":1769950421328,"version":"3.49.0"},"reference-count":35,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2009,8,1]],"date-time":"2009-08-01T00:00:00Z","timestamp":1249084800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Appl. Percept."],"published-print":{"date-parts":[[2009,8]]},"abstract":"<jats:p>Video-based eye trackers produce an output video showing where a subject is looking, the subject's Point-of-Regard (POR), for each frame of a video of the scene. This information can be extremely valuable, but its analysis can be overwhelming. Analysis of eye-tracked data from portable (wearable) eye trackers is especially daunting, as the scene video may be constantly changing, rendering automatic analysis more difficult. A common way to begin analysis of POR data is to group these data into fixations. In a previous article, we compared the fixations identified (i.e., start and end marked) automatically by an algorithm to those identified manually by users (i.e., manual coders). Here, we extend this automatic identification of fixations to tagging each fixation to a Region-of-Interest (ROI). Our fixation tagging algorithm, FixTag, requires the relative 3D positions of the vertices of ROIs and calibration of the scene camera. Fixation tagging is performed by first calculating the camera projection matrices for keyframes of the scene video (captured by the eye tracker) via an iterative structure and motion recovery algorithm. These matrices are then used to project 3D ROI vertices into the keyframes. A POR for each fixation is matched to a point in the closest keyframe, which is then checked against the 2D projected ROI vertices for tagging. Our fixation tags were compared to those produced by three manual coders tagging the automatically identified fixations for two different scenarios. For each scenario, eight ROIs were defined along with the 3D positions of eight calibration points. Therefore, 17 tags were available for each fixation: 8 for ROIs, 8 for calibration points, and 1 for \u201cother.\u201d For the first scenario, a subject was tracked looking through products on four store shelves, resulting in 182 automatically identified fixations. Our automatic tagging algorithm produced tags that matched those produced by at least one manual coder for 181 out of the 182 fixations (99.5% agreement). For the second scenario, a subject was tracked looking at two posters on adjoining walls of a room. Our algorithm matched at least one manual coder's tag for 169 fixations out of 172 automatically identified (98.3% agreement).<\/jats:p>","DOI":"10.1145\/1577755.1577759","type":"journal-article","created":{"date-parts":[[2009,9,1]],"date-time":"2009-09-01T17:52:59Z","timestamp":1251827579000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["FixTag: An algorithm for identifying and tagging fixations to simplify the analysis of data collected by portable eye trackers"],"prefix":"10.1145","volume":"6","author":[{"given":"Susan M.","family":"Munn","sequence":"first","affiliation":[{"name":"Rochester Institute of Technology, Rochester, NY"}]},{"given":"Jeff B.","family":"Pelz","sequence":"additional","affiliation":[{"name":"Rochester Institute of Technology, Rochester, NY"}]}],"member":"320","published-online":{"date-parts":[[2009,9,3]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/968363.968386"},{"key":"e_1_2_1_2_1","unstructured":"Bouguet J. 2007. Camera Calibration Toolbox for Matlab\u00ae. www.vision.caltech.edu\/bouguetj\/calib doc\/.  Bouguet J. 2007. Camera Calibration Toolbox for Matlab\u00ae. www.vision.caltech.edu\/bouguetj\/calib doc\/."},{"key":"e_1_2_1_3_1","volume-title":"Movements of the Eyes","author":"Carpenter R. H. S.","unstructured":"Carpenter , R. H. S. 1988. Movements of the Eyes , 2 nd Ed. Pion Limited , London . Carpenter, R. H. S. 1988. Movements of the Eyes, 2nd Ed. Pion Limited, London.","edition":"2"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/505008.505012"},{"key":"e_1_2_1_5_1","volume-title":"Eye Tracking Methodology: Theory and Practice","author":"Duchowski A. T.","unstructured":"Duchowski , A. T. 2007. Eye Tracking Methodology: Theory and Practice , 2 nd ed. Springer-Verlag , London . Duchowski, A. T. 2007. Eye Tracking Methodology: Theory and Practice, 2nd ed. Springer-Verlag, London.","edition":"2"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/507072.507094"},{"key":"e_1_2_1_7_1","first-page":"402","article-title":"A note on the difference between the camera resection and the pnp problem","volume":"31","author":"Feng-Mei S.","year":"2005","unstructured":"Feng-Mei , S. 2005 . A note on the difference between the camera resection and the pnp problem . Acta Automatica Sinica 31 , 3, 402 -- 405 . Feng-Mei, S. 2005. A note on the difference between the camera resection and the pnp problem. Acta Automatica Sinica 31, 3, 402--405.","journal-title":"Acta Automatica Sinica"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/358669.358692"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 4th Alvey Vision Conference. 147--151","author":"Harris C.","unstructured":"Harris , C. and Stephens , M . 1988. A combined corner and edge detector . In Proceedings of the 4th Alvey Vision Conference. 147--151 . Harris, C. and Stephens, M. 1988. A combined corner and edge detector. In Proceedings of the 4th Alvey Vision Conference. 147--151."},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Hartley R. and Zisserman A. 2004. Multiple View Geometry in Computer Vision 2nd ed. Cambridge University Press Cambridge UK.   Hartley R. and Zisserman A. 2004. Multiple View Geometry in Computer Vision 2nd ed. Cambridge University Press Cambridge UK.","DOI":"10.1017\/CBO9780511811685"},{"key":"e_1_2_1_11_1","volume-title":"Software functions. School of Computer Science and Software Engineering","author":"Huynh D.","unstructured":"Huynh , D. 2004. Software functions. School of Computer Science and Software Engineering , The University of Western Australia . http:\/\/www.csse.uwa.edu.au\/~du\/Software\/Welcome.html. Huynh, D. 2004. Software functions. School of Computer Science and Software Engineering, The University of Western Australia. http:\/\/www.csse.uwa.edu.au\/~du\/Software\/Welcome.html."},{"key":"e_1_2_1_12_1","volume-title":"Eye Movements and Visual Cognition: Scene Perception and Reading","author":"Irwin D. E.","unstructured":"Irwin , D. E. 1992. Eye Movements and Visual Cognition: Scene Perception and Reading . Springer-Verlag , Berlin , 146--165. Irwin, D. E. 1992. Eye Movements and Visual Cognition: Scene Perception and Reading. Springer-Verlag, Berlin, 146--165."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/0010-0285(76)90015-3"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the European Signal Processing Conference (EUSIPCO'06)","author":"Knorr S.","unstructured":"Knorr , S. , Imre , E. , Alatan , A. A. , and Sikora , T . 2006. A geometric segmentation approach for the 3D reconstruction of dynamic scenes in 2D video sequences . In Proceedings of the European Signal Processing Conference (EUSIPCO'06) . Knorr, S., Imre, E., Alatan, A. A., and Sikora, T. 2006. A geometric segmentation approach for the 3D reconstruction of dynamic scenes in 2D video sequences. In Proceedings of the European Signal Processing Conference (EUSIPCO'06)."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1117309.1117348"},{"key":"e_1_2_1_16_1","volume-title":"Matlab and Octave functions for computer vision and image processing. School of Computer Science and Software Engineering","author":"Kovesi P. D.","unstructured":"Kovesi , P. D. 2007. Matlab and Octave functions for computer vision and image processing. School of Computer Science and Software Engineering , The University of Western Australia . http:\/\/www.csse.uwa.edu.au\/~pk\/research\/matlabfns\/. Kovesi, P. D. 2007. Matlab and Octave functions for computer vision and image processing. School of Computer Science and Software Engineering, The University of Western Australia. http:\/\/www.csse.uwa.edu.au\/~pk\/research\/matlabfns\/."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1117309.1117350"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.531"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1080\/09500340701467827"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2003.1217601"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBME.1974.324318"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the 11th IEEE International Conference on Computer Vision (ICCV'07)","author":"Moreno-Noguer F.","unstructured":"Moreno-Noguer , F. , Lepetit , V. , and Fua , P . 2007. Accurate non-iterative o(n) solution to the pnp problem . In Proceedings of the 11th IEEE International Conference on Computer Vision (ICCV'07) . IEEE, Los Alamitos, CA, 1--8. Moreno-Noguer, F., Lepetit, V., and Fua, P. 2007. Accurate non-iterative o(n) solution to the pnp problem. In Proceedings of the 11th IEEE International Conference on Computer Vision (ICCV'07). IEEE, Los Alamitos, CA, 1--8."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1344471.1344517"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the International Conference on Artificial Intelligence and Pattern Recognition (AIPR-09)","author":"Munn S. M.","unstructured":"Munn , S. M. and Pelz , J. B . 2009. Simple routines to improve feature tracks . In Proceedings of the International Conference on Artificial Intelligence and Pattern Recognition (AIPR-09) . Munn, S. M. and Pelz, J. B. 2009. Simple routines to improve feature tracks. In Proceedings of the International Conference on Artificial Intelligence and Pattern Recognition (AIPR-09)."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1394281.1394287"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/968363.968387"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0042-6989(01)00245-0"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000025798.50602.3a"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DIM.2005.4"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/968363.968388"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/355017.355028"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/968363.968368"},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of International Conference on Computer Vision. IEEE","author":"Torr P.","unstructured":"Torr , P. , Fitzgibbon , A. , and Zisserman , A . 1998. Maintaining multiple motion model hypotheses through many views to recover matching and structure . In Proceedings of International Conference on Computer Vision. IEEE , Los Alamitos, CA, 485--491. Torr, P., Fitzgibbon, A., and Zisserman, A. 1998. Maintaining multiple motion model hypotheses through many views to recover matching and structure. In Proceedings of International Conference on Computer Vision. IEEE, Los Alamitos, CA, 485--491."},{"key":"e_1_2_1_35_1","unstructured":"Trucco E. and Verri A. 1998. Introductory Techniques for 3D Computer Vision. Prentice Hall PTR Upper Saddle River NJ.   Trucco E. and Verri A. 1998. Introductory Techniques for 3D Computer Vision. Prentice Hall PTR Upper Saddle River NJ."},{"key":"e_1_2_1_36_1","volume-title":"Matlab functions for multiple view geometry. Department of Engineering Science","author":"Visual Geometry Group","unstructured":"Visual Geometry Group . 2005. Matlab functions for multiple view geometry. Department of Engineering Science , University of Oxford. http :\/\/www.robots.ox.ac.uk\/~vgg\/hzbook\/code\/. Visual Geometry Group. 2005. Matlab functions for multiple view geometry. Department of Engineering Science, University of Oxford. http:\/\/www.robots.ox.ac.uk\/~vgg\/hzbook\/code\/."}],"container-title":["ACM Transactions on Applied Perception"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1577755.1577759","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1577755.1577759","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T20:22:35Z","timestamp":1750278155000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1577755.1577759"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,8]]},"references-count":35,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2009,8]]}},"alternative-id":["10.1145\/1577755.1577759"],"URL":"https:\/\/doi.org\/10.1145\/1577755.1577759","relation":{},"ISSN":["1544-3558","1544-3965"],"issn-type":[{"value":"1544-3558","type":"print"},{"value":"1544-3965","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,8]]},"assertion":[{"value":"2009-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-05-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-09-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}