{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,5]],"date-time":"2026-05-05T06:18:43Z","timestamp":1777961923714,"version":"3.51.4"},"reference-count":24,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2005,8,1]],"date-time":"2005-08-01T00:00:00Z","timestamp":1122854400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2005,8]]},"abstract":"<jats:p>Organizing digital photograph collections according to events such as holiday gatherings or vacations is a common practice among photographers. To support photographers in this task, we present similarity-based methods to cluster digital photos by time and image content. The approach is general and unsupervised, and makes minimal assumptions regarding the structure or statistics of the photo collection. We present several variants of an automatic unsupervised algorithm to partition a collection of digital photographs based either on temporal similarity alone, or on temporal and content-based similarity. First, interphoto similarity is quantified at multiple temporal scales to identify likely event clusters. Second, the final clusters are determined according to one of three clustering goodness criteria. The clustering criteria trade off computational complexity and performance. We also describe a supervised clustering method based on learning vector quantization. Finally, we review the results of an experimental evaluation of the proposed algorithms and existing approaches on two test collections.<\/jats:p>","DOI":"10.1145\/1083314.1083317","type":"journal-article","created":{"date-parts":[[2005,11,7]],"date-time":"2005-11-07T16:00:45Z","timestamp":1131379245000},"page":"269-288","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":126,"title":["Temporal event clustering for digital photo collections"],"prefix":"10.1145","volume":"1","author":[{"given":"Matthew","family":"Cooper","sequence":"first","affiliation":[{"name":"FX Palo Alto Laboratory, Palo Alto, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jonathan","family":"Foote","sequence":"additional","affiliation":[{"name":"FX Palo Alto Laboratory, Palo Alto, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Girgensohn","sequence":"additional","affiliation":[{"name":"FX Palo Alto Laboratory, Palo Alto, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lynn","family":"Wilcox","sequence":"additional","affiliation":[{"name":"FX Palo Alto Laboratory, Palo Alto, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2005,8]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Boreczky J. and Rowe L. 1996. Comparison of video shot boundary detection techniques. In SPIE Storage and Retrieval for Image and Video Databases. SPIE Press Bellingham WA 170--179.  Boreczky J. and Rowe L. 1996. Comparison of video shot boundary detection techniques. In SPIE Storage and Retrieval for Image and Video Databases. SPIE Press Bellingham WA 170--179.","DOI":"10.1117\/12.234794"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the 11th ACM International Conference on Multimedia. ACM Press","author":"Cooper M.","unstructured":"Cooper , M. , Foote , J. , Girgensohn , A. , and Wilcox , L . 2003. Temporal event clustering for digital photo collections . In Proceedings of the 11th ACM International Conference on Multimedia. ACM Press , New York, NY, 364--373. 10.1145\/957013.957093 Cooper, M., Foote, J., Girgensohn, A., and Wilcox, L. 2003. Temporal event clustering for digital photo collections. In Proceedings of the 11th ACM International Conference on Multimedia. ACM Press, New York, NY, 364--373. 10.1145\/957013.957093"},{"key":"e_1_2_1_3_1","unstructured":"Duda R. and Hart P. 1973. Pattern Classification and Scene Analysis. Wiley-Interscience New York NY.   Duda R. and Hart P. 1973. Pattern Classification and Scene Analysis. Wiley-Interscience New York NY."},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo. IEEE, Computer Society Press","author":"Foote J.","year":"2000","unstructured":"Foote , J. 2000 . Automatic audio segmentation using a measure of audio novelty . In Proceedings of the IEEE International Conference on Multimedia and Expo. IEEE, Computer Society Press , Los Alamitos, CA, 452--55. Foote, J. 2000. Automatic audio segmentation using a measure of audio novelty. In Proceedings of the IEEE International Conference on Multimedia and Expo. IEEE, Computer Society Press, Los Alamitos, CA, 452--55."},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the ACM Conference on CSCW. ACM Press","author":"Frohlich D.","unstructured":"Frohlich , D. , Kuchinsky , A. , Pering , C. , Don , A. , and Ariss , S . 2002. Requirements for photoware . In Proceedings of the ACM Conference on CSCW. ACM Press , New York, NY, 166--175. 10.1145\/587078.587102 Frohlich, D., Kuchinsky, A., Pering, C., Don, A., and Ariss, S. 2002. Requirements for photoware. In Proceedings of the ACM Conference on CSCW. ACM Press, New York, NY, 166--175. 10.1145\/587078.587102"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the 5th ACM SIGMM Workshop on Multimedia Information Retrieval. ACM Press","author":"Gargi U.","year":"2003","unstructured":"Gargi , U. 2003 . Modeling and clustering of photo capture streams . In Proceedings of the 5th ACM SIGMM Workshop on Multimedia Information Retrieval. ACM Press , New York, NY, 47--54. 10.1145\/973264.973273 Gargi, U. 2003. Modeling and clustering of photo capture streams. In Proceedings of the 5th ACM SIGMM Workshop on Multimedia Information Retrieval. ACM Press, New York, NY, 47--54. 10.1145\/973264.973273"},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of Human-Computer Interaction INTERACT '03","author":"Girgensohn A.","unstructured":"Girgensohn , A. , Adcock , J. , Cooper , M. , Foote , J. , and Wilcox , L . 2003. Simplifying the management of large photo collections . In Proceedings of Human-Computer Interaction INTERACT '03 . IOS Press, Amsterdam, The Netherlands, 196--203. Girgensohn, A., Adcock, J., Cooper, M., Foote, J., and Wilcox, L. 2003. Simplifying the management of large photo collections. In Proceedings of Human-Computer Interaction INTERACT '03. IOS Press, Amsterdam, The Netherlands, 196--203."},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the Joint Conference on Digital Libraries. ACM Press","author":"Graham A.","unstructured":"Graham , A. , Garcia-Molina , H. , Paepcke , A. , and Winograd , T . 2002. Time as the essence for photo browsing through personal digital libraries . In Proceedings of the Joint Conference on Digital Libraries. ACM Press , New York, NY, 326--35. 10.1145\/544220.544301 Graham, A., Garcia-Molina, H., Paepcke, A., and Winograd, T. 2002. Time as the essence for photo browsing through personal digital libraries. In Proceedings of the Joint Conference on Digital Libraries. ACM Press, New York, NY, 326--35. 10.1145\/544220.544301"},{"key":"e_1_2_1_9_1","volume-title":"Clustering Algorithms","author":"Hartigan J.","unstructured":"Hartigan , J. 1975. Clustering Algorithms . Wiley & Sons , New York, NY . Hartigan, J. 1975. Clustering Algorithms. Wiley & Sons, New York, NY."},{"key":"e_1_2_1_10_1","volume-title":"IEEE International Conference on Image Processing","volume":"2","author":"Jaimes A.","unstructured":"Jaimes , A. , Benitez , A. B. , Chang , S.-F. , and Loui , A. C . 2000. Discovering recurrent visual semantics in consumer photographs . In IEEE International Conference on Image Processing , Vol. 2 . IEEE Press, Los Alamitos, CA, 528--531. Jaimes, A., Benitez, A. B., Chang, S.-F., and Loui, A. C. 2000. Discovering recurrent visual semantics in consumer photographs. In IEEE International Conference on Image Processing, Vol. 2. IEEE Press, Los Alamitos, CA, 528--531."},{"key":"e_1_2_1_11_1","volume-title":"Digital Still Camera Image File Format Standard","author":"Jeida","unstructured":"Jeida . 1998. Digital Still Camera Image File Format Standard . Japan Electronic Industry Development Association , Tokyo, Japan . Jeida. 1998. Digital Still Camera Image File Format Standard. Japan Electronic Industry Development Association, Tokyo, Japan."},{"key":"e_1_2_1_12_1","volume-title":"Self-Organization and Associative Memory","author":"Kohonen T.","unstructured":"Kohonen , T. 1989. Self-Organization and Associative Memory . Springer-Verlag , Berlin, Germany . Kohonen, T. 1989. Self-Organization and Associative Memory. Springer-Verlag, Berlin, Germany."},{"key":"e_1_2_1_13_1","volume-title":"International Joint Conference on Neural Networks. ACM Press","author":"Kohonen T.","unstructured":"Kohonen , T. , Kangas , J. , Laaksonen , J. , and Torkkola , K . 1992. Lvq pak: A program package for the correct application of learning vector quantization algorithms . In International Joint Conference on Neural Networks. ACM Press , New York, NY, 725--730. Kohonen, T., Kangas, J., Laaksonen, J., and Torkkola, K. 1992. Lvq pak: A program package for the correct application of learning vector quantization algorithms. In International Joint Conference on Neural Networks. ACM Press, New York, NY, 725--730."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.895974"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2003.1237548"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2003.814723"},{"key":"e_1_2_1_17_1","volume-title":"Shoebox: A digital photo management system. In Technical Report","author":"Mills T.","year":"2000","unstructured":"Mills , T. , Pye , D. , Sinclair , D. , and Wood , K . 2000 . Shoebox: A digital photo management system. In Technical Report 2000.10. AT&T Laboratories Cambridge, Cambridge, U.K. Mills, T., Pye, D., Sinclair, D., and Wood, K. 2000. Shoebox: A digital photo management system. In Technical Report 2000.10. AT&T Laboratories Cambridge, Cambridge, U.K."},{"key":"e_1_2_1_18_1","volume-title":"Isee: Perceptual features for image library navigation. In SPIE Human Vision and Electronic Imaging","author":"Mojsilovic A.","year":"2002","unstructured":"Mojsilovic , A. , Gomes , J. , and Rogowitz , B . 2002 . Isee: Perceptual features for image library navigation. In SPIE Human Vision and Electronic Imaging . SPIE Press , Bellingham, WA , 266--277. Mojsilovic, A., Gomes, J., and Rogowitz, B. 2002. Isee: Perceptual features for image library navigation. In SPIE Human Vision and Electronic Imaging. SPIE Press, Bellingham, WA, 266--277."},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the Joint Conference on Digital Libraries. ACM Press","author":"Naaman M.","unstructured":"Naaman , M. , Song , Y. J. , Paepcke , A. , and Garcia-Molina , H . 2004. Automatic organization for digital photographs with geographic coordinates . In Proceedings of the Joint Conference on Digital Libraries. ACM Press , New York, NY, 326--35. 10.1145\/996350.996366 Naaman, M., Song, Y. J., Paepcke, A., and Garcia-Molina, H. 2004. Automatic organization for digital photographs with geographic coordinates. In Proceedings of the Joint Conference on Digital Libraries. ACM Press, New York, NY, 326--35. 10.1145\/996350.996366"},{"key":"e_1_2_1_20_1","volume-title":"Fourth IEEE Pacific Rim Conference on Multimedia. IEEE Press","author":"Platt J.","unstructured":"Platt , J. , Czerwinski , M. , and Field , B . 2003. Simplifying the management of large photo collections . In Fourth IEEE Pacific Rim Conference on Multimedia. IEEE Press , Los Alamitos, CA, 6--10. Platt, J., Czerwinski, M., and Field, B. 2003. Simplifying the management of large photo collections. In Fourth IEEE Pacific Rim Conference on Multimedia. IEEE Press, Los Alamitos, CA, 6--10."},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI). ACM Press","author":"Rodden K.","unstructured":"Rodden , K. and Wood , K . 2003. How do people manage their digital photographs? In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI). ACM Press , New York, NY, 409--416. 10.1145\/642611.642682 Rodden, K. and Wood, K. 2003. How do people manage their digital photographs? In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI). ACM Press, New York, NY, 409--416. 10.1145\/642611.642682"},{"key":"e_1_2_1_23_1","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1214\/aos\/1176344136","article-title":"Estimating the dimension of a model","volume":"6","author":"Schwarz G.","year":"1978","unstructured":"Schwarz , G. 1978 . Estimating the dimension of a model . Ann. Statist. 6 , 461 -- 464 . Schwarz, G. 1978. Estimating the dimension of a model. Ann. Statist. 6, 461--64.","journal-title":"Ann. Statist."},{"key":"e_1_2_1_24_1","volume-title":"ACM International Conference on Multimedia. ACM Press","author":"Slaney M.","unstructured":"Slaney , M. , Ponceleon , D. , and Kaufman , J . 2001. Multimedia edges: Finding hierarchy in all dimensions . In ACM International Conference on Multimedia. ACM Press , New York, NY, 29--40. 10.1145\/500141.500149 Slaney, M., Ponceleon, D., and Kaufman, J. 2001. Multimedia edges: Finding hierarchy in all dimensions. In ACM International Conference on Multimedia. ACM Press, New York, NY, 29--40. 10.1145\/500141.500149"},{"key":"e_1_2_1_25_1","volume-title":"IEEE ICASSP","volume":"9","author":"Witkin A.","year":"1984","unstructured":"Witkin , A. 1984 . Scale-space filtering: A new approach to multi-scale description . In IEEE ICASSP , Vol. 9 . IEEE Press, Los Alamitos, CA, 150--153. Witkin, A. 1984. Scale-space filtering: A new approach to multi-scale description. In IEEE ICASSP, Vol. 9. IEEE Press, Los Alamitos, CA, 150--153."}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1083314.1083317","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1083314.1083317","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T16:08:20Z","timestamp":1750262900000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1083314.1083317"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,8]]},"references-count":24,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2005,8]]}},"alternative-id":["10.1145\/1083314.1083317"],"URL":"https:\/\/doi.org\/10.1145\/1083314.1083317","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2005,8]]},"assertion":[{"value":"2005-08-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}