{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,4]],"date-time":"2025-12-04T09:51:11Z","timestamp":1764841871661,"version":"3.41.0"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2016,7,11]],"date-time":"2016-07-11T00:00:00Z","timestamp":1468195200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2016,7,11]]},"abstract":"<jats:p>People often take a series of nearly redundant pictures to capture a moment or scene. However, selecting photos to keep or share from a large collection is a painful chore. To address this problem, we seek a relative quality measure within a series of photos taken of the same scene, which can be used for automatic photo triage. Towards this end, we gather a large dataset comprised of photo series distilled from personal photo albums. The dataset contains 15, 545 unedited photos organized in 5,953 series. By augmenting this dataset with ground truth human preferences among photos within each series, we establish a benchmark for measuring the effectiveness of algorithmic models of how people select photos. We introduce several new approaches for modeling human preference based on machine learning. We also describe applications for the dataset and predictor, including a smart album viewer, automatic photo enhancement, and providing overviews of video clips.<\/jats:p>","DOI":"10.1145\/2897824.2925908","type":"journal-article","created":{"date-parts":[[2016,7,11]],"date-time":"2016-07-11T16:04:33Z","timestamp":1468253073000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":39,"title":["Automatic triage for a photo series"],"prefix":"10.1145","volume":"35","author":[{"given":"Huiwen","family":"Chang","sequence":"first","affiliation":[{"name":"Princeton University"}]},{"given":"Fisher","family":"Yu","sequence":"additional","affiliation":[{"name":"Princeton University"}]},{"given":"Jue","family":"Wang","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Douglas","family":"Ashley","sequence":"additional","affiliation":[{"name":"Princeton University"}]},{"given":"Adam","family":"Finkelstein","sequence":"additional","affiliation":[{"name":"Princeton University"}]}],"member":"320","published-online":{"date-parts":[[2016,7,11]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766959"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1873951.1873990"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001493000339"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995413"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-013-0667-3"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.202"},{"key":"e_1_2_2_8_1","doi-asserted-by":"crossref","unstructured":"Cootes T. F. Edwards G. J. and Taylor C. J. 1998. Active appearance models. In Computer Vision?ECCV?98. Springer 484--498.   Cootes T. F. Edwards G. J. and Taylor C. J. 1998. Active appearance models. In Computer Vision?ECCV?98. Springer 484--498.","DOI":"10.1007\/BFb0054760"},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/11744078_23"},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995467"},{"volume-title":"Photo-triage: Rapidly annotating your digital photographs. Tech. rep., Microsoft Research Technical Report, MSR-TR-2003-99.","year":"2003","author":"Drucker S.","key":"e_1_2_2_11_1"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-8659.2012.03212.x"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964965"},{"key":"e_1_2_2_16_1","doi-asserted-by":"crossref","unstructured":"Hariharan B. Arbel\u00e1ez P. Girshick R. and Malik J. 2014. Hypercolumns for object segmentation and fine-grained localization. arXiv preprint arXiv:1411.5752.  Hariharan B. Arbel\u00e1ez P. Girshick R. and Malik J. 2014. Hypercolumns for object segmentation and fine-grained localization. arXiv preprint arXiv:1411.5752.","DOI":"10.1109\/CVPR.2015.7298642"},{"key":"e_1_2_2_17_1","doi-asserted-by":"crossref","unstructured":"He K. Zhang X. Ren S. and Sun J. 2015. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385.  He K. Zhang X. Ren S. and Sun J. 2015. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385.","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/383259.383295"},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1866029.1866066"},{"volume-title":"IEEE International Conference on Computer Vision (ICCV), IEEE.","author":"Judd T.","key":"e_1_2_2_20_1"},{"key":"e_1_2_2_21_1","doi-asserted-by":"crossref","unstructured":"Karayev S. Trentacoste M. Han H. Agarwala A. Darrell T. Hertzmann A. and Winnemoeller H. 2013. Recognizing image style. arXiv preprint arXiv:1311.3715.  Karayev S. Trentacoste M. Han H. Agarwala A. Darrell T. Hertzmann A. and Winnemoeller H. 2013. Recognizing image style. arXiv preprint arXiv:1311.3715.","DOI":"10.5244\/C.28.122"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-8659.2012.03225.x"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.303"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.275"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357127"},{"key":"e_1_2_2_26_1","unstructured":"Krizhevsky A. Sutskever I. and Hinton G. E. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems 1097--1105.  Krizhevsky A. Sutskever I. and Hinton G. E. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems 1097--1105."},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-8659.2009.01616.x"},{"key":"e_1_2_2_28_1","doi-asserted-by":"crossref","unstructured":"Long J. Shelhamer E. and Darrell T. 2014. Fully convolutional networks for semantic segmentation. arXiv preprint arXiv:1411.4038.  Long J. Shelhamer E. and Darrell T. 2014. Fully convolutional networks for semantic segmentation. arXiv preprint arXiv:1411.4038.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654927"},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88690-7_29"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126498"},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/641007.641116"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126444"},{"key":"e_1_2_2_35_1","unstructured":"Megvii Inc. 2013. Face++ research toolkit. www.faceplusplus.com.  Megvii Inc. 2013. Face++ research toolkit. www.faceplusplus.com."},{"volume-title":"Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, IEEE, 2408--2415","author":"Murray N.","key":"e_1_2_2_36_1"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995539"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1011139631724"},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/355984.355989"},{"volume-title":"Image Processing (ICIP), 2012 19th IEEE International Conference on, IEEE, 2741--2744","author":"Park J.","key":"e_1_2_2_40_1"},{"key":"e_1_2_2_41_1","doi-asserted-by":"crossref","unstructured":"Ralph Allan Bradley M. E. T. 1952. Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika 39 3\/4 324--345.  Ralph Allan Bradley M. E. T. 1952. Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika 39 3\/4 324--345.","DOI":"10.1093\/biomet\/39.3-4.324"},{"volume-title":"Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on, IEEE, 10--17","author":"Ren X.","key":"e_1_2_2_42_1"},{"key":"e_1_2_2_43_1","unstructured":"Ren S. He K. Girshick R. and Sun J. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems 91--99.   Ren S. He K. Girshick R. and Sun J. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems 91--99."},{"key":"e_1_2_2_44_1","unstructured":"Ren S. He K. Girshick R. B. Zhang X. and Sun J. 2015. Object detection networks on convolutional feature maps. CoRR abs\/1504.06066.  Ren S. He K. Girshick R. B. Zhang X. and Sun J. 2015. Object detection networks on convolutional feature maps. CoRR abs\/1504.06066."},{"key":"e_1_2_2_45_1","doi-asserted-by":"crossref","unstructured":"Simon I. Snavely N. and Seitz S. M. 2007. Scene summarization for online image collections. In ICCV IEEE.  Simon I. Snavely N. and Seitz S. M. 2007. Scene summarization for online image collections. In ICCV IEEE.","DOI":"10.1109\/ICCV.2007.4408863"},{"key":"e_1_2_2_46_1","unstructured":"Simonyan K. and Zisserman A. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.  Simonyan K. and Zisserman A. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556."},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/1991996.1992000"},{"key":"e_1_2_2_48_1","doi-asserted-by":"crossref","unstructured":"Szegedy C. Liu W. Jia Y. Sermanet P. Reed S. Anguelov D. Erhan D. Vanhoucke V. and Rabinovich A. 2014. Going deeper with convolutions. arXiv preprint arXiv:1409.4842.  Szegedy C. Liu W. Jia Y. Sermanet P. Reed S. Anguelov D. Erhan D. Vanhoucke V. and Rabinovich A. 2014. Going deeper with convolutions. arXiv preprint arXiv:1409.4842.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_2_2_49_1","doi-asserted-by":"crossref","unstructured":"Szegedy C. Vanhoucke V. Ioffe S. Shlens J. and Wojna Z. 2015. Rethinking the inception architecture for computer vision. arXiv preprint arXiv:1512.00567.  Szegedy C. Vanhoucke V. Ioffe S. Shlens J. and Wojna Z. 2015. Rethinking the inception architecture for computer vision. arXiv preprint arXiv:1512.00567.","DOI":"10.1109\/CVPR.2016.308"},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995446"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2013.71"},{"volume-title":"Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, IEEE, 1098--1105","author":"Ye P.","key":"e_1_2_2_52_1"},{"key":"e_1_2_2_53_1","unstructured":"Yu F. and Koltun V. 2015. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122.  Yu F. and Koltun V. 2015. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122."},{"volume-title":"Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365.","year":"2015","author":"Yu F.","key":"e_1_2_2_54_1"},{"key":"e_1_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33765-9_55"},{"key":"e_1_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2012.2223226"},{"key":"e_1_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2013.58"},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661287"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2897824.2925908","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2897824.2925908","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:55:03Z","timestamp":1750222503000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2897824.2925908"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,11]]},"references-count":58,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2016,7,11]]}},"alternative-id":["10.1145\/2897824.2925908"],"URL":"https:\/\/doi.org\/10.1145\/2897824.2925908","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"type":"print","value":"0730-0301"},{"type":"electronic","value":"1557-7368"}],"subject":[],"published":{"date-parts":[[2016,7,11]]},"assertion":[{"value":"2016-07-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}