{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:29:38Z","timestamp":1750220978561,"version":"3.41.0"},"reference-count":45,"publisher":"Association for Computing Machinery (ACM)","issue":"2s","license":[{"start":{"date-parts":[[2019,4,30]],"date-time":"2019-04-30T00:00:00Z","timestamp":1556582400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"MMU-GRA Scheme, Multimedia University","award":["MMUI\/160085"],"award-info":[{"award-number":["MMUI\/160085"]}]},{"DOI":"10.13039\/501100003093","name":"Ministry of Higher Education, Malaysia","doi-asserted-by":"crossref","award":["FRGS\/1\/2018\/ICT02\/MMU\/02\/2"],"award-info":[{"award-number":["FRGS\/1\/2018\/ICT02\/MMU\/02\/2"]}],"id":[{"id":"10.13039\/501100003093","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2019,4,30]]},"abstract":"<jats:p>Aesthetics is a subjective concept that is likely to be perceived differently among people of different ages, genders, and cultural backgrounds. While techniques that directly compute this concept in images has seen increasing attention by the multimedia and machine-learning community, there are very few attempts at encoding the influences from the photographer\u2019s viewpoint. This work demonstrates how the aesthetic quality of photos can be better learned by accounting for the demographic background of a photographer. A new AVA-PD (Photographer Demographic) dataset is created to supplement the AVA dataset by providing photographers\u2019 age, gender and location attributes. Two deep convolutional neural network (CNN) architectures are proposed to utilize demographic information for aesthetic prediction of photos; both are shown to yield better prediction capabilities compared to most existing approaches. By leveraging on AVA-PD meta-data, we also present some additional machine-learnable tasks such as identifying the photographer and predicting photography styles from a person\u2019s gallery of photos.<\/jats:p>","DOI":"10.1145\/3328993","type":"journal-article","created":{"date-parts":[[2019,7,25]],"date-time":"2019-07-25T12:34:33Z","timestamp":1564058073000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Beauty Is in the Eye of the Beholder"],"prefix":"10.1145","volume":"15","author":[{"given":"Magzhan","family":"Kairanbay","sequence":"first","affiliation":[{"name":"Multimedia University, Cyberjaya, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3005-4109","authenticated-orcid":false,"given":"John","family":"See","sequence":"additional","affiliation":[{"name":"Multimedia University, Cyberjaya, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4517-0391","authenticated-orcid":false,"given":"Lai-Kuan","family":"Wong","sequence":"additional","affiliation":[{"name":"Multimedia University, Cyberjaya, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,7,25]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the Workshop at the European Conference on Computer Vision. 71--84","author":"Bar Yaniv","year":"2014","unstructured":"Yaniv Bar, Noga Levy, and Lior Wolf. 2014. Classification of artistic styles using binarized features derived from a deep neural network. In Proceedings of the Workshop at the European Conference on Computer Vision. 71--84."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1097\/SCS.0000000000000406"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0900304106"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2967251"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/11744078_23"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1291233.1291364"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995467"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-14442-9_57"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201917)","author":"Kairanbay Magzhan","year":"2017","unstructured":"Yong-Lian Hii, John See, Magzhan Kairanbay, and Lai-Kuan Wong. 2017. Multigap: Multi-pooled inception network with text augmentation for aesthetic prediction of photographs. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201917). IEEE, 1722--1726."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1469-7580.2009.01164.x"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/WCSP.2016.7752571"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the Asian Conference on Computer Vision. Springer, 462--474","author":"Wong Lai-Kuan","year":"2016","unstructured":"Magzhan Kairanbay, John See, and Lai-Kuan Wong. 2016. Aesthetic evaluation of facial portraits using compositional augmentation for deep CNNs. In Proceedings of the Asian Conference on Computer Vision. Springer, 462--474."},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the International Conference on Multimedia Modeling. Springer, 531--543","author":"Wong Lai-Kuan","year":"2018","unstructured":"Magzhan Kairanbay, John See, and Lai-Kuan Wong. 2018. Towards demographic-based photographic aesthetics prediction for portraitures. In Proceedings of the International Conference on Multimedia Modeling. Springer, 531--543."},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing (ICIP\u201917)","author":"Wong Lai-Kuan","year":"2017","unstructured":"Magzhan Kairanbay, John See, Lai-Kuan Wong, and Yong-Lian Hii. 2017. Filling the gaps: Reducing the complexity of networks for multi-attribute image aesthetic prediction. In Proceedings of the IEEE International Conference on Image Processing (ICIP\u201917). IEEE, 3051--3055."},{"key":"e_1_2_1_15_1","volume-title":"Visual aesthetic quality assessment with multi-task deep learning. Retrieved from arXiv preprint arXiv:1604.049705","author":"Kao Yueying","year":"2016","unstructured":"Yueying Kao, Ran He, and Kaiqi Huang. 2016a. Visual aesthetic quality assessment with multi-task deep learning. Retrieved from arXiv preprint arXiv:1604.049705 (2016)."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2651399"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2016.05.004"},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Sergey Karayev Matthew Trentacoste Helen Han Aseem Agarwala Trevor Darrell Aaron Hertzmann and Holger Winnemoeller. 2013. Recognizing image style. Retrieved from: arXiv preprint arXiv:1311.3715.","DOI":"10.5244\/C.28.122"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.303"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46448-0_40"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/2999134.2999257"},{"key":"e_1_2_1_22_1","unstructured":"Challenging Technologies LLC. 2018. DpChallenge dataset. Retrieved from: http:\/\/www.dpchallenge.com\/."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654927"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2015.2477040"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.119"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.60"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126444"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1177\/0146167208320555"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2354807"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1631144.1631158"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2010.5654231"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/FG.2015.7163086"},{"volume-title":"Proceedings of the IEEE International Conference on Computer Vision. 638--647","author":"Ren Jian","key":"e_1_2_1_33_1","unstructured":"Jian Ren, Xiaohui Shen, Zhe Lin, Radom\u00edr Mech, and David J. Foran. 2017. Personalized image aesthetics. In Proceedings of the IEEE International Conference on Computer Vision. 638--647."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1117\/12.387147"},{"key":"e_1_2_1_35_1","volume-title":"Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV\u201918). 2048","author":"Schwarz Katharina","year":"2057","unstructured":"Katharina Schwarz, Patrick Wieschollek, and Hendrik P. A. Lensch. 2018. Will people like your image? Learning the aesthetic space. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV\u201918). 2048--2057."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2710631"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1631272.1631351"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.308"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.380"},{"key":"e_1_2_1_40_1","unstructured":"Stanford University. 2018. ImageNet dataset. Retrieved from: http:\/\/www.image-net.org\/."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/3304415.3304551"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2016.05.009"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2017.7965953"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.5555\/1818719.1819008"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2303650"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3328993","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3328993","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:54:41Z","timestamp":1750204481000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3328993"}},"subtitle":["Demographically Oriented Analysis of Aesthetics in Photographs"],"short-title":[],"issued":{"date-parts":[[2019,4,30]]},"references-count":45,"journal-issue":{"issue":"2s","published-print":{"date-parts":[[2019,4,30]]}},"alternative-id":["10.1145\/3328993"],"URL":"https:\/\/doi.org\/10.1145\/3328993","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2019,4,30]]},"assertion":[{"value":"2018-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-04-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-07-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}