{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,2]],"date-time":"2026-01-02T07:45:35Z","timestamp":1767339935723,"version":"3.41.0"},"reference-count":56,"publisher":"Association for Computing Machinery (ACM)","issue":"1s","license":[{"start":{"date-parts":[[2021,1,31]],"date-time":"2021-01-31T00:00:00Z","timestamp":1612051200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2021,1,31]]},"abstract":"<jats:p>Social scientists have shown evidence that visual perceptions of urban attributes, such as safe, wealthy, and beautiful perspectives of the given cities, are highly correlated to the residents\u2019 behaviors and quality of life. Despite their significance, measuring visual perceptions of urban attributes is challenging due to the following facts: (1) Visual perceptions are subjectively contradistinctive rather than absolute. (2) Perception comparisons between image pairs are usually conducted region by region, and highly related to the specific urban attributes. And (3) the urban attributes have both the shared and specific information. To address these problems, in this article, we present a Deep inteRActive Multi-task leArning scheme, DRAMA for short. DRAMA comparatively quantifies the perceptions of urban attributes by jointly integrating the pairwise comparisons, regional interactions, and urban attribute correlations within a unified deep scheme. In DRAMA, each urban attribute is treated as a task, whereby the task-sharing and the task-specific information is fully explored. By conducting extensive experiments over a public large-scale benchmark dataset, it is demonstrated that our proposed DRAMA scheme outperforms several state-of-the-art baselines. Meanwhile, we applied the pairwise comparisons of our DRAMA model to further quantify the urban attributes and hence rank cities with respect to the given urban attributes. As a byproduct, we have released the codes and parameter settings to facilitate other researches.<\/jats:p>","DOI":"10.1145\/3424115","type":"journal-article","created":{"date-parts":[[2021,4,1]],"date-time":"2021-04-01T01:53:55Z","timestamp":1617242035000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Urban Perception: Sensing Cities via a Deep Interactive Multi-task Learning Framework"],"prefix":"10.1145","volume":"17","author":[{"given":"Weili","family":"Guan","sequence":"first","affiliation":[{"name":"Monash University, Melbourne, Australia"}]},{"given":"Zhaozheng","family":"Chen","sequence":"additional","affiliation":[{"name":"Singapore Management University, Singapore"}]},{"given":"Fuli","family":"Feng","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore"}]},{"given":"Weifeng","family":"Liu","sequence":"additional","affiliation":[{"name":"China University of Petroleum (East China), China"}]},{"given":"Liqiang","family":"Nie","sequence":"additional","affiliation":[{"name":"Shandong University, China"}]}],"member":"320","published-online":{"date-parts":[[2021,3,31]]},"reference":[{"key":"e_1_2_1_1_1","first-page":"29","article-title":"Broken windows: The police and neighborhood safety","volume":"249","author":"Wilson James Q.","year":"1982","unstructured":"James Q. Wilson . 1982 . Broken windows: The police and neighborhood safety . Atlan. Month. 249 , 2 (1982), 29 \u2013 38 . James Q. Wilson. 1982. Broken windows: The police and neighborhood safety. Atlan. Month. 249, 2 (1982), 29\u201338.","journal-title":"Atlan. Month."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1126\/science.1161405"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11256-010-0165-7"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.2105\/AJPH.93.3.467"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1136\/jech.2005.042697"},{"key":"e_1_2_1_6_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_2_1_7_1","volume-title":"HyperFace: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition","author":"Ranjan Rajeev","year":"2016","unstructured":"Rajeev Ranjan , Vishal M. Patel , and Rama Chellappa . 2016. HyperFace: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition . IEEE Trans. Pattern Anal. Mach. Intell . ( 2016 ), 1\u20131. https:\/\/pubmed.ncbi.nlm.nih.gov\/29990235\/. Rajeev Ranjan, Vishal M. Patel, and Rama Chellappa. 2016. HyperFace: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans. Pattern Anal. Mach. Intell. (2016), 1\u20131. https:\/\/pubmed.ncbi.nlm.nih.gov\/29990235\/."},{"volume-title":"Proceedings of the European Conference on Computer Vision. IEEE, 196\u2013212","author":"Dubey Abhimanyu","key":"e_1_2_1_8_1","unstructured":"Abhimanyu Dubey , Nikhil Naik , Devi Parikh , Ramesh Raskar , and C\u00e9sar A. Hidalgo . 2016. Deep learning the city: Quantifying urban perception at a global scale . In Proceedings of the European Conference on Computer Vision. IEEE, 196\u2013212 . Abhimanyu Dubey, Nikhil Naik, Devi Parikh, Ramesh Raskar, and C\u00e9sar A. Hidalgo. 2016. Deep learning the city: Quantifying urban perception at a global scale. In Proceedings of the European Conference on Computer Vision. IEEE, 196\u2013212."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1068\/p251203"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.35632\/ajis.v16i2.2126"},{"key":"e_1_2_1_11_1","volume-title":"Maloney","author":"Mamassian Pascal","year":"2002","unstructured":"Pascal Mamassian , Michael Landy , and Laurence T . Maloney . 2002 . Bayesian modelling of visual perception. Probabil. Mod. Brain ( 2002), 13\u201336. https:\/\/psycnet.apa.org\/record\/2002-02646-001. Pascal Mamassian, Michael Landy, and Laurence T. Maloney. 2002. Bayesian modelling of visual perception. Probabil. Mod. Brain (2002), 13\u201336. https:\/\/psycnet.apa.org\/record\/2002-02646-001."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00503"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0068400"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the International ACM SIGIR Conference. ACM, 76\u201379","author":"Maggini Marco","year":"2008","unstructured":"Marco Maggini , Franco Scarselli , Leonardo Rigutini , and Tiziano Papini . 2008 . SortNet: Learning to rank by a neural-based sorting algorithm . In Proceedings of the International ACM SIGIR Conference. ACM, 76\u201379 . Marco Maggini, Franco Scarselli, Leonardo Rigutini, and Tiziano Papini. 2008. SortNet: Learning to rank by a neural-based sorting algorithm. In Proceedings of the International ACM SIGIR Conference. ACM, 76\u201379."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1102351.1102363"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277808"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/945365.964285"},{"key":"e_1_2_1_18_1","unstructured":"Ralf Herbrich. 2000. Large margin rank boundaries for ordinal regression. Adv. Large Marg. Classif. (2000) 115\u2013132. https:\/\/www.bibsonomy.org\/bibtex\/c1aab52010073f7f01771dabde1e5b9a.  Ralf Herbrich. 2000. Large margin rank boundaries for ordinal regression. Adv. Large Marg. Classif. (2000) 115\u2013132. https:\/\/www.bibsonomy.org\/bibtex\/c1aab52010073f7f01771dabde1e5b9a."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277792"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2017.2700206"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01105"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123270"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123424"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123382"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2733373.2806217"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2967271"},{"key":"e_1_2_1_27_1","volume-title":"A survey on multi-task learning. arXiv preprint arXiv:1707.08114","author":"Zhang Yu","year":"2017","unstructured":"Yu Zhang and Qiang Yang . 2017. A survey on multi-task learning. arXiv preprint arXiv:1707.08114 ( 2017 ). Yu Zhang and Qiang Yang. 2017. A survey on multi-task learning. arXiv preprint arXiv:1707.08114 (2017)."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2390959"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2495116"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2331755"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2762591"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2016.2543099"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2524212"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2932502"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2302684"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2011.2163522"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2456502"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2010.2057438"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123277"},{"key":"e_1_2_1_40_1","first-page":"291","article-title":"Reflecting on the subject: A critique of the social influence conception of deterrence, the broken windows theory, and order-maintenance policing New York style","volume":"97","author":"Harcourt Bernard E.","year":"1998","unstructured":"Bernard E. Harcourt . 1998 . Reflecting on the subject: A critique of the social influence conception of deterrence, the broken windows theory, and order-maintenance policing New York style . Soc. Sci. Electron. Publish. 97 , 2 (1998), 291 \u2013 389 . Bernard E. Harcourt. 1998. Reflecting on the subject: A critique of the social influence conception of deterrence, the broken windows theory, and order-maintenance policing New York style. Soc. Sci. Electron. Publish. 97, 2 (1998), 291\u2013389.","journal-title":"Soc. Sci. Electron. Publish."},{"key":"e_1_2_1_41_1","volume-title":"There are no cracks in the broken windows. Nat. Rev. 28","author":"Bratton William","year":"2006","unstructured":"William Bratton and George Kelling . 2006. There are no cracks in the broken windows. Nat. Rev. 28 ( 2006 ). William Bratton and George Kelling. 2006. There are no cracks in the broken windows. Nat. Rev. 28 (2006)."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.2105\/AJPH.90.2.230"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.2307\/3090214"},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","first-page":"e0119352","DOI":"10.1371\/journal.pone.0119352","article-title":"Correction: The collaborative image of the city: Mapping the inequality of urban perception","volume":"10","author":"Staff Plos One","year":"2015","unstructured":"Plos One Staff . 2015 . Correction: The collaborative image of the city: Mapping the inequality of urban perception . PLoS One 10 , 3 (2015), e0119352 . Plos One Staff. 2015. Correction: The collaborative image of the city: Mapping the inequality of urban perception. PLoS One 10, 3 (2015), e0119352.","journal-title":"PLoS One"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2014.121"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1257\/aer.p20161030"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jue.2015.12.002"},{"key":"e_1_2_1_48_1","volume-title":"Ramesh Raskar, Edward L. Glaeser, and C\u00e9sar Hidalgo.","author":"Naik Nikhil","year":"2015","unstructured":"Nikhil Naik , Scott Duke Kominers , Ramesh Raskar, Edward L. Glaeser, and C\u00e9sar Hidalgo. 2015 . Preserving history or restricting development? The co-evolution of physical, social, and economic change in five major U.S. cities. Soc. Sci. Electron. Pub . (2015). https:\/\/www.hbs.edu\/faculty\/Pages\/item.aspx?num=50631. Nikhil Naik, Scott Duke Kominers, Ramesh Raskar, Edward L. Glaeser, and C\u00e9sar Hidalgo. 2015. Preserving history or restricting development? The co-evolution of physical, social, and economic change in five major U.S. cities. Soc. Sci. Electron. Pub. (2015). https:\/\/www.hbs.edu\/faculty\/Pages\/item.aspx?num=50631."},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.landurbplan.2015.05.007"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-05716-9_3"},{"key":"e_1_2_1_51_1","volume-title":"An overview of multi-task learning in deep neural networks. arXiv preprint arXiv:1706.05098","author":"Ruder Sebastian","year":"2017","unstructured":"Sebastian Ruder . 2017. An overview of multi-task learning in deep neural networks. arXiv preprint arXiv:1706.05098 ( 2017 ). Sebastian Ruder. 2017. An overview of multi-task learning in deep neural networks. arXiv preprint arXiv:1706.05098 (2017)."},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2466106"},{"key":"e_1_2_1_53_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems, Workshop on Machine Learning Systems.","author":"Chen Tianqi","year":"2015","unstructured":"Tianqi Chen , Mu Li , Yutian Li , Min Lin , Naiyan Wang , Minjie Wang , Tianjun Xiao , Bing Xu , Chiyuan Zhang , and Zheng Zhang . 2015 . MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems . In Proceedings of the International Conference on Neural Information Processing Systems, Workshop on Machine Learning Systems. Tianqi Chen, Mu Li, Yutian Li, Min Lin, Naiyan Wang, Minjie Wang, Tianjun Xiao, Bing Xu, Chiyuan Zhang, and Zheng Zhang. 2015. MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems. In Proceedings of the International Conference on Neural Information Processing Systems, Workshop on Machine Learning Systems."},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.5555\/2976456.2976528"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.5555\/2999325.2999411"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/582415.582418"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3424115","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3424115","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:01:51Z","timestamp":1750197711000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3424115"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,31]]},"references-count":56,"journal-issue":{"issue":"1s","published-print":{"date-parts":[[2021,1,31]]}},"alternative-id":["10.1145\/3424115"],"URL":"https:\/\/doi.org\/10.1145\/3424115","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2021,1,31]]},"assertion":[{"value":"2020-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-09-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-31","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}