{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T06:58:49Z","timestamp":1758265129745,"version":"3.41.0"},"reference-count":52,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2017,8,12]],"date-time":"2017-08-12T00:00:00Z","timestamp":1502496000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Regional Administration of Sardinia, Italy"},{"name":"Advanced and secure sharing of multimedia data over social networks in the future Internet","award":["CUP F71J11000690002"],"award-info":[{"award-number":["CUP F71J11000690002"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2017,11,30]]},"abstract":"<jats:p>\n            In this article, we present a novel framework that can produce a visual description of a tourist attraction by choosing the most diverse pictures from community-contributed datasets, which describe different details of the queried location. The main strength of the proposed approach is its flexibility that permits us to filter out non-relevant images and to obtain a reliable set of diverse and relevant images by first clustering similar images according to their textual descriptions and their visual content and then extracting images from different clusters according to a measure of the user\u2019s credibility. Clustering is based on a two-step process, where textual descriptions are used first and the clusters are then refined according to the visual features. The degree of diversification can be further increased by exploiting users\u2019 judgments on the results produced by the proposed algorithm through a novel approach, where users not only provide a\n            <jats:italic>relevance<\/jats:italic>\n            feedback but also a\n            <jats:italic>diversity<\/jats:italic>\n            feedback. Experimental results performed on the MediaEval 2015 \u201cRetrieving Diverse Social Images\u201d dataset show that the proposed framework can achieve very good performance both in the case of automatic retrieval of diverse images and in the case of the exploitation of the users\u2019 feedback. The effectiveness of the proposed approach has been also confirmed by a small case study involving a number of real users.\n          <\/jats:p>","DOI":"10.1145\/3103613","type":"journal-article","created":{"date-parts":[[2017,8,14]],"date-time":"2017-08-14T12:24:33Z","timestamp":1502713473000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Multimodal Retrieval with Diversification and Relevance Feedback for Tourist Attraction Images"],"prefix":"10.1145","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2761-2213","authenticated-orcid":false,"given":"Duc-Tien","family":"Dang-Nguyen","sequence":"first","affiliation":[{"name":"University of Trento and Dublin City University, Dublin, Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luca","family":"Piras","sequence":"additional","affiliation":[{"name":"University of Cagliari"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Giorgio","family":"Giacinto","sequence":"additional","affiliation":[{"name":"University of Cagliari"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Giulia","family":"Boato","sequence":"additional","affiliation":[{"name":"University of Trento"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Francesco G. B. DE","family":"Natale","sequence":"additional","affiliation":[{"name":"University of Trento"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,8,12]]},"reference":[{"volume-title":"Cluster Analysis for Applications","author":"Anderberg M. R.","unstructured":"M. R. Anderberg . 1973. Cluster Analysis for Applications . Academic Press . M. R. Anderberg. 1973. Cluster Analysis for Applications. Academic Press.","key":"e_1_2_1_1_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_2_1","DOI":"10.1109\/TMM.2014.2384912"},{"unstructured":"G. Boato D.-T. Dang-Nguyen O. Muratov N. Alajlan and F. G. B. De Natale. 2015. Exploiting visual saliency for increasing diversity of image retrieval results. Multimed. Tools. Appl. (2015) 1--22.  G. Boato D.-T. Dang-Nguyen O. Muratov N. Alajlan and F. G. B. De Natale. 2015. Exploiting visual saliency for increasing diversity of image retrieval results. Multimed. Tools. Appl. (2015) 1--22.","key":"e_1_2_1_3_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_4_1","DOI":"10.1109\/ICCP.2014.6936979"},{"doi-asserted-by":"publisher","key":"e_1_2_1_5_1","DOI":"10.1109\/CBMI.2015.7153613"},{"doi-asserted-by":"publisher","key":"e_1_2_1_6_1","DOI":"10.1145\/290941.291025"},{"doi-asserted-by":"publisher","key":"e_1_2_1_7_1","DOI":"10.1109\/TMM.2014.2301978"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the IEEE International Conference on Image Processing","volume":"1","author":"Chen Y.","unstructured":"Y. Chen , X. S. Zhou , and T. S. Huang . 2001. One-class SVM for learning in image retrieval . In Proceedings of the IEEE International Conference on Image Processing , Vol. 1 . 34--37. Y. Chen, X. S. Zhou, and T. S. Huang. 2001. One-class SVM for learning in image retrieval. In Proceedings of the IEEE International Conference on Image Processing, Vol. 1. 34--37."},{"doi-asserted-by":"publisher","key":"e_1_2_1_9_1","DOI":"10.1007\/s00530-015-0491-4"},{"doi-asserted-by":"publisher","key":"e_1_2_1_10_1","DOI":"10.1109\/CVPR.2005.177"},{"key":"e_1_2_1_11_1","volume-title":"MediaEval","volume":"1436","author":"Dang-Nguyen D.-T.","unstructured":"D.-T. Dang-Nguyen , G. Boato , F. G.B. De Natale , L. Piras , G. Giacinto , F. Tuveri , and M. Angioni . 2015a. Multimodal-based diversified summarization in social image retrieval . In MediaEval , Vol. 1436 . D.-T. Dang-Nguyen, G. Boato, F. G.B. De Natale, L. Piras, G. Giacinto, F. Tuveri, and M. Angioni. 2015a. Multimodal-based diversified summarization in social image retrieval. In MediaEval, Vol. 1436."},{"doi-asserted-by":"publisher","key":"e_1_2_1_12_1","DOI":"10.1109\/ICME.2015.7177486"},{"doi-asserted-by":"publisher","key":"e_1_2_1_13_1","DOI":"10.1109\/TIP.2009.2019809"},{"volume-title":"Proceedings of the ACM International Conference on Multimedia. 1021--1024","author":"G\u00eensc\u0103 A. L.","unstructured":"A. L. G\u00eensc\u0103 , A. Popescu , B. Ionescu , A. Armagan , and I. Kanellos . 2014. Toward an estimation of user tagging credibility for social image retrieval . In Proceedings of the ACM International Conference on Multimedia. 1021--1024 . A. L. G\u00eensc\u0103, A. Popescu, B. Ionescu, A. Armagan, and I. Kanellos. 2014. Toward an estimation of user tagging credibility for social image retrieval. In Proceedings of the ACM International Conference on Multimedia. 1021--1024.","key":"e_1_2_1_14_1"},{"volume-title":"Proceedings of the Conference on Intelligent Signal Processing and Communication Systems. 157--160","author":"Huang J.-T.","unstructured":"J.-T. Huang , C.-H. Shen , S.-M. Phoong , and H. Chen . 2005. Robust measure of image focus in the wavelet domain . In Proceedings of the Conference on Intelligent Signal Processing and Communication Systems. 157--160 . J.-T. Huang, C.-H. Shen, S.-M. Phoong, and H. Chen. 2005. Robust measure of image focus in the wavelet domain. In Proceedings of the Conference on Intelligent Signal Processing and Communication Systems. 157--160.","key":"e_1_2_1_15_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_16_1","DOI":"10.1145\/1852102.1852108"},{"key":"e_1_2_1_17_1","volume-title":"MediaEval","volume":"1436","author":"Ionescu B.","unstructured":"B. Ionescu , A.-L. G\u00eensc\u0103 , B. Boteanu , A. Popescu , M. Lupu , and H. M\u00fcller . 2015. Retrieving diverse social images at mediaeval 2015: Challenge, dataset and evaluation . In MediaEval , Vol. 1436 . B. Ionescu, A.-L. G\u00eensc\u0103, B. Boteanu, A. Popescu, M. Lupu, and H. M\u00fcller. 2015. Retrieving diverse social images at mediaeval 2015: Challenge, dataset and evaluation. In MediaEval, Vol. 1436."},{"unstructured":"B. Ionescu A. Popescu M. Lupu A. L. G\u00eensc\u0103 and M\u00fcller. 2014. Retrieving diverse social images at mediaeval 2014: Challenge dataset and evaluation. In MediaEval.  B. Ionescu A. Popescu M. Lupu A. L. G\u00eensc\u0103 and M\u00fcller. 2014. Retrieving diverse social images at mediaeval 2014: Challenge dataset and evaluation. In MediaEval.","key":"e_1_2_1_18_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_19_1","DOI":"10.1109\/TMM.2015.2417506"},{"doi-asserted-by":"publisher","key":"e_1_2_1_20_1","DOI":"10.1145\/1367497.1367539"},{"doi-asserted-by":"publisher","key":"e_1_2_1_21_1","DOI":"10.1016\/j.jss.2005.02.005"},{"doi-asserted-by":"publisher","key":"e_1_2_1_22_1","DOI":"10.1109\/TNN.2002.1021885"},{"doi-asserted-by":"publisher","key":"e_1_2_1_23_1","DOI":"10.1109\/CVPR.2006.68"},{"doi-asserted-by":"publisher","key":"e_1_2_1_24_1","DOI":"10.1016\/j.patrec.2008.05.004"},{"doi-asserted-by":"publisher","key":"e_1_2_1_25_1","DOI":"10.1109\/TMM.2016.2568099"},{"doi-asserted-by":"publisher","key":"e_1_2_1_26_1","DOI":"10.1109\/TMM.2010.2041100"},{"doi-asserted-by":"publisher","key":"e_1_2_1_27_1","DOI":"10.1109\/76.927424"},{"doi-asserted-by":"publisher","key":"e_1_2_1_28_1","DOI":"10.1109\/CBMI.2012.6269811"},{"doi-asserted-by":"publisher","key":"e_1_2_1_29_1","DOI":"10.1109\/ICPR.1994.576366"},{"volume-title":"Proceedings of the International Conference on Cross-language Evaluation Forum: Multimedia Experiments.","author":"Paramita M.","unstructured":"M. Paramita , M. Sanderson , and P. Clough . 2009. Diversity in photo retrieval: Overview of the ImageCLEF photo task 2009 . In Proceedings of the International Conference on Cross-language Evaluation Forum: Multimedia Experiments. M. Paramita, M. Sanderson, and P. Clough. 2009. Diversity in photo retrieval: Overview of the ImageCLEF photo task 2009. In Proceedings of the International Conference on Cross-language Evaluation Forum: Multimedia Experiments.","key":"e_1_2_1_30_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_31_1","DOI":"10.1109\/WIAMIS.2009.5031477"},{"doi-asserted-by":"publisher","key":"e_1_2_1_32_1","DOI":"10.1016\/j.inffus.2017.01.003"},{"doi-asserted-by":"publisher","key":"e_1_2_1_33_1","DOI":"10.1109\/TIP.2015.2497145"},{"doi-asserted-by":"publisher","key":"e_1_2_1_34_1","DOI":"10.1109\/TCSVT.2014.2369731"},{"unstructured":"S. S. Ravindranath M. Gygli and L. van Gool. In MediaEval.  S. S. Ravindranath M. Gygli and L. van Gool. In MediaEval.","key":"e_1_2_1_35_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_36_1","DOI":"10.1109\/TMM.2013.2237896"},{"doi-asserted-by":"publisher","key":"e_1_2_1_37_1","DOI":"10.1109\/ICIP.1997.638621"},{"doi-asserted-by":"publisher","key":"e_1_2_1_38_1","DOI":"10.1109\/76.718510"},{"key":"e_1_2_1_39_1","volume-title":"MediaEval","volume":"1436","author":"Sabetghadam S.","unstructured":"S. Sabetghadam , J. R. M. Palotti , N. Rekabsaz , M. Lupu , and A. Hanbury . 2015. TUW @ MediaEval 2015 retrieving diverse social images task . In MediaEval , Vol. 1436 . S. Sabetghadam, J. R. M. Palotti, N. Rekabsaz, M. Lupu, and A. Hanbury. 2015. TUW @ MediaEval 2015 retrieving diverse social images task. In MediaEval, Vol. 1436."},{"doi-asserted-by":"publisher","key":"e_1_2_1_40_1","DOI":"10.1109\/ICCV.2007.4408863"},{"doi-asserted-by":"publisher","key":"e_1_2_1_41_1","DOI":"10.1007\/s13735-012-0014-4"},{"doi-asserted-by":"publisher","key":"e_1_2_1_42_1","DOI":"10.1007\/978-3-642-31546-6_4"},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo. 1270--1273","author":"Tsai C.-M.","year":"2006","unstructured":"C.-M. Tsai , A. Qamra , E. Y. Chang , and Y.-F. Wang . 2006 . Extent: Interring image metadata from context and content . In Proceedings of the IEEE International Conference on Multimedia and Expo. 1270--1273 . C.-M. Tsai, A. Qamra, E. Y. Chang, and Y.-F. Wang. 2006. Extent: Interring image metadata from context and content. In Proceedings of the IEEE International Conference on Multimedia and Expo. 1270--1273."},{"doi-asserted-by":"publisher","key":"e_1_2_1_44_1","DOI":"10.1145\/1526709.1526756"},{"doi-asserted-by":"publisher","key":"e_1_2_1_45_1","DOI":"10.1007\/s00530-003-0084-5"},{"doi-asserted-by":"publisher","key":"e_1_2_1_46_1","DOI":"10.1109\/CVPR.2010.5539970"},{"key":"e_1_2_1_47_1","volume-title":"USEMP: Finding diverse images at MediaEval","author":"Xioufis E. S.","year":"2015","unstructured":"E. S. Xioufis , A. Popescu , S. Papadopoulos , and I. Kompatsiaris . USEMP: Finding diverse images at MediaEval 2015 . In MediaEval . E. S. Xioufis, A. Popescu, S. Papadopoulos, and I. Kompatsiaris. USEMP: Finding diverse images at MediaEval 2015. In MediaEval."},{"key":"e_1_2_1_48_1","volume-title":"MediaEval","volume":"1436","author":"Zaharieva M.","unstructured":"M. Zaharieva and L. Diem . 2015. MIS @ retrieving diverse social images task 2015 . In MediaEval , Vol. 1436 . M. Zaharieva and L. Diem. 2015. MIS @ retrieving diverse social images task 2015. In MediaEval, Vol. 1436."},{"doi-asserted-by":"publisher","key":"e_1_2_1_49_1","DOI":"10.1109\/ICIP.2001.958595"},{"doi-asserted-by":"publisher","key":"e_1_2_1_50_1","DOI":"10.1007\/s00530-005-0180-9"},{"doi-asserted-by":"publisher","key":"e_1_2_1_51_1","DOI":"10.1145\/233269.233324"},{"doi-asserted-by":"publisher","key":"e_1_2_1_52_1","DOI":"10.1109\/TMM.2015.2431496"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3103613","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3103613","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:30:37Z","timestamp":1750217437000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3103613"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,8,12]]},"references-count":52,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2017,11,30]]}},"alternative-id":["10.1145\/3103613"],"URL":"https:\/\/doi.org\/10.1145\/3103613","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2017,8,12]]},"assertion":[{"value":"2016-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-05-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-08-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}