{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,30]],"date-time":"2025-10-30T07:05:24Z","timestamp":1761807924915,"version":"3.41.0"},"reference-count":106,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2015,3,11]],"date-time":"2015-03-11T00:00:00Z","timestamp":1426032000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Nature Science Foundation of China","doi-asserted-by":"crossref","award":["61422210, 61373076"],"award-info":[{"award-number":["61422210, 61373076"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2015,3,11]]},"abstract":"<jats:p>Coming with the popularity of multimedia sharing platforms such as Facebook and Flickr, recent years have witnessed an explosive growth of geographical tags on social multimedia content. This trend enables a wide variety of emerging applications, for example, mobile location search, landmark recognition, scene reconstruction, and touristic recommendation, which range from purely research prototype to commercial systems. In this article, we give a comprehensive survey on these applications, covering recent advances in recognition and mining of geographical-aware social multimedia. We review related work in the past decade regarding to location recognition, scene summarization, tourism suggestion, 3D building modeling, mobile visual search and city navigation. At the end, we further discuss potential challenges, future topics, as well as open issues related to geo-social multimedia computing, recognition, mining, and analytics.<\/jats:p>","DOI":"10.1145\/2597181","type":"journal-article","created":{"date-parts":[[2015,4,1]],"date-time":"2015-04-01T14:59:12Z","timestamp":1427900352000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":33,"title":["When Location Meets Social Multimedia"],"prefix":"10.1145","volume":"6","author":[{"given":"Rongrong","family":"Ji","sequence":"first","affiliation":[{"name":"Department of Cognitive Science, Xiamen University, Fujian, China"}]},{"given":"Yue","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Computing, National University of Singapore"}]},{"given":"Wei","family":"Liu","sequence":"additional","affiliation":[{"name":"IBM T. J. Watson Research Center, NY, USA"}]},{"given":"Xing","family":"Xie","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia, Beijing, China"}]},{"given":"Qi","family":"Tian","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Texas at San Antonio, San Antonio, TX, USA"}]},{"given":"Xuelong","family":"Li","sequence":"additional","affiliation":[{"name":"Chinese Academy of Science Ch'ang-an, Shaanxi, China"}]}],"member":"320","published-online":{"date-parts":[[2015,3,26]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DPVT.2006.141"},{"volume-title":"International Symposium on Interactive 3D Graphics.","author":"Aliaga D. G.","key":"e_1_2_1_2_1"},{"volume-title":"Active Perception","author":"Aloimonos Y.","key":"e_1_2_1_3_1","doi-asserted-by":"crossref","DOI":"10.4324\/9780203773178"},{"key":"e_1_2_1_4_1","doi-asserted-by":"crossref","unstructured":"S. Ben-David J. Blitzer K. Crammer and F. Pereira. 2006. Analysis of representations for domain adaptation. In NIPS. S. Ben-David J. Blitzer K. Crammer and F. Pereira. 2006. Analysis of representations for domain adaptation. In NIPS.","DOI":"10.7551\/mitpress\/7503.003.0022"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937"},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"S. Brin and L. Page. 1998. The anatomy of a large-scale hypertextual Web search engine. In World Wide Web. S. Brin and L. Page. 1998. The anatomy of a large-scale hypertextual Web search engine. In World Wide Web.","DOI":"10.1016\/S0169-7552(98)00110-X"},{"key":"e_1_2_1_7_1","unstructured":"S. Brin. 1995. Near neighbor search in large metric spaces. In VLDB. 574--584. S. Brin. 1995. Near neighbor search in large metric spaces. In VLDB. 574--584."},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","unstructured":"D. Brockmann L. Hufnagel and T. Geisel. 2006. The scaling laws of human travel. Nature 439 7075 462--465. D. Brockmann L. Hufnagel and T. Geisel. 2006. The scaling laws of human travel. Nature 439 7075 462--465.","DOI":"10.1038\/nature04292"},{"volume-title":"ACM SIGMOD Workshop on the Web and Databases.","author":"Buyukkokten O.","key":"e_1_2_1_9_1"},{"key":"e_1_2_1_10_1","unstructured":"I. Cadez and P. Bradley. 2001. Model based population tracking and automatic detection of distribution changes. In NIPS. I. Cadez and P. Bradley. 2001. Model based population tracking and automatic detection of distribution changes. In NIPS."},{"key":"e_1_2_1_11_1","unstructured":"C. Campbell and K. P. Bennett. 2001. A linear programming approach to novelty detection. In NIPS. C. Campbell and K. P. Bennett. 2001. A linear programming approach to novelty detection. In NIPS."},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"L. Cao Y. Gao Q. Liu and R. Ji. 2012. Geographical retagging. In Multimedia Modeling. L. Cao Y. Gao Q. Liu and R. Ji. 2012. Geographical retagging. In Multimedia Modeling.","DOI":"10.1007\/978-3-642-35728-2_5"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1631272.1631292"},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","unstructured":"D. Crandall L. Backstrom D. Huttenlocher and J. Kleinberg. 2009. Mapping the world\u2019s photos. In WWW. D. Crandall L. Backstrom D. Huttenlocher and J. Kleinberg. 2009. Mapping the world\u2019s photos. In WWW.","DOI":"10.1145\/1526709.1526812"},{"key":"e_1_2_1_15_1","doi-asserted-by":"crossref","unstructured":"M. Cristani A. Perina U. Castellani and V. Murino. 2008. Geolocated image analysis using latent representations. In CVPR. M. Cristani A. Perina U. Castellani and V. Murino. 2008. Geolocated image analysis using latent representations. In CVPR.","DOI":"10.1109\/CVPR.2008.4587390"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237191"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpa.20131"},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"M. Dundar and J. Bi. 2007. Joint optimization of cascaded classifiers for computer aided detection. In CVPR. M. Dundar and J. Bi. 2007. Joint optimization of cascaded classifiers for computer aided detection. In CVPR.","DOI":"10.1109\/CVPR.2007.383093"},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","unstructured":"E. D. Eade and T. W. Drummond. 2008. Unified loop closing and recovery for real time monocular SLAM. In BMVC. E. D. Eade and T. W. Drummond. 2008. Unified loop closing and recovery for real time monocular SLAM. In BMVC.","DOI":"10.5244\/C.22.6"},{"key":"e_1_2_1_20_1","unstructured":"EveryScape. 2009. Homepage. Retrieved from www.everyscape.com. EveryScape. 2009. Homepage. Retrieved from www.everyscape.com."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/T-C.1975.224297"},{"key":"e_1_2_1_22_1","unstructured":"L. Fei-Fei and P. Perona. 2007. A Bayesian hierarchical model for learning natural scene categories. In ICCV. L. Fei-Fei and P. Perona. 2007. A Bayesian hierarchical model for learning natural scene categories. In ICCV."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1873951.1873970"},{"key":"e_1_2_1_24_1","doi-asserted-by":"crossref","unstructured":"T. Goedeme and T. Tuytelaars. 2004. Fast wide baseline matching for visual navigation. In CVPR. 24--29. T. Goedeme and T. Tuytelaars. 2004. Fast wide baseline matching for visual navigation. In CVPR. 24--29.","DOI":"10.1109\/CVPR.2004.1315009"},{"key":"e_1_2_1_25_1","doi-asserted-by":"crossref","unstructured":"M. C. Gonzalez C. A. Hidalgo and A.-L. Barabasi. 2008. Understanding individual human mobility patterns. Nature 453 7196 779--782. M. C. Gonzalez C. A. Hidalgo and A.-L. Barabasi. 2008. Understanding individual human mobility patterns. Nature 453 7196 779--782.","DOI":"10.1038\/nature06958"},{"key":"e_1_2_1_26_1","doi-asserted-by":"crossref","unstructured":"K. Grauman and T. Darrell. 2007. Approximate correspondences in high dimensions. In NIPS. K. Grauman and T. Darrell. 2007. Approximate correspondences in high dimensions. In NIPS.","DOI":"10.7551\/mitpress\/7503.003.0068"},{"key":"e_1_2_1_27_1","doi-asserted-by":"crossref","unstructured":"A. Irschara C. Zach J. M. Frahm and H. Bischof. 2009. From structure-from-motion point clouds to fast location recognition. In CVPR. A. Irschara C. Zach J. M. Frahm and H. Bischof. 2009. From structure-from-motion point clouds to fast location recognition. In CVPR.","DOI":"10.1109\/CVPR.2009.5206587"},{"key":"e_1_2_1_28_1","unstructured":"R. Ji X. Xie H. Yao and W.-Y. Ma. 2009. Hierarchical optimization of visual vocabulary for effective and transferable retrieval. In CVPR. R. Ji X. Xie H. Yao and W.-Y. Ma. 2009. Hierarchical optimization of visual vocabulary for effective and transferable retrieval. In CVPR."},{"key":"e_1_2_1_29_1","doi-asserted-by":"crossref","unstructured":"R. I. Hartley and A. Zisserman. 2004. Multiple View Geometry. Cambridge University Press. R. I. Hartley and A. Zisserman. 2004. Multiple View Geometry. Cambridge University Press.","DOI":"10.1017\/CBO9780511811685"},{"key":"e_1_2_1_30_1","doi-asserted-by":"crossref","unstructured":"J. Hays and A. Efros. 2008. IMG2GPS: Estimating geographic information from a single image. In CVPR. J. Hays and A. Efros. 2008. IMG2GPS: Estimating geographic information from a single image. In CVPR.","DOI":"10.1109\/CVPR.2008.4587784"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007617005950"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/2111235.2111259"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCG.2003.1242383"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0308344101"},{"key":"e_1_2_1_35_1","unstructured":"Y. Li D. J. Crandall and D. P. Huttenlocher. 2009. Landmark recognition in large-scale image collections. In ICCV. Y. Li D. J. Crandall and D. P. Huttenlocher. 2009. Landmark recognition in large-scale image collections. In ICCV."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072228.1072355"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.3115\/1119394.1119400"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/800250.807465"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/1631272.1631416"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/324133.324140"},{"key":"e_1_2_1_41_1","doi-asserted-by":"crossref","unstructured":"H. Jegou H. Harzallah and C. Schmid. 2007. A contextual dissimilarity measure for accurate and efficient image search. CVPR. H. Jegou H. Harzallah and C. Schmid. 2007. A contextual dissimilarity measure for accurate and efficient image search. CVPR.","DOI":"10.1109\/CVPR.2007.382970"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2037676.2037688"},{"volume-title":"IEEE International Conference on Multimedia and Expo.","author":"Ji R.","key":"e_1_2_1_43_1"},{"key":"e_1_2_1_44_1","unstructured":"R. Ji X. Xie H. Yao and W.-Y. Ma. 2009a. Hierarchical optimization of visual vocabulary for effective and transferable retrieval. CVPR. R. Ji X. Xie H. Yao and W.-Y. Ma. 2009a. Hierarchical optimization of visual vocabulary for effective and transferable retrieval. CVPR."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/1631272.1631289"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/MDM.2006.122"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/1367497.1367540"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/1180639.1180762"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-010-0553-8"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2005.66"},{"key":"e_1_2_1_51_1","doi-asserted-by":"crossref","unstructured":"E. Kalogerakis O. Vesselova J. Hays A. Efros and A. Hertzmann. 2009. Image sequence geolocation with human travel priors. In CVPR. E. Kalogerakis O. Vesselova J. Hays A. Efros and A. Hertzmann. 2009. Image sequence geolocation with human travel priors. In CVPR.","DOI":"10.1109\/ICCV.2009.5459259"},{"key":"e_1_2_1_52_1","unstructured":"Y. Keiji and B. Qiu. 2010. Mining regional representative photos from consumer-generated geotagged photos. In Handbook of Social Network Technologies and Applications. Y. Keiji and B. Qiu. 2010. Mining regional representative photos from consumer-generated geotagged photos. In Handbook of Social Network Technologies and Applications."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/1291233.1291384"},{"volume-title":"IEEE Conference on Intelligent Robots and Systems.","author":"Kretzschmar H.","key":"e_1_2_1_54_1"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-89646-3_34"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1011126920638"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88682-2_33"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1038\/293133a0"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-010-0623-y"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.1085"},{"key":"e_1_2_1_62_1","unstructured":"H. Mannilla H. Toivonen and A. Verkamo. 1997. Discovery of frequent episodes in event sequences. ACM SIGKDD. H. Mannilla H. Toivonen and A. Verkamo. 1997. Discovery of frequent episodes in event sequences. ACM SIGKDD."},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2004.02.006"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2005.146"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.264"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008139920864"},{"volume-title":"European Workshop on the Integration of Knowledge, Semantic and Digital Media Technology.","author":"O\u2019Hare N.","key":"e_1_2_1_67_1"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1007\/11499145_64"},{"key":"e_1_2_1_69_1","doi-asserted-by":"crossref","unstructured":"J. Philbin O. Chum M. Isard J. Sivic and A. Zisserman. 2007. Object retrieval with large vocabulary and fast spatial matching. In CVPR. J. Philbin O. Chum M. Isard J. Sivic and A. Zisserman. 2007. Object retrieval with large vocabulary and fast spatial matching. In CVPR.","DOI":"10.1109\/CVPR.2007.383172"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008109111715"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/514236.514263"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/192844.192905"},{"key":"e_1_2_1_73_1","doi-asserted-by":"crossref","unstructured":"D. Robertson and R. Cipolla. 2004. An image-based system for urban navigation. In BMVC. D. Robertson and R. Cipolla. 2004. An image-based system for urban navigation. In BMVC.","DOI":"10.5244\/C.18.84"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1109\/VISUAL.2004.50"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"e_1_2_1_76_1","doi-asserted-by":"crossref","unstructured":"G. Schindler and M. Brown. 2007. City-scale location recognition. In CVPR. G. Schindler and M. Brown. 2007. City-scale location recognition. In CVPR.","DOI":"10.1109\/CVPR.2007.383150"},{"key":"e_1_2_1_77_1","unstructured":"H. Shao T. Svoboda T. Tuytelaars and L. J. Van Gool. 2003. Hpat indexing for fast object\/scene recognition based on local appearance. In CIVR. H. Shao T. Svoboda T. Tuytelaars and L. J. Van Gool. 2003. Hpat indexing for fast object\/scene recognition based on local appearance. In CIVR."},{"key":"e_1_2_1_79_1","doi-asserted-by":"crossref","unstructured":"I. Simmon N. Snavely and S. M. Seitz. 2007. Scene summarization for online image collections. In ICCV. I. Simmon N. Snavely and S. M. Seitz. 2007. Scene summarization for online image collections. In ICCV.","DOI":"10.1109\/ICCV.2007.4408863"},{"volume-title":"Video Google: A text retrieval approach to object matching in videos. In ICCV.","year":"2003","author":"Sivic J.","key":"e_1_2_1_80_1"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/1179352.1141964"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2010.68"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1006\/jvci.1994.1002"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1561\/0600000009"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1023035826052"},{"key":"e_1_2_1_86_1","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the Lasso","volume":"58","author":"Tibshirani R.","year":"1997","journal-title":"Journal of the Royal Statistical Society"},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00129684"},{"key":"e_1_2_1_88_1","doi-asserted-by":"crossref","unstructured":"C. Torniai S. Batte and S. Cayzer. 2007. Sharing Discovering and Browsing Geotagged Pictures on the Web. HP Lab. Technical Report. C. Torniai S. Batte and S. Cayzer. 2007. Sharing Discovering and Browsing Geotagged Pictures on the Web. HP Lab. Technical Report.","DOI":"10.1007\/978-1-84628-827-2_15"},{"key":"e_1_2_1_89_1","unstructured":"Travel Guide. 2008. Homepage. Retrieved from www.travel.msra.cn. Travel Guide. 2008. Homepage. Retrieved from www.travel.msra.cn."},{"volume-title":"International Workshop on Vision Algorithms. 298--372","author":"Triggs B.","key":"e_1_2_1_90_1"},{"volume-title":"Extent: Inferring image metadata from context and content. In ICME.","year":"2005","author":"Tsai C.","key":"e_1_2_1_91_1"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00138-006-0027-1"},{"key":"e_1_2_1_93_1","unstructured":"P. Viola and M. Jones. 2001. Rapid object detection using a boosted cascade of simple features. In CVPR. P. Viola and M. Jones. 2001. Rapid object detection using a boosted cascade of simple features. In CVPR."},{"key":"e_1_2_1_94_1","doi-asserted-by":"crossref","unstructured":"L. Wang. 2007. Toward a discriminative codebook: Codeword selection across multi-resolution. In CVPR. L. Wang. 2007. Toward a discriminative codebook: Codeword selection across multi-resolution. In CVPR.","DOI":"10.1109\/CVPR.2007.383374"},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1145\/1096985.1096991"},{"volume-title":"International Conference of the Language Resources and Evaluation.","year":"2000","author":"Wayne C. L.","key":"e_1_2_1_96_1"},{"key":"e_1_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.79"},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88690-7_54"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.1145\/1661412.1618460"},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.1145\/1290082.1290111"},{"key":"e_1_2_1_101_1","unstructured":"J. Yang J. Wright T. Huang and Y. Ma. 2008. Image super-resolution as sparse representation of raw image patches. In CVPR. J. Yang J. Wright T. Huang and Y. Ma. 2008. Image super-resolution as sparse representation of raw image patches. In CVPR."},{"key":"e_1_2_1_102_1","unstructured":"R. B. Yates and B. R. Neto. 1999. Modern Information Retrieval. ACM Press. R. B. Yates and B. R. Neto. 1999. Modern Information Retrieval. ACM Press."},{"key":"e_1_2_1_103_1","doi-asserted-by":"crossref","unstructured":"T. Yeh J. Lee and T. Darell. 2007. Adaptive vocabulary forest for dynamic indexing and category learning. In CVPR. T. Yeh J. Lee and T. Darell. 2007. Adaptive vocabulary forest for dynamic indexing and category learning. In CVPR.","DOI":"10.1109\/ICCV.2007.4409053"},{"key":"e_1_2_1_104_1","doi-asserted-by":"crossref","unstructured":"T. Yeh K. Tollmar and T. Darrell. 2004. Searching the web with mobile images for location recognition. In CVPR. T. Yeh K. Tollmar and T. Darrell. 2004. Searching the web with mobile images for location recognition. In CVPR.","DOI":"10.1007\/978-3-540-28637-0_25"},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DPVT.2006.80"},{"key":"e_1_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.1145\/1367497.1367532"},{"key":"e_1_2_1_107_1","doi-asserted-by":"crossref","unstructured":"Y.-T. Zheng M. Zhao Y. Song and H. Adam. 2009. Tour the world: Building a web-scale landmark recognition engine. In CVPR. Y.-T. Zheng M. Zhao Y. Song and H. Adam. 2009. Tour the world: Building a web-scale landmark recognition engine. In CVPR.","DOI":"10.1109\/CVPR.2009.5206749"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2597181","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2597181","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:09:58Z","timestamp":1750234198000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2597181"}},"subtitle":["A Survey on Vision-Based Recognition and Mining for Geo-Social Multimedia Analytics"],"short-title":[],"issued":{"date-parts":[[2015,3,11]]},"references-count":106,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,3,11]]}},"alternative-id":["10.1145\/2597181"],"URL":"https:\/\/doi.org\/10.1145\/2597181","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"type":"print","value":"2157-6904"},{"type":"electronic","value":"2157-6912"}],"subject":[],"published":{"date-parts":[[2015,3,11]]},"assertion":[{"value":"2013-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-03-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}