{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,7]],"date-time":"2026-06-07T17:05:24Z","timestamp":1780851924632,"version":"3.54.1"},"reference-count":57,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2019,12,3]],"date-time":"2019-12-03T00:00:00Z","timestamp":1575331200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61972439"],"award-info":[{"award-number":["61972439"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61672039"],"award-info":[{"award-number":["61672039"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61702010"],"award-info":[{"award-number":["61702010"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61772034"],"award-info":[{"award-number":["61772034"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Tongling College Talent Research Fund Project","award":["2017tlxyrc07"],"award-info":[{"award-number":["2017tlxyrc07"]}]},{"DOI":"10.13039\/501100003995","name":"Natural Science Foundation of Anhui Province","doi-asserted-by":"publisher","award":["1808085MF172"],"award-info":[{"award-number":["1808085MF172"]}],"id":[{"id":"10.13039\/501100003995","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Technological advances have led to numerous developments in data sources. Geo-tagged photo metadata has provided a new source of mass research data for tourism studies. A series of data processing methods centering on the various types of information contained in geo-tagged photo metadata have thus been proposed; as a result, the development of tourism studies based on such data has advanced. However, an in-depth study of the data processing methods designed to conduct tourist flow prediction based on geo-tagged photo metadata has not yet been conducted. In order to acquire accurate substitutive data regarding inbound flows in cities, this paper introduces and designs several methods, including data screening, text data similarity calculation, geographical location clustering, and time series data modelling, in order to realize a data preprocessing model for inbound tourist flows in cities based on geo-tagged photo metadata. Wherein, the entropy filtering method was introduced to aid in determining whether the data were posted by inbound tourists; whether the inbound persons\u2019 activities were related to tourism was judged through the calculation of tag text similarity; an efficient clustering method based on geographic grid partition was designed for cases in which the tag values were empty; finally, the time series of the inbound tourist flows of a certain region and period were obtained through data statistics and normalization. For the empirical research, Beijing City in China was selected as the research case, after which the feasibility and accuracy of the methods proposed in this paper were verified through data correlation analysis between Flickr data and real statistical yearbook data, as well as analysis of the prediction results based on a machine learning algorithm. The data preprocessing method introduced and designed in this paper provides a reference for the study of geo-tagged photo metadata in the field of tourism flow prediction. These methods can effectively filter out inbound tourist flow data from geotag photo metadata, thus providing a novel, reliable, and low-cost research data source for urban inbound tourism flow forecasting.<\/jats:p>","DOI":"10.3390\/ijgi8120556","type":"journal-article","created":{"date-parts":[[2019,12,4]],"date-time":"2019-12-04T04:30:35Z","timestamp":1575433835000},"page":"556","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["Geo-Tagged Photo Metadata Processing Method for Beijing Inbound Tourism Flow"],"prefix":"10.3390","volume":"8","author":[{"given":"Wen","family":"Chen","sequence":"first","affiliation":[{"name":"School of Geography and Tourism, Anhui Normal University, Wuhu 241003, China"},{"name":"School of Mathematics and Computer Science, Tongling College, Tongling 244000, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhiyun","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Geography and Tourism, Anhui Normal University, Wuhu 241003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoyao","family":"Zheng","sequence":"additional","affiliation":[{"name":"Anhui Provincial Key Laboratory of Network and Information Security, Anhui Normal University, Wuhu 241003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4987-0376","authenticated-orcid":false,"given":"Yonglong","family":"Luo","sequence":"additional","affiliation":[{"name":"School of Geography and Tourism, Anhui Normal University, Wuhu 241003, China"},{"name":"Anhui Provincial Key Laboratory of Network and Information Security, Anhui Normal University, Wuhu 241003, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2019,12,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/j.tourman.2007.07.016","article-title":"Tourism demand modelling and forecasting\u2014A review of recent research","volume":"29","author":"Song","year":"2008","journal-title":"Tour. Manag."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1080\/14616680802236352","article-title":"Movement Patterns of Tourists within a Destination","volume":"10","author":"Mckercher","year":"2008","journal-title":"Tour. Geogr."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"684","DOI":"10.1016\/j.tra.2006.07.003","article-title":"Analysis of tourist behaviour based on the tracking data collected using a mobile communication instrument","volume":"41","author":"Asakura","year":"2007","journal-title":"Transp. Res. Part Policy Pract."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1016\/j.tourman.2014.07.003","article-title":"Exploring the travel behaviors of inbound tourists to Hong Kong using geotagged photos","volume":"46","author":"Vu","year":"2015","journal-title":"Tour. Manag."},{"key":"ref_5","unstructured":"(1981). Technical Handbook on the Collection and Presentation of Domestic and International Tourism Statistics, World Tourism Organization."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1080\/17489720802261138","article-title":"Leveraging explicitly disclosed location information to understand tourist dynamics: A case study","volume":"2","author":"Girardin","year":"2008","journal-title":"J. Locat. Based Serv."},{"key":"ref_7","unstructured":"Jones, B. (1996). Reforming the System?. A Review of Australian Tourism Statistics, Bureau of Tourism Research."},{"key":"ref_8","unstructured":"Finn, M., Walton, M., and Elliott-White, M. (2000). Tourism and Leisure Research Methods: Data Collection, Analysis, and Interpretation, Pearson Education."},{"key":"ref_9","first-page":"1556","article-title":"Exploring the spatial characteristics of Beijing inbound tourist flow based on geotagged photos","volume":"73","author":"Qin","year":"2018","journal-title":"Acta Geogr. Sin."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1012","DOI":"10.1038\/nature07634","article-title":"Detecting influenza epidemics using search engine query data","volume":"457","author":"Ginsberg","year":"2009","journal-title":"Nature"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1016\/j.tourman.2014.07.019","article-title":"Forecasting Chinese tourist volume with search engine data","volume":"46","author":"Yang","year":"2015","journal-title":"Tour. Manag."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/j.tourman.2016.07.005","article-title":"Forecasting tourism demand with composite search index","volume":"59","author":"Li","year":"2017","journal-title":"Tour. Manag."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1016\/j.tourman.2018.03.009","article-title":"Big data in tourism research: A literature review","volume":"68","author":"Li","year":"2018","journal-title":"Tour. Manag."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"771","DOI":"10.1016\/j.im.2016.11.011","article-title":"A Big Data Analytics Method for Tourist Behaviour Analysis","volume":"54","author":"Miah","year":"2017","journal-title":"Inf. Manag."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Kisilevich, S., Krstajic, M., Keim, D., Andrienko, N., and Andrienko, G. (2010, January 26\u201329). Event-Based Analysis of People\u2019s Activities and Behavior Using Flickr and Panoramio Geotagged Photo Collections. Proceedings of the IEEE 2010 14th International Conference Information Visualisation, London, UK.","DOI":"10.1109\/IV.2010.94"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Lucchese, C., Perego, R., Silvestri, F., Vahabi, H., and Venturini, R. (2012, January 1\u20135). How Random Walks Can Help Tourism. Proceedings of the Advances in Information Retrieval, Barcelona, Spain.","DOI":"10.1007\/978-3-642-28997-2_17"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"103955","DOI":"10.1016\/j.tourman.2019.103955","article-title":"Exploring spatio-temporal changes of city inbound tourism flow: The case of Shanghai, China","volume":"76","author":"Mou","year":"2020","journal-title":"Tour. Manag."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1862","DOI":"10.1080\/13683500.2018.1564739","article-title":"Managing customer knowledge through the use of big data analytics in tourism research","volume":"22","author":"Centobelli","year":"2019","journal-title":"Curr. Issues Tour."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"3514","DOI":"10.1108\/IJCHM-07-2017-0461","article-title":"Business intelligence and big data in hospitality and tourism: A systematic literature review","volume":"30","author":"Mariani","year":"2018","journal-title":"Int. J. Contemp. Hosp. Manag."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Peng, X., and Huang, Z.A. (2017). Novel Popular Tourist Attraction Discovering Approach Based on Geo-Tagged Social Media Big Data. ISPRS Int. J. Geo-Inf., 6.","DOI":"10.3390\/ijgi6070216"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Kisilevich, S., Mansmann, F., and Keim, D. (2010, January 21\u201323). P-DBSCAN: A Density Based Clustering Algorithm for Exploration and Analysis of Attractive Areas Using Collections of Geo-tagged Photos. Proceedings of the 1st International Conference and Exhibition on Computing for Geospatial Research & Application, Washington, DC, USA.","DOI":"10.1145\/1823854.1823897"},{"key":"ref_22","first-page":"226","article-title":"A density-based algorithm for discovering clusters in large spatial databases with noise","volume":"96","author":"Ester","year":"1996","journal-title":"KDD"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1016\/j.tourman.2019.07.002","article-title":"Discovering the tourists\u2019 behaviors and perceptions in a tourism destination by analyzing photos\u2019 visual content with a computer deep learning model: The case of Beijing","volume":"75","author":"Zhang","year":"2019","journal-title":"Tour. Manag."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Xia, C., Schwartz, R., Xie, K., Krebs, A., Langdon, A., Ting, J., and Naaman, M. (2014, January 7\u201311). CityBeat: Real-time Social Media Visualization of Hyper-local City Data. Proceedings of the 23rd International Conference on World Wide Web, Seoul, Korea.","DOI":"10.1145\/2567948.2577020"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1016\/j.jretconser.2019.06.018","article-title":"Store buildings as tourist attractions: Mining retail meaning of store building pictures through a machine learning approach","volume":"51","author":"Pantano","year":"2019","journal-title":"J. Retail. Consum. Serv."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Giglio, S., Bertacchini, F., Bilotta, E., and Pantano, P. (2019). Machine learning and point of interests: Typical tourist Italian cities. Curr. Issues Tour., 1\u201313.","DOI":"10.1080\/13683500.2019.1637827"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1007\/s10707-015-0226-x","article-title":"Travel topic analysis: A mutually reinforcing method for geo-tagged photos","volume":"19","author":"Kou","year":"2015","journal-title":"Geoinformatica"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1885","DOI":"10.14778\/2824032.2824092","article-title":"I2RS: A Distributed Geo-Textual Image Retrieval and Recommendation System","volume":"8","author":"Chen","year":"2015","journal-title":"Proc. Vldb Endow."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"356","DOI":"10.1504\/IJES.2018.095023","article-title":"User similarity-based gender-aware travel location recommendation by mining geotagged photos","volume":"10","author":"Xu","year":"2018","journal-title":"Int. J. Embed. Syst."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1016\/j.apgeog.2016.06.001","article-title":"Characterizing geographical preferences of international tourists and the local influential factors in China using geo-tagged photos on social media","volume":"73","author":"Su","year":"2016","journal-title":"Appl. Geogr."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Koylu, C., Zhao, C., and Shao, W. (2019). Deep Neural Networks and Kernel Density Estimation for Detecting Human Activity Patterns from Geo-Tagged Images: A Case Study of Birdwatching on Flickr. ISPRS Int. J. Geo-Inf., 8.","DOI":"10.3390\/ijgi8010045"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Yang, L., Wu, L., Liu, Y., and Kang, C. (2017). Quantifying Tourist Behavior Patterns by Travel Motifs and Geo-Tagged Photos from Flickr. ISPRS Int. J. Geo-Inf., 6.","DOI":"10.3390\/ijgi6110345"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Zamir, A.R., Hakeem, A., Van Gool, L., Shah, M., and Szeliski, R. (2016). Where the Photos Were Taken: Location Prediction by Learning from Flickr Photos. Advances in Computer Vision and Pattern Recognition. Large-Scale Visual Geo-Localization, Springer International Publishing.","DOI":"10.1007\/978-3-319-25781-5"},{"key":"ref_34","unstructured":"Baraglia, R., Muntean, C.I., Nardini, F.M., and Silvestri, F. (November, January 27). LearNext: Learning to Predict Tourists Movements. Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management, San Francisco, CA, USA."},{"key":"ref_35","first-page":"155","article-title":"Intelligent travel information platform based on location base services to predict user travel behavior from user-generated GPS traces","volume":"39","author":"Arain","year":"2017","journal-title":"Int. J. Comput. Appl."},{"key":"ref_36","unstructured":"Kang, B.H., and Bai, Q. A Framework for Mining Semantic-Level Tourist Movement Behaviours from Geo-Tagged Photos. Proceedings of the AI 2016: Advances in Artificial Intelligence."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1177\/0047287514528283","article-title":"A Meta-Analysis of International Tourism Demand Elasticities","volume":"54","author":"Peng","year":"2015","journal-title":"J. Travel Res."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1300\/J073v13n01_02","article-title":"Forecasting China\u2019s Monthly Inbound Travel Demand","volume":"13","author":"Kulendran","year":"2002","journal-title":"J. Travel Tour. Mark."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1016\/j.compenvurbsys.2013.07.006","article-title":"Road-based travel recommendation using geo-tagged images","volume":"53","author":"Sun","year":"2015","journal-title":"Comput. Environ. Urban Syst."},{"key":"ref_40","unstructured":"Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. arXiv."},{"key":"ref_41","unstructured":"Weiss, S.M., Indurkhya, N., Zhang, T., and Damerau, F. (2010). Text Mining: Predictive Methods for Analyzing Unstructured Information, Springer Publishing Company, Incorporated. [1st ed.]."},{"key":"ref_42","unstructured":"Wei, X. (2000). Introduction to Tourism, China Forestry Publishing House."},{"key":"ref_43","unstructured":"Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., and Weinberger, K.Q. (2013). Distributed Representations of Words and Phrases and their Compositionality. Advances in Neural Information Processing Systems 26, Curran Associates, Inc."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/1961189.1961199","article-title":"LIBSVM: A Library for Support Vector Machines","volume":"2","author":"Chang","year":"2011","journal-title":"Acm Trans. Intell. Syst. Technol."},{"key":"ref_45","first-page":"985","article-title":"Extreme learning machine: A new learning scheme of feedforward neural networks","volume":"2","author":"Huang","year":"2004","journal-title":"Neural Netw."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Cheng, G.-J., Cai, L., and Pan, H.-X. (2009, January 11\u201314). Comparison of Extreme Learning Machine with Support Vector Regression for Reservoir Permeability Prediction. Proceedings of the 2009 International Conference on Computational Intelligence and Security, Beijing, China.","DOI":"10.1109\/CIS.2009.124"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1007\/s13042-011-0019-y","article-title":"Extreme learning machines: A survey","volume":"2","author":"Huang","year":"2011","journal-title":"Int. J. Mach. Learn. Cybern."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.ijepes.2016.03.001","article-title":"A systematic design of interval type-2 fuzzy logic system using extreme learning machine for electricity load demand forecasting","volume":"82","author":"Hassan","year":"2016","journal-title":"Int. J. Electr. Power Energy Syst."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1016\/j.chb.2016.08.014","article-title":"Economic growth forecasting by artificial neural network with extreme learning machine based on trade, import and export parameters","volume":"65","author":"Milovancevic","year":"2016","journal-title":"Comput. Hum. Behav."},{"key":"ref_50","unstructured":"China Tourism Academy (2013). Annual Report of China Inbound Tourism Development 2013, Tourism Education Press."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1080\/19388160.2018.1516584","article-title":"Research on China\u2019s Inbound Tourism: A Comparative Review","volume":"15","author":"Wu","year":"2019","journal-title":"J. China Tour. Res."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1007\/BF02289588","article-title":"Hierarchical clustering schemes","volume":"32","author":"Johnson","year":"1967","journal-title":"Psychometrika"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1136\/amiajnl-2011-000480","article-title":"A vector space model approach to identify genetically related diseases","volume":"19","author":"Sarkar","year":"2012","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1016\/j.ipm.2016.12.002","article-title":"Opinion mining from online hotel reviews\u2014A text summarization approach","volume":"53","author":"Hu","year":"2017","journal-title":"Inf. Process. Manag."},{"key":"ref_55","first-page":"54","article-title":"Classifying multi-destination trips in Austria with big data","volume":"21","author":"Oender","year":"2017","journal-title":"Tour. Manag. Perspect."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.tourman.2017.11.001","article-title":"Tourists\u2019 digital footprint in cities: Comparing Big Data sources","volume":"66","author":"Gutierrez","year":"2018","journal-title":"Tour. Manag."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1109\/TPAMI.1979.4766902","article-title":"N-Gram Statistics for Natural Language Understanding and Text Processing","volume":"1","author":"Suen","year":"1979","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/8\/12\/556\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:39:55Z","timestamp":1760189995000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/8\/12\/556"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,3]]},"references-count":57,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2019,12]]}},"alternative-id":["ijgi8120556"],"URL":"https:\/\/doi.org\/10.3390\/ijgi8120556","relation":{},"ISSN":["2220-9964"],"issn-type":[{"value":"2220-9964","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12,3]]}}}