{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T18:58:27Z","timestamp":1774551507781,"version":"3.50.1"},"reference-count":51,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2023,6,21]],"date-time":"2023-06-21T00:00:00Z","timestamp":1687305600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"FCT\u2014Foundation for Science and Technology","award":["UIDB\/04020\/2020"],"award-info":[{"award-number":["UIDB\/04020\/2020"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BDCC"],"abstract":"<jats:p>Social Media Analytics (SMA) is more and more relevant in today\u2019s market dynamics. However, it is necessary to use it wisely, either in promoting any kind of product\/brand, or interacting with customers. This requires its effective understanding and monitoring. One way is through web data scraping (WDS) tools that allow to select sites and platforms to compare them in their performances. They can optimize extraction of big data published on social media. Due to current challenges, a sector that can particularly take advantage of this source is tourism (and its related sectors). This year has the hope of tourism\u2019s revival after a pandemic whose impacts are still affecting several activities. Many traders and entrepreneurs have already used these versatile tools. However, do they really know their potential? The present study highlights the use of WDS to collect data from TripAdvisor\u2019s social pages. Besides comparing competitors\u2019 performance, companies also gain new knowledge of unnoticed preferences\/habits. This contributes to more interesting innovations and results for them and for their customers. The approach used here is based on a project for smart tourism consultancy, from the identification of a gap in our region, to aid tourism organizations to enhance their digital presence and business model. Many things can be detected in this big source of unstructured data very quickly and easily without programming. Moreover, exploring code, either to refine the web scraper or connect it with other platforms\/apps, can be an object of future research to leverage consumer behavior prediction for more advanced interactions.<\/jats:p>","DOI":"10.3390\/bdcc7030121","type":"journal-article","created":{"date-parts":[[2023,6,21]],"date-time":"2023-06-21T01:35:11Z","timestamp":1687311311000},"page":"121","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":22,"title":["The Value of Web Data Scraping: An Application to TripAdvisor"],"prefix":"10.3390","volume":"7","author":[{"given":"Gianluca","family":"Barbera","sequence":"first","affiliation":[{"name":"School of Political Sciences \u201cCesare Alfieri\u201d, University of Florence, 50127 Florence, Italy"}]},{"given":"Luiz","family":"Araujo","sequence":"additional","affiliation":[{"name":"Faculty of Economics, University of Algarve, 8005-139 Faro, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1699-5415","authenticated-orcid":false,"given":"Silvia","family":"Fernandes","sequence":"additional","affiliation":[{"name":"Faculty of Economics, University of Algarve, 8005-139 Faro, Portugal"},{"name":"CinTurs\u2014Research Centre for Tourism, Sustainability and Well-being, University of Algarve, 8005-139 Faro, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2023,6,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1590\/s1677-5538.ibju.2020.s121","article-title":"Social media influence in the COVID-19 Pandemic","volume":"46","year":"2020","journal-title":"Int. Braz. J. Urol."},{"key":"ref_2","first-page":"100819","article-title":"COVID-19 and tourism vulnerability","volume":"38","author":"Duro","year":"2021","journal-title":"Tour. Manag. Perspect."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1002\/mar.20761","article-title":"Creative Strategies in Social Media Marketing: An Exploratory Study of Branded Social Content and Consumer Engagement","volume":"32","author":"Ashley","year":"2015","journal-title":"Psychol. Mark."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Zhao, B. (2017). Web Scraping. Encyclopedia of Big Data, Springer.","DOI":"10.1007\/978-3-319-32001-4_483-1"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Kaisler, S., Armour, F., Espinosa, J., and Money, W. (2013, January 7\u201310). Big Data: Issues and Challenges Moving Forward. Proceedings of the 46th Hawaii International Conference on System Sciences 2013, Wailea, HI, USA.","DOI":"10.1109\/HICSS.2013.645"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1023\/A:1005682102768","article-title":"Data collection methods on the Web for infometric purposes-A review and analysis","volume":"50","year":"2001","journal-title":"Scientometrics"},{"key":"ref_7","unstructured":"Mitchell, R. (2018). Web Scraping with Python: Collecting More Data from the Modern Web, O\u2019Reilly. [2nd ed.]."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"3415","DOI":"10.1007\/s11069-020-04136-z","article-title":"Scraping social media data for disaster communication: How the pattern of Twitter users affects disasters in Asia and the Pacific","volume":"103","author":"Kusumasari","year":"2020","journal-title":"Nat. Hazards"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Kaburuan, E., Lindawati, A., Putra, M., and Utama, D. (2019, January 6\u20138). A Model Configuration of Social Media Text Mining for Projecting the Online-Commerce Transaction (Case: Twitter Tweets Scraping). Proceedings of the 7th International Conference on Cyber and IT Service Management (CITSM) 2019, Jakarta, Indonesia.","DOI":"10.1109\/CITSM47753.2019.8965417"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Kaur, C., and Sharma, A. (2020, January 14\u201316). Social Issues Sentiment Analysis using Python. Proceedings of the 5th International Conference on Computing, Communication and Security (ICCCS) 2020, Patna, India.","DOI":"10.1109\/ICCCS49678.2020.9277251"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Raman, D., Jayalakshmi, S., Arumugam, K., Raj, A., Balaji, D., and Brightsingh, R. (2022, January 21\u201323). Implementation of Data Analysis and Document Summarization in Social Media Data Using R and Python. Proceedings of the 4th International Conference on Inventive Research in Computing Applications (ICIRCA) 2022, Coimbatore, India.","DOI":"10.1109\/ICIRCA54612.2022.9985479"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Bhardwaj, B., Ahmed, S., Jaiharie, J., Dadhich, R., and Ganesan, M. (2021, January 19\u201320). Web Scraping Using Summarization and Named Entity Recognition (NER). Proceedings of the 7th International Conference on Advanced Computing and Communication Systems (ICACCS) 2021, Coimbatore, India.","DOI":"10.1109\/ICACCS51430.2021.9441888"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Dansana, D., Adhikari, J., Mohapatra, M., and Sahoo, S. (2020, January 13\u201314). An Approach to Analyse and Forecast Social media Data using Machine Learning and Data Analysis. Proceedings of the International Conference on Computer Science, Engineering and Applications (ICCSEA) 2020, Gunupur, India.","DOI":"10.1109\/ICCSEA49143.2020.9132895"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Camargo-Henr\u00edquez, I., and N\u00fa\u00f1ez-Bernal, Y. (2022, January 14\u201316). A Web Scraping based approach for data research through social media: An Instagram case. Proceedings of the V Congreso Internacional en Inteligencia Ambiental, Ingenier\u00eda de Software y Salud Electr\u00f3nica y M\u00f3vil (AmITIC) 2022, San Jose, Costa Rica.","DOI":"10.1109\/AmITIC55733.2022.9941290"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1007\/s10032-009-0105-9","article-title":"Locating and parsing bibliographic references in HTML medical articles","volume":"13","author":"Zou","year":"2010","journal-title":"Int. J. Doc. Anal. Recognit."},{"key":"ref_16","unstructured":"Korab, P. (2023, May 21). Text Network Analysis: Generate Beautiful Network Visualisations. Available online: https:\/\/towardsdatascience.com\/text-network-analysis-generate-beautiful-network-visualisations-a373dbe183ca."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1177\/0047287517747753","article-title":"Sentiment Analysis in Tourism: Capitalizing on Big Data","volume":"58","author":"Alaei","year":"2019","journal-title":"J. Travel Res."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1177\/00222429221100750","article-title":"Fields of Gold: Scraping Web Data for Marketing Insights","volume":"86","author":"Boegershausen","year":"2022","journal-title":"J. Mark."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"M\u00e0rquez-Dom\u00ednguez, C., L\u00f3pez L\u00f3pez, P., and Arias, T. (2017, January 21\u201324). Social networking and political agenda: Donald Trump\u2019s Twitter accounts. Proceedings of the 12th Iberian Conference on Information Systems and Technologies (CISTI) 2017, Lisbon, Portugal.","DOI":"10.23919\/CISTI.2017.7976052"},{"key":"ref_20","first-page":"89","article-title":"Political Social Media Campaigning in Fiji\u2019s 2014 Elections","volume":"35","author":"Tarai","year":"2015","journal-title":"J. Pac. Stud."},{"key":"ref_21","first-page":"309","article-title":"\u201cSometimes the Crisis Makes the Leader?\u201d A Comparison of Giuseppe Conte Digital Communication before and during the COVID-19 Pandemic","volume":"3","author":"Rullo","year":"2021","journal-title":"Comun. Politica"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Mabillard, V., Zumofen, R., and Pasquier, M. (2022). Local governments\u2019 communication on social media platforms: Refining and assessing patterns of adoption in Belgium. Int. Rev. Adm. Sci., 1\u201317.","DOI":"10.1177\/00208523221133229"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"96","DOI":"10.17979\/redma.2022.26.1.8869","article-title":"Comunicaci\u00f3n y diabetes, un camino para la reflexi\u00f3n","volume":"26","year":"2022","journal-title":"RedMarka-Rev. De Mark. Apl."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"19","DOI":"10.5539\/ass.v11n26p19","article-title":"Customer Engagement Factors in Facebook Brand Pages","volume":"11","author":"Jayasingh","year":"2015","journal-title":"Asian Soc. Sci."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1007\/s40558-015-0045-9","article-title":"User reactions to destination brand contents in social media","volume":"15","author":"Huertas","year":"2016","journal-title":"Inf. Technol. Tour."},{"key":"ref_26","first-page":"1","article-title":"Dilemmas Between Freedom of Speech and Hate Speech: Russophobia on Facebook and Instagram in the Spanish Media","volume":"11","year":"2022","journal-title":"Politics Gov."},{"key":"ref_27","first-page":"47","article-title":"La gesti\u00f3n de los medios sociales en la dinamizaci\u00f3n de destinos tur\u00edsticos termales: An\u00e1lisis crosscultural de modelos aplicados en Espa\u00f1a, Portugal y Ecuador","volume":"2","author":"Amboage","year":"2015","journal-title":"Hologram\u00e1tica"},{"key":"ref_28","first-page":"210","article-title":"Evolution of the presence and engagement of official social networks in promoting tourism in Spain","volume":"7","author":"Matos","year":"2019","journal-title":"J. Spat. Organ. Dyn."},{"key":"ref_29","first-page":"17","article-title":"An\u00e1lisis de la comunicaci\u00f3n digital oficial en la promoci\u00f3n tur\u00edstica de Brasil","volume":"9","year":"2020","journal-title":"3c TIC-Cuad. De Desarro. Apl. A Las TIC"},{"key":"ref_30","first-page":"62","article-title":"Evolution of hospitality and tourism technology research from Journal of Hospitality and Tourism Technology: A computer-assisted qualitative data analysis","volume":"13","author":"Lee","year":"2021","journal-title":"J. Hosp. Tour. Technol."},{"key":"ref_31","unstructured":"Pereira, P. (2023). Social Media Influencers in Travel and Tourism. [Master\u2019s Thesis, Nova Information Management School]. Master Course in Information Management."},{"key":"ref_32","unstructured":"Phaujdar, A. (2023, May 22). 9 Best Web Scraping Tools. Available online: https:\/\/hevodata.com\/learn\/web-scraping-tools\/."},{"key":"ref_33","unstructured":"Rizkallah, J. (2023, March 23). The Big (Unstructured) Data Problem. Available online: https:\/\/www.forbes.com\/sites\/forbestechcouncil\/2017\/06\/05\/the-big-unstructured-data-problem\/?sh=cd00fa3493a3."},{"key":"ref_34","unstructured":"Selz, D. (2023, March 23). Unstructured Data Is Key to True Customer Insight. Available online: https:\/\/www.linkedin.com\/pulse\/unstructured-data-key-true-customer-insight-dorian-selz."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1108\/EJM-01-2019-0092","article-title":"Cognitive computing on unstructured data for customer co-innovation","volume":"54","author":"Chen","year":"2020","journal-title":"Eur. J. Mark."},{"key":"ref_36","unstructured":"Marr, B. (2023, March 12). How Much Data Do We Create Every Day?. Available online: https:\/\/www.forbes.com\/sites\/bernardmarr\/2018\/05\/21\/how-much-data-do-we-create-every-day-the-mind-blowing-stats-everyone-should-read\/?sh=4de1a9aa60ba."},{"key":"ref_37","unstructured":"Ruan, Z., and Siau, K. (2023, March 13). Digital Marketing in the Artificial Intelligence and Machine Learning Age. Americas Conference on Information Systems. Available online: https:\/\/www.semanticscholar.org\/paper\/Digital-Marketing-in-the-Artificial-Intelligence-Ruan-Siau\/5d0764dbe4cb3beb6c194b49a4eae1a991a72cd8."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1108\/ITP-09-2014-0197","article-title":"Examining information systems infusion from a user commitment perspective","volume":"29","author":"Kim","year":"2016","journal-title":"Inf. Technol. People"},{"key":"ref_39","first-page":"1","article-title":"Cloud computing: An examination of factors impacting users\u2019 adoption","volume":"58","author":"Changchit","year":"2018","journal-title":"J. Comput. Inf. Syst."},{"key":"ref_40","unstructured":"Biedrzycki, N. (2023, May 26). Cognitive Computing. What Can It Be Used for?. Available online: https:\/\/towardsdatascience.com\/cognitive-computing-what-can-it-be-used-for-8af4721928f5."},{"key":"ref_41","unstructured":"Frackiewicz, M. (2023, May 26). The Role of NLP in Cognitive Computing. Available online: https:\/\/ts2.space\/en\/the-role-of-nlp-in-cognitive-computing\/."},{"key":"ref_42","unstructured":"Rao, L. (2023, June 05). Instagram Copies Snapchat Once again with Face Filters. Available online: https:\/\/tinyurl.com\/ybcuxxdv."},{"key":"ref_43","unstructured":"Perry, E. (2023, June 05). Meet HearMeOut: The Social Media Platform Looking to Bring Audio Back into the Mainstream. Available online: https:\/\/tinyurl.com\/y8yxbzah."},{"key":"ref_44","unstructured":"Katai, L. (2023, June 05). 3 Reasons Why Audio Will Conquer All Social Media. Available online: https:\/\/www.adweek.com\/performance-marketing\/3-reasons-why-audio-will-conquer-social-media\/."},{"key":"ref_45","first-page":"27","article-title":"Impact of Artificial Intelligence in Marketing: A Perspective of Marketing Professionals of Pakistan","volume":"19","author":"Shahid","year":"2019","journal-title":"Glob. J. Manag. Bus. Res."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"102168","DOI":"10.1016\/j.ijinfomgt.2020.102168","article-title":"Setting the future of digital and social media marketing research: Perspectives and research propositions","volume":"59","author":"Dwivedi","year":"2021","journal-title":"Int. J. Inf. Manag."},{"key":"ref_47","unstructured":"Zoho Social (2023, June 05). Social Media Marketing Trends for 2022. Available online: https:\/\/www.zoho.com\/social\/journal\/social-media-marketing-trends-2022.html."},{"key":"ref_48","unstructured":"NBBJ (2023, June 05). Social Media Is Evolving Quickly, and Your Business Needs to Also. Available online: https:\/\/www.northbaybusinessjournal.com\/article\/industrynews\/social-media-is-evolving-quickly-and-your-business-needs-to-also\/."},{"key":"ref_49","unstructured":"Corcoran, S. (2023, March 12). Defining Earned, Owned and Paid Media. Available online: https:\/\/www.forrester.com\/blogs\/09-12-16-defining_earned_owned_and_paid_media\/."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Wozniak, T., Stangl, B., Schegg, R., and Liebrich, A. (2016, January 2\u20135). Do Social Media Investments Pay Off? Preliminary Evidence from Swiss Destination Marketing Organizations. Proceedings of the ENTER eTourism Conference 2016, Bilbao, Spain.","DOI":"10.1007\/978-3-319-28231-2_20"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1016\/j.intmar.2013.09.007","article-title":"Social media metrics-A framework and guidelines for managing social media","volume":"27","author":"Peters","year":"2013","journal-title":"J. Interact. Mark."}],"container-title":["Big Data and Cognitive Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-2289\/7\/3\/121\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:57:36Z","timestamp":1760126256000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-2289\/7\/3\/121"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,21]]},"references-count":51,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,9]]}},"alternative-id":["bdcc7030121"],"URL":"https:\/\/doi.org\/10.3390\/bdcc7030121","relation":{},"ISSN":["2504-2289"],"issn-type":[{"value":"2504-2289","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,21]]}}}