{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T22:49:41Z","timestamp":1777416581413,"version":"3.51.4"},"reference-count":41,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2022,9,15]],"date-time":"2022-09-15T00:00:00Z","timestamp":1663200000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Res. Metr. Anal."],"abstract":"<jats:p>Infodemiologic methods could be used to enhance modeling infectious diseases. It is of interest to verify the utility of these methods using a Nigerian case study. We used Google Trends data to track COVID-19 incidences and assessed whether they could complement traditional data based solely on reported case numbers. Data on the Nigerian weekly COVID-19 cases spanning through March 1, 2020, to May 31, 2021, were matched with internet search data from Google Trends. The reported weekly incidence numbers and the GT data were split into training and testing sets. ARIMA models were fitted to describe reported weekly COVID cases using the training set. Several COVID-related search terms were theoretically and empirically assessed for initial screening. The utilized Google Trends (GT) variable was added to the ARIMA model as a regressor. Model forecasts, both with and without GTD, were compared with weekly cases in the test set over 13 weeks. Forecast accuracies were compared visually and using RMSE (root mean square error) and MAE (mean average error). Statistical significance of the difference in predictions was determined with the two-sided Diebold-Mariano test. Preliminary results of contemporaneous correlations between COVID-related search terms and weekly COVID cases reveal \u201closs of smell,\u201d \u201closs of taste,\u201d \u201cfever\u201d (in order of magnitude) as significantly associated with the official cases. Predictions of the ARIMA model using solely reported case numbers resulted in an RMSE (root mean squared error) of 411.4 and mean absolute error (MAE) of 354.9. The GT expanded model achieved better forecasting accuracy (RMSE: 388.7 and MAE = 340.1). Corrected Akaike Information Criteria also favored the GT expanded model (869.4 vs. 872.2). The difference in predictive performances was significant when using a two-sided Diebold-Mariano test (DM = 6.75, <jats:italic>p<\/jats:italic> &amp;lt; 0.001) for the 13 weeks. Google trends data enhanced the predictive ability of a traditionally based model and should be considered a suitable method to enhance infectious disease modeling.<\/jats:p>","DOI":"10.3389\/frma.2022.1003972","type":"journal-article","created":{"date-parts":[[2022,9,15]],"date-time":"2022-09-15T09:50:34Z","timestamp":1663235434000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Modeling COVID-19 incidence with Google Trends"],"prefix":"10.3389","volume":"7","author":[{"given":"Lateef Babatunde","family":"Amusa","sequence":"first","affiliation":[]},{"given":"Hossana","family":"Twinomurinzi","sequence":"additional","affiliation":[]},{"given":"Chinedu Wilfred","family":"Okonkwo","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2022,9,15]]},"reference":[{"key":"B1","first-page":"327","article-title":"Use of time-series analysis in infectious disease surveillance","volume":"76","author":"Allard","year":"1998","journal-title":"Bull. World Health Organ."},{"key":"B2","doi-asserted-by":"publisher","first-page":"e18828","DOI":"10.2196\/18828","article-title":"Predicting COVID-19 incidence through analysis of google trends data in iran: data mining and deep learning pilot study","volume":"6","author":"Ayyoubzadeh","year":"2020","journal-title":"JMIR Public Health Surveill."},{"key":"B3","doi-asserted-by":"publisher","first-page":"1557","DOI":"10.1086\/630200","article-title":"Google trends: a web-based tool for real-time surveillance of disease outbreaks","volume":"49","author":"Carneiro","year":"2009","journal-title":"Clin. Infect. Dis."},{"key":"B4","doi-asserted-by":"publisher","first-page":"1165","DOI":"10.1080\/14787210.2020.1792289","article-title":"Loss of smell and taste: a new marker of COVID-19? Tracking reduced sense of smell during the coronavirus pandemic using search trends","volume":"18","author":"Cherry","year":"2020","journal-title":"Expert Rev. Anti. Infect. Ther."},{"key":"B5","doi-asserted-by":"publisher","first-page":"134","DOI":"10.1198\/073500102753410444","article-title":"Comparing predictive accuracy","volume":"20","author":"Diebold","year":"2002","journal-title":"J. Bus. Econ. Stat."},{"key":"B6","doi-asserted-by":"publisher","first-page":"533","DOI":"10.1016\/S1473-3099(20)30120-1","article-title":"An interactive web-based dashboard to track COVID-19 in real time","volume":"20","author":"Dong","year":"2020","journal-title":"Lancet Infect. Dis."},{"key":"B7","doi-asserted-by":"publisher","first-page":"e1157","DOI":"10.2196\/jmir.1157","article-title":"Infodemiology and infoveillance: framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the Internet","volume":"11","author":"Eysenbach","year":"2009","journal-title":"J. Med. Internet Res."},{"key":"B8","doi-asserted-by":"publisher","first-page":"e8186","DOI":"10.2196\/publichealth.8186","article-title":"Associations of topics of discussion on Twitter with survey measures of attitudes, knowledge, and behaviors related to Zika: probabilistic study in the United States","volume":"4","author":"Farhadloo","year":"2018","journal-title":"JMIR Public Health Surveill."},{"key":"B9","doi-asserted-by":"publisher","DOI":"10.1101\/2021.03.26.21254369","article-title":"Using Google Health Trends to investigate COVID19 incidence in Africa","author":"Fulk","year":"2021","journal-title":"medRxiv"},{"key":"B10","unstructured":"How Data Is Adjusted2018"},{"key":"B11","volume-title":"Forecasting: Principles and Practice","author":"Hyndman","year":"2018"},{"key":"B12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v027.i03","article-title":"Automatic time series forecasting: the forecast package for R","volume":"27","author":"Hyndman","year":"2008","journal-title":"J. Stat. Softw."},{"key":"B13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/srep33707","article-title":"Evaluating the performance of infectious disease forecasts: a comparison of climate-driven and seasonal dengue forecasts for Mexico","volume":"6","author":"Johansson","year":"2016","journal-title":"Sci. Rep."},{"key":"B14","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1016\/j.epidem.2019.01.002","article-title":"Near-term forecasts of influenza-like illness: an evaluation of autoregressive time series approaches","volume":"27","author":"Kandula","year":"2019","journal-title":"Epidemics"},{"key":"B15","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1471-2105-15-276","article-title":"Comparison of ARIMA and Random Forest time series models for prediction of avian influenza H5N1 outbreaks","volume":"15","author":"Kane","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"B16","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1016\/0304-4076(92)90104-Y","article-title":"Testing the null hypothesis of stationarity against the alternative of a unit root: How sure are we that economic time series have a unit root?","volume":"54","author":"Kwiatkowski","year":"1992","journal-title":"J. Econom."},{"key":"B17","doi-asserted-by":"publisher","first-page":"e8950","DOI":"10.2196\/publichealth.8950","article-title":"Accurate influenza monitoring and forecasting using novel internet data streams: a case study in the Boston Metropolis","volume":"4","author":"Lu","year":"2018","journal-title":"JMIR Public Health Surveill."},{"key":"B18","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-020-77275-9","article-title":"COVID-19 predictability in the United States using Google Trends time series","volume":"10","author":"Mavragani","year":"2020","journal-title":"Sci. Rep."},{"key":"B19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-018-0126-7","article-title":"Forecasting AIDS prevalence in the United States using online search traffic data","volume":"5","author":"Mavragani","year":"","journal-title":"J. Big Data"},{"key":"B20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-018-0140-9","article-title":"Infoveillance of infectious diseases in USA: STDs, tuberculosis, and hepatitis","volume":"5","author":"Mavragani","year":"","journal-title":"J. Big Data"},{"key":"B21","doi-asserted-by":"publisher","first-page":"e9366","DOI":"10.2196\/jmir.9366","article-title":"Assessing the methods, tools, and statistical approaches in Google Trends research: systematic review","volume":"20","author":"Mavragani","year":"","journal-title":"J. Med. Internet Res."},{"key":"B22","doi-asserted-by":"publisher","first-page":"e8726","DOI":"10.2196\/publichealth.8726","article-title":"Integrating smart health in the us health care system: infodemiology study of asthma monitoring in the google era","volume":"4","author":"Mavragani","year":"","journal-title":"JMIR Public Health Surveill."},{"key":"B23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1475-2875-11-165","article-title":"Remote sensing-based time series models for malaria early warning in the highlands of Ethiopia","volume":"11","author":"Midekisa","year":"2012","journal-title":"Malar. J."},{"key":"B24","doi-asserted-by":"publisher","first-page":"e08386","DOI":"10.1016\/j.heliyon.2021.e08386","article-title":"Forecasting the future number of pertussis cases using data from Google Trends","volume":"7","author":"Nann","year":"2021","journal-title":"Heliyon"},{"key":"B25","unstructured":"COVID-19 Outbreak in Nigeria: Situation Reports2020"},{"key":"B26","doi-asserted-by":"publisher","first-page":"e109583","DOI":"10.1371\/journal.pone.0109583","article-title":"The use of google trends in health care research: a systematic review","volume":"9","author":"Nuti","year":"2014","journal-title":"PLoS ONE"},{"key":"B27","doi-asserted-by":"publisher","first-page":"3438","DOI":"10.1109\/BigData50022.2020.9377852","article-title":"\u201cGoogle trends analysis of covid-19 pandemic,\u201d","author":"Pan","year":"2020","journal-title":"2020 IEEE International Conference on Big Data (Big Data)"},{"key":"B28","volume-title":"R: A Language and Environment for Statistical Computing","year":"2020"},{"key":"B29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1471-2334-13-421","article-title":"The development of a combined mathematical model to forecast the incidence of hepatitis E in Shanghai, China","volume":"13","author":"Ren","year":"2013","journal-title":"BMC Infect. Dis."},{"key":"B30","doi-asserted-by":"publisher","first-page":"e10719","DOI":"10.7759\/cureus.10719","article-title":"The impact of COVID-19 on Italian web users: a quantitative analysis of regional hygiene interest and emotional response","volume":"12","author":"Rovetta","year":"2020","journal-title":"Cureus"},{"key":"B31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40504-017-0065-7","article-title":"Digital epidemiology: what is it, and where is it going?","volume":"14","author":"Salath\u00e9","year":"2018","journal-title":"Life Sci. Soc. Policy"},{"key":"B32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1017\/dmp.2021.249","article-title":"Suitability of Google Trends\u2122 for digital surveillance during ongoing COVID-19 epidemic: a case study from India","author":"Satpathy","year":"2021","journal-title":"Disaster Med. Public Health Prep"},{"key":"B33","doi-asserted-by":"publisher","first-page":"e3929","DOI":"10.1097\/MD.0000000000003929","article-title":"Time series analysis of influenza incidence in Chinese provinces from 2004 to 2011","volume":"95","author":"Song","year":"2016","journal-title":"Medicine"},{"key":"B34","doi-asserted-by":"publisher","first-page":"101306","DOI":"10.1016\/j.ttbdis.2019.101306","article-title":"Predicting tick-borne encephalitis using Google Trends","volume":"11","author":"Sulyok","year":"2020","journal-title":"Ticks Tick Borne Dis."},{"key":"B35","doi-asserted-by":"publisher","first-page":"e0165085","DOI":"10.1371\/journal.pone.0165085","article-title":"Dynamic forecasting of Zika epidemics using Google Trends","volume":"12","author":"Teng","year":"2017","journal-title":"PLoS ONE"},{"key":"B36","doi-asserted-by":"publisher","first-page":"e7219","DOI":"10.2196\/jmir.7219","article-title":"Too far to care? Measuring public attention and fear for Ebola using Twitter","volume":"19","author":"Van Lent","year":"2017","journal-title":"J. Med. Internet Res."},{"key":"B37","doi-asserted-by":"publisher","first-page":"e228","DOI":"10.2196\/jmir.7215","article-title":"Researching mental health disorders in the era of social media: systematic review","volume":"19","author":"Wongkoblap","year":"2017","journal-title":"J. Med. Internet Res."},{"key":"B38","first-page":"339","article-title":"Assessing the temporal modelling for prediction of dengue infection in northern and northeastern, Thailand","volume":"29","author":"Wongkoon","year":"2012","journal-title":"Trop. Biomed."},{"key":"B39","unstructured":"Coronavirus Update (Live): COVID-19 Virus Outbreak2022"},{"key":"B40","doi-asserted-by":"publisher","first-page":"e16184","DOI":"10.2196\/16184","article-title":"Detecting lung cancer trends by leveraging real-world and internet-based data: Infodemiology study","volume":"22","author":"Xu","year":"2020","journal-title":"J. Med. Internet Res."},{"key":"B41","doi-asserted-by":"publisher","first-page":"284","DOI":"10.1016\/j.envint.2018.05.016","article-title":"Using Google Trends and ambient temperature to predict seasonal influenza outbreaks","volume":"117","author":"Zhang","year":"2018","journal-title":"Environ. Int."}],"container-title":["Frontiers in Research Metrics and Analytics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frma.2022.1003972\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,15]],"date-time":"2022-09-15T09:50:42Z","timestamp":1663235442000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frma.2022.1003972\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,15]]},"references-count":41,"alternative-id":["10.3389\/frma.2022.1003972"],"URL":"https:\/\/doi.org\/10.3389\/frma.2022.1003972","relation":{},"ISSN":["2504-0537"],"issn-type":[{"value":"2504-0537","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,9,15]]},"article-number":"1003972"}}