{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T16:12:33Z","timestamp":1763395953300,"version":"3.45.0"},"reference-count":105,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2025,11,14]],"date-time":"2025-11-14T00:00:00Z","timestamp":1763078400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Ministry of Economic Development of the Russian Federation","award":["000000C313925P4G0002"],"award-info":[{"award-number":["000000C313925P4G0002"]}]},{"name":"Ivannikov Institute for System Programming of the Russian Academy of Sciences","award":["139-15-2025-011"],"award-info":[{"award-number":["139-15-2025-011"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BDCC"],"abstract":"<jats:p>Natural Language Processing is being used for Disease Outbreak Prediction using news data. However, the available research focuses on predicting outbreaks for only specific diseases using disease-specific data such as COVID-19, Zika, SARS, MERS, and Ebola, etc. To address the challenge of disease outbreak prediction without relying on prior knowledge or introducing bias, this research proposes a model that leverages a news dataset devoid of specific disease names. This approach ensures generalizability and domain independence in identifying potential outbreaks. To facilitate supervised learning, spaCy was employed to annotate the dataset, enabling the classification of articles as either related or unrelated to disease outbreaks. LSTM, Bi-LSTM, and Bi-LSTM with a Multi-Head Attention mechanism, and transformer have been used and compared for the purpose of classification. Experimental results exhibit good prediction accuracy with Bi-LSTM with Multi-Head Attention and transformer on the test dataset. The work serves as a pro-active and unbiased approach to predict any disease outbreak without being specific to any disease.<\/jats:p>","DOI":"10.3390\/bdcc9110291","type":"journal-article","created":{"date-parts":[[2025,11,14]],"date-time":"2025-11-14T14:37:52Z","timestamp":1763131072000},"page":"291","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Attention-Driven Deep Learning for News-Based Prediction of Disease Outbreaks"],"prefix":"10.3390","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8603-6455","authenticated-orcid":false,"given":"Avneet Singh","family":"Gautam","sequence":"first","affiliation":[{"name":"School of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi 110067, India"}]},{"given":"Zahid","family":"Raza","sequence":"additional","affiliation":[{"name":"School of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi 110067, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8117-9142","authenticated-orcid":false,"given":"Maria","family":"Lapina","sequence":"additional","affiliation":[{"name":"Research Center for Trusted Artificial Intelligence, Ivannikov Institute for System Programming of the Russian Academy of Science, 109004 Moscow, Russia"},{"name":"Department of Computational Mathematics and Cybernetics, North-Caucasus Federal University, 355017 Stavropol, Russia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7066-0061","authenticated-orcid":false,"given":"Mikhail","family":"Babenko","sequence":"additional","affiliation":[{"name":"Research Center for Trusted Artificial Intelligence, Ivannikov Institute for System Programming of the Russian Academy of Science, 109004 Moscow, Russia"},{"name":"Department of Computational Mathematics and Cybernetics, North-Caucasus Federal University, 355017 Stavropol, Russia"}]}],"member":"1968","published-online":{"date-parts":[[2025,11,14]]},"reference":[{"key":"ref_1","unstructured":"(2025, August 19). Disease Outbreak News. Available online: https:\/\/www.who.int\/emergencies\/disease-outbreak-news."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"6561","DOI":"10.1007\/s10115-024-02192-6","article-title":"Disease Outbreak Prediction Using Natural Language Processing: A Review","volume":"66","author":"Gautam","year":"2024","journal-title":"Knowl. Inf. Syst."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Gautam, A.S., and Raza, Z. (2024, January 28\u201329). Autoencoder and Multi-Head Attention with GRU Based Approach to Predict Disease Outbreak Using News-Crawl 2019 Data. Proceedings of the 2024 International Conference on Computational Intelligence and Network Systems (CINS), Dubai, United Arab Emirates.","DOI":"10.1109\/CINS63881.2024.10864399"},{"key":"ref_4","unstructured":"(2025, August 19). Vector-Borne Diseases. Available online: https:\/\/www.who.int\/news-room\/fact-sheets\/detail\/vector-borne-diseases."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"e739","DOI":"10.1016\/S2542-5196(21)00141-8","article-title":"Digital and Technological Innovation in Vector-Borne Disease Surveillance to Predict, Detect, and Control Climate-Driven Outbreaks","volume":"5","author":"Pley","year":"2021","journal-title":"Lancet Planet. Health"},{"key":"ref_6","unstructured":"(2025, August 19). Coronavirus Disease ({COVID-19})\u2013World Health Organization. Available online: https:\/\/www.who.int\/emergencies\/diseases\/novel-coronavirus-2019."},{"key":"ref_7","unstructured":"(2025, August 19). New Research on Deaths and Economic Impact in the First Year of the COVID-19 Pandemic. Available online: https:\/\/siepr.stanford.edu\/news\/new-research-deaths-and-economic-impact-first-year-covid-19-pandemic."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wang, W., Gurgone, A., Mart\u00ednez, H., Barbieri G\u00f3es, M.C., Gallo, E., Ker\u00e9nyi, \u00c1., Turco, E.M., Coburger, C., and Andrade, P.D.S. (2022). COVID-19 Mortality and Economic Losses: The Role of Policies and Structural Conditions. JRFM, 15.","DOI":"10.3390\/jrfm15080354"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Khotimah, P.H., Fachrur Rozie, A., Nugraheni, E., Arisal, A., Suwarningsih, W., and Purwarianti, A. (2020, January 18\u201320). Deep Learning for Dengue Fever Event Detection Using Online News. Proceedings of the 2020 International Conference on Radar, Antenna, Microwave, Electronics, and Telecommunications (ICRAMET), Tangerang, Indonesia.","DOI":"10.1109\/ICRAMET51080.2020.9298630"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Li, J., Sia, C.-L., Chen, Z., and Huang, W. (2021). Enhancing Influenza Epidemics Forecasting Accuracy in China with Both Official and Unofficial Online News Articles, 2019\u20132020. Int. J. Environ. Res. Public Health, 18.","DOI":"10.3390\/ijerph18126591"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Ibaraki, M., and Schwartz, F.W. (2020). Disease Surveillance Using Online News: Dengue and Zika in Tropical Countries. J. Biomed. Inform., 102.","DOI":"10.1016\/j.jbi.2020.103374"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1007\/s10479-017-2480-9","article-title":"Predicting Social Response to Infectious Disease Outbreaks from Internet-Based News Streams","volume":"263","author":"Fast","year":"2018","journal-title":"Ann. Oper. Res."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Azam, N., Tahir, B., and Mehmood, M.A. (2020, January 16\u201317). News-EDS: News Based Epidemic Disease Surveillance Using Machine Learning. Proceedings of the 2020 14th International Conference on Open Source Systems and Technologies (ICOSST), Lahore, Pakistan.","DOI":"10.1109\/ICOSST51357.2020.9333083"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Chakraborty, S., and Subramanian, L. (2016, January 7\u20139). Extracting Signals from News Streams for Disease Outbreak Prediction. Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Washington, DC, USA.","DOI":"10.1109\/GlobalSIP.2016.7906051"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Li, Z., Wang, B., Li, M., and Ma, W.-Y. (2005, January 15\u201319). A Probabilistic Model for Retrospective News Event Detection. Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil.","DOI":"10.1145\/1076034.1076055"},{"key":"ref_16","unstructured":"(2025, August 19). PULS Project: Surveillance of Global News Media. Available online: http:\/\/puls.cs.helsinki.fi\/static\/index.html."},{"key":"ref_17","first-page":"163","article-title":"Identifying Associations between Epidemiological Entities in News Data for Animal Disease Surveillance","volume":"5","author":"Valentin","year":"2021","journal-title":"Artif. Intell. Agric."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"e23305","DOI":"10.2196\/23305","article-title":"Effective Training Data Extraction Method to Improve Influenza Outbreak Prediction from Online News Articles: Deep Learning Model Study","volume":"9","author":"Jang","year":"2021","journal-title":"JMIR Med. Inform."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"378","DOI":"10.4103\/1995-7645.279651","article-title":"Using Twitter and Web News Mining to Predict COVID-19 Outbreak","volume":"13","author":"Jahanbin","year":"2020","journal-title":"Asian Pac. J. Trop. Med."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Liu, D., Clemente, L., Poirier, C., Ding, X., Chinazzi, M., Davis, J.T., Vespignani, A., and Santillana, M. (2025, August 19). A Machine Learning Methodology for Real-Time Forecasting of the 2019\u20132020 COVID-19 Outbreak Using Internet Searches, News Alerts, and Estimates from Mechanistic Models. Available online: https:\/\/www.jmir.org\/2020\/8\/e20285\/.","DOI":"10.2196\/20285"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Collier, N. (2010). What\u2019s Unusual in Online Disease Outbreak News?. J. Biomed. Sem., 1.","DOI":"10.1186\/2041-1480-1-2"},{"key":"ref_22","first-page":"983","article-title":"GODSN: Global News Driven Disease Outbreak and Surveillance","volume":"2006","author":"Khan","year":"2006","journal-title":"AMIA Annu. Symp. Proc."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"969","DOI":"10.1016\/j.ipm.2019.02.003","article-title":"Event Mining and Timeliness Analysis from Heterogeneous News Streams","volume":"56","author":"Mele","year":"2019","journal-title":"Inf. Process. Manag."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"100936","DOI":"10.1016\/j.cola.2019.100936","article-title":"EpidNews: Extracting, Exploring and Annotating News for Monitoring Animal Diseases","volume":"56","author":"Goel","year":"2020","journal-title":"J. Comput. Lang."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Ghosh, S., Chakraborty, P., Nsoesie, E.O., Cohn, E., Mekaru, S.R., Brownstein, J.S., and Ramakrishnan, N. (2017). Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks. Sci. Rep., 7.","DOI":"10.1038\/srep40841"},{"key":"ref_26","unstructured":"(2025, August 19). All the News 2-News Articles Dataset. Available online: https:\/\/components.one\/datasets\/all-the-news-2-news-articles-503."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Camacho-Collados, J., and Pilehvar, M.T. (November, January 31). On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis. Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Brussels, Belgium.","DOI":"10.18653\/v1\/W18-5406"},{"key":"ref_28","unstructured":"Honnibal, M., Montani, I., Van Landeghem, S., and Boyd, A. (2025, August 19). Others Industrial-Strength Natural Language Processing. Available online: https:\/\/spacy.io."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Reimers, N., and Gurevych, I. (2019, January 3\u20137). Sentence-BERT: Sentence Embeddings Using Siamese BERT-Networks 2019. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Stroudsburg, PA, USA.","DOI":"10.18653\/v1\/D19-1410"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1007\/978-3-642-24797-2_4","article-title":"Long Short-Term Memory","volume":"Volume 385","author":"Graves","year":"2012","journal-title":"Supervised Sequence Labelling with Recurrent Neural Networks"},{"key":"ref_31","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2015, January 7\u20139). Neural Machine Translation by Jointly Learning to Align and Translate 2016. Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA."},{"key":"ref_32","first-page":"1","article-title":"Attention Is All You Need","volume":"Volume 30","author":"Vaswani","year":"2017","journal-title":"Proceedings of the Advances in Neural Information Processing Systems"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"taaa011","DOI":"10.1093\/jtm\/taaa011","article-title":"Potential for Global Spread of a Novel Coronavirus from China","volume":"27","author":"Bogoch","year":"2020","journal-title":"J. Travel. Med."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Fong, S.J., Dey, N., and Chaki, J. (2021). AI-Empowered Data Analytics for Coronavirus Epidemic Monitoring and Control. Artificial Intelligence for Coronavirus Outbreak, Springer. Springer Briefs in Applied Sciences and Technology.","DOI":"10.1007\/978-981-15-5936-5_3"},{"key":"ref_35","unstructured":"(2025, August 19). Ten Threats to Global Health in 2019. Available online: https:\/\/www.who.int\/news-room\/spotlight\/ten-threats-to-global-health-in-2019."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"189054","DOI":"10.1109\/ACCESS.2020.3031174","article-title":"Detecting Dengue\/Flu Infections Based on Tweets Using LSTM and Word Embedding","volume":"8","author":"Amin","year":"2020","journal-title":"IEEE Access"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"13","DOI":"10.52131\/jcsit.2021.0201.0007","article-title":"Dengue Cases Prediction Using Machine Learning Approach","volume":"2","author":"Aziz","year":"2021","journal-title":"Irasd J. Comp. Info Tech."},{"key":"ref_38","first-page":"1317","article-title":"Detecting Information on the Spread of Dengue on Twitter Using Artificial Neural Networks","volume":"67","author":"Amin","year":"2021","journal-title":"Comput. Mater. Contin."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/2049-9957-2-31","article-title":"Chinese Social Media Reaction to the MERS-CoV and Avian Influenza A(H7N9) Outbreaks","volume":"2","author":"Fung","year":"2013","journal-title":"Infect. Dis. Poverty"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"2321","DOI":"10.1007\/s11063-022-10795-9","article-title":"HFMD Cases Prediction Using Transfer One-Step-Ahead Learning","volume":"55","author":"Huang","year":"2023","journal-title":"Neural Process Lett."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Wang, Y., Cao, Z., Zeng, D., Wang, X., and Wang, Q. (2020). Using Deep Learning to Predict the Hand-Foot-and-Mouth Disease of Enterovirus A71 Subtype in Beijing from 2011 to 2018. Sci. Rep., 10.","DOI":"10.1038\/s41598-020-68840-3"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Meng, D., Xu, J., and Zhao, J. (2021). Analysis and Prediction of Hand, Foot and Mouth Disease Incidence in China Using Random Forest and XGBoost. PLoS ONE, 16.","DOI":"10.1371\/journal.pone.0261629"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1016\/j.idh.2017.08.005","article-title":"Twitter and Middle East Respiratory Syndrome, South Korea, 2015: A Multi-Lingual Study","volume":"23","author":"Fung","year":"2018","journal-title":"Infect. Dis. Health"},{"key":"ref_44","first-page":"227","article-title":"Stochastic and Spatio-Temporal Analysis of the Middle East Respiratory Syndrome Outbreak in South Korea, 2015","volume":"4","author":"Lee","year":"2019","journal-title":"Infect. Dis. Model."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Balashankar, A., Dugar, A., Subramanian, L., and Fraiberger, S. (2019, January 3\u20135). Reconstructing the MERS Disease Outbreak from News. Proceedings of the 2nd ACM SIGCAS Conference on Computing and Sustainable Societies, Accra, Ghana.","DOI":"10.1145\/3314344.3332498"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1016\/j.ajic.2015.02.023","article-title":"What Can We Learn about the Ebola Outbreak from Tweets?","volume":"43","author":"Odlum","year":"2015","journal-title":"Am. J. Infect. Control"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Joshi, A., Sparks, R., Karimi, S., Yan, S.-L.J., Chughtai, A.A., Paris, C., and MacIntyre, C.R. (2020). Automated Monitoring of Tweets for Early Detection of the 2014 Ebola Epidemic. PLoS ONE, 15.","DOI":"10.1371\/journal.pone.0230322"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1080\/02664763.2020.1825646","article-title":"A Non-Parametric Hawkes Model of the Spread of Ebola in West Africa","volume":"49","author":"Park","year":"2022","journal-title":"J. Appl. Stat."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"e65","DOI":"10.2196\/publichealth.8627","article-title":"Twitter-Based Influenza Detection After Flu Peak via Tweets with Indirect Information: Text Mining Study","volume":"4","author":"Wakamiya","year":"2018","journal-title":"JMIR Public Health Surveill."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Nsoesie, E.O., Oladeji, O., Abah, A.S.A., and Ndeffo-Mbah, M.L. (2021). Forecasting Influenza-like Illness Trends in Cameroon Using Google Search Data. Sci. Rep., 11.","DOI":"10.1038\/s41598-021-85987-9"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Kim, M., Chae, K., Lee, S., Jang, H.-J., and Kim, S. (2020). Automated Classification of Online Sources for Infectious Disease Occurrences Using Machine-Learning-Based Natural Language Processing Approaches. Int. J. Environ. Res. Public Health, 17.","DOI":"10.3390\/ijerph17249467"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"100357","DOI":"10.1016\/j.onehlt.2021.100357","article-title":"PADI-Web 3.0: A New Framework for Extracting and Disseminating Fine-Grained Information from the News for Animal Disease Surveillance","volume":"13","author":"Valentin","year":"2021","journal-title":"One Health"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"105163","DOI":"10.1016\/j.compag.2019.105163","article-title":"PADI-Web: A Multilingual Event-Based Surveillance System for Monitoring Animal Infectious Diseases","volume":"169","author":"Valentin","year":"2020","journal-title":"Comput. Electron. Agric."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Jang, B., Kim, M., Kim, I., and Kim, J.W. (2021). EagleEye: A Worldwide Disease-Related Topic Extraction System Using a Deep Learning Based Ranking Algorithm and Internet-Sourced Data. Sensors, 21.","DOI":"10.3390\/s21144665"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"1166","DOI":"10.1016\/j.ipm.2018.04.011","article-title":"Real-Time Processing of Social Media with SENTINEL: A Syndromic Surveillance System Incorporating Deep Learning for Health Classification","volume":"56","author":"Thapen","year":"2019","journal-title":"Inf. Process. Manag."},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Abbood, A., Ullrich, A., Busche, R., and Ghozzi, S. (2020). EventEpi\u2014A Natural Language Processing Framework for Event-Based Surveillance. PLoS Comput. Biol., 16.","DOI":"10.1371\/journal.pcbi.1008277"},{"key":"ref_57","unstructured":"(2025, August 19). EpiTator. Available online: https:\/\/github.com\/ecohealthalliance\/EpiTator."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"131522","DOI":"10.1109\/ACCESS.2020.3009058","article-title":"Recurrent Neural Networks with TF-IDF Embedding Technique for Detection and Classification in Tweets of Dengue Disease","volume":"8","author":"Amin","year":"2020","journal-title":"IEEE Access"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1016\/j.neucom.2021.10.035","article-title":"Time Series Predicting of COVID-19 Based on Deep Learning","volume":"468","author":"Alassafi","year":"2022","journal-title":"Neurocomputing"},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Nguyen, V.-H., Tuyet-Hanh, T.T., Mulhall, J., Minh, H.V., Duong, T.Q., Chien, N.V., Nhung, N.T.T., Lan, V.H., Minh, H.B., and Cuong, D. (2022). Deep Learning Models for Forecasting Dengue Fever Based on Climate Data in Vietnam. PLoS Negl. Trop. Dis., 16.","DOI":"10.1371\/journal.pntd.0010509"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"e42519","DOI":"10.2196\/42519","article-title":"Long Short-Term Memory\u2013Based Prediction of the Spread of Influenza-Like Illness Leveraging Surveillance, Weather, and Twitter Data: Model Development and Validation","volume":"25","author":"Athanasiou","year":"2023","journal-title":"J. Med. Internet Res."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Zhu, X., Fu, B., Yang, Y., Ma, Y., Hao, J., Chen, S., Liu, S., Li, T., Liu, S., and Guo, W. (2019). Attention-Based Recurrent Neural Network for Influenza Epidemic Prediction. BMC Bioinform., 20.","DOI":"10.1186\/s12859-019-3131-8"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Kim, Y., Park, C.-R., Ahn, J.-P., and Jang, B. (2023). COVID-19 Outbreak Prediction Using Seq2Seq + Attention and Word2Vec Keyword Time Series Data. PLoS ONE, 18.","DOI":"10.1371\/journal.pone.0284298"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Dai, S., and Han, L. (2023). Influenza Surveillance with Baidu Index and Attention-Based Long Short-Term Memory Model. PLoS ONE, 18.","DOI":"10.1371\/journal.pone.0280834"},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"e44238","DOI":"10.2196\/44238","article-title":"Deep-Learning Model for Influenza Prediction from Multisource Heterogeneous Data in a Megacity: Model Development and Evaluation","volume":"25","author":"Yang","year":"2023","journal-title":"J. Med. Internet Res."},{"key":"ref_66","unstructured":"Wu, N., Green, B., Ben, X., and O\u2019Banion, S. (2020). Deep Transformer Models for Time Series Forecasting: The Influenza Prevalence Case 2020. arXiv."},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Li, L., Jiang, Y., and Huang, B. (2021). Long-Term Prediction for Temporal Propagation of Seasonal Influenza Using Transformer-Based Model. J. Biomed. Inform., 122.","DOI":"10.1016\/j.jbi.2021.103894"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Li, Y., Wang, Y., and Ma, K. (2022). Integrating Transformer and GCN for COVID-19 Forecasting. Sustainability, 14.","DOI":"10.3390\/su141610393"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Yom-Tov, E. (2015, January 18\u201320). Ebola Data from the Internet: An Opportunity for Syndromic Surveillance or a News Event?. Proceedings of the 5th International Conference on Digital Health 2015, Florence, Italy.","DOI":"10.1145\/2750511.2750512"},{"key":"ref_70","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1016\/j.ymeth.2017.07.027","article-title":"Large-Scale Machine Learning of Media Outlets for Understanding Public Reactions to Nation-Wide Viral Infection Outbreaks","volume":"129","author":"Choi","year":"2017","journal-title":"Methods"},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"McGough, S.F., Brownstein, J.S., Hawkins, J.B., and Santillana, M. (2017). Forecasting Zika Incidence in the 2016 Latin America Outbreak Combining Traditional Disease Surveillance with Search, Social Media, and News Report Data. PLoS Negl. Trop Dis., 11.","DOI":"10.1371\/journal.pntd.0005295"},{"key":"ref_72","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1111\/j.1467-9868.2011.00771.x","article-title":"Regression Shrinkage and Selection via the Lasso: A Retrospective","volume":"73","author":"Tibshirani","year":"2011","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_73","doi-asserted-by":"crossref","first-page":"1057","DOI":"10.1016\/j.cell.2019.07.018","article-title":"Travel Surveillance and Genomics Uncover a Hidden Zika Outbreak during the Waning Epidemic","volume":"178","author":"Grubaugh","year":"2019","journal-title":"Cell"},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Yong, B., and Owen, L. (2016, January 22\u201323). Dynamical Transmission Model of MERS-CoV in Two Areas. Proceedings of the AIP Conference Proceedings, Bandung, Indonesia.","DOI":"10.1063\/1.4942993"},{"key":"ref_75","doi-asserted-by":"crossref","first-page":"424","DOI":"10.2307\/1912791","article-title":"Investigating Causal Relations by Econometric Models and Cross-Spectral Methods","volume":"37","author":"Granger","year":"1969","journal-title":"Econometrica"},{"key":"ref_76","unstructured":"Ramos, J. (2003, January 9\u201315). Using TF-IDF to Determine Word Relevance in Document Queries. Proceedings of the First Instructional Conference on Machine Learning, West New York, NJ, USA."},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"012018","DOI":"10.1088\/1757-899X\/261\/1\/012018","article-title":"Comparisons and Selections of Features and Classifiers for Short Text Classification","volume":"261","author":"Wang","year":"2017","journal-title":"IOP Conf. Ser. Mater. Sci. Eng."},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"61","DOI":"10.14257\/ijdta.2014.7.1.06","article-title":"KNN Based Machine Learning Approach for Text and Document Mining","volume":"7","author":"Bijalwan","year":"2014","journal-title":"Int. J. Database Theory Appl."},{"key":"ref_79","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1109\/72.159058","article-title":"Multilayer Perceptron, Fuzzy Sets, and Classification","volume":"3","author":"Pal","year":"1992","journal-title":"IEEE Trans. Neural Netw."},{"key":"ref_80","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_81","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/3-540-59119-2_166","article-title":"A Desicion-Theoretic Generalization of on-Line Learning and an Application to Boosting","volume":"Volume 904","year":"1995","journal-title":"Computational Learning Theory"},{"key":"ref_82","unstructured":"Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019, January 2\u20137). BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North, Minneapolis, MN, USA."},{"key":"ref_83","unstructured":"Zhang, X., and LeCun, Y. (2016). Text Understanding from Scratch. arXiv."},{"key":"ref_84","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1162\/tacl_a_00051","article-title":"Enriching Word Vectors with Subword Information","volume":"5","author":"Bojanowski","year":"2017","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_85","unstructured":"(2025, August 19). International Health Regulations (2005) (IHR). Available online: https:\/\/www.who.int\/teams\/ihr."},{"key":"ref_86","unstructured":"(2025, August 19). Weekly Bulletin on Outbreak and Other Emergencies: Week 29: 14\u201320 July 2025. Available online: https:\/\/www.afro.who.int\/countries\/democratic-republic-of-congo\/publication\/weekly-bulletin-outbreak-and-other-emergencies-week-29-14-20-july-2025."},{"key":"ref_87","unstructured":"(2025, August 19). Nigeria Centre for Disease Control and Prevention, Available online: https:\/\/ncdc.gov.ng\/."},{"key":"ref_88","doi-asserted-by":"crossref","first-page":"4446","DOI":"10.1093\/bioinformatics\/btac497","article-title":"BioCaster in 2021: Automatic Disease Outbreaks Detection from Global News Media","volume":"38","author":"Meng","year":"2022","journal-title":"Bioinformatics"},{"key":"ref_89","doi-asserted-by":"crossref","unstructured":"Liu, F., Shareghi, E., Meng, Z., Basaldella, M., and Collier, N. (2021, January 6\u201311). Self-Alignment Pretraining for Biomedical Entity Representations 2021. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Virtual Meeting.","DOI":"10.18653\/v1\/2021.naacl-main.334"},{"key":"ref_90","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1007\/978-3-030-91669-5_12","article-title":"Multilingual Epidemic Event Extraction","volume":"Volume 13133","author":"Ke","year":"2021","journal-title":"Towards Open and Trustworthy Digital Societies"},{"key":"ref_91","doi-asserted-by":"crossref","unstructured":"Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzm\u00e1n, F., Grave, E., Ott, M., Zettlemoyer, L., and Stoyanov, V. (2020, January 5\u201310). Unsupervised Cross-Lingual Representation Learning at Scale 2020. Proceedings of the 58th annual meeting of the association for computational linguistics, Virtual Meeting.","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"ref_92","unstructured":"Mutuvi, S., Doucet, A., Lejeune, G., and Odeo, M. (2020, January 11\u201316). A Dataset for Multilingual Epidemiological Event Extraction. Proceedings of the 12th Conference on Language Resources and Evaluation, Marseille, France."},{"key":"ref_93","unstructured":"Lejeune, G. (2025, August 19). Daniel_corpus: A Corpus for Evaluating Multilingual Epidemic Surveillance Systems (2089 Annotated Documents in 5 Languages). Available online: https:\/\/aclanthology.org\/2021.ranlp-1.138.pdf."},{"key":"ref_94","doi-asserted-by":"crossref","unstructured":"Menya, E., Roche, M., Interdonato, R., and Owuor, D. (2022, January 20\u201325). Enriching Epidemiological Thematic Features for Disease Surveillance Corpora Classification. Proceedings of the Thirteenth Language Resources and Evaluation Conference, Marseille, France.","DOI":"10.2139\/ssrn.4471792"},{"key":"ref_95","doi-asserted-by":"crossref","unstructured":"Parekh, T., Mac, A., Yu, J., Dong, Y., Shahriar, S., Liu, B., Yang, E., Huang, K.-H., Wang, W., and Peng, N. (2024). Event Detection from Social Media for Epidemic Prediction 2024. arXiv.","DOI":"10.18653\/v1\/2024.naacl-long.322"},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"Wadden, D., Wennberg, U., Luan, Y., and Hajishirzi, H. (2019). Entity, Relation, and Event Extraction with Contextualized Span Representations 2019. arXiv.","DOI":"10.18653\/v1\/D19-1585"},{"key":"ref_97","doi-asserted-by":"crossref","unstructured":"Du, X., and Cardie, C. (2020). Event Extraction by Answering (Almost) Natural Questions 2021. arXiv.","DOI":"10.18653\/v1\/2020.emnlp-main.49"},{"key":"ref_98","doi-asserted-by":"crossref","unstructured":"Hsu, I.-H., Huang, K.-H., Boschee, E., Miller, S., Natarajan, P., Chang, K.-W., and Peng, N. (2022, January 10\u201315). DEGREE: A Data-Efficient Generation-Based Event Extraction Model. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Seattle, DC, USA.","DOI":"10.18653\/v1\/2022.naacl-main.138"},{"key":"ref_99","unstructured":"Hsu, I.-H., Huang, K.-H., Zhang, S., Cheng, W., Natarajan, P., Chang, K.-W., and Peng, N. (2022). TAGPRIME: A Unified Framework for Relational Structure Extraction 2022. arXiv."},{"key":"ref_100","doi-asserted-by":"crossref","first-page":"1075","DOI":"10.1057\/s41599-024-03589-y","article-title":"Leveraging Social Media Data for Pandemic Detection and Prediction","volume":"11","author":"Shi","year":"2024","journal-title":"Humanit. Soc. Sci. Commun."},{"key":"ref_101","doi-asserted-by":"crossref","unstructured":"Cui, Y., Che, W., Liu, T., Qin, B., Wang, S., and Hu, G. (2020, January 16\u201320). Revisiting Pre-Trained Models for Chinese Natural Language Processing. Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, Virtual Meeting.","DOI":"10.18653\/v1\/2020.findings-emnlp.58"},{"key":"ref_102","unstructured":"(2025, August 19). Keras 3: Deep Learning for Humans. Available online: https:\/\/github.com\/fchollet\/keras."},{"key":"ref_103","unstructured":"Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., and Isard, M. (2016, January 2\u20134). Tens\u00a7orflow: A System for Large-Scale Machine Learning. Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Savannah, GA, USA."},{"key":"ref_104","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1109\/TSIPN.2025.3530346","article-title":"DeepTrace: Learning to Optimize Contact Tracing in Epidemic Networks with Graph Neural Networks. IEEE Transactions on Signal and Information Processing over Networks","volume":"11","author":"Tan","year":"2025","journal-title":"IEEE Trans. Signal Inf. Process. over Networks"},{"key":"ref_105","first-page":"106","article-title":"Contagion Source Detection in Epidemic and Infodemic Outbreaks: Mathematical Analysis and Network Algorithms","volume":"13","author":"Tan","year":"2023","journal-title":"Found. Trends\u00ae Netw."}],"container-title":["Big Data and Cognitive Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/11\/291\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T16:08:02Z","timestamp":1763395682000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/11\/291"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,14]]},"references-count":105,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2025,11]]}},"alternative-id":["bdcc9110291"],"URL":"https:\/\/doi.org\/10.3390\/bdcc9110291","relation":{},"ISSN":["2504-2289"],"issn-type":[{"type":"electronic","value":"2504-2289"}],"subject":[],"published":{"date-parts":[[2025,11,14]]}}}