{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,9]],"date-time":"2026-03-09T11:53:32Z","timestamp":1773057212412,"version":"3.50.1"},"reference-count":75,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2019,10,16]],"date-time":"2019-10-16T00:00:00Z","timestamp":1571184000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2020,11,30]]},"abstract":"<jats:p>Epidemic intelligence deals with the detection of outbreaks using formal (such as hospital records) and informal sources (such as user-generated text on the web) of information. In this survey, we discuss approaches for epidemic intelligence that use textual datasets, referring to it as \u201ctext-based epidemic intelligence.\u201d We view past work in terms of two broad categories: health mention classification (selecting relevant text from a large volume) and health event detection (predicting epidemic events from a collection of relevant text). The focus of our discussion is the underlying computational linguistic techniques in the two categories. The survey also provides details of the state of the art in annotation techniques, resources, and evaluation strategies for epidemic intelligence.<\/jats:p>","DOI":"10.1145\/3361141","type":"journal-article","created":{"date-parts":[[2019,10,16]],"date-time":"2019-10-16T18:55:35Z","timestamp":1571252135000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":23,"title":["Survey of Text-based Epidemic Intelligence"],"prefix":"10.1145","volume":"52","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2200-9703","authenticated-orcid":false,"given":"Aditya","family":"Joshi","sequence":"first","affiliation":[{"name":"CSIRO Data61, Epping, NSW, Australia"}]},{"given":"Sarvnaz","family":"Karimi","sequence":"additional","affiliation":[{"name":"CSIRO Data61, Epping, NSW, Australia"}]},{"given":"Ross","family":"Sparks","sequence":"additional","affiliation":[{"name":"CSIRO Data61, Epping, NSW, Australia"}]},{"given":"C\u00e9cile","family":"Paris","sequence":"additional","affiliation":[{"name":"CSIRO Data61, Epping, NSW, Australia"}]},{"given":"C. Raina","family":"Macintyre","sequence":"additional","affiliation":[{"name":"Kirby Institute, University of New South Wales, Australia"}]}],"member":"320","published-online":{"date-parts":[[2019,10,16]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the Australasian Language Technology Association Workshop","author":"Aamer Hafsah","year":"2016"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the International Workshop on Digital Disease Detection Using Social Media 2017 (DDDSM\u201917)","author":"Adam Dillon C.","year":"2017"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2016.05.005"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40249-015-0090-9"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1568--1576","author":"Aramaki Eiji","year":"2011"},{"key":"e_1_2_1_6_1","volume-title":"C","author":"Arsevska Elena","year":"2016"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0199960"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-1612"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.2740"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.3109\/17538157.2011.590258"},{"key":"e_1_2_1_11_1","volume-title":"Jordan","author":"Blei David M.","year":"2003"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkh061"},{"key":"e_1_2_1_13_1","first-page":"S28","article-title":"Prediction and surveillance of influenza epidemics","volume":"194","author":"Boyle Justin R.","year":"2011","journal-title":"Med. J. Austr."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMp0900702"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2004.04.001"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0139701"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-015-0434-x"},{"key":"e_1_2_1_18_1","volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics. Association for Computational Linguistics, 215--222","author":"Collier Nigel","year":"2010"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-007-9019-7"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the 3rd International Workshop on Health Document Text Mining and Information Analysis (LOUHI\u201911)","author":"Conway Mike","year":"2011"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2013.04.003"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2005.91"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/SECON.2017.7925400"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3308560.3316741"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 3rd International Joint Conference on Natural Language Processing.","author":"Doan Son","year":"2008"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1197\/jamia.M2544"},{"key":"e_1_2_1_27_1","first-page":"3","article-title":"The use of social media in public health surveillance","volume":"6","author":"Chun-Hai Fung Isaac","year":"2015","journal-title":"West. Pac. Surveill. Resp. J."},{"key":"e_1_2_1_28_1","volume-title":"Detecting influenza epidemics using search engine query data. Nature 457, 7232","author":"Ginsberg Jeremy","year":"2009"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2527031.2527049"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1006\/knac.1993.1008"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the 26th International Conference on Computational Linguistics. 76--86","author":"Hayate I. S. O.","year":"2016"},{"key":"e_1_2_1_32_1","first-page":"7","article-title":"What is syndromic surveillance","volume":"53","author":"Henning Kelly J.","year":"2004","journal-title":"Morbid. Mortal. Week. Rep."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1177\/0033354917709784"},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the Australasian Language Technology Association Workshop","author":"Huang Pin","year":"2016"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0004378"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1108"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-2917"},{"key":"e_1_2_1_38_1","volume-title":"A Practical Guide to Sentiment Analysis","author":"Joshi Aditya"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-5015"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-1160"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2719920"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3186055"},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 789--795","author":"Lamb Alex","year":"2013"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052622"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/JBHI.2015.2403839"},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the 4th International Workshop on Cross-lingual Information Access.","author":"Lejeune Ga\u00ebl","year":"2010"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1055\/s-0038-1634945"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2008.08.004"},{"key":"e_1_2_1_49_1","volume-title":"Foundations of Statistical Natural Language Processing","author":"Manning Christopher D."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.5555\/1572364.1572385"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1142\/9789814749411_0046"},{"key":"e_1_2_1_52_1","volume-title":"Proceedings of the International Society for Disease Surveillance.","author":"Okhmatovskaia A."},{"key":"e_1_2_1_53_1","volume-title":"Proceedings of the International Florida Artificial Intelligence Research Society Conference. 412--416","author":"Olszewski Robert T.","year":"2003"},{"key":"e_1_2_1_54_1","first-page":"265","article-title":"You are what you Tweet: Analyzing Twitter for public health","volume":"20","author":"Paul Michael J.","year":"2011","journal-title":"International AAAI Conference on Web and Social Media"},{"key":"e_1_2_1_55_1","first-page":"16","article-title":"A model for mining public health topics from Twitter","volume":"11","author":"Paul Michael J.","year":"2012","journal-title":"Health"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.2102"},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of the Conference on Artificial Intelligence (AAAI). 136--142","author":"Sadilek Adam","year":"2012"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1142\/9789814749411_0054"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052588"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1080\/07408170903468597"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1080\/03610918.2016.1186182"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2014.01.002"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1111\/1468-0009.12038"},{"key":"e_1_2_1_64_1","volume-title":"Aryel","author":"Wagner Michael M.","year":"2011"},{"key":"e_1_2_1_65_1","volume-title":"Proceedings of the International Workshop on Digital Disease Detection Using Social Media 2017 (DDDSM\u201917)","author":"Wang Chen-Kai","year":"2017"},{"key":"e_1_2_1_66_1","volume-title":"Proceedings of the AAAI Workshop on the World Wide Web and Public Health Intelligence","volume":"31","author":"Wang Shiliang","year":"2014"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W18-5904"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0172457"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.4955"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1007\/11760146_22"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijid.2017.07.020"},{"key":"e_1_2_1_72_1","volume-title":"Proceedings of the International Workshop on Describing Medical Web Resources (DrMED\u201908)","author":"Yangarber Roman","year":"2008"},{"key":"e_1_2_1_73_1","volume-title":"Language Resources and Evaluation Conference. 475--482","author":"Yates Andrew","year":"2014"},{"key":"e_1_2_1_74_1","volume-title":"Proceedings of the Workshop on Biomedical Natural Language Processing. 164--170","author":"Yepes Antonio Jimeno","year":"2015"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3186050"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3361141","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3361141","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:12:51Z","timestamp":1750201971000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3361141"}},"subtitle":["A Computational Linguistics Perspective"],"short-title":[],"issued":{"date-parts":[[2019,10,16]]},"references-count":75,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2020,11,30]]}},"alternative-id":["10.1145\/3361141"],"URL":"https:\/\/doi.org\/10.1145\/3361141","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,10,16]]},"assertion":[{"value":"2018-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-10-16","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}