{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T16:55:36Z","timestamp":1771001736737,"version":"3.50.1"},"reference-count":24,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2012,9,1]],"date-time":"2012-09-01T00:00:00Z","timestamp":1346457600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"publisher","award":["DTA\/SB1826"],"award-info":[{"award-number":["DTA\/SB1826"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2012,9]]},"abstract":"<jats:p>\n            We present a general methodology for inferring the occurrence and magnitude of an event or phenomenon by exploring the rich amount of unstructured textual information on the social part of the Web. Having geo-tagged user posts on the microblogging service of\n            <jats:italic>Twitter<\/jats:italic>\n            as our input data, we investigate two case studies. The first consists of a benchmark problem, where actual levels of rainfall in a given location and time are inferred from the content of\n            <jats:italic>tweets<\/jats:italic>\n            . The second one is a real-life task, where we infer regional Influenza-like Illness rates in the effort of detecting timely an emerging epidemic disease. Our analysis builds on a statistical learning framework, which performs sparse learning via the bootstrapped version of LASSO to select a consistent subset of textual features from a large amount of candidates. In both case studies, selected features indicate close semantic correlation with the target topics and inference, conducted by regression, has a significant performance, especially given the short length --approximately one year-- of Twitter\u2019s data time series.\n          <\/jats:p>","DOI":"10.1145\/2337542.2337557","type":"journal-article","created":{"date-parts":[[2012,10,12]],"date-time":"2012-10-12T20:56:02Z","timestamp":1350075362000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":98,"title":["Nowcasting Events from the Social Web with Statistical Learning"],"prefix":"10.1145","volume":"3","author":[{"given":"Vasileios","family":"Lampos","sequence":"first","affiliation":[{"name":"University of Bristol, UK"}]},{"given":"Nello","family":"Cristianini","sequence":"additional","affiliation":[{"name":"University of Bristol, UK"}]}],"member":"320","published-online":{"date-parts":[[2012,9]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/WI-IAT.2010.63"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390161"},{"key":"e_1_2_1_3_1","unstructured":"Bartlett P. L. Mendelson S. and Neeman J. 2009. l1-regularized linear regression: Persistence and oracle inequalities. Tech. rep. UC-Berkeley. Bartlett P. L. Mendelson S. and Neeman J. 2009. l1-regularized linear regression: Persistence and oracle inequalities. Tech. rep. UC-Berkeley."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2010.12.007"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1018054314350"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the International Conference on Bioinformatics and Computational Biology. 340--346","author":"Corley C. D.","unstructured":"Corley , C. D. , Mikler , A. R. , Singh , K. P. , and Cook , D. J . 2009. Monitoring influenza trends through mining social media . In Proceedings of the International Conference on Bioinformatics and Computational Biology. 340--346 . Corley, C. D., Mikler, A. R., Singh, K. P., and Cook, D. J. 2009. Monitoring influenza trends through mining social media. In Proceedings of the International Conference on Bioinformatics and Computational Biology. 340--346."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964858.1964874"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176344552"},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Efron B. and Tibshirani R. J. 1993. An Introduction to the Bootstrap. Chapman & Hall. Efron B. and Tibshirani R. J. 1993. An Introduction to the Bootstrap . Chapman & Hall.","DOI":"10.1007\/978-1-4899-4541-9"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1214\/009053604000000067"},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Ginsberg J. Mohebbi M. H. Patel R. S. Brammer L. Smolinski M. S. and Brilliant L. 2008. Detecting influenza epidemics using search engine query data. Nature 457 7232 1012--1014. Ginsberg J. Mohebbi M. H. Patel R. S. Brammer L. Smolinski M. S. and Brilliant L. 2008. Detecting influenza epidemics using search engine query data. Nature 457 7232 1012--1014.","DOI":"10.1038\/nature07634"},{"key":"e_1_2_1_12_1","first-page":"7","article-title":"An introduction to variable and feature selection","volume":"3","author":"Guyon I.","year":"2003","unstructured":"Guyon , I. and Elisseeff , A. 2003 . An introduction to variable and feature selection . J. Mach. Learn. Resear. 3 , 7 -- 8 , 1157--1182. Guyon, I. and Elisseeff, A. 2003. An introduction to variable and feature selection. J. Mach. Learn. Resear. 3, 7--8, 1157--1182.","journal-title":"J. Mach. Learn. Resear."},{"key":"e_1_2_1_13_1","unstructured":"Jenkins G. J. Perry M. C. and Prior M. J. 2008. The Climate of the United Kingdom and Recent Trends. Met Office Hadley Centre Exeter UK. Jenkins G. J. Perry M. C. and Prior M. J. 2008. The Climate of the United Kingdom and Recent Trends . Met Office Hadley Centre Exeter UK."},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the 2nd IAPR Workshop on Cognitive Information Processing. IEEE Press, 411--416","author":"Lampos V.","unstructured":"Lampos , V. and Cristianini , N . 2010. Tracking the flu pandemic by monitoring the Social Web . In Proceedings of the 2nd IAPR Workshop on Cognitive Information Processing. IEEE Press, 411--416 . Lampos, V. and Cristianini, N. 2010. Tracking the flu pandemic by monitoring the Social Web. In Proceedings of the 2nd IAPR Workshop on Cognitive Information Processing. IEEE Press, 411--416."},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Springer, 599--602","author":"Lampos V.","unstructured":"Lampos , V. , De Bie , T. , and Cristianini , N . 2010. Flu detector---Tracking epidemics on Twitter . In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Springer, 599--602 . Lampos, V., De Bie, T., and Cristianini, N. 2010. Flu detector---Tracking epidemics on Twitter. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Springer, 599--602."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1214\/09-AOS683"},{"key":"e_1_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Manning C. D. Raghavan P. and Sch\u00fctze H. 2008. Introduction to Information Retrieval. Cambridge University Press. Manning C. D. Raghavan P. and Sch\u00fctze H. 2008. Introduction to Information Retrieval . Cambridge University Press.","DOI":"10.1017\/CBO9780511809071"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1561\/1500000011"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1086\/593098"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1108\/eb046814"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772777"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.2517-6161.1996.tb02080.x"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the International AAAI Conference on Weblogs and Social Media. 178--185","author":"Tumasjan A.","unstructured":"Tumasjan , A. , Sprenger , T. O. , Sandner , P. G. , and Welpe , I. M . 2010. Predicting elections with Twitter: What 140 characters reveal about political sentiment . In Proceedings of the International AAAI Conference on Weblogs and Social Media. 178--185 . Tumasjan, A., Sprenger, T. O., Sandner, P. G., and Welpe, I. M. 2010. Predicting elections with Twitter: What 140 characters reveal about political sentiment. In Proceedings of the International AAAI Conference on Weblogs and Social Media. 178--185."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/1248547.1248637"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2337542.2337557","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2337542.2337557","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:49:00Z","timestamp":1750236540000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2337542.2337557"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9]]},"references-count":24,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,9]]}},"alternative-id":["10.1145\/2337542.2337557"],"URL":"https:\/\/doi.org\/10.1145\/2337542.2337557","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,9]]},"assertion":[{"value":"2011-04-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-09-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2012-09-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}