{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T18:38:25Z","timestamp":1768588705638,"version":"3.49.0"},"reference-count":57,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2015,6,1]],"date-time":"2015-06-01T00:00:00Z","timestamp":1433116800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2015,6]]},"abstract":"<jats:p>\n            We address the problem of\n            <jats:italic>quantification<\/jats:italic>\n            , a supervised learning task whose goal is, given a class, to estimate the relative frequency (or\n            <jats:italic>prevalence<\/jats:italic>\n            ) of the class in a dataset of unlabeled items. Quantification has several applications in data and text mining, such as estimating the prevalence of positive reviews in a set of reviews of a given product or estimating the prevalence of a given support issue in a dataset of transcripts of phone calls to tech support. So far, quantification has been addressed by learning a general-purpose classifier, counting the unlabeled items that have been assigned the class, and tuning the obtained counts according to some heuristics. In this article, we depart from the tradition of using general-purpose classifiers and use instead a supervised learning model for\n            <jats:italic>structured prediction<\/jats:italic>\n            , capable of generating classifiers directly optimized for the (multivariate and nonlinear) function used for evaluating quantification accuracy. The experiments that we have run on 5,500 binary high-dimensional datasets (averaging more than 14,000 documents each) show that this method is more accurate, more stable, and more efficient than existing state-of-the-art quantification methods.\n          <\/jats:p>","DOI":"10.1145\/2700406","type":"journal-article","created":{"date-parts":[[2015,6,2]],"date-time":"2015-06-02T15:13:25Z","timestamp":1433258005000},"page":"1-27","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":42,"title":["Optimizing Text Quantifiers for Multivariate Loss Functions"],"prefix":"10.1145","volume":"9","author":[{"given":"Andrea","family":"Esuli","sequence":"first","affiliation":[{"name":"Consiglio Nazionale delle Ricerche, Pisa, Italy"}]},{"given":"Fabrizio","family":"Sebastiani","sequence":"additional","affiliation":[{"name":"Qatar Computing Research Institute, Doha, Qatar"}]}],"member":"320","published-online":{"date-parts":[[2015,6]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2011.03.019"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2012.12.052"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2014.07.032"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2012.07.022"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2010.75"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-013-0308-z"},{"key":"e_1_2_1_7_1","volume-title":"Every tweet counts&quest","author":"Ceron Andrea","year":"2014","unstructured":"Andrea Ceron , Luigi Curini , Stefano M. Iacus , and Giuseppe Porro . 2014. Every tweet counts&quest ; How sentiment analysis of social media can improve our knowledge of citizens\u2019 political preferences with an application to Italy and France. New Media &amp; Society 16, 2 ( 2014 ), 340--358. Andrea Ceron, Luigi Curini, Stefano M. Iacus, and Giuseppe Porro. 2014. Every tweet counts&quest; How sentiment analysis of social media can improve our knowledge of citizens\u2019 political preferences with an application to Italy and France. New Media &amp; Society 16, 2 (2014), 340--358."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1642293.1642455"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220175.1220187"},{"key":"e_1_2_1_10_1","volume-title":"Thomas","author":"Cover Thomas M.","year":"1991","unstructured":"Thomas M. Cover and Joy A . Thomas . 1991 . Elements of Information Theory. John Wiley & amp; Sons, New York, NY. Thomas M. Cover and Joy A. Thomas. 1991. Elements of Information Theory. John Wiley &amp; Sons, New York, NY."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1561\/0100000004"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1177\/0894439314521983"},{"key":"e_1_2_1_13_1","volume-title":"Isabel M. Kloumann, Catherine A. Bliss, and Christopher M. Danforth.","author":"Dodds Peter Sheridan","year":"2011","unstructured":"Peter Sheridan Dodds , Kameron Decker Harris , Isabel M. Kloumann, Catherine A. Bliss, and Christopher M. Danforth. 2011 . Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter. PLoS ONE 6, 12 (2011). Peter Sheridan Dodds, Kameron Decker Harris, Isabel M. Kloumann, Catherine A. Bliss, and Christopher M. Danforth. 2011. Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter. PLoS ONE 6, 12 (2011)."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.2501\/S147078531020165X"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2010.94"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2516889"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-005-5256-4"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/11564096_55"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150423"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148216"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-008-0097-y"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150520"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220355.1220476"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.10335"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2012.05.028"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/188490.188557"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1540-5907.2009.00428.x"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1102351.1102399"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150429"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-009-5108-8"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1592761.1592783"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/312129.312285"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1214\/07-STS247"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijar.2012.06.013"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1093\/oxfordjournals.aje.a121122"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1002\/sim.4780081006"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/215206.215366"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.5555\/1005332.1005345"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-25856-5_2"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/2390374.2390378"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2013.122"},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the 4th AAAI Conference on Weblogs and Social Media (ICWSM\u201910)","author":"O\u2019Connor Brendan","unstructured":"Brendan O\u2019Connor , Ramnath Balasubramanyan , Bryan R. Routledge , and Noah A. Smith . 2010. From tweets to polls: Linking text sentiment to public opinion time series . In Proceedings of the 4th AAAI Conference on Weblogs and Social Media (ICWSM\u201910) . Brendan O\u2019Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith. 2010. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the 4th AAAI Conference on Weblogs and Social Media (ICWSM\u201910)."},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","unstructured":"Joaquin Qui\u00f1onero-Candela Masashi Sugiyama Anton Schwaighofer and Neil D. Lawrence (Eds.). 2009. Dataset Shift in Machine Learning. MIT Press Cambridge MA.   Joaquin Qui\u00f1onero-Candela Masashi Sugiyama Anton Schwaighofer and Neil D. Lawrence (Eds.). 2009. Dataset Shift in Machine Learning. MIT Press Cambridge MA.","DOI":"10.7551\/mitpress\/9780262170055.001.0001"},{"key":"e_1_2_1_45_1","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1111\/1467-9884.00120","article-title":"Estimating the prevalence of a rare disease: Adjusted maximum likelihood","volume":"47","author":"Rahme Elham","year":"1998","unstructured":"Elham Rahme and Lawrence Joseph . 1998 . Estimating the prevalence of a rare disease: Adjusted maximum likelihood . The Statistician 47 (1998), 149 -- 158 . Elham Rahme and Lawrence Joseph. 1998. Estimating the prevalence of a rare disease: Adjusted maximum likelihood. The Statistician 47 (1998), 149--158.","journal-title":"The Statistician"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"e_1_2_1_47_1","volume-title":"Encyclopedia of Machine Learning, Claude Sammut and Geoffrey I","author":"Sammut Claude","unstructured":"Claude Sammut and Michael Harries . 2011. Concept drift . In Encyclopedia of Machine Learning, Claude Sammut and Geoffrey I . Webb (Eds.). Springer , Berlin , 202--205. Claude Sammut and Michael Harries. 2011. Concept drift. In Encyclopedia of Machine Learning, Claude Sammut and Geoffrey I. Webb (Eds.). Springer, Berlin, 202--205."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-69812-8_82"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007649029923"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1609\/aimag.v29i3.2157"},{"key":"e_1_2_1_51_1","volume-title":"Density Estimation for Statistics and Data Analysis","author":"Silverman Bernard W.","unstructured":"Bernard W. Silverman . 1986. Density Estimation for Statistics and Data Analysis . Chapman and Hall , London, UK . Bernard W. Silverman. 1986. Density Estimation for Statistics and Data Analysis. Chapman and Hall, London, UK."},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/1830252.1830271"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015330.1015341"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2494091.2495972"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/1557019.1557117"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1561\/1500000008"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2010.03.021"},{"key":"e_1_2_1_58_1","volume-title":"Obuchowski","author":"Zhou Xiao-Hua","year":"2002","unstructured":"Xiao-Hua Zhou , Donna K. McClish , and Nancy A . Obuchowski . 2002 . Statistical Methods in Diagnostic Medicine. Wiley , New York, NY. Xiao-Hua Zhou, Donna K. McClish, and Nancy A. Obuchowski. 2002. Statistical Methods in Diagnostic Medicine. Wiley, New York, NY."}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2700406","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2700406","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:07:44Z","timestamp":1750223264000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2700406"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,6]]},"references-count":57,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2015,6]]}},"alternative-id":["10.1145\/2700406"],"URL":"https:\/\/doi.org\/10.1145\/2700406","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,6]]},"assertion":[{"value":"2013-04-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-06-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}