{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,6]],"date-time":"2026-04-06T21:18:00Z","timestamp":1775510280644,"version":"3.50.1"},"reference-count":69,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2015,9,21]],"date-time":"2015-09-21T00:00:00Z","timestamp":1442793600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2015,10]]},"abstract":"<jats:p>Multidocument summarization addresses the selection of a compact subset of highly informative sentences, i.e., the summary, from a collection of textual documents. To perform sentence selection, two parallel strategies have been proposed: (a) apply general-purpose techniques relying on data mining or information retrieval techniques, and\/or (b) perform advanced linguistic analysis relying on semantics-based models (e.g., ontologies) to capture the actual sentence meaning. Since there is an increasing need for processing documents written in different languages, the attention of the research community has recently focused on summarizers based on strategy (a).<\/jats:p>\n          <jats:p>This article presents a novel multilingual summarizer, namely MWI-Sum (Multilingual Weighted Itemset-based Summarizer), that exploits an itemset-based model to summarize collections of documents ranging over the same topic. Unlike previous approaches, it extracts frequent weighted itemsets tailored to the analyzed collection and uses them to drive the sentence selection process. Weighted itemsets represent correlations among multiple highly relevant terms that are neglected by previous approaches. The proposed approach makes minimal use of language-dependent analyses. Thus, it is easily applicable to document collections written in different languages.<\/jats:p>\n          <jats:p>Experiments performed on benchmark and real-life collections, English-written and not, demonstrate that the proposed approach performs better than state-of-the-art multilingual document summarizers.<\/jats:p>","DOI":"10.1145\/2809786","type":"journal-article","created":{"date-parts":[[2015,9,22]],"date-time":"2015-09-22T12:31:00Z","timestamp":1442925060000},"page":"1-35","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":42,"title":["MWI-Sum"],"prefix":"10.1145","volume":"34","author":[{"given":"Elena","family":"Baralis","sequence":"first","affiliation":[{"name":"Politecnico di Torino, Torino (Italy)"}]},{"given":"Luca","family":"Cagliero","sequence":"additional","affiliation":[{"name":"Politecnico di Torino, Torino (Italy)"}]},{"given":"Alessandro","family":"Fiori","sequence":"additional","affiliation":[{"name":"IRCC: Institute for Cancer Research at Candiolo, Strada Provinciale, Candiolo (Italy)"}]},{"given":"Paolo","family":"Garza","sequence":"additional","affiliation":[{"name":"Politecnico di Torino, Torino (Italy)"}]}],"member":"320","published-online":{"date-parts":[[2015,9,21]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2013.01.017"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMPSAC.2015.15"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2245276.2245427"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2013.06.047"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2013.06.046"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the Mining Complex Patterns Workshop. 18--29","author":"Baralis Elena Maria","year":"2011"},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization. 10--17","author":"Barzilay Regina","year":"1997"},{"key":"e_1_2_1_8_1","unstructured":"S. Bird E. Klein and E. Loper. 2009. Natural Language Processing with Python. O\u2019Reilly Media.   S. Bird E. Klein and E. Loper. 2009. Natural Language Processing with Python. O\u2019Reilly Media."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/297805.297827"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2013.69"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242586"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345566"},{"key":"e_1_2_1_13_1","volume-title":"CLASSY 2011 at TAC: Guided and multi-lingual summaries and evaluation metrics. In TAC\u201911: Proceedings of the the 2011 Text Analysis Conference (TAC\u201911)","author":"Conroy John","year":"2011"},{"key":"e_1_2_1_14_1","volume-title":"DUC 2004 Conference Proceedings.","author":"Conroy John M."},{"key":"e_1_2_1_15_1","unstructured":"Wordnet Lexical Database. 2012. Homepage. Available at http:\/\/wordnet.princeton.edu.  Wordnet Lexical Database. 2012. Homepage. Available at http:\/\/wordnet.princeton.edu."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1162\/089976698300017197"},{"key":"e_1_2_1_17_1","volume-title":"Understanding Conference. 2004. HTL\/NAACL Workshop on Text Summarization. http:\/\/duc.nist.gov\/pubs.html#2004","author":"Document"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1378773.1378800"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220355.1220412"},{"key":"e_1_2_1_20_1","volume-title":"Peter P. Chen and Leah Y. Wong (Eds.)","volume":"4512","author":"Fernando Fortes Garcia Lus","year":"2006"},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of the TAC 2011 Workshop. NIST","author":"Giannakopoulos George","year":"2011"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the TAC 2011 Workshop. NIST.","author":"Giannakopoulos George","year":"2011"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the Text Analysis Conference (TAC\u201908)","author":"Gillick Dan","year":"2008"},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the Text Analysis Conference (TAC\u201909)","author":"Gillick Dan","year":"2009"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the Workshop on Frequent Itemset Mining Implementations, FIMI\u201903 (CEUR-WS), Bart Goethals and Mohammed J. Zaki (Eds.)","volume":"90","author":"Grahne G\u00f6sta","year":"2003"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2600428.2609500"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2003.1198387"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-006-0059-1"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/335191.335372"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/WIIAT.2008.175"},{"key":"e_1_2_1_31_1","unstructured":"Jiri Hynek and Karel Jezek. 2003. Practical approach to automatic text summarization. In ELPUB.  Jiri Hynek and Karel Jezek. 2003. Practical approach to automatic text summarization. In ELPUB."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1014052.1014074"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/324133.324140"},{"key":"e_1_2_1_34_1","article-title":"Ontology enhanced clustering based summarization of medical documents","volume":"1","author":"Kogilavani A.","year":"2009","journal-title":"Int. J. Recent Trends Engin."},{"key":"e_1_2_1_35_1","unstructured":"Eugene Krapivin Mark Last and Marina Litvak. 2014. JRouge - Java ROUGE Implementation. Retrieved from https:\/\/bitbucket.org\/nocgod\/jrouge\/wiki\/Home\/.  Eugene Krapivin Mark Last and Marina Litvak. 2014. JRouge - Java ROUGE Implementation. Retrieved from https:\/\/bitbucket.org\/nocgod\/jrouge\/wiki\/Home\/."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835449.1835632"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073445.1073465"},{"key":"e_1_2_1_38_1","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies -","volume":"1","author":"Lin Hui","year":"2011"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118108.1118117"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2020408.2020499"},{"key":"e_1_2_1_41_1","unstructured":"Michael McCandless Erik Hatcher and Otis Gospodnetic. 2010. Lucene in Action Second Edition: Covers Apache Lucene 3.0. Manning Publications Co. Greenwich CT.   Michael McCandless Erik Hatcher and Otis Gospodnetic. 2010. Lucene in Action Second Edition: Covers Apache Lucene 3.0. Manning Publications Co. Greenwich CT."},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of the ANLP\/NAACL Workshop on Automatic Summarization. 40--48","author":"Vibhu Mittal Jade Goldstein","year":"2000"},{"key":"e_1_2_1_43_1","first-page":"1","article-title":"Lexical cohesion computed by thesaural relations as an indicator of the structure of text","volume":"17","author":"Morris Jane","year":"1991","journal-title":"Comput. Linguist."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.5555\/645503.656256"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CBMS.2006.25"},{"key":"e_1_2_1_46_1","unstructured":"Mohsen Pourvali and Mohammad Saniee Abadeh. 2012. Automated text summarization base on lexicales chain and graph using of WordNet and Wikipedia knowledge base. CoRR abs\/1203.3586 (2012).  Mohsen Pourvali and Mohammad Saniee Abadeh. 2012. Automated text summarization base on lexicales chain and graph using of WordNet and Wikipedia knowledge base. CoRR abs\/1203.3586 (2012)."},{"key":"e_1_2_1_47_1","first-page":"2004","article-title":"Lexrank: Graph-based lexical centrality as salience in text summarization","volume":"22","author":"Radev Dragomir R.","year":"2004","journal-title":"J. Artif. Intell. Res."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2003.10.006"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/0-387-23529-9_5"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/276304.276313"},{"key":"e_1_2_1_51_1","unstructured":"N. Rotem. 2011. Open Text Summarizer (OTS). Retrieved from http:\/\/libots.sourceforge.net\/.  N. Rotem. 2011. Open Text Summarizer (OTS). Retrieved from http:\/\/libots.sourceforge.net\/."},{"key":"e_1_2_1_52_1","volume-title":"Proceedings of the 2011 Text Analysis Conference (TAC\u201911)","author":"Steinberger Josef","year":"2011"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2007.190723"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/1645953.1646179"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/775047.775053"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835843"},{"key":"e_1_2_1_57_1","unstructured":"TexLexAn. 2011. TexLexAn: An Open-Source Text Summarizer. Retrieved from http:\/\/texlexan.sourceforge.net\/.  TexLexAn. 2011. TexLexAn: An Open-Source Text Summarizer. Retrieved from http:\/\/texlexan.sourceforge.net\/."},{"key":"e_1_2_1_58_1","volume-title":"Analysis Conference. 2011. NIST Text Analysis Conference Summarization Track.","author":"Text"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICETET.2010.104"},{"key":"e_1_2_1_60_1","volume-title":"Proceedings of the 7th International Workshop on Frontiers in Handwriting Recognition. 443--452","author":"van Erp Merijn","year":"2000"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.5555\/1614049.1614095"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871476"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/1993077.1993078"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/2435209.2435211"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/347090.347149"},{"key":"e_1_2_1_66_1","unstructured":"Chia-Wei Wu and Chao-Lin Liu. 2003. Ontology-based text summarization for business news articles. In Computers and Their Applications Narayan C. Debnath (Ed.). ISCA 389--392.  Chia-Wei Wu and Chao-Lin Liu. 2003. Ontology-based text summarization for business news articles. In Computers and Their Applications Narayan C. Debnath (Ed.). ISCA 389--392."},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2010.11"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2009954"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526925"}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2809786","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2809786","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:42:54Z","timestamp":1750225374000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2809786"}},"subtitle":["A Multilingual Summarizer Based on Frequent Weighted Itemsets"],"short-title":[],"issued":{"date-parts":[[2015,9,21]]},"references-count":69,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,10]]}},"alternative-id":["10.1145\/2809786"],"URL":"https:\/\/doi.org\/10.1145\/2809786","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,9,21]]},"assertion":[{"value":"2014-06-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-09-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}