{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:08:06Z","timestamp":1750306086678,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":60,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,8,23]],"date-time":"2017-08-23T00:00:00Z","timestamp":1503446400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["DP140103157"],"award-info":[{"award-number":["DP140103157"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,8,23]]},"DOI":"10.1145\/3106426.3106440","type":"proceedings-article","created":{"date-parts":[[2017,8,10]],"date-time":"2017-08-10T12:12:36Z","timestamp":1502367156000},"page":"654-661","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Topical term weighting based on extended random sets for relevance feature selection"],"prefix":"10.1145","author":[{"given":"Abdullah Semran","family":"Alharbi","sequence":"first","affiliation":[{"name":"Queensland University of Technology, Brisbane, QLD, Australia"}]},{"given":"Yuefeng","family":"Li","sequence":"additional","affiliation":[{"name":"Queensland University of Technology, Brisbane, QLD, Australia"}]},{"given":"Yue","family":"Xu","sequence":"additional","affiliation":[{"name":"Queensland University of Technology, Brisbane, QLD, Australia"}]}],"member":"320","published-online":{"date-parts":[[2017,8,23]]},"reference":[{"volume-title":"Mining text data","author":"Aggarwal Charu C","key":"e_1_3_2_1_1_1","unstructured":"Charu C Aggarwal and ChengXiang Zhai . 2012. A survey of text clustering algorithms . In Mining text data . Springer , 77--128. Charu C Aggarwal and ChengXiang Zhai. 2012. A survey of text clustering algorithms. In Mining text data. Springer, 77--128."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-03680-9_46"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/WI-IAT.2014.77"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/WI-IAT.2014.53"},{"key":"e_1_3_2_1_5_1","first-page":"1964","article-title":"A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization","volume":"65","author":"Aphinyanaphongs Yindalon","year":"2014","unstructured":"Yindalon Aphinyanaphongs , Lawrence D Fu , Zhiguo Li , Eric R Peskin , Efstratios Efstathiadis , Constantin F Aliferis , and Alexander Statnikov . 2014 . A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization . Journal of the Association for IST 65 , 10 (2014), 1964 -- 1987 . Yindalon Aphinyanaphongs, Lawrence D Fu, Zhiguo Li, Eric R Peskin, Efstratios Efstathiadis, Constantin F Aliferis, and Alexander Statnikov. 2014. A comprehensive empirical comparison of modern supervised classification and feature selection methods for text categorization. Journal of the Association for IST 65, 10 (2014), 1964--1987.","journal-title":"Journal of the Association for IST"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.4304\/jait.1.1.4-20"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/WI.2016.0025"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2133806.2133826"},{"key":"e_1_3_2_1_9_1","volume-title":"Latent dirichlet allocation. the Journal of machine Learning research 3","author":"Blei David M","year":"2003","unstructured":"David M Blei , Andrew Y Ng , and Michael I Jordan . 2003. Latent dirichlet allocation. the Journal of machine Learning research 3 ( 2003 ), 993--1022. David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. the Journal of machine Learning research 3 (2003), 993--1022."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345543"},{"key":"e_1_3_2_1_11_1","unstructured":"Allison June-Barlow Chaney and David M Blei. 2012. Visualizing Topic Models.. In ICWSM.  Allison June-Barlow Chaney and David M Blei. 2012. Visualizing Topic Models.. In ICWSM."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"e_1_3_2_1_13_1","volume-title":"Mar","author":"Forman George","year":"2003","unstructured":"George Forman . 2003. An extensive empirical study of feature selection metrics for text classification. Journal of machine learning research 3 , Mar ( 2003 ), 1289--1305. George Forman. 2003. An extensive empirical study of feature selection metrics for text classification. Journal of machine learning research 3, Mar (2003), 1289--1305."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2013.30"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-11749-2_15"},{"key":"e_1_3_2_1_16_1","first-page":"1629","article-title":"Pattern-based topics for document modelling in information filtering","volume":"27","author":"Gao Yang","year":"2015","unstructured":"Yang Gao , Yue Xu , and Yuefeng Li . 2015 . Pattern-based topics for document modelling in information filtering . IEEE TKDE 27 , 6 (2015), 1629 -- 1642 . Yang Gao, Yue Xu, and Yuefeng Li. 2015. Pattern-based topics for document modelling in information filtering. IEEE TKDE 27, 6 (2015), 1629--1642.","journal-title":"IEEE TKDE"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277811"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-006-0059-1"},{"key":"e_1_3_2_1_19_1","volume-title":"Unsupervised learning by probabilistic latent semantic analysis. Machine learning 42, 1--2","author":"Hofmann Thomas","year":"2001","unstructured":"Thomas Hofmann . 2001. Unsupervised learning by probabilistic latent semantic analysis. Machine learning 42, 1--2 ( 2001 ), 177--196. Thomas Hofmann. 2001. Unsupervised learning by probabilistic latent semantic analysis. Machine learning 42, 1--2 (2001), 177--196."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2086737.2086740"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1080\/03772063.2015.1021385"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/775047.775067"},{"key":"e_1_3_2_1_23_1","volume-title":"Farhad Oroumchian, and Hassan Seyed Razi.","author":"Keikha Mostafa","year":"2008","unstructured":"Mostafa Keikha , Narjes Sharif Razavian , Farhad Oroumchian, and Hassan Seyed Razi. 2008 . Document representation and quality of text: An analysis. In Survey of Text Mining II. Springer , 219--232. Mostafa Keikha, Narjes Sharif Razavian, Farhad Oroumchian, and Hassan Seyed Razi. 2008. Document representation and quality of text: An analysis. In Survey of Text Mining II. Springer, 219--232."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-011-9168-6"},{"volume-title":"Uncertainty and vagueness in knowledge based systems: numerical methods","author":"Kruse Rudolf","key":"e_1_3_2_1_25_1","unstructured":"Rudolf Kruse , Erhard Schwecke , and Jochen Heinsohn . 2012. Uncertainty and vagueness in knowledge based systems: numerical methods . Springer Science & Business Media . Rudolf Kruse, Erhard Schwecke, and Jochen Heinsohn. 2012. Uncertainty and vagueness in knowledge based systems: numerical methods. Springer Science & Business Media."},{"key":"e_1_3_2_1_26_1","unstructured":"Simon Lacoste-Julien Fei Sha and Michael I Jordan. 2009. DiscLDA: Discriminative learning for dimensionality reduction and classification. In Advances in neural information processing systems. 897--904.   Simon Lacoste-Julien Fei Sha and Michael I Jordan. 2009. DiscLDA: Discriminative learning for dimensionality reduction and classification. In Advances in neural information processing systems. 897--904."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.110"},{"volume-title":"RSFDGrC'03","author":"Yuefeng Li.","key":"e_1_3_2_1_28_1","unstructured":"Yuefeng Li. 2003. Extended random sets for knowledge discovery in information systems . In RSFDGrC'03 . Springer , 524--532. Yuefeng Li. 2003. Extended random sets for knowledge discovery in information systems. In RSFDGrC'03. Springer, 524--532."},{"key":"e_1_3_2_1_29_1","first-page":"1656","article-title":"Relevance feature discovery for text mining","volume":"27","author":"Li Yuefeng","year":"2015","unstructured":"Yuefeng Li , Abdulmohsen Algarni , Mubarak Albathan , Yan Shen , and Moch Arif Bijaksana . 2015 . Relevance feature discovery for text mining . IEEE TKDE 27 , 6 (2015), 1656 -- 1669 . Yuefeng Li, Abdulmohsen Algarni, Mubarak Albathan, Yan Shen, and Moch Arif Bijaksana. 2015. Relevance feature discovery for text mining. IEEE TKDE 27, 6 (2015), 1656--1669.","journal-title":"IEEE TKDE"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-010-9154-4"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835900"},{"key":"e_1_3_2_1_32_1","first-page":"488","article-title":"An evaluation on feature selection for text clustering","volume":"3","author":"Liu Tao","year":"2003","unstructured":"Tao Liu , Shengping Liu , Zheng Chen , and Wei-Ying Ma . 2003 . An evaluation on feature selection for text clustering . In Icml , Vol. 3. 488 -- 495 . Tao Liu, Shengping Liu, Zheng Chen, and Wei-Ying Ma. 2003. An evaluation on feature selection for text clustering. In Icml, Vol. 3. 488--495.","journal-title":"Icml"},{"key":"e_1_3_2_1_33_1","volume-title":"Web N-gram Workshop. Citeseer, 30","author":"Macdonald Craig","year":"2010","unstructured":"Craig Macdonald and Iadh Ounis . 2010 . Global statistics in proximity weighting models . In Web N-gram Workshop. Citeseer, 30 . Craig Macdonald and Iadh Ounis. 2010. Global statistics in proximity weighting models. In Web N-gram Workshop. Citeseer, 30."},{"volume-title":"Introduction to information retrieval","author":"Manning Christopher D","key":"e_1_3_2_1_34_1","unstructured":"Christopher D Manning , Prabhakar Raghavan , and Hinrich Sch\u00fctze . 2008. Introduction to information retrieval . Cambridge University Press . Christopher D Manning, Prabhakar Raghavan, and Hinrich Sch\u00fctze. 2008. Introduction to information retrieval. Cambridge University Press."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2484028.2484096"},{"key":"e_1_3_2_1_36_1","volume-title":"Mallet: A machine learning for language toolkit.","author":"McCallum Andrew Kachites","year":"2002","unstructured":"Andrew Kachites McCallum . 2002 . Mallet: A machine learning for language toolkit. (2002). Andrew Kachites McCallum. 2002. Mallet: A machine learning for language toolkit. (2002)."},{"volume-title":"Theory of random sets","author":"Molchanov Ilya","key":"e_1_3_2_1_37_1","unstructured":"Ilya Molchanov . 2006. Theory of random sets . Springer Science & Business Media . Ilya Molchanov. 2006. Theory of random sets. Springer Science & Business Media."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2431211.2431218"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-24752-4_14"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.4249\/scholarpedia.3383"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1108\/eb046814"},{"volume-title":"The probabilistic relevance framework: BM25 and beyond","author":"Robertson Stephen","key":"e_1_3_2_1_42_1","unstructured":"Stephen Robertson and Hugo Zaragoza . 2009. The probabilistic relevance framework: BM25 and beyond . Now Publishers Inc . Stephen Robertson and Hugo Zaragoza. 2009. The probabilistic relevance framework: BM25 and beyond. Now Publishers Inc."},{"key":"e_1_3_2_1_43_1","volume-title":"The TREC 2002 Filtering Track Report. In TREC","volume":"2002","author":"Robertson Stephen E","year":"2002","unstructured":"Stephen E Robertson and Ian Soboroff . 2002 . The TREC 2002 Filtering Track Report. In TREC , Vol. 2002 . 5. Stephen E Robertson and Ian Soboroff. 2002. The TREC 2002 Filtering Track Report. In TREC, Vol. 2002. 5."},{"key":"e_1_3_2_1_44_1","volume-title":"Relevance feedback in information retrieval. THE SMART RETRIEVAL SYSTEM","author":"Rocchio Joseph John","year":"1971","unstructured":"Joseph John Rocchio . 1971. Relevance feedback in information retrieval. THE SMART RETRIEVAL SYSTEM ( 1971 ). Joseph John Rocchio. 1971. Relevance feedback in information retrieval. THE SMART RETRIEVAL SYSTEM (1971)."},{"key":"e_1_3_2_1_45_1","volume-title":"ICML","volume":"99","author":"Scott Sam","year":"1999","unstructured":"Sam Scott and Stan Matwin . 1999 . Feature engineering for text classification . In ICML , Vol. 99 . Citeseer, 379--388. Sam Scott and Stan Matwin. 1999. Feature engineering for text classification. In ICML, Vol. 99. Citeseer, 379--388."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860481"},{"key":"e_1_3_2_1_47_1","volume-title":"Probabilistic topic models. Handbook of latent semantic analysis 427, 7","author":"Steyvers Mark","year":"2007","unstructured":"Mark Steyvers and Tom Griffiths . 2007. Probabilistic topic models. Handbook of latent semantic analysis 427, 7 ( 2007 ), 424--440. Mark Steyvers and Tom Griffiths. 2007. Probabilistic topic models. Handbook of latent semantic analysis 427, 7 (2007), 424--440."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCIS.2009.5291818"},{"key":"e_1_3_2_1_49_1","volume-title":"Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society","author":"Tibshirani Robert","year":"1996","unstructured":"Robert Tibshirani . 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society ( 1996 ), 267--288. Robert Tibshirani. 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society (1996), 267--288."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2007.86"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148204"},{"key":"e_1_3_2_1_52_1","volume-title":"Individual comparisons by ranking methods. Biometrics bulletin 1, 6","author":"Wilcoxon Frank","year":"1945","unstructured":"Frank Wilcoxon . 1945. Individual comparisons by ranking methods. Biometrics bulletin 1, 6 ( 1945 ), 80--83. Frank Wilcoxon. 1945. Individual comparisons by ranking methods. Biometrics bulletin 1, 6 (1945), 80--83."},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2006.50"},{"key":"e_1_3_2_1_54_1","volume-title":"WI'04","author":"Wu Sheng-Tang","year":"2004","unstructured":"Sheng-Tang Wu , Yuefeng Li , Yue Xu , Binh Pham , and Phoebe Chen . 2004 . Automatic pattern-taxonomy extraction for web mining . In WI'04 . IEEE, 242--248. Sheng-Tang Wu, Yuefeng Li, Yue Xu, Binh Pham, and Phoebe Chen. 2004. Automatic pattern-taxonomy extraction for web mining. In WI'04. IEEE, 242--248."},{"key":"e_1_3_2_1_55_1","volume-title":"Mining Topically Coherent Patterns for Unsupervised Extractive Multi-document Summarization. In WI'16","author":"Wu Yutong","year":"2016","unstructured":"Yutong Wu , Yuefeng Li , Yue Xu , and Wei Huang . 2016 . Mining Topically Coherent Patterns for Unsupervised Extractive Multi-document Summarization. In WI'16 . IEEE, 129--136. Yutong Wu, Yuefeng Li, Yue Xu, and Wei Huang. 2016. Mining Topically Coherent Patterns for Unsupervised Extractive Multi-document Summarization. In WI'16. IEEE, 129--136."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2011.02.003"},{"key":"e_1_3_2_1_57_1","first-page":"412","article-title":"A comparative study on feature selection in text categorization","volume":"97","author":"Yang Yiming","year":"1997","unstructured":"Yiming Yang and Jan O Pedersen . 1997 . A comparative study on feature selection in text categorization . In Icml , Vol. 97. 412 -- 420 . Yiming Yang and Jan O Pedersen. 1997. A comparative study on feature selection in text categorization. In Icml, Vol. 97. 412--420.","journal-title":"Icml"},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/WAINA.2008.137"},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2010.211"},{"key":"e_1_3_2_1_60_1","unstructured":"Weidong Zhu and Yongmin Lin. 2013. Using GINI-index for feature weighting in text categorization. (2013).  Weidong Zhu and Yongmin Lin. 2013. Using GINI-index for feature weighting in text categorization. (2013)."}],"event":{"name":"WI '17: International Conference on Web Intelligence 2017","sponsor":["SIGAI ACM Special Interest Group on Artificial Intelligence","TCII IEEE Computer Society Technical Committee on Intelligent Informatics","Web Intelligence Consortium"],"location":"Leipzig Germany","acronym":"WI '17"},"container-title":["Proceedings of the International Conference on Web Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3106426.3106440","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3106426.3106440","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:30:17Z","timestamp":1750217417000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3106426.3106440"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,8,23]]},"references-count":60,"alternative-id":["10.1145\/3106426.3106440","10.1145\/3106426"],"URL":"https:\/\/doi.org\/10.1145\/3106426.3106440","relation":{},"subject":[],"published":{"date-parts":[[2017,8,23]]},"assertion":[{"value":"2017-08-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}