{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T12:10:38Z","timestamp":1763467838582,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":40,"publisher":"ACM","license":[{"start":{"date-parts":[[2007,8,12]],"date-time":"2007-08-12T00:00:00Z","timestamp":1186876800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2007,8,12]]},"DOI":"10.1145\/1281192.1281220","type":"proceedings-article","created":{"date-parts":[[2007,12,7]],"date-time":"2007-12-07T19:19:41Z","timestamp":1197055181000},"page":"230-239","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":85,"title":["Feature selection methods for text classification"],"prefix":"10.1145","author":[{"given":"Anirban","family":"Dasgupta","sequence":"first","affiliation":[{"name":"Yahoo Research"}]},{"given":"Petros","family":"Drineas","sequence":"additional","affiliation":[{"name":"RPI"}]},{"given":"Boulos","family":"Harb","sequence":"additional","affiliation":[{"name":"U Penn"}]},{"given":"Vanja","family":"Josifovski","sequence":"additional","affiliation":[{"name":"Yahoo Research"}]},{"given":"Michael W.","family":"Mahoney","sequence":"additional","affiliation":[{"name":"Yahoo Research"}]}],"member":"320","published-online":{"date-parts":[[2007,8,12]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"20\n    Newsgroups Dataset. J. Rennie. http:\/\/people.csail.mit.edu\/jrennie\/20Newsgroups\/.  20 Newsgroups Dataset. J. Rennie. http:\/\/people.csail.mit.edu\/jrennie\/20Newsgroups\/."},{"key":"e_1_3_2_2_2_1","unstructured":"20\n    Newsgroups Dataset. UCI KDD Archive. http:\/\/kdd.ics.uci.edu\/databases\/20newsgroups\/20newsgroups.html.  20 Newsgroups Dataset. UCI KDD Archive. http:\/\/kdd.ics.uci.edu\/databases\/20newsgroups\/20newsgroups.html."},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/775047.775073"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(97)00063-5"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0893-6080(05)80010-3"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/795666.796576"},{"key":"e_1_3_2_2_7_1","unstructured":"A. Das and D. Kempe. Algorithms for subset selection in linear regression. Manuscript.  A. Das and D. Kempe. Algorithms for subset selection in linear regression. Manuscript."},{"key":"e_1_3_2_2_8_1","unstructured":"A. Dasgupta P. Drineas B. Harb R. Kumar and M. W. Mahoney. Sampling algorithms and coresets for lp regression. In Manuscript submitted for publication.   A. Dasgupta P. Drineas B. Harb R. Kumar and M. W. Mahoney. Sampling algorithms and coresets for l p regression. In Manuscript submitted for publication."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1008992.1009036"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1137\/0613074"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/1109557.1109682"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1018946025316"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944974"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/775047.775120"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/502512.502527"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015330.1015388"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944968"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/645326.649721"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/1046920.1046922"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(97)00043-X"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/3091622.3091662"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1012491419635"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/1005332.1005345"},{"key":"e_1_3_2_2_24_1","first-page":"258","volume-title":"Proceedings of the 16th International Conference on Machine Learning","author":"Mladenic D.","year":"1999","unstructured":"D. Mladenic and M. Grobelnik . Feature selection for unbalanced class distribution and Naive Bayes . In Proceedings of the 16th International Conference on Machine Learning , pages 258 -- 267 , 1999 . D. Mladenic and M. Grobelnik. Feature selection for unbalanced class distribution and Naive Bayes. In Proceedings of the 16th International Conference on Machine Learning, pages 258--267, 1999."},{"issue":"5","key":"e_1_3_2_2_25_1","first-page":"537","article-title":"The mathematics of learning: Dealing with data","volume":"50","author":"Poggio T.","year":"2003","unstructured":"T. Poggio and S. Smale . The mathematics of learning: Dealing with data . Notices of the AMS , 50 ( 5 ): 537 -- 544 , May 2003 . T. Poggio and S. Smale. The mathematics of learning: Dealing with data. Notices of the AMS, 50(5):537--544, May 2003.","journal-title":"Notices of the AMS"},{"key":"e_1_3_2_2_27_1","first-page":"131","volume-title":"Advances in Learning Theory: Methods, Models and Applications, NATO Science Series III: Computer and Systems Sciences","author":"Rifkin R.","year":"2003","unstructured":"R. Rifkin , G. Yeo , and T. Poggio . Regularized least-squares classification . In J. A. K. Suykens, G. Horvath, S. Basu, C. Micchelli, and J. Vandewalle, editors, Advances in Learning Theory: Methods, Models and Applications, NATO Science Series III: Computer and Systems Sciences , pages 131 -- 154 . VIOS Press , 2003 . R. Rifkin, G. Yeo, and T. Poggio. Regularized least-squares classification. In J. A. K. Suykens, G. Horvath, S. Basu, C. Micchelli, and J. Vandewalle, editors, Advances in Learning Theory: Methods, Models and Applications, NATO Science Series III: Computer and Systems Sciences, pages 131--154. VIOS Press, 2003."},{"key":"e_1_3_2_2_28_1","first-page":"356","volume-title":"Manuscript. Salton. Proceedings of the 14th Annual International ACM SIGIR Conference","author":"Rudelson M.","year":"1991","unstructured":"M. Rudelson and R. Vershynin . Sampling from large matrices: an approach through geometric functional analysis . Manuscript. Salton. Proceedings of the 14th Annual International ACM SIGIR Conference , pages 356 -- 358 , 1991 . M. Rudelson and R. Vershynin. Sampling from large matrices: an approach through geometric functional analysis. Manuscript. Salton. Proceedings of the 14th Annual International ACM SIGIR Conference, pages 356--358, 1991."},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/648300.755324"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"e_1_3_2_2_31_1","volume-title":"Matrix Perturbation Theory","author":"Stewart G. W.","year":"1990","unstructured":"G. W. Stewart and J. G. Sun . Matrix Perturbation Theory . Academic Press , New York , 1990 . G. W. Stewart and J. G. Sun. Matrix Perturbation Theory. Academic Press, New York, 1990."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1018628609742"},{"key":"e_1_3_2_2_33_1","volume-title":"Solutions of Ill-Posed Problems. W. H","author":"Tikhonov A. N.","year":"1977","unstructured":"A. N. Tikhonov and V. Y. Arsenin . Solutions of Ill-Posed Problems. W. H . Winston, Washington, D. C. , 1977 . A. N. Tikhonov and V. Y. Arsenin. Solutions of Ill-Posed Problems. W. H. Winston, Washington, D. C., 1977."},{"key":"e_1_3_2_2_34_1","volume-title":"Statistical Learning Theory","author":"Vapnik V. N.","year":"1998","unstructured":"V. N. Vapnik . Statistical Learning Theory . Wiley , New York , 1998 . V. N. Vapnik. Statistical Learning Theory. Wiley, New York, 1998."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/215206.215367"},{"key":"e_1_3_2_2_36_1","first-page":"88","volume-title":"AAAI Spring Symposium on Machine Learning in Information Access","author":"Yang Y.","year":"1996","unstructured":"Y. Yang . Sampling strategies and learning efficiency in text categorization . In AAAI Spring Symposium on Machine Learning in Information Access , pages 88 -- 95 , 1996 . Y. Yang. Sampling strategies and learning efficiency in text categorization. In AAAI Spring Symposium on Machine Learning in Information Access, pages 88--95, 1996."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312647"},{"key":"e_1_3_2_2_38_1","first-page":"412","volume-title":"Proceedings of the 14th International Conference on Machine Learning","author":"Yang Y.","year":"1997","unstructured":"Y. Yang and J. O. Pedersen . A comparative study on feature selection in text categorization . In Proceedings of the 14th International Conference on Machine Learning , pages 412 -- 420 , 1997 . Y. Yang and J. O. Pedersen. A comparative study on feature selection in text categorization. In Proceedings of the 14th International Conference on Machine Learning, pages 412--420, 1997."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860471"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.5555\/1018427.1020467"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1011441423217"}],"event":{"name":"KDD07: The 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data","ACM Association for Computing Machinery"],"location":"San Jose California USA","acronym":"KDD07"},"container-title":["Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1281192.1281220","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1281192.1281220","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T14:57:54Z","timestamp":1750258674000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1281192.1281220"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,8,12]]},"references-count":40,"alternative-id":["10.1145\/1281192.1281220","10.1145\/1281192"],"URL":"https:\/\/doi.org\/10.1145\/1281192.1281220","relation":{},"subject":[],"published":{"date-parts":[[2007,8,12]]},"assertion":[{"value":"2007-08-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}