{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,4]],"date-time":"2022-04-04T23:28:48Z","timestamp":1649114928242},"reference-count":8,"publisher":"World Scientific Pub Co Pte Lt","issue":"01","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Unc. Fuzz. Knowl. Based Syst."],"published-print":{"date-parts":[[2003,2]]},"abstract":"<jats:p> This paper discusses the notion of Uncertainty, which has a prominent place in the theory and experimental practice of modern Physics. It argues that the awareness of Uncertainty may also be of tremendous importance to the field of Information Retrieval, and in particular Text Categorization. <\/jats:p><jats:p> As an application of Uncertainty in Text Categorization, a new criterion for Term Selection is described, which is based on the Uncertainty in Term Frequency across categories. This criterion allows to distinguish between low-quality (or \"noisy\") and high-quality (\"stiff\") terms. <\/jats:p><jats:p> We describe an experiment investigating the effect of eliminating noisy and stiff terms in the context of text classification. In the experiment we applied the Rocchio and Winnow classification algorithms to a collection of newspaper items, a mono-classified subset of the well-known Reuters 21578 corpus. <\/jats:p><jats:p> This investigation shows that both the local elimination of noisy terms and the global elimination of stiff terms can be used for Term Selection in Text Categorization. <\/jats:p>","DOI":"10.1142\/s0218488503001977","type":"journal-article","created":{"date-parts":[[2003,2,26]],"date-time":"2003-02-26T10:09:39Z","timestamp":1046254179000},"page":"115-137","source":"Crossref","is-referenced-by-count":2,"title":["UNCERTAINTY AND TERM SELECTION IN TEXT CATEGORIZATION"],"prefix":"10.1142","volume":"11","author":[{"given":"CHARLES M. E. E.","family":"PETERS","sequence":"first","affiliation":[{"name":"Department of Computer Science,  University of Nijmegen, Nijmegen, The Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"CORNELIS H. A.","family":"KOSTER","sequence":"additional","affiliation":[{"name":"Department of Computer Science,  University of Nijmegen, Nijmegen, The Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"rf2","doi-asserted-by":"publisher","DOI":"10.1145\/183422.183423"},{"key":"rf5","first-page":"100","volume":"13","author":"Cohen W. W.","journal-title":"ACM Transactions on Information Systems"},{"key":"rf6","unstructured":"R. P.\u00a0Feynman, R. B.\u00a0Leighton and M.\u00a0Sands\u00a0II (Addison Wesley Publishing Co., 1989)."},{"key":"rf9","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010844028087"},{"key":"rf12","unstructured":"B. B.\u00a0Mandelbrot, The fractal geometry of nature (Freeman an Company, 1982)\u00a0pp. 344\u2013348."},{"key":"rf15","unstructured":"F.\u00a0Reif, Statistical and thermal physics (MacGraw-Hill Kogakusha, LTD., 1965)\u00a0pp. 201\u2013232."},{"key":"rf16","unstructured":"J. J.\u00a0Rocchio, The Smart Retrieval system - experiments in automatic document processing, ed. G.\u00a0Salton (Prentice - Hall, Englewood Cliffs, NJ, 1971)\u00a0pp. 313\u2013323."},{"key":"rf17","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"}],"container-title":["International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218488503001977","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,7]],"date-time":"2019-08-07T03:02:26Z","timestamp":1565146946000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218488503001977"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2003,2]]},"references-count":8,"journal-issue":{"issue":"01","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2003,2]]}},"alternative-id":["10.1142\/S0218488503001977"],"URL":"https:\/\/doi.org\/10.1142\/s0218488503001977","relation":{},"ISSN":["0218-4885","1793-6411"],"issn-type":[{"value":"0218-4885","type":"print"},{"value":"1793-6411","type":"electronic"}],"subject":[],"published":{"date-parts":[[2003,2]]}}}