{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T17:37:59Z","timestamp":1760981879512,"version":"3.41.0"},"reference-count":4,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2005,12,1]],"date-time":"2005-12-01T00:00:00Z","timestamp":1133395200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGKDD Explor. Newsl."],"published-print":{"date-parts":[[2005,12]]},"abstract":"<jats:p>In this paper, we present a general solution for the KDD Cup 2005 problem. It uses the Internet as source of knowledge and extends it to categorize very short (less than 5 words) documents with reasonable accuracy. Our approach consists of three main parts: i.) a central knowledge filter ii.) an on-demand web crawler and iii.) a very efficient categorizer system. Our solution obtained Creativity and Precision Runner-up Awards at the competition. The main idea of Ferrety Algorithm can be generalized for mapping one taxonomy to another if training documents are available.<\/jats:p>","DOI":"10.1145\/1117454.1117468","type":"journal-article","created":{"date-parts":[[2007,1,17]],"date-time":"2007-01-17T18:32:02Z","timestamp":1169058722000},"page":"111-116","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":18,"title":["The Ferrety algorithm for the KDD Cup 2005 problem"],"prefix":"10.1145","volume":"7","author":[{"given":"Zsolt T.","family":"Kardkov\u00e1cs","sequence":"first","affiliation":[{"name":"Budapest University of Technology and Economics, Hungary"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Domonkos","family":"Tikk","sequence":"additional","affiliation":[{"name":"Budapest University of Technology and Economics, Hungary"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zolt\u00e1n","family":"B\u00e1ns\u00e1ghi","sequence":"additional","affiliation":[{"name":"Budapest University of Technology and Economics, Hungary"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2005,12]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"HITEC categorizer online. http:\/\/categorizer.tmit.bme.hu.  HITEC categorizer online. http:\/\/categorizer.tmit.bme.hu."},{"key":"e_1_2_1_2_1","volume-title":"An Introduction to Modern Information Retrieval","author":"Salton G.","year":"1983","unstructured":"G. Salton and M. J. McGill . An Introduction to Modern Information Retrieval . McGraw-Hill , 1983 . G. Salton and M. J. McGill. An Introduction to Modern Information Retrieval. McGraw-Hill, 1983."},{"issue":"3","key":"e_1_2_1_3_1","first-page":"123","article-title":"A hierarchical text categorization approach and its application to FRT expansion","volume":"8","author":"Tikk D.","year":"2004","unstructured":"D. Tikk , Gy. Bir\u00f3 , and J. D. Yang . A hierarchical text categorization approach and its application to FRT expansion . Australian Journal of Intelligent Information Processing Systems , 8 ( 3 ): 123 -- 131 , 2004 . D. Tikk, Gy. Bir\u00f3, and J. D. Yang. A hierarchical text categorization approach and its application to FRT expansion. Australian Journal of Intelligent Information Processing Systems, 8(3):123--131, 2004.","journal-title":"Australian Journal of Intelligent Information Processing Systems"},{"key":"e_1_2_1_4_1","first-page":"283","volume-title":"Applied Research in Uncertainty Modelling and Analysis, number 20 in International Series in Intelligent Technologies","author":"Tikk D.","year":"2005","unstructured":"D. Tikk , Gy. Bir\u00f3 , and J. D. Yang . Experiments with a hierarchical text categorization method on WIPO patent collections . In N. O. Attok-Okine and B. M. Ayyub, editors, Applied Research in Uncertainty Modelling and Analysis, number 20 in International Series in Intelligent Technologies , pages 283 -- 302 . Springer , 2005 . D. Tikk, Gy. Bir\u00f3, and J. D. Yang. Experiments with a hierarchical text categorization method on WIPO patent collections. In N. O. Attok-Okine and B. M. Ayyub, editors, Applied Research in Uncertainty Modelling and Analysis, number 20 in International Series in Intelligent Technologies, pages 283--302. Springer, 2005."}],"container-title":["ACM SIGKDD Explorations Newsletter"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1117454.1117468","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1117454.1117468","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T16:18:45Z","timestamp":1750263525000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1117454.1117468"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,12]]},"references-count":4,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2005,12]]}},"alternative-id":["10.1145\/1117454.1117468"],"URL":"https:\/\/doi.org\/10.1145\/1117454.1117468","relation":{},"ISSN":["1931-0145","1931-0153"],"issn-type":[{"type":"print","value":"1931-0145"},{"type":"electronic","value":"1931-0153"}],"subject":[],"published":{"date-parts":[[2005,12]]},"assertion":[{"value":"2005-12-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}