{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:53:34Z","timestamp":1754157214853,"version":"3.41.2"},"reference-count":32,"publisher":"Emerald","issue":"5","license":[{"start":{"date-parts":[[1998,12,1]],"date-time":"1998-12-01T00:00:00Z","timestamp":912470400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[1998,12,1]]},"abstract":"<jats:p>The authors propose a client\u2010side agent for exploring and categorizing documents on the World Wide Web. As the user browses the Web using a usual Web browser, this agent is designed to aid the user by classifying the documents the user finds most interesting into clusters. The agent carries out the task completely automatically and autonomously, with as little user intervention as the user desires. The principal novel components in this agent that make it possible are a scalable hierarchical clustering algorithm and a taxonomic label generator. In this paper, the overall architecture of this agent is described and the details of the algorithms within its key components are discussed.<\/jats:p>","DOI":"10.1108\/10662249810241257","type":"journal-article","created":{"date-parts":[[2002,7,27]],"date-time":"2002-07-27T02:09:54Z","timestamp":1027735794000},"page":"387-399","source":"Crossref","is-referenced-by-count":2,"title":["A client\u2010side Web agent for document categorization"],"prefix":"10.1108","volume":"8","author":[{"given":"Daniel","family":"Boley","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Maria","family":"Gini","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kyle","family":"Hastings","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bamshad","family":"Mobasher","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jerry","family":"Moore","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","reference":[{"key":"key2022021920090472800_b1","doi-asserted-by":"crossref","unstructured":"Ackerman, M., Billsus, D., Goffney, S., Hettich, S., Khoo, G., Kim, D.J., Klefstad, R., Lowe, C., Ludeman, A., Mutamatsu, J., Omori, K., Pazzani, M., Semler, D., Starr, B. and Yap, P. (1997, \u201cLearning probabilistic user profiles\u201d, AI Magazine, Vol. 18 No. 2, pp. 47\u201056.","DOI":"10.1080\/01626620.1997.10463363"},{"key":"key2022021920090472800_b2","unstructured":"Agrawal, A., Mannila, H., Srikant, R., Toivonen, H. and Verkamo, A. (1996, \u201cFast discovery of association rules\u201d, in Fayyad, U., Piatetsky\u2010Shapiro, G., Smyth, P. and Uthurusamy, R. (Eds), Advances in Knowledge Discovery and Data Mining, AAAI\/MIT Press, Boston, MA, pp. 307\u201028."},{"key":"key2022021920090472800_b3","doi-asserted-by":"crossref","unstructured":"Armstrong, R., Freitag, D., Joachims, T. and Mitchell, T. (1995, \u201cWebWatcher: a learning apprentice for the World Wide Web\u201d, Proceedings, AAAI Spring Symposium on Information Gathering from Heterogeneous, Distributed Environments. AAAI Press..","DOI":"10.21236\/ADA640219"},{"key":"key2022021920090472800_b4","unstructured":"Balabanovic, M., Shoham, Y. and Yun, Y. (1995, \u201cAn adaptive agent for automated Web browsing\u201d, Journal of Visual Communication and Image Representation, Vol. 6 No. 4."},{"key":"key2022021920090472800_b5","doi-asserted-by":"crossref","unstructured":"Berry, M.W., Dumais, S.T. and O\u2019Brien, G.W. (1995, \u201cUsing linear algebra for intelligent information retrieval\u201d, SIAM Review, Vol. 37, pp. 573\u201095.","DOI":"10.1137\/1037127"},{"key":"key2022021920090472800_b6","unstructured":"Boley, D. (1997,\u201dPrincipal direction divisive partitioning\u201d, Technical Report TR\u201097\u2010056, Department of Computer Science, University of Minnesota, Minneapolis, MN, to appear inData Mining and Knowledge Discovery."},{"key":"key2022021920090472800_b7","unstructured":"Boley, D. (1998, \u201cHierarchical taxonomies using divisive partitioning\u201d, Technical Report TR\u201098\u2010012, Department of Computer Science, University of Minnesota, MN."},{"key":"key2022021920090472800_b8","doi-asserted-by":"crossref","unstructured":"Broder, A.Z., Glassman, S.C., Manasse, M.S. and Zweig, G. (1997, \u201cSyntactic clustering of the Web\u201d, Proceedings of 6th International World Wide Web Conference, Computer Networks and ISDN Systems, Elsevier, Amsterdam, Vol. 29 No. 8\u201013, pp. 1157\u201066.","DOI":"10.1016\/S0169-7552(97)00031-7"},{"key":"key2022021920090472800_b9","doi-asserted-by":"crossref","unstructured":"Chang, C. and Hsu, C. (1997, \u201cCustomizable multi\u2010engine search tool with clustering\u201d, Proceedings of 6th International World Wide Web Conference, Computer Networks and ISDN Systems, Elsevier, Amsterdam, Vol. 29 No. 8\u201013, pp. 1217\u201024.","DOI":"10.1016\/S0169-7552(97)00053-6"},{"key":"key2022021920090472800_b10","unstructured":"Cheeseman, P. and Stutz, J. (1996, \u201cBayesian classification (Autoclass): theory and results\u201d, in Fayyad, U., Piatetsky\u2010Shapiro, G., Smyth, P. and Uthurusamy, R. (Eds), Advances in Knowledge Discovery and Data Mining, AAAI\/MIT Press, Boston, MA, pp. 153\u201080."},{"key":"key2022021920090472800_b11","doi-asserted-by":"crossref","unstructured":"Dubes, R. and Jain, A. (1980, \u201cClustering methodologies in exploratory data analysis\u201d, in Yovits, M. (Ed.), Advances in Computers, Academic Press Inc., New York, NY.","DOI":"10.1016\/S0065-2458(08)60034-0"},{"key":"key2022021920090472800_b12","unstructured":"Duda, R.O. and Hart, P.E. (1973, Pattern Classification and Scene Analysis, John Wiley & Sons, New York, NY."},{"key":"key2022021920090472800_b13","unstructured":"Fisher, D. (1995, \u201cOptimization and simplification of hierarchical clusterings\u201d, Proceedings of the First International Conference on Knowledge Discovery and Data Mining, American Association for Artificial Intelligence, Menlo Park, CA, pp. 118\u201023."},{"key":"key2022021920090472800_b14","unstructured":"Frakes, W.B. (1992, \u201cStemming algorithms\u201d in Frakes, W.B. and Baeza\u2010Yates, R. (Eds), Information Retrieval Data Structures and Algorithms, Prentice\u2010Hall, Englewood Cliffs, NJ, pp. 131\u201060."},{"key":"key2022021920090472800_b15","unstructured":"Frakes, W.B. and Baeza\u2010Yates, R. (1992, Information Retrieval Data Structures and Algorithms, Prentice\u2010Hall, Englewood Cliffs, NJ."},{"key":"key2022021920090472800_b16","unstructured":"Golub, G.H. and Van Loan, C.F. (1996, Matrix Computations (3rd ed.), Johns Hopkins University Press, Baltimore, MD."},{"key":"key2022021920090472800_b17","unstructured":"Han, E., Karypis, G., Kumar, V. and Mobasher, B. (1998b, \u201cHypergraph based clustering in high\u2010dimensional data sets: a summary of results\u201d, Bulletin of the Technical Committee on Data Engineering, Association for Computing Machinery, Vol. 21 No. 1, New York, NY."},{"key":"key2022021920090472800_b18","doi-asserted-by":"crossref","unstructured":"Han, E.H.S., Boley, D., Gini, M., Gross, R., Hastings., K., Karypis, G., Kumar, V., Mobasher, B. and Moore, J. (1998a, \u201cWebACE: a Web agent for document categorization and exploration\u201d, Proceedings of 2nd International Conference on Autonomous Agents.","DOI":"10.1145\/280765.280872"},{"key":"key2022021920090472800_b19","unstructured":"Jain, A. and Dubes, R.C. (1988, Algorithms for Clustering Data, Prentice\u2010Hall, Englewood Cliffs, NJ."},{"key":"key2022021920090472800_b20","doi-asserted-by":"crossref","unstructured":"Karypis, G., Aggarwal, R., Kumar, V. and Shekhar, S. (1997, \u201cMultilevel hypergraph partitioning: application in VLSI domain\u201d, Proceedings ACM\/IEEE Design Automation Conference, Association for Computing Machinery, New York, NY.","DOI":"10.1145\/266021.266273"},{"key":"key2022021920090472800_b21","doi-asserted-by":"crossref","unstructured":"Kohonen, T. (1988, Self\u2010Organization and Associated Memory, Springer\u2010Verlag, New York, NY.","DOI":"10.1007\/978-3-662-00784-6"},{"key":"key2022021920090472800_b22","doi-asserted-by":"crossref","unstructured":"Lee, R. (1981, \u201cClustering analysis and its applications\u201d, in Toum, J. (Ed.), Advances in Information Systems Science, Plenum Press, New York, NY.","DOI":"10.1007\/978-1-4613-9883-7_4"},{"key":"key2022021920090472800_b23","doi-asserted-by":"crossref","unstructured":"Lu, S. and Fu, K. (1978, \u201cA sentence\u2010to\u2010sentence clustering procedure for pattern analysis\u201d, IEEE Transactions on Systems, Man and Cybernetics, Vol. 8, pp. 381\u20109.","DOI":"10.1109\/TSMC.1978.4309979"},{"key":"key2022021920090472800_b24","doi-asserted-by":"crossref","unstructured":"Maarek, Y.S. and Shaul, I.Z.B. (1996, \u201cAutomatically organizing bookmarks per contents\u201d, Proceedings of 5th International World Wide Web Conference, Computer Networks and ISDN Systems, Elsevier, Amsterdam, Vol. 28 No. 7\u201011, pp. 1321\u201034.","DOI":"10.1016\/0169-7552(96)00024-4"},{"key":"key2022021920090472800_b25","unstructured":"Ng, R. and Han, J. (1994, \u201cEfficient and effective clustering method for spatial data mining\u201d, Proceedings of the 20th VLDB Conference, Morgan Kaufman, San Francisco, CA, pp. 144\u201055."},{"key":"key2022021920090472800_b26","unstructured":"Pazzani, M., Muramatsu, J. and Billsus, D. (1996, \u201cSyskill and Webert: identifying interesting Web sites\u201d, National Conference on Artificial Intelligence, Portland OR, pp. 54\u201061."},{"key":"key2022021920090472800_b27","doi-asserted-by":"crossref","unstructured":"Porter, M.F. (1980, \u201cAn algorithm for suffix stripping\u201d, Program, Vol. 14 No. 3, pp. 130\u20107.","DOI":"10.1108\/eb046814"},{"key":"key2022021920090472800_b28","unstructured":"Salton, G. and McGill, M.J. (1983, Introduction to Modern Information Retrieval, McGraw\u2010Hill, New York, NY."},{"key":"key2022021920090472800_b29","unstructured":"Shavlik, J. and Dietterich, T. (1990, Readings in Machine Learning, Morgan\u2010Kaufman, San Mateo, CA."},{"key":"key2022021920090472800_b30","unstructured":"Titterington, D., Smith, A. and Makov, U. (1985, Statistical Analysis of Finite Mixture Distributions, John Wiley & Sons, New York, NY."},{"key":"key2022021920090472800_b31","doi-asserted-by":"crossref","unstructured":"Weiss, R., Velez, B., Sheldon, M.A., Nemprempre, C., Szilagyi, P., Duda, A. and Gifford, D.K. (1996, \u201cHypursuit: a hierarchical network search engine that exploits content\u2010link hypertext clustering\u201d, Seventh ACM Conference on Hypertext, Association for Computing Machinery, New York, NY.","DOI":"10.1145\/234828.234846"},{"key":"key2022021920090472800_b32","doi-asserted-by":"crossref","unstructured":"Wulfekuhler, M.R. and Punch, W.F. (1997, \u201cFinding salient features for personal Web page categories\u201d, Proceedings of 6th International World Wide Web Conference, Computer Networks and ISDN Systems, Elsevier, Amsterdam, Vol. 29 No. 8\u201013, pp. 1147\u201056.","DOI":"10.1016\/S0169-7552(97)00010-X"}],"container-title":["Internet Research"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/10662249810241257","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/10662249810241257\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/10662249810241257\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:40:56Z","timestamp":1753400456000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/intr\/article\/8\/5\/387-399\/181700"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1998,12,1]]},"references-count":32,"journal-issue":{"issue":"5","published-print":{"date-parts":[[1998,12,1]]}},"alternative-id":["10.1108\/10662249810241257"],"URL":"https:\/\/doi.org\/10.1108\/10662249810241257","relation":{},"ISSN":["1066-2243"],"issn-type":[{"type":"print","value":"1066-2243"}],"subject":[],"published":{"date-parts":[[1998,12,1]]}}}