{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:36:28Z","timestamp":1750307788238,"version":"3.41.0"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2008,1,1]],"date-time":"2008-01-01T00:00:00Z","timestamp":1199145600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2008,1]]},"abstract":"<jats:p>A topic taxonomy is an effective representation that describes salient features of virtual groups or online communities. A topic taxonomy consists of topic nodes. Each internal node is defined by its vertical path (i.e., ancestor and child nodes) and its horizonal list of attributes (or terms). In a text-dominant environment, a topic taxonomy can be used to flexibly describe a group's interests with varying granularity. However, the stagnant nature of a taxonomy may fail to timely capture the dynamic change of a group's interest. This article addresses the problem of how to adapt a topic taxonomy to the accumulated data that reflects the change of a group's interest to achieve dynamic group profiling. We first discuss the issues related to topic taxonomy. We next formulate taxonomy adaptation as an optimization problem to find the taxonomy that best fits the data. We then present a viable algorithm that can efficiently accomplish taxonomy adaptation. We conduct extensive experiments to evaluate our approach's efficacy for group profiling, compare the approach with some alternatives, and study its performance for dynamic group profiling. While pointing out various applications of taxonomy adaption, we suggest some future work that can take advantage of burgeoning Web 2.0 services for online targeted marketing, counterterrorism in connecting dots, and community tracking.<\/jats:p>","DOI":"10.1145\/1324172.1324173","type":"journal-article","created":{"date-parts":[[2008,2,8]],"date-time":"2008-02-08T15:32:16Z","timestamp":1202484736000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":27,"title":["Topic taxonomy adaptation for group profiling"],"prefix":"10.1145","volume":"1","author":[{"given":"Lei","family":"Tang","sequence":"first","affiliation":[{"name":"Arizona State University, Tempe, AZ"}]},{"given":"Huan","family":"Liu","sequence":"additional","affiliation":[{"name":"Arizona State University, Tempe, AZ"}]},{"given":"Jianping","family":"Zhang","sequence":"additional","affiliation":[{"name":"MITRE, McLean, VA"}]},{"given":"Nitin","family":"Agarwal","sequence":"additional","affiliation":[{"name":"Arizona State University, Tempe, AZ"}]},{"given":"John J.","family":"Salerno","sequence":"additional","affiliation":[{"name":"Air Force Research Laboratory, Rome, NY"}]}],"member":"320","published-online":{"date-parts":[[2008,2]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.901170"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/312129.312279"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/170036.170072"},{"key":"e_1_2_1_4_1","volume-title":"Tech. Rep. CMU-ML-06-101, School of Computer Science","author":"Airoldi E. M.","year":"2006","unstructured":"Airoldi , E. M. , Fienberg , S. E. , Joutard , C. , and Love , T. M . 2006 . Discovering latent patterns with hierarchical Bayesian mixed-membership models. Tech. Rep. CMU-ML-06-101, School of Computer Science , Carnegie Mellon University , Philadelphia, PA . Airoldi, E. M., Fienberg, S. E., Joutard, C., and Love, T. M. 2006. Discovering latent patterns with hierarchical Bayesian mixed-membership models. Tech. Rep. CMU-ML-06-101, School of Computer Science, Carnegie Mellon University, Philadelphia, PA."},{"volume-title":"Introduction to Topic Detection and Tracking","author":"Allan J.","key":"e_1_2_1_5_1","unstructured":"Allan , J. 2002. Introduction to Topic Detection and Tracking . Kluwer Academic , Norwell, MA , 1--16. Allan, J. 2002. Introduction to Topic Detection and Tracking. Kluwer Academic, Norwell, MA, 1--16."},{"key":"e_1_2_1_6_1","unstructured":"Blei D. Griffiths T. L. Jordan M. I. and Tenenbaum J. B. 2003. Hierarchical topic models and the nested Chinese restaurant process. In Advances in Neural Information Processing Systems 16 S. Thrun et al. eds. MIT Press Cambridge MA.  Blei D. Griffiths T. L. Jordan M. I. and Tenenbaum J. B. 2003. Hierarchical topic models and the nested Chinese restaurant process. In Advances in Neural Information Processing Systems 16 S. Thrun et al. eds. MIT Press Cambridge MA."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143859"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937"},{"key":"e_1_2_1_9_1","unstructured":"Bounsaythip C. and Rinta-Runsala E. 2001. Overview of data mining for customer behavior modeling. http:\/\/virtual.vtt.fi\/inf\/julkaisut\/muut\/2001\/customerprofiling.pdf.  Bounsaythip C. and Rinta-Runsala E. 2001. Overview of data mining for customer behavior modeling. http:\/\/virtual.vtt.fi\/inf\/julkaisut\/muut\/2001\/customerprofiling.pdf."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1031171.1031186"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143867"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/1248547.1248549"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150467"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/s007780050061"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2004.12.033"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1031171.1031193"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015330.1015374"},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Dhillon I. S. Fan J. and Guan Y. 2001. Efficient clustering of very large document collections. In Data Mining for Scientific and Engineering Applications. Kluwer Academic.  Dhillon I. S. Fan J. and Guan Y. 2001. Efficient clustering of very large document collections. In Data Mining for Scientific and Engineering Applications. Kluwer Academic.","DOI":"10.1007\/978-1-4615-1733-7_20"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345593"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944974"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1099554.1099703"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/988672.988739"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI). Morgan Kaufmann","author":"Hofmann T.","year":"1999","unstructured":"Hofmann , T. 1999 . The cluster-abstraction model: Unsupervised learning of topic hierarchies from text data . In Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI). Morgan Kaufmann , San Francisco, CA, 682--687. Hofmann, T. 1999. The cluster-abstraction model: Unsupervised learning of topic hierarchies from text data. In Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI). Morgan Kaufmann, San Francisco, CA, 682--687."},{"key":"e_1_2_1_24_1","unstructured":"Hwang F. and Richards D. 1992. The Steiner tree problem. Ann. Discrete Math. 53.  Hwang F. and Richards D. 1992. The Steiner tree problem. Ann. Discrete Math. 53."},{"key":"e_1_2_1_25_1","unstructured":"Jain A. K. and Dubes R. C. 1988. Algorithms for Clustering Data. Prentice-Hall.   Jain A. K. and Dubes R. C. 1988. Algorithms for Clustering Data. Prentice-Hall."},{"volume-title":"Proceedings of the 14th International Conference on Machine Learning (ICML). Morgan Kaufmann","author":"Koller D.","key":"e_1_2_1_26_1","unstructured":"Koller , D. and Sahami , M . 1997. Hierarchically classifying documents using very few words . In Proceedings of the 14th International Conference on Machine Learning (ICML). Morgan Kaufmann , San Francisco, CA, 170--178. Koller, D. and Sahami, M. 1997. Hierarchically classifying documents using very few words. In Proceedings of the 14th International Conference on Machine Learning (ICML). Morgan Kaufmann, San Francisco, CA, 170--178."},{"volume-title":"SIAM International Data Mining Conference","author":"Li T.","key":"e_1_2_1_27_1","unstructured":"Li , T. and Zhu , S . 2005. Hierarchical document classification using automatically generated hierarchy . In SIAM International Data Mining Conference , Newport Beach, CA. Li, T. and Zhu, S. 2005. Hierarchical document classification using automatically generated hierarchy. In SIAM International Data Mining Conference, Newport Beach, CA."},{"key":"e_1_2_1_28_1","volume-title":"eds","author":"Liu H.","year":"2007","unstructured":"Liu , H. and Motoda , H. , eds . 2007 . Computational Methods of Feature Selection. Chapman and Hall\/CRC Press . Liu, H. and Motoda, H., eds. 2007. Computational Methods of Feature Selection. Chapman and Hall\/CRC Press."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2005.66"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1089815.1089821"},{"volume-title":"AAAI-98 Workshop on Learning for Text Categorization.","author":"McCallum A.","key":"e_1_2_1_31_1","unstructured":"McCallum , A. and Nigam , K . 1998. A comparison of event models for naive Bayes text classification . In AAAI-98 Workshop on Learning for Text Categorization. McCallum, A. and Nigam, K. 1998. A comparison of event models for naive Bayes text classification. In AAAI-98 Workshop on Learning for Text Categorization."},{"volume-title":"Proceedings of the 15th International Conference on Machine Learning (ICML). Morgan Kaufmann","author":"McCallum A.","key":"e_1_2_1_32_1","unstructured":"McCallum , A. , Rosenfeld , R. , Mitchell , T. M. , and Ng , A. Y . 1998. Improving text classification by shrinkage in a hierarchy of classes . In Proceedings of the 15th International Conference on Machine Learning (ICML). Morgan Kaufmann , San Francisco, CA, 359--367. McCallum, A., Rosenfeld, R., Mitchell, T. M., and Ng, A. Y. 1998. Improving text classification by shrinkage in a hierarchy of classes. In Proceedings of the 15th International Conference on Machine Learning (ICML). Morgan Kaufmann, San Francisco, CA, 359--367."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1062745.1062843"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1102351.1102445"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312700"},{"key":"e_1_2_1_36_1","unstructured":"Segal E. Koller D. and Ormoneit D. 2001. Probabilistic abstraction hierarchies. In Advances in Neural Information Processing Systems 14. MIT Press Vancouver British Columbia Canada 913--920.  Segal E. Koller D. and Ormoneit D. 2001. Probabilistic abstraction hierarchies. In Advances in Neural Information Processing Systems 14. MIT Press Vancouver British Columbia Canada 913--920."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-9236(00)00123-8"},{"volume-title":"Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE Computer Society","author":"Sun A.","key":"e_1_2_1_38_1","unstructured":"Sun , A. and Lim , E . -P. 2001. Hierarchical text classification and evaluation . In Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE Computer Society , Washington, DC, 521--528. Sun, A. and Lim, E.-P. 2001. Hierarchical text classification and evaluation. In Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE Computer Society, Washington, DC, 521--528."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2005.34"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150446"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/502585.502604"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015330.1015341"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1102351.1102468"},{"volume-title":"Proceedings of the 25th International Conference on Very Large Data Bases (VLDB). Morgan Kaufmann","author":"Wang K.","key":"e_1_2_1_44_1","unstructured":"Wang , K. , Zhou , S. , and Liew , S. C . 1999. Building hierarchical classifiers using class proximity . In Proceedings of the 25th International Conference on Very Large Data Bases (VLDB). Morgan Kaufmann , San Francisco, CA, 363--374. Wang, K., Zhou, S., and Liew, S. C. 1999. Building hierarchical classifiers using class proximity. In Proceedings of the 25th International Conference on Very Large Data Bases (VLDB). Morgan Kaufmann, San Francisco, CA, 363--374."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009983522080"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/584792.584878"},{"volume-title":"Proceedings of the 14th International Conference on Machine Learning (ICML). Morgan Kaufmann","author":"Yang Y.","key":"e_1_2_1_47_1","unstructured":"Yang , Y. and Pedersen , J. O . 1997. A comparative study on feature selection in text categorization . In Proceedings of the 14th International Conference on Machine Learning (ICML). Morgan Kaufmann , San Francisco, CA, 412--420. Yang, Y. and Pedersen, J. O. 1997. A comparative study on feature selection in text categorization. In Proceedings of the 14th International Conference on Machine Learning (ICML). Morgan Kaufmann, San Francisco, CA, 412--420."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860455"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/1031171.1031263"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1324172.1324173","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1324172.1324173","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T13:56:15Z","timestamp":1750254975000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1324172.1324173"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,1]]},"references-count":49,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2008,1]]}},"alternative-id":["10.1145\/1324172.1324173"],"URL":"https:\/\/doi.org\/10.1145\/1324172.1324173","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"type":"print","value":"1556-4681"},{"type":"electronic","value":"1556-472X"}],"subject":[],"published":{"date-parts":[[2008,1]]},"assertion":[{"value":"2007-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2007-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2008-02-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}