{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:15:18Z","timestamp":1750306518815,"version":"3.41.0"},"reference-count":31,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2014,12,17]],"date-time":"2014-12-17T00:00:00Z","timestamp":1418774400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Internet Technol."],"published-print":{"date-parts":[[2014,12,17]]},"abstract":"<jats:p>\n            Microblogging platforms, such as Twitter, have already played an important role in recent cultural, social and political events. Discovering latent topics from social streams is therefore important for many downstream applications, such as clustering, classification or recommendation. However, traditional topic models that rely on the bag-of-words assumption are insufficient to uncover the rich semantics and temporal aspects of topics in Twitter. In particular, microblog content is often influenced by external information sources, such as Web documents linked from Twitter posts, and often focuses on specific entities, such as people or organizations. These external sources provide useful semantics to understand microblogs and we generally refer to these semantics as\n            <jats:italic>auxiliary semantics<\/jats:italic>\n            . In this article, we address the mentioned issues and propose a unified framework for Multifaceted Topic Modeling from Twitter streams. We first extract social semantics from Twitter by modeling the social chatter associated with hashtags. We further extract terms and named entities from linked Web documents to serve as auxiliary semantics during topic modeling. The Multifaceted Topic Model (MfTM) is then proposed to jointly model latent semantics among the social terms from Twitter, auxiliary terms from the linked Web documents and named entities. Moreover, we capture the temporal characteristics of each topic. An efficient online inference method for MfTM is developed, which enables our model to be applied to large-scale and streaming data. Our experimental evaluation shows the effectiveness and efficiency of our model compared with state-of-the-art baselines. We evaluate each aspect of our framework and show its utility in the context of tweet clustering.\n          <\/jats:p>","DOI":"10.1145\/2651403","type":"journal-article","created":{"date-parts":[[2014,12,19]],"date-time":"2014-12-19T13:38:51Z","timestamp":1418996331000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":26,"title":["Integrating Social and Auxiliary Semantics for Multifaceted Topic Modeling in Twitter"],"prefix":"10.1145","volume":"14","author":[{"given":"Jan","family":"Vosecky","sequence":"first","affiliation":[{"name":"Hong Kong University of Science and Technology, Kowloon, Hong Kong"}]},{"given":"Di","family":"Jiang","sequence":"additional","affiliation":[{"name":"Hong Kong University of Science and Technology, Kowloon, Hong Kong"}]},{"given":"Kenneth Wai-Ting","family":"Leung","sequence":"additional","affiliation":[{"name":"Hong Kong University of Science and Technology, Kowloon, Hong Kong"}]},{"given":"Kai","family":"Xing","sequence":"additional","affiliation":[{"name":"Hong Kong University of Science and Technology, Kowloon, Hong Kong"}]},{"given":"Wilfred","family":"Ng","sequence":"additional","affiliation":[{"name":"Hong Kong University of Science and Technology, Kowloon, Hong Kong"}]}],"member":"320","published-online":{"date-parts":[[2014,12,17]]},"reference":[{"volume-title":"Proceedings of the Extended Semantic Web Conference (ESWC'11)","author":"Abel F.","key":"e_1_2_1_1_1","unstructured":"F. Abel , Q. Gao , G.-J. Houben , and K. Tao . 2011. Semantic enrichment of twitter posts for user profile construction on the social web . In Proceedings of the Extended Semantic Web Conference (ESWC'11) , 375--389. F. Abel, Q. Gao, G.-J. Houben, and K. Tao. 2011. Semantic enrichment of twitter posts for user profile construction on the social web. In Proceedings of the Extended Semantic Web Conference (ESWC'11), 375--389."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2008.140"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1718487.1718524"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/2027776.2027790"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the International Conference on Computational Linguistics (COLING'10)","author":"Duan Y.","year":"2010","unstructured":"Y. Duan , L. Jiang , T. Qin , M. Zhou , and H.-Y. Shum . 2010 . An empirical study on learning to rank of tweets . In Proceedings of the International Conference on Computational Linguistics (COLING'10) . 295--303. Y. Duan, L. Jiang, T. Qin, M. Zhou, and H.-Y. Shum. 2010. An empirical study on learning to rank of tweets. In Proceedings of the International Conference on Computational Linguistics (COLING'10). 295--303."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835449.1835616"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2009916.2009984"},{"volume-title":"Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'96)","author":"Ester M.","key":"e_1_2_1_9_1","unstructured":"M. Ester , H. P. Kriegel , J. Sander , and X. Xu . 1996. A density-based algorithm for discovering clusters in large spatial databases with noise . In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'96) . 226--231. M. Ester, H. P. Kriegel, J. Sander, and X. Xu. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'96). 226--231."},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Y. Genc Y. Sakamoto and J. V. Nickerson. 2011. Discovering context: Classifying tweets through a semantic transform based on Wikipedia. In Foundations of Augmented Cognition. Directing the Future of Adaptive Systems 484--492.   Y. Genc Y. Sakamoto and J. V. Nickerson. 2011. Discovering context: Classifying tweets through a semantic transform based on Wikipedia. In Foundations of Augmented Cognition. Directing the Future of Adaptive Systems 484--492.","DOI":"10.1007\/978-3-642-21852-1_55"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2536689.2536803"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/2567709.2502622"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964858.1964870"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1810617.1810647"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063689"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1935826.1935932"},{"volume-title":"Proceedings of the International Conference on Computational Linguistics. 1519--1534","author":"Lau J. H.","key":"e_1_2_1_17_1","unstructured":"J. H. Lau , N. Collier , and T. Baldwin . 2012. On-line trend analysis with topic models: #twitter trends detection topic model online . In Proceedings of the International Conference on Computational Linguistics. 1519--1534 . J. H. Lau, N. Collier, and T. Baldwin. 2012. On-line trend analysis with topic models: #twitter trends detection topic model online. In Proceedings of the International Conference on Computational Linguistics. 1519--1534."},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"C. D. Manning P. Raghavan and H. Sch\u00fctze. 2008. Introduction to Information Retrieval. Cambridge University Press.   C. D. Manning P. Raghavan and H. Sch\u00fctze. 2008. Introduction to Information Retrieval. Cambridge University Press.","DOI":"10.1017\/CBO9780511809071"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150487"},{"volume-title":"Proceedings of the National Conference on Artificial Intelligence (AAAI'10)","author":"Paul M.","key":"e_1_2_1_20_1","unstructured":"M. Paul and R. Girju . 2010. A two-dimensional topic-aspect model for discovering multi-faceted topics . In Proceedings of the National Conference on Artificial Intelligence (AAAI'10) . 545--550. M. Paul and R. Girju. 2010. A two-dimensional topic-aspect model for discovering multi-faceted topics. In Proceedings of the National Conference on Artificial Intelligence (AAAI'10). 545--550."},{"volume-title":"Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP'11)","author":"Ritter A.","key":"e_1_2_1_21_1","unstructured":"A. Ritter , S. Clark , Mausam, and O. Etzioni . 2011. Named entity recognition in tweets: An experimental study . In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP'11) . 1524--1534. A. Ritter, S. Clark, Mausam, and O. Etzioni. 2011. Named entity recognition in tweets: An experimental study. In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP'11). 1524--1534."},{"volume-title":"Proceedings of the ACM SIGIR Special Interest Group on Information Retrieval's 3rd Workshop on Social Web Search and Mining (SWSM'10)","author":"Rosa K. D.","key":"e_1_2_1_22_1","unstructured":"K. D. Rosa , R. Shah , B. Lin , A. Gershman , and R. Frederking . 2010. Topical clustering of tweets . In Proceedings of the ACM SIGIR Special Interest Group on Information Retrieval's 3rd Workshop on Social Web Search and Mining (SWSM'10) . K. D. Rosa, R. Shah, B. Lin, A. Gershman, and R. Frederking. 2010. Topical clustering of tweets. In Proceedings of the ACM SIGIR Special Interest Group on Information Retrieval's 3rd Workshop on Social Web Search and Mining (SWSM'10)."},{"volume-title":"Proceedings of the Conference on Uncertainty in Artificial Intelligence (AUAI'04)","author":"Rosen-Zvi M.","key":"e_1_2_1_23_1","unstructured":"M. Rosen-Zvi , T. Griffiths , M. Steyvers , and P. Smyth . 2004. The author-topic model for authors and documents . In Proceedings of the Conference on Uncertainty in Artificial Intelligence (AUAI'04) . 487--494. M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. 2004. The author-topic model for authors and documents. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (AUAI'04). 487--494."},{"volume-title":"Proceedings of the International Conference on Weblogs and Social Media (ICWSM'13)","author":"Tsur O.","key":"e_1_2_1_24_1","unstructured":"O. Tsur and A. Rappoport . 2013. Efficient clustering of short messages into general domains . In Proceedings of the International Conference on Weblogs and Social Media (ICWSM'13) . 621--630. O. Tsur and A. Rappoport. 2013. Efficient clustering of short messages into general domains. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM'13). 621--630."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505515.2505593"},{"key":"e_1_2_1_26_1","doi-asserted-by":"crossref","unstructured":"J. Vosecky. 2014. Online Appendix to: Integrating Social and Auxiliary Semantics for Multi-Faceted Topic Modeling in Twitter. http:\/\/www.cse.ust.hk\/_wilfred\/mftm.html.  J. Vosecky. 2014. Online Appendix to: Integrating Social and Auxiliary Semantics for Multi-Faceted Topic Modeling in Twitter. http:\/\/www.cse.ust.hk\/_wilfred\/mftm.html.","DOI":"10.1145\/2651403"},{"key":"e_1_2_1_27_1","unstructured":"B. Walsh. 2004. Markov chain Monte Carlo and Gibbs sampling. Lecture Notes MIT.  B. Walsh. 2004. Markov chain Monte Carlo and Gibbs sampling. Lecture Notes MIT."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150450"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063726"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2010.34"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.5555\/1996889.1996934"}],"container-title":["ACM Transactions on Internet Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2651403","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2651403","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:11:54Z","timestamp":1750227114000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2651403"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,12,17]]},"references-count":31,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2014,12,17]]}},"alternative-id":["10.1145\/2651403"],"URL":"https:\/\/doi.org\/10.1145\/2651403","relation":{},"ISSN":["1533-5399","1557-6051"],"issn-type":[{"type":"print","value":"1533-5399"},{"type":"electronic","value":"1557-6051"}],"subject":[],"published":{"date-parts":[[2014,12,17]]},"assertion":[{"value":"2013-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-12-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}