{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:08:24Z","timestamp":1750219704955,"version":"3.41.0"},"reference-count":61,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2023,12,29]],"date-time":"2023-12-29T00:00:00Z","timestamp":1703808000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61972184, 62272205, 62272206, and 62076112"],"award-info":[{"award-number":["61972184, 62272205, 62272206, and 62076112"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Natural Science and Foundation of Jiangxi Province","award":["20212ACB202002 and 20192BAB207017"],"award-info":[{"award-number":["20212ACB202002 and 20192BAB207017"]}]},{"name":"Funding Program for Academic and Technical Leaders in Major Disciplines of Jiangxi Province","award":["20213BCJL22041"],"award-info":[{"award-number":["20213BCJL22041"]}]},{"name":"Research Project for Science and Technology of Jiangxi Education Department","award":["GJJ2200643"],"award-info":[{"award-number":["GJJ2200643"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2024,5,31]]},"abstract":"<jats:p>Learning topic hierarchies from a multi-domain corpus is crucial in topic modeling as it reveals valuable structural information embedded within documents. Despite the extensive literature on hierarchical topic models, effectively discovering inter-topic correlations and differences among subtopics at the same level in the topic hierarchy, obtained from multiple domains, remains an unresolved challenge. This article proposes an enhanced nested Chinese restaurant process (nCRP), nCRP+, by introducing an additional mechanism based on Chinese restaurant franchise (CRF) for aspect-sharing pattern extraction in the original nCRP. Subsequently, by employing the distribution extracted from nCRP+ as the prior distribution for topic hierarchy in the hierarchical Dirichlet processes (HDP), we develop a hierarchical topic model for multi-domain corpus, named rHDP. We describe the model with the analogy of Chinese restaurant franchise based on the central kitchen and propose a hierarchical Gibbs sampling scheme to infer the model. Our method effectively constructs well-established topic hierarchies, accurately reflecting diverse parent-child topic relationships, explicit topic aspect sharing correlations for inter-topics, and differences between these shared topics. To validate the efficacy of our approach, we conduct experiments using a renowned public dataset and an online collection of Chinese financial documents. The experimental results confirm the superiority of our method over the state-of-the-art techniques in identifying multi-domain topic hierarchies, according to multiple evaluation metrics.<\/jats:p>","DOI":"10.1145\/3631352","type":"journal-article","created":{"date-parts":[[2023,11,2]],"date-time":"2023-11-02T22:19:01Z","timestamp":1698963541000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["rHDP: An Aspect Sharing-Enhanced Hierarchical Topic Model for Multi-Domain Corpus"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4049-3001","authenticated-orcid":false,"given":"Yitao","family":"Zhang","sequence":"first","affiliation":[{"name":"Jiangxi University of Finance and Economics, China and East China Jiaotong University, China and Jiangxi Key Laboratory of Data and Knowledge Engineering, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6222-1015","authenticated-orcid":false,"given":"Changxuan","family":"Wan","sequence":"additional","affiliation":[{"name":"Jiangxi University of Finance and Economics, China and Jiangxi Key Laboratory of Data and Knowledge Engineering, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6494-1174","authenticated-orcid":false,"given":"Keli","family":"Xiao","sequence":"additional","affiliation":[{"name":"College of Business, Stony Brook University, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8835-5134","authenticated-orcid":false,"given":"Qizhi","family":"Wan","sequence":"additional","affiliation":[{"name":"Jiangxi University of Finance and Economics, China and Jiangxi Key Laboratory of Data and Knowledge Engineering, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1093-2744","authenticated-orcid":false,"given":"Dexi","family":"Liu","sequence":"additional","affiliation":[{"name":"Jiangxi University of Finance and Economics, China and Jiangxi Key Laboratory of Data and Knowledge Engineering, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0230-8004","authenticated-orcid":false,"given":"Xiping","family":"Liu","sequence":"additional","affiliation":[{"name":"Jiangxi University of Finance and Economics, China and Jiangxi Key Laboratory of Data and Knowledge Engineering, China"}]}],"member":"320","published-online":{"date-parts":[[2023,12,29]]},"reference":[{"key":"e_1_3_2_2_2","first-page":"1426","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Ahmed Amr","year":"2013","unstructured":"Amr Ahmed, Liangjie Hong, and Alexander J. Smola. 2013. Nested Chinese restaurant franchise processes: Applications to user tracking and document modeling. In Proceedings of the International Conference on Machine Learning. 1426\u20131434."},{"doi-asserted-by":"publisher","key":"e_1_3_2_3_2","DOI":"10.1007\/978-3-030-01771-2_20"},{"doi-asserted-by":"publisher","key":"e_1_3_2_4_2","DOI":"10.1007\/BFb0099421"},{"doi-asserted-by":"publisher","key":"e_1_3_2_5_2","DOI":"10.1145\/3269206.3271696"},{"doi-asserted-by":"publisher","key":"e_1_3_2_6_2","DOI":"10.18653\/v1\/2021.acl-short.96"},{"doi-asserted-by":"publisher","key":"e_1_3_2_7_2","DOI":"10.1145\/1667053.1667056"},{"unstructured":"David M. Blei and John D. Lafferty. 2006. Correlated topic models. Advances in Neural Information Processing Systems and Information Sciences 18 (2006) 147\u2013154.","key":"e_1_3_2_8_2"},{"unstructured":"Jianfei Chen Jun Zhu Jie Lu and Shixia Liu. 2017. Scalable inference for nested Chinese restaurant process topic models. arXiv:1702.07083 . Retrieved from https:\/\/arxiv.org\/abs\/1702.07083","key":"e_1_3_2_9_2"},{"doi-asserted-by":"publisher","key":"e_1_3_2_10_2","DOI":"10.14778\/3192965.3192972"},{"doi-asserted-by":"publisher","key":"e_1_3_2_11_2","DOI":"10.21469\/22233792.2.2.05"},{"key":"e_1_3_2_12_2","volume-title":"Probabilistic generative models-based topic modeling of text and its applications","author":"Ding Yiqun","year":"2010","unstructured":"Yiqun Ding. 2010. Probabilistic generative models-based topic modeling of text and its applications. Ph. D. Dissertation. Dissertation. Zhejiang University, China."},{"doi-asserted-by":"publisher","key":"e_1_3_2_13_2","DOI":"10.1631\/jzus.A0820796"},{"key":"e_1_3_2_14_2","first-page":"2903","volume-title":"Proceedings of the 38th International Conference on Machine Learning","volume":"139","author":"Duan Zhibin","year":"2021","unstructured":"Zhibin Duan, Dongsheng Wang, Bo Chen, Chaojie Wang, Wenchao Chen, Yewen Li, Jie Ren, and Mingyuan Zhou. 2021. Sawtooth factorial topic embeddings guided gamma belief network. In Proceedings of the 38th International Conference on Machine Learning, Vol. 139. 2903\u20132913."},{"doi-asserted-by":"publisher","key":"e_1_3_2_15_2","DOI":"10.1145\/3404835.3462982"},{"key":"e_1_3_2_16_2","article-title":"BERTTM: Leveraging contextualized word embeddings from pre-trained language models for neural topic modeling","volume":"2305","author":"Fang Zheng","year":"2023","unstructured":"Zheng Fang, Yulan He, and Rob Procter. 2023. BERTTM: Leveraging contextualized word embeddings from pre-trained language models for neural topic modeling. arXiv :2305.09329 . Retrieved from https:\/\/arxiv.org\/abs\/2305.09329","journal-title":"arXiv"},{"doi-asserted-by":"publisher","key":"e_1_3_2_17_2","DOI":"10.1609\/aaai.v34i05.6277"},{"key":"e_1_3_2_18_2","first-page":"1823","volume-title":"Proceedings of the 32nd International Conference on Machine Learning","author":"Gan Zhe","year":"2015","unstructured":"Zhe Gan, Changyou Chen, Ricardo Henao, David Carlson, and Lawrence Carin. 2015. Scalable deep Poisson factor analysis for topic modeling. In Proceedings of the 32nd International Conference on Machine Learning. PMLR, 1823\u20131832."},{"unstructured":"Maarten Grootendorst. 2022. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv:2203.05794. Retrieved from https:\/\/arxiv.org\/abs\/2203.05794","key":"e_1_3_2_19_2"},{"issue":"7","key":"e_1_3_2_20_2","first-page":"1539","article-title":"Flow hierarchical dirichlet process for complex topic modeling","volume":"42","author":"Han Zhongming","year":"2019","unstructured":"Zhongming Han, Mengmei Zhang, Mengqi Li, Dagao Duan, and Yi Chen. 2019. Flow hierarchical dirichlet process for complex topic modeling. Chinese Journal of Computers 42, 7 (2019), 1539\u20131552.","journal-title":"Chinese Journal of Computers"},{"doi-asserted-by":"publisher","key":"e_1_3_2_21_2","DOI":"10.1145\/3488560.3498518"},{"doi-asserted-by":"publisher","key":"e_1_3_2_22_2","DOI":"10.18653\/v1\/P18-2082"},{"key":"e_1_3_2_23_2","first-page":"2018","volume-title":"Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021","author":"Hoyle Alexander Miserlis","year":"2021","unstructured":"Alexander Miserlis Hoyle, Pranav Goel, Andrew Hian-Cheong, Denis Peskov, Jordan L. Boyd-Graber, and Philip Resnik. 2021. Is automated topic model evaluation broken? the incoherence of coherence. In Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021. 2018\u20132033."},{"unstructured":"Weip\u00e9ng Hu\u00e1ng Nishma Laitonjam Guangyuan Piao and Neil Hurley. 2019. Bayesian hierarchical mixture clustering using multilevel hierarchical Dirichlet processes. arXiv:1905.05022. Retrieved from https:\/\/arxiv.org\/abs\/1905.05022","key":"e_1_3_2_24_2"},{"doi-asserted-by":"publisher","key":"e_1_3_2_25_2","DOI":"10.4018\/IJERTCS.2020100107"},{"doi-asserted-by":"publisher","key":"e_1_3_2_26_2","DOI":"10.1145\/3502727"},{"doi-asserted-by":"publisher","key":"e_1_3_2_27_2","DOI":"10.18653\/v1\/2020.acl-main.73"},{"key":"e_1_3_2_28_2","article-title":"ArchiText: Interactive hierarchical topic modeling","author":"Kim Hannah","year":"2020","unstructured":"Hannah Kim, Barry Drake, Alex Endert, and Haesun Park. 2020. ArchiText: Interactive hierarchical topic modeling. IEEE Transactions on Visualization and Computer Graphics 27, 9 (2020), 3644\u20133655.","journal-title":"IEEE Transactions on Visualization and Computer Graphics"},{"doi-asserted-by":"publisher","key":"e_1_3_2_29_2","DOI":"10.1145\/2396761.2396861"},{"doi-asserted-by":"publisher","key":"e_1_3_2_30_2","DOI":"10.1145\/3485447.3512002"},{"doi-asserted-by":"publisher","key":"e_1_3_2_31_2","DOI":"10.1145\/3238250"},{"doi-asserted-by":"publisher","key":"e_1_3_2_32_2","DOI":"10.1145\/1143844.1143917"},{"doi-asserted-by":"publisher","key":"e_1_3_2_33_2","DOI":"10.1007\/s11518-018-5375-7"},{"doi-asserted-by":"publisher","key":"e_1_3_2_34_2","DOI":"10.1609\/aaai.v29i1.9591"},{"key":"e_1_3_2_35_2","volume-title":"Macroeconomics","author":"Mankiw N. Gregory","year":"2003","unstructured":"N. Gregory Mankiw et\u00a0al. 2003. Macroeconomics. Vol. 41. New York: Worth Publishers."},{"doi-asserted-by":"publisher","key":"e_1_3_2_36_2","DOI":"10.1145\/3485447.3512034"},{"doi-asserted-by":"publisher","key":"e_1_3_2_37_2","DOI":"10.1145\/3394486.3403242"},{"doi-asserted-by":"publisher","key":"e_1_3_2_38_2","DOI":"10.1109\/TPAMI.2014.2318728"},{"doi-asserted-by":"publisher","key":"e_1_3_2_39_2","DOI":"10.1016\/j.joi.2020.101047"},{"doi-asserted-by":"publisher","key":"e_1_3_2_40_2","DOI":"10.1145\/3184558.3186916"},{"doi-asserted-by":"publisher","key":"e_1_3_2_41_2","DOI":"10.1198\/016214508000000553"},{"doi-asserted-by":"publisher","key":"e_1_3_2_42_2","DOI":"10.1002\/asi.23439"},{"doi-asserted-by":"publisher","key":"e_1_3_2_43_2","DOI":"10.1198\/016214506000000302"},{"doi-asserted-by":"publisher","key":"e_1_3_2_44_2","DOI":"10.1145\/3289600.3291032"},{"doi-asserted-by":"publisher","key":"e_1_3_2_45_2","DOI":"10.18653\/v1\/2020.acl-main.724"},{"doi-asserted-by":"publisher","key":"e_1_3_2_46_2","DOI":"10.1016\/j.ins.2020.01.036"},{"key":"e_1_3_2_47_2","article-title":"A multi-channel hierarchical graph attention network for open event extraction","author":"Wan Qizhi","year":"2022","unstructured":"Qizhi Wan, Changxuan Wan, Keli Xiao, Rong Hu, and Dexi Liu. 2022. A multi-channel hierarchical graph attention network for open event extraction. ACM Transactions on Information Systems (TOIS) 41, 1 (2022), 1\u201327.","journal-title":"ACM Transactions on Information Systems (TOIS)"},{"doi-asserted-by":"publisher","key":"e_1_3_2_48_2","DOI":"10.1016\/j.ins.2023.01.143"},{"doi-asserted-by":"publisher","key":"e_1_3_2_49_2","DOI":"10.1145\/3527240"},{"key":"e_1_3_2_50_2","first-page":"1990","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Wang Chong","year":"2009","unstructured":"Chong Wang and David M. Blei. 2009. Variational inference for the nested Chinese restaurant process. In Proceedings of the Advances in Neural Information Processing Systems. 1990\u20131998."},{"doi-asserted-by":"publisher","key":"e_1_3_2_51_2","DOI":"10.1007\/978-3-030-63820-7_56"},{"issue":"3","key":"e_1_3_2_52_2","first-page":"1","article-title":"Graph neural collaborative topic model for citation recommendation","volume":"40","author":"Xie Qianqian","year":"2021","unstructured":"Qianqian Xie, Yutao Zhu, Jimin Huang, Pan Du, and Jian-Yun Nie. 2021. Graph neural collaborative topic model for citation recommendation. ACM Transactions on Information Systems (TOIS) 40, 3 (2021), 1\u201330.","journal-title":"ACM Transactions on Information Systems (TOIS)"},{"doi-asserted-by":"publisher","key":"e_1_3_2_53_2","DOI":"10.1016\/j.eswa.2018.03.008"},{"doi-asserted-by":"publisher","key":"e_1_3_2_54_2","DOI":"10.1145\/3291044"},{"doi-asserted-by":"publisher","key":"e_1_3_2_55_2","DOI":"10.1109\/TKDE.2016.2636182"},{"doi-asserted-by":"crossref","unstructured":"Guangxu Xun Yaliang Li Wayne Xin Jing Gao and Aidong Zhang. 2017. A correlated topic model using word embeddings. In Proceedings of the 26th International Joint Conference on Artificial Intelligence 4207\u20134213.","key":"e_1_3_2_56_2","DOI":"10.24963\/ijcai.2017\/588"},{"doi-asserted-by":"publisher","key":"e_1_3_2_57_2","DOI":"10.1109\/TKDE.2019.2922179"},{"doi-asserted-by":"publisher","key":"e_1_3_2_58_2","DOI":"10.1109\/TKDE.2021.3093350"},{"doi-asserted-by":"publisher","key":"e_1_3_2_59_2","DOI":"10.1145\/3477495.3531990"},{"issue":"3","key":"e_1_3_2_60_2","first-page":"845","article-title":"Mining unstructured economic indicators based on PSP_HDP topic model","volume":"31","author":"Zhang Yitao","year":"2020","unstructured":"Yitao Zhang, Changxuan Wan, Xipin Liu, Tengjiao Jiang, Dexi Liu, and Guoqiong Liao. 2020. Mining unstructured economic indicators based on PSP_HDP topic model. Journal of Software 31, 3 (2020), 845\u2013865.","journal-title":"Journal of Software"},{"key":"e_1_3_2_61_2","first-page":"7966","article-title":"Dirichlet belief networks for topic structure learning","volume":"31","author":"Zhao He","year":"2018","unstructured":"He Zhao, Lan Du, Wray L. Buntine, and Mingyuan Zhou. 2018. Dirichlet belief networks for topic structure learning. Advances in Neural Information Processing Systems 31 (2018), 7966\u20137977.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_62_2","first-page":"5892","volume-title":"Proceedings of the 35th International Conference on Machine Learning 2018","author":"Zhao He","year":"2018","unstructured":"He Zhao, Lan Du, Wray L. Buntine, and Mingyuan Zhou. 2018. Inter and intra topic structure learning with word embeddings. In Proceedings of the 35th International Conference on Machine Learning 2018. Association for Computing Machinery, New York, NY, 5892\u20135901."}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3631352","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3631352","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:35:52Z","timestamp":1750178152000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3631352"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,29]]},"references-count":61,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,5,31]]}},"alternative-id":["10.1145\/3631352"],"URL":"https:\/\/doi.org\/10.1145\/3631352","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"type":"print","value":"1046-8188"},{"type":"electronic","value":"1558-2868"}],"subject":[],"published":{"date-parts":[[2023,12,29]]},"assertion":[{"value":"2022-08-24","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-10-25","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}