{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T02:30:33Z","timestamp":1775874633722,"version":"3.50.1"},"reference-count":50,"publisher":"Association for Computing Machinery (ACM)","issue":"7","license":[{"start":{"date-parts":[[2023,4,14]],"date-time":"2023-04-14T00:00:00Z","timestamp":1681430400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2020YFC0832600"],"award-info":[{"award-number":["2020YFC0832600"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Fund of China","doi-asserted-by":"crossref","award":["62076027, 42201461"],"award-info":[{"award-number":["62076027, 42201461"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2023,8,31]]},"abstract":"<jats:p>Discovering causal relationships among observed variables is an important research focus in data mining. Existing causal discovery approaches are mainly based on constraint-based methods and functional causal models (FCMs). However, the constraint-based method cannot identify the Markov equivalence class and the functional causal models cannot identify the complex interrelationships when multiple variables affect one variable. To address the two aforementioned problems, we propose a new graph structure Causal Star Graph (CSG) and a corresponding framework Causal Discovery via Causal Star Graphs (CD-CSG) to divide a causal directed acyclic graph into multiple CSGs for causal discovery. In this framework, we also propose a generalized learning in CSGs based on a variational approach to learn the representative intermediate variable of CSG\u2019s non-central variables. Through the generalized learning in CSGs, the asymmetry in the forward and backward model of CD-CSG can be found to identify the causal directions in the directed acyclic graphs. We further divide the CSGs into three categories and provide the causal identification principle under each category in our proposed framework. Experiments using synthetic data show that the causal relationships between variables can be effectively identified with CD-CSG and the accuracy of CD-CSG is higher than the best existing model. By applying CD-CSG to real-world data, our proposed method can greatly augment the applicability and effectiveness of causal discovery.<\/jats:p>","DOI":"10.1145\/3586997","type":"journal-article","created":{"date-parts":[[2023,3,6]],"date-time":"2023-03-06T12:37:28Z","timestamp":1678106248000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Causal Discovery via Causal Star Graphs"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8450-4676","authenticated-orcid":false,"given":"Boxiang","family":"Zhao","sequence":"first","affiliation":[{"name":"Beijing Institute of Technology, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5326-7209","authenticated-orcid":false,"given":"Shuliang","family":"Wang","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6851-0731","authenticated-orcid":false,"given":"Lianhua","family":"Chi","sequence":"additional","affiliation":[{"name":"La Trobe University, Bundoora, Melbourne, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1896-7044","authenticated-orcid":false,"given":"Qi","family":"Li","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0471-2251","authenticated-orcid":false,"given":"Xiaojia","family":"Liu","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4076-6134","authenticated-orcid":false,"given":"Jing","family":"Geng","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2023,4,14]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1031833662"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1995.10476535"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1996.10476902"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF02293905"},{"key":"e_1_3_2_6_2","first-page":"900","volume-title":"Proceedings of the International Conference on Artificial Intelligence and Statistics","author":"Bl\u00f6baum Patrick","year":"2018","unstructured":"Patrick Bl\u00f6baum, Dominik Janzing, Takashi Washio, Shohei Shimizu, and Bernhard Sch\u00f6lkopf. 2018. Cause-effect inference by comparing regression errors. In Proceedings of the International Conference on Artificial Intelligence and Statistics. PMLR, 900\u2013909."},{"issue":"2","key":"e_1_3_2_7_2","article-title":"Improving the reliability of causal discovery from small datasets using argumentation.","volume":"10","author":"Bromberg Facundo","year":"2009","unstructured":"Facundo Bromberg and Dimitris Margaritis. 2009. Improving the reliability of causal discovery from small datasets using argumentation. Journal of Machine Learning Research 10, 2 (2009), 141\u2013180.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1214\/14-AOS1260"},{"issue":"6","key":"e_1_3_2_9_2","first-page":"1470","article-title":"A survey on non-temporal series observational data based causal discovery","volume":"40","author":"Cai Ruichu","year":"2017","unstructured":"Ruichu Cai, Wei Chen, Kun Zhang, and Zhifeng Hao. 2017. A survey on non-temporal series observational data based causal discovery. Chinese Journal of Computers 40, 6 (2017), 1470\u20131490.","journal-title":"Chinese Journal of Computers"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.5555\/3367243.3367262"},{"key":"e_1_3_2_11_2","first-page":"316","volume-title":"Proceedings of the 30th International Conference on Machine Learning.","author":"Chang Billy","year":"2013","unstructured":"Billy Chang, Uwe Kr\u00fcger, Rafal Kustra, and Junping Zhang. 2013. Canonical correlation analysis based on Hilbert\u2013Schmidt independence criterion and centered kernel target alignment. In Proceedings of the 30th International Conference on Machine Learning.316\u2013324."},{"key":"e_1_3_2_12_2","first-page":"507","article-title":"Optimal structure identification with greedy search","volume":"3","author":"Chickering David Maxwell","year":"2002","unstructured":"David Maxwell Chickering. 2002. Optimal structure identification with greedy search. Journal of Machine Learning Research 3, Nov. (2002), 507\u2013554.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2750365"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2750365"},{"key":"e_1_3_2_15_2","first-page":"850","volume-title":"Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence","author":"Colombo Diego","year":"2011","unstructured":"Diego Colombo, Marloes H. Maathuis, Markus Kalisch, and Thomas S. Richardson. 2011. Learning high-dimensional DAGs with latent and selection variables. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 850."},{"key":"e_1_3_2_16_2","first-page":"116","volume-title":"Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence","author":"Cooper Gregory F.","year":"1999","unstructured":"Gregory F. Cooper and Changwon Yoo. 1999. Causal discovery from a mixture of experimental and observational data. In Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence. 116\u2013125."},{"key":"e_1_3_2_17_2","first-page":"143","volume-title":"Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence","author":"Daniusis Povilas","year":"2010","unstructured":"Povilas Daniusis, Dominik Janzing, Joris M. Mooij, Jakob Zscheischler, Bastian Steudel, Kun Zhang, and Bernhard Sch\u00f6lkopf. 2010. Inferring deterministic causal relations. In Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence. 143\u2013150."},{"key":"e_1_3_2_18_2","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1007\/978-3-030-21810-2_12","volume-title":"Proceedings of the Cause Effect Pairs in Machine Learning","author":"Fonollosa Jos\u00e9 A. R.","year":"2019","unstructured":"Jos\u00e9 A. R. Fonollosa. 2019. Conditional distribution variability measures for causality detection. In Proceedings of the Cause Effect Pairs in Machine Learning. Springer, 339\u2013347."},{"key":"e_1_3_2_19_2","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1109\/CIC.1997.647926","volume-title":"Proceedings of the Computers in Cardiology 1997","author":"Guvenir H. Altay","year":"1997","unstructured":"H. Altay Guvenir, Burak Acar, Gulsen Demiroz, and Ayhan Cekin. 1997. A supervised machine learning algorithm for arrhythmia analysis. In Proceedings of the Computers in Cardiology 1997. 433\u2013436."},{"key":"e_1_3_2_20_2","first-page":"969","volume-title":"Proceedings of the International Conference on Advances in Social Networks Analysis and Mining","author":"Hauffa Jan","year":"2019","unstructured":"Jan Hauffa, Wolfgang Br\u00e4u, and Georg Groh. 2019. Detection of topical influence in social networks via granger-causal inference: A Twitter case study. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining. 969\u2013977."},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.5555\/2503308.2503320"},{"key":"e_1_3_2_22_2","first-page":"689","volume-title":"Proceedings of the 22nd Annual Conference on Neural Information Processing Systems","author":"Hoyer Patrik O.","year":"2008","unstructured":"Patrik O. Hoyer, Dominik Janzing, Joris M. Mooij, Jonas Peters, and Bernhard Sch\u00f6lkopf. 2008. Nonlinear causal discovery with additive noise models. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 689\u2013696."},{"key":"e_1_3_2_23_2","first-page":"5212","volume-title":"Proceedings of the Annual Conference on Neural Information Processing Systems","author":"Hu Shoubo","year":"2018","unstructured":"Shoubo Hu, Zhitang Chen, Vahid Partovi Nia, Lai-Wan Chan, and Yanhui Geng. 2018. Causal inference and mechanism clustering of a mixture of additive noise models. In Proceedings of the Annual Conference on Neural Information Processing Systems. 5212\u20135222."},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2012.01.002"},{"key":"e_1_3_2_25_2","first-page":"37:1\u201337:5","article-title":"Causal discovery toolbox: Uncovering causal relationships in python","volume":"21","author":"Kalainathan Diviyan","year":"2020","unstructured":"Diviyan Kalainathan, Olivier Goudet, and Ritik Dutta. 2020. Causal discovery toolbox: Uncovering causal relationships in python. Journal of Machine Learning Research 21, 1 (2020), 37:1\u201337:5.","journal-title":"Journal of Machine Learning Research"},{"issue":"3","key":"e_1_3_2_26_2","first-page":"613","article-title":"Estimating high-dimensional directed acyclic graphs with the PC-algorithm","volume":"8","author":"Kalisch Markus","year":"2007","unstructured":"Markus Kalisch and Peter B\u00fchlman. 2007. Estimating high-dimensional directed acyclic graphs with the PC-algorithm. Journal of Machine Learning Research 8, 3 (2007), 613\u2013636.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_27_2","volume-title":"Proceedings of the 2nd International Conference on Learning Representations","author":"Kingma Diederik P.","year":"2014","unstructured":"Diederik P. Kingma and Max Welling. 2014. Auto-encoding variational bayes. In Proceedings of the 2nd International Conference on Learning Representations."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2016.2591526"},{"key":"e_1_3_2_29_2","first-page":"14257","volume-title":"Proceedings of the Annual Conference on Neural Information Processing Systems 2019","author":"Li Honghao","year":"2019","unstructured":"Honghao Li, Vincent Cabeli, Nadir Sella, and Herv\u00e9 Isambert. 2019. Constraint-based causal structure learning with consistent separating sets. In Proceedings of the Annual Conference on Neural Information Processing Systems 2019. 14257\u201314266."},{"key":"e_1_3_2_30_2","first-page":"147","volume-title":"Proceedings of the Causality: Objectives and Assessment.","author":"Mooij Joris M.","year":"2010","unstructured":"Joris M. Mooij and Dominik Janzing. 2010. Distinguishing between cause and effect. In Proceedings of the Causality: Objectives and Assessment.147\u2013156."},{"key":"e_1_3_2_31_2","first-page":"32:1\u201332:102","article-title":"Distinguishing cause from effect using observational data: Methods and benchmarks","volume":"17","author":"Mooij Joris M.","year":"2016","unstructured":"Joris M. Mooij, Jonas Peters, Dominik Janzing, Jakob Zscheischler, and Bernhard Sch\u00f6lkopf. 2016. Distinguishing cause from effect using observational data: Methods and benchmarks. Journal of Machine Learning Research 17, 1 (2016), 32:1\u201332:102.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_32_2","first-page":"p411","article-title":"The population biology of abalone (Haliotis species) in Tasmania. i. blacklip abalone (h. rubra) from the north coast and islands of bass strait","volume":"48","author":"Nash Warwick J.","year":"1994","unstructured":"Warwick J. Nash, Tracy L. Sellers, Simon R. Talbot, Andrew J. Cawthorn, and Wes B. Ford. 1994. The population biology of abalone (Haliotis species) in Tasmania. i. blacklip abalone (h. rubra) from the north coast and islands of bass strait. Sea Fisheries Division, Technical Report 48 (1994), p411.","journal-title":"Sea Fisheries Division, Technical Report"},{"key":"e_1_3_2_33_2","volume-title":"Causality: Models, Reasoning, and Inference","author":"Pearl Judea","year":"2000","unstructured":"Judea Pearl. 2000. Causality: Models, Reasoning, and Inference. Cambridge University Press."},{"key":"e_1_3_2_34_2","volume-title":"Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference","author":"Pearl Judea","year":"2014","unstructured":"Judea Pearl. 2014. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Elsevier."},{"key":"e_1_3_2_35_2","doi-asserted-by":"crossref","unstructured":"Kurt Driessens and Saso Dzeroski. 2005. Combining model-based and instance-based learning for first order regression. Machine Learning Proceedings of the Twenty-Second International Conference (ICML\u201905 Bonn Germany August 7-11 2005) ACM International Conference Proceeding Series Vol. 119 ACM 193\u2013200.","DOI":"10.1145\/1102351.1102376"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1126\/science.1105809"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.5555\/1248547.1248619"},{"key":"e_1_3_2_38_2","volume-title":"Causation, Prediction, and Search","author":"Spirtes Peter","year":"2000","unstructured":"Peter Spirtes, Clark N. Glymour, Richard Scheines, and David Heckerman. 2000. Causation, Prediction, and Search. MIT Press."},{"key":"e_1_3_2_39_2","first-page":"1","article-title":"An algorithm for causal inference in the presence of latent variables and selection bias","volume":"21","author":"Spirtes Peter","year":"1999","unstructured":"Peter Spirtes, Christopher Meek, and Thomas Richardson. 1999. An algorithm for causal inference in the presence of latent variables and selection bias. Computation, Causation, and Discovery 21 (1999), 1\u2013252.","journal-title":"Computation, Causation, and Discovery"},{"key":"e_1_3_2_40_2","volume-title":"Umweltstatistik: Statistische Verarbeitung und Analyse Von Umweltdaten","author":"Stoyan Helga","year":"2013","unstructured":"Helga Stoyan and Uwe Jansen. 2013. Umweltstatistik: Statistische Verarbeitung und Analyse Von Umweltdaten. Springer-Verlag."},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273604"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.2014.2320500"},{"key":"e_1_3_2_43_2","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1007\/978-94-007-6094-3_17","article-title":"Social networks and causal inference","author":"VanderWeele Tyler J.","year":"2013","unstructured":"Tyler J. VanderWeele and Weihua An. 2013. Social networks and causal inference. Handbook of Causal Analysis for Social Research (2013), 353\u2013374.","journal-title":"Handbook of Causal Analysis for Social Research"},{"key":"e_1_3_2_44_2","first-page":"255","volume-title":"Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence","author":"Verma Thomas","year":"1990","unstructured":"Thomas Verma and Judea Pearl. 1990. Equivalence and synthesis of causal models. In Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence. 255\u2013270."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.2019.1686987"},{"issue":"4","key":"e_1_3_2_46_2","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1061\/(ASCE)0899-1561(2006)18:4(597)","article-title":"Analysis of strength of concrete using design of experiments and neural networks","volume":"18","author":"Yeh I-Cheng","year":"2006","unstructured":"I-Cheng Yeh. 2006. Analysis of strength of concrete using design of experiments and neural networks. Journal of Materials in Civil Engineering 18, 4 (2006), 597\u2013604.","journal-title":"Journal of Materials in Civil Engineering"},{"key":"e_1_3_2_47_2","first-page":"647","volume-title":"Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence","author":"Zhang Kun","year":"2009","unstructured":"Kun Zhang and Aapo Hyv\u00e4rinen. 2009. On the identifiability of the post-nonlinear causal model. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. 647\u2013655."},{"key":"e_1_3_2_48_2","first-page":"804","volume-title":"Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence","author":"Zhang Kun","year":"2011","unstructured":"Kun Zhang, Jonas Peters, Dominik Janzing, and Bernhard Sch\u00f6lkopf. 2011. Kernel-based conditional independence test and application in causal discovery. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence. 804\u2013813."},{"issue":"2","key":"e_1_3_2_49_2","first-page":"13:1\u201313:22","article-title":"On estimation of functional causal models: General results and application to the post-nonlinear causal model","volume":"7","author":"Zhang Kun","year":"2016","unstructured":"Kun Zhang, Zhikun Wang, Jiji Zhang, and Bernhard Sch\u00f6lkopf. 2016. On estimation of functional causal models: General results and application to the post-nonlinear causal model. ACM Transactions on Intelligent Systems and Technology 7, 2 (2016), 13:1\u201313:22.","journal-title":"ACM Transactions on Intelligent Systems and Technology"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2014.01.057"},{"key":"e_1_3_2_51_2","first-page":"1937","volume-title":"Proceedings of the 22nd Annual Conference on Neural Information Processing Systems","author":"Zhang Xinhua","year":"2008","unstructured":"Xinhua Zhang, Le Song, Arthur Gretton, and Alexander J. Smola. 2008. Kernel measures of independence for non-iid data. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. 1937\u20131944."}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3586997","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3586997","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:33Z","timestamp":1750178253000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3586997"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,14]]},"references-count":50,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2023,8,31]]}},"alternative-id":["10.1145\/3586997"],"URL":"https:\/\/doi.org\/10.1145\/3586997","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,14]]},"assertion":[{"value":"2022-02-15","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-02-27","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-04-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}