{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T05:52:30Z","timestamp":1774072350621,"version":"3.50.1"},"reference-count":102,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2022,3,12]],"date-time":"2022-03-12T00:00:00Z","timestamp":1647043200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,3,12]],"date-time":"2022-03-12T00:00:00Z","timestamp":1647043200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001774","name":"University of Sydney","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001774","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Knowl Inf Syst"],"published-print":{"date-parts":[[2022,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>As an essential component of human cognition, cause\u2013effect relations appear frequently in text, and curating cause\u2013effect relations from text helps in building causal networks for predictive tasks. Existing causality extraction techniques include knowledge-based, statistical machine learning (ML)-based, and deep learning-based approaches. Each method has its advantages and weaknesses. For example, knowledge-based methods are understandable but require extensive manual domain knowledge and have poor cross-domain applicability. Statistical machine learning methods are more automated because of natural language processing (NLP) toolkits. However, feature engineering is labor-intensive, and toolkits may lead to error propagation. In the past few years, deep learning techniques attract substantial attention from NLP researchers because of its powerful representation learning ability and the rapid increase in computational resources. Their limitations include high computational costs and a lack of adequate annotated training data. In this paper, we conduct a comprehensive survey of causality extraction. We initially introduce primary forms existing in the causality extraction: explicit intra-sentential causality, implicit causality, and inter-sentential causality. Next, we list benchmark datasets and modeling assessment methods for causal relation extraction. Then, we present a structured overview of the three techniques with their representative systems. Lastly, we highlight existing open challenges with their potential directions.<\/jats:p>","DOI":"10.1007\/s10115-022-01665-w","type":"journal-article","created":{"date-parts":[[2022,3,12]],"date-time":"2022-03-12T08:05:48Z","timestamp":1647072348000},"page":"1161-1186","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":65,"title":["A survey on extraction of causal relations from natural language text"],"prefix":"10.1007","volume":"64","author":[{"given":"Jie","family":"Yang","sequence":"first","affiliation":[]},{"given":"Soyeon Caren","family":"Han","sequence":"additional","affiliation":[]},{"given":"Josiah","family":"Poon","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,3,12]]},"reference":[{"issue":"11","key":"1665_CR1","doi-asserted-by":"publisher","first-page":"S2","DOI":"10.1186\/1471-2105-9-S11-S2","volume":"9","author":"A Airola","year":"2008","unstructured":"Airola A, Pyysalo S, Bj\u00f6rne J, Pahikkala T, Ginter F, Salakoski T (2008) All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning. BMC Bioinform 9(11):S2. https:\/\/doi.org\/10.1186\/1471-2105-9-S11-S2","journal-title":"BMC Bioinform"},{"key":"1665_CR2","unstructured":"Asghar N (2016) Automatic extraction of causal relations from natural language texts: a comprehensive survey. arXiv preprint arXiv:1605.07895"},{"key":"1665_CR3","doi-asserted-by":"publisher","unstructured":"Balashankar A, Chakraborty S, Fraiberger S, Subramanian L (2019) Identifying predictive causal factors from news streams. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), association for computational linguistics, Hong Kong, China, pp 2338\u2013234. https:\/\/doi.org\/10.18653\/v1\/D19-1238","DOI":"10.18653\/v1\/D19-1238"},{"key":"1665_CR4","doi-asserted-by":"publisher","unstructured":"Barik B, Marsi E, Ozturk P (2016) Event causality extraction from natural science literature. Res Comput Sci 117:97\u2013107. https:\/\/doi.org\/10.13053\/rcs-117-1-8","DOI":"10.13053\/rcs-117-1-8"},{"key":"1665_CR5","unstructured":"Beamer B, Rozovskaya A, Girju R (2008) Automatic semantic relation extraction with multiple boundary generation. In: Proceedings of the 23rd national conference on artificial intelligence. AAAI Press, Chicago, Illinois, pp 824\u2013829"},{"key":"1665_CR6","doi-asserted-by":"publisher","unstructured":"Bekoulis G, Deleu J, Demeester T, Develder C (2018a) Adversarial training for multi-context joint entity and relation extraction. In: Proceedings of the 2018 conference on empirical methods in natural language processing, association for computational linguistics. Brussels, Belgium, pp 2830\u20132836. https:\/\/doi.org\/10.18653\/v1\/D18-1307","DOI":"10.18653\/v1\/D18-1307"},{"key":"1665_CR7","doi-asserted-by":"publisher","first-page":"34","DOI":"10.1016\/j.eswa.2018.07.032","volume":"114","author":"G Bekoulis","year":"2018","unstructured":"Bekoulis G, Deleu J, Demeester T, Develder C (2018) Joint entity recognition and relation extraction as a multi-head selection problem. Expert Syst Appl 114:34\u201345. https:\/\/doi.org\/10.1016\/j.eswa.2018.07.032","journal-title":"Expert Syst Appl"},{"key":"1665_CR8","doi-asserted-by":"publisher","unstructured":"Beltagy I, Lo K, Cohan A (2019) Scibert: pretrained language model for scientific text. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing, Association for Computational Linguistics, Hong Kong, China, pp 3615\u20133620. https:\/\/doi.org\/10.18653\/v1\/D19-1371","DOI":"10.18653\/v1\/D19-1371"},{"key":"1665_CR9","doi-asserted-by":"crossref","unstructured":"Bethard S, Martin JH (2008) Learning semantic links from a corpus of parallel temporal and causal relations. In: Proceedings of ACL-08: HLT, short papers, association for computational linguistics. Columbus, Ohio, pp 177\u2013180","DOI":"10.3115\/1557690.1557740"},{"key":"1665_CR10","unstructured":"Blanco E, Castell N, Moldovan D (2008) Causal relation extraction. In: Proceedings of the international conference on language resources and evaluation. Marrakech, Morocco, pp 310\u2013313"},{"key":"1665_CR11","unstructured":"Brown PF, Della\u00a0Pietra VJ, deSouza PV, Lai JC, Mercer RL (1992) Class-based n-gram models of natural language. Comput Ling 18(4):467\u2013480. https:\/\/aclanthology.org\/J92-4003"},{"key":"1665_CR12","doi-asserted-by":"publisher","unstructured":"Bui QC, Nuallain OB, Boucher CA, Sloot PM (2010) Extracting causal relations on hiv drug resistance from literature. BMC Bioinform 11(1):101\u2013110. https:\/\/doi.org\/10.1186\/1471-2105-11-101","DOI":"10.1186\/1471-2105-11-101"},{"issue":"3","key":"1665_CR13","doi-asserted-by":"publisher","first-page":"662","DOI":"10.1016\/j.ipm.2005.04.004","volume":"42","author":"DS Chang","year":"2006","unstructured":"Chang DS, Choi KS (2006) Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities. Inf Process Manage 42(3):662\u2013678. https:\/\/doi.org\/10.1016\/j.ipm.2005.04.004","journal-title":"Inf Process Manage"},{"key":"1665_CR14","doi-asserted-by":"publisher","unstructured":"Chen D, Li Y, Lei K, Shen Y (2020) Relabel the noise: joint extraction of entities and relations via cooperative multiagents. In: Jurafsky D, Chai J, Schluter N, Tetreault JR (eds) Proceedings of the 58th annual meeting of the association for computational linguistics, ACL 2020. Online, July 5-10, 2020, Association for Computational Linguistics, pp 5940\u20135950. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.527","DOI":"10.18653\/v1\/2020.acl-main.527"},{"key":"1665_CR15","doi-asserted-by":"publisher","unstructured":"Chen J, Zhang Q, Liu P, Qiu X, Huang X (2016) Implicit discourse relation detection via a deep architecture with gated relevance network. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers),.Association for Computational Linguistics, Berlin, Germany, pp 1726\u20131735. https:\/\/doi.org\/10.18653\/v1\/P16-1163","DOI":"10.18653\/v1\/P16-1163"},{"issue":"1","key":"1665_CR16","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1186\/s12864-019-6413-7","volume":"21","author":"D Chicco","year":"2020","unstructured":"Chicco D, Jurman G (2020) The advantages of the matthews correlation coefficient (mcc) over f1 score and accuracy in binary classification evaluation. BMC Genomics 21(1):6. https:\/\/doi.org\/10.1186\/s12864-019-6413-7","journal-title":"BMC Genomics"},{"key":"1665_CR17","doi-asserted-by":"publisher","unstructured":"Christopoulou F, Miwa M, Ananiadou S (2018) A walk-based model on entity graphs for relation extraction. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 2: short papers). Association for Computational Linguistics, Melbourne, Australia, pp 81\u201388. https:\/\/doi.org\/10.18653\/v1\/P18-2014","DOI":"10.18653\/v1\/P18-2014"},{"key":"1665_CR18","doi-asserted-by":"publisher","unstructured":"Christopoulou F, Miwa M, Ananiadou S (2019) Connecting the dots: document-level neural relation extraction with edge-oriented graphs. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, pp 4927\u20134938. https:\/\/doi.org\/10.18653\/v1\/D19-1498","DOI":"10.18653\/v1\/D19-1498"},{"key":"1665_CR19","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1109\/second.2006.1629336","volume":"2006","author":"SV Cole","year":"2006","unstructured":"Cole SV, Royal MD, Valtorta MG, Huhns MN, Bowles JB (2006) A lightweight tool for automatically extracting causal relationships from text. Proc IEEE SoutheastCon 2006:125\u2013129. https:\/\/doi.org\/10.1109\/second.2006.1629336","journal-title":"Proc IEEE SoutheastCon"},{"issue":"1","key":"1665_CR20","doi-asserted-by":"publisher","first-page":"80","DOI":"10.1145\/234173.234209","volume":"39","author":"J Cowie","year":"1996","unstructured":"Cowie J, Lehnert W (1996) Information extraction. Commun ACM 39(1):80\u201391. https:\/\/doi.org\/10.1145\/234173.234209","journal-title":"Commun ACM"},{"key":"1665_CR21","doi-asserted-by":"publisher","unstructured":"Dasgupta T, Saha R, Dey L, Naskar A (2018) Automatic extraction of causal relations from text using linguistically informed deep neural networks. In: Proceedings of the 19th annual SIGdial meeting on discourse and dialogue. Association for Computational Linguistics, Melbourne, Australia, pp 306\u2013316. https:\/\/doi.org\/10.18653\/v1\/W18-5035","DOI":"10.18653\/v1\/W18-5035"},{"key":"1665_CR22","doi-asserted-by":"publisher","unstructured":"Devlin J, Chang MW, Lee K, Toutanova K (2019) Bert: pre-training of deep bidirectional transformers for language understanding. In: NAACL-HLT, pp 4171\u20134186. https:\/\/doi.org\/10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"1665_CR23","doi-asserted-by":"publisher","first-page":"549","DOI":"10.3389\/fgene.2021.669328","volume":"12","author":"GF Dong","year":"2021","unstructured":"Dong GF, Zheng L, Huang SH, Gao J, Zuo YC (2021) Amino acid reduction can help to improve the identification of antimicrobial peptides and their functional activities. Front Genet 12:549. https:\/\/doi.org\/10.3389\/fgene.2021.669328","journal-title":"Front Genet"},{"key":"1665_CR24","doi-asserted-by":"publisher","unstructured":"Ertekin S, Huang J, Bottou L, Giles L (2007) Learning on the border: active learning in imbalanced data classification. In: Proceedings of the sixteenth ACM conference on conference on information and knowledge management. Association for Computing Machinery, New York, NY, USA, CIKM \u201907, pp 127\u2013136. https:\/\/doi.org\/10.1145\/1321440.1321461","DOI":"10.1145\/1321440.1321461"},{"key":"1665_CR25","doi-asserted-by":"publisher","unstructured":"Garcia D, EDF-DER, IMA-TIEM (2006) COATIS, an NLP system to locate expressions of actions connected by causality links, vol 1319. Springer, pp 347\u2013352 (chap BFb0026799). https:\/\/doi.org\/10.1007\/BFb0026799","DOI":"10.1007\/BFb0026799"},{"key":"1665_CR26","doi-asserted-by":"publisher","unstructured":"Girju R (2003) Automatic detection of causal relations for question answering. In: Proceedings of the ACL 2003 workshop on multilingual summarization and question answering, Association for Computational Linguistics, USA, MultiSumQA \u201903, vol\u00a012, pp 76\u201383. https:\/\/doi.org\/10.3115\/1119312.1119322","DOI":"10.3115\/1119312.1119322"},{"key":"1665_CR27","doi-asserted-by":"crossref","unstructured":"Girju R, Nakov P, Nastase V, Szpakowicz S, Turney P, Yuret D (2007) Semeval-2007 task 04: classification of semantic relations between nominals. In: Proceedings of the 4th international workshop on semantic evaluations. Association for Computational Linguistics, USA, SemEval \u201907, pp 13\u201318","DOI":"10.3115\/1621474.1621477"},{"issue":"2","key":"1665_CR28","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1007\/s10579-009-9083-2","volume":"43","author":"R Girju","year":"2009","unstructured":"Girju R, Nakov P, Nastase V, Szpakowicz S, Turney P, Yuret D (2009) Classification of semantic relations between nominals. Lang Resour Eval 43(2):105\u2013121. https:\/\/doi.org\/10.1007\/s10579-009-9083-2","journal-title":"Lang Resour Eval"},{"key":"1665_CR29","unstructured":"Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press"},{"issue":"3","key":"1665_CR30","doi-asserted-by":"publisher","first-page":"424","DOI":"10.2307\/1912791","volume":"37","author":"CWJ Granger","year":"1969","unstructured":"Granger CWJ (1969) Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37(3):424\u2013438. https:\/\/doi.org\/10.2307\/1912791","journal-title":"Econometrica"},{"key":"1665_CR31","doi-asserted-by":"publisher","unstructured":"Gu Q, Zhu L, Cai Z (2009) Evaluation measures of the classification performance of imbalanced data sets. In: Computational intelligence and intelligent systems, communications in computer and information science, Springer Berlin Heidelberg, Berlin, Heidelberg, pp 461\u2013471. https:\/\/doi.org\/10.1007\/978-3-642-04962-0_53","DOI":"10.1007\/978-3-642-04962-0_53"},{"key":"1665_CR32","doi-asserted-by":"publisher","unstructured":"Guo Z, Zhang Y, Lu W (2019) Attention guided graph convolutional networks for relation extraction. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Association for Computational Linguistics, Florence, Italy, pp 241\u2013251. https:\/\/doi.org\/10.18653\/v1\/P19-1024","DOI":"10.18653\/v1\/P19-1024"},{"issue":"5","key":"1665_CR33","doi-asserted-by":"publisher","first-page":"885","DOI":"10.1016\/j.jbi.2012.04.008","volume":"45","author":"H Gurulingappa","year":"2012","unstructured":"Gurulingappa H, Rajput AM, Roberts A, Fluck J, Hofmann-Apitius M, Toldo L (2012) Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports. J Biomed Inform 45(5):885\u2013892. https:\/\/doi.org\/10.1016\/j.jbi.2012.04.008","journal-title":"J Biomed Inform"},{"key":"1665_CR34","doi-asserted-by":"crossref","unstructured":"Hendrickx I, Kim SN, Kozareva Z, Nakov P, \u00d3\u00a0S\u00e9aghdha D, Pad\u00f3 S, Pennacchiotti M, Romano L, Szpakowicz S (2010) SemEval-2010 task 8: Multi-way classification of semantic relations between pairs of nominals. In: Proceedings of the 5th International Workshop on Semantic Evaluation, Association for Computational Linguistics, Uppsala, Sweden, pp 33\u201338","DOI":"10.3115\/1621969.1621986"},{"key":"1665_CR35","doi-asserted-by":"publisher","unstructured":"Hidey C, McKeown K (2016) Identifying causal relations using parallel Wikipedia articles. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers). Association for computational linguistics, Berlin, Germany, pp 1424\u20131433. https:\/\/doi.org\/10.18653\/v1\/P16-1135","DOI":"10.18653\/v1\/P16-1135"},{"key":"1665_CR36","unstructured":"Honnibal M, Montani I (2017) spacy 2: Natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing (to appear)"},{"key":"1665_CR37","doi-asserted-by":"publisher","unstructured":"Huang H, Wong R (2020) Deep embedding for relation extraction on insufficient labelled data. In: 2020 international joint conference on neural networks (IJCNN), pp 1\u20138. https:\/\/doi.org\/10.1109\/IJCNN48605.2020.9207554","DOI":"10.1109\/IJCNN48605.2020.9207554"},{"key":"1665_CR38","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1016\/j.datak.2013.08.004","volume":"88","author":"A Ittoo","year":"2013","unstructured":"Ittoo A, Bouma G (2013) Minimally-supervised learning of domain-specific causal relations using an open-domain corpus as knowledge base. Data Knowl Eng 88:142\u2013163. https:\/\/doi.org\/10.1016\/j.datak.2013.08.004","journal-title":"Data Knowl Eng"},{"key":"1665_CR39","doi-asserted-by":"publisher","first-page":"739","DOI":"10.1007\/978-3-030-47426-3_57","volume":"12084","author":"X Jin","year":"2020","unstructured":"Jin X, Wang X, Luo X, Huang S, Gu S (2020) Inter-sentence and implicit causality extraction from chinese corpus. Pacific-Asia Conf Knowl Discov Data Min Springer 12084:739\u2013751. https:\/\/doi.org\/10.1007\/978-3-030-47426-3_57","journal-title":"Pacific-Asia Conf Knowl Discov Data Min Springer"},{"key":"1665_CR40","doi-asserted-by":"publisher","unstructured":"Jinhyuk L, Wonjin Y, Kim, (2019) Biobert: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4):1234\u20131240. https:\/\/doi.org\/10.1093\/bioinformatics\/btz682","DOI":"10.1093\/bioinformatics\/btz682"},{"key":"1665_CR41","doi-asserted-by":"publisher","unstructured":"Kadir RA, Bokharaeian B (2013) Overview of biomedical relations extraction using hybrid rule-based approaches. J Indus Intell Inf 1(3):169\u2013173. https:\/\/doi.org\/10.12720\/jiii.1.3.169-173","DOI":"10.12720\/jiii.1.3.169-173"},{"key":"1665_CR42","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1186\/1471-2105-15-64","volume":"15","author":"N Kang","year":"2014","unstructured":"Kang N, Singh B, Bui QC, Afzal Z, van Mulligen EM, Kors J (2014) Knowledge-based extraction of adverse drug events from biomedical text. BMC Bioinform 15:64. https:\/\/doi.org\/10.1186\/1471-2105-15-64","journal-title":"BMC Bioinform"},{"key":"1665_CR43","doi-asserted-by":"publisher","unstructured":"Karagiannopoulos MG, Anyfantis DS, Kotsiantis SB, Pintelas PE (2007) Local cost sensitive learning for handling imbalanced data sets. In: 2007 Mediterranean conference on control automation, pp 1\u20136. https:\/\/doi.org\/10.1109\/MED.2007.4433808","DOI":"10.1109\/MED.2007.4433808"},{"key":"1665_CR44","doi-asserted-by":"publisher","first-page":"398","DOI":"10.1016\/j.jksuci.2014.06.001","volume":"26","author":"I Keskes","year":"2014","unstructured":"Keskes I, Zitoune FB, Belguith L (2014) Learning explicit and implicit arabic discourse relations. J King Saud Univ Comput Inf Sci Arch 26:398\u2013416. https:\/\/doi.org\/10.1016\/j.jksuci.2014.06.001","journal-title":"J King Saud Univ Comput Inf Sci Arch"},{"key":"1665_CR45","doi-asserted-by":"publisher","unstructured":"Khoo C, Chan S, Niu Y (2002) The many facets of the cause-effect relation. The Semantics of Relationships, pp 51\u201370. https:\/\/doi.org\/10.1007\/978-94-017-0073-3_4","DOI":"10.1007\/978-94-017-0073-3_4"},{"issue":"4","key":"1665_CR46","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1093\/llc\/13.4.177","volume":"13","author":"CSG Khoo","year":"1998","unstructured":"Khoo CSG, Kornfilt J, Oddy RN, Myaeng SH (1998) Automatic extraction of cause-effect information from newspaper text without knowledge-based inferencing. Literary Ling Comput 13(4):177\u2013186. https:\/\/doi.org\/10.1093\/llc\/13.4.177","journal-title":"Literary Ling Comput"},{"key":"1665_CR47","doi-asserted-by":"publisher","unstructured":"Khoo CSG, Chan S, Niu Y (2000) Extracting causal knowledge from a medical database using graphical patterns. In: Proceedings of the 38th annual meeting of the association for computational linguistics. Association for Computational Linguistics, Hong Kong, pp 336\u2013343. https:\/\/doi.org\/10.3115\/1075218.1075261","DOI":"10.3115\/1075218.1075261"},{"key":"1665_CR48","doi-asserted-by":"publisher","unstructured":"Kim H, Castellanos M, Hsu M, Zhai C, Rietz T, Diermeier D (2013) Mining causal topics in text data: iterative topic modeling with time series feedback. In: CIKM 2013\u2014proceedings of the 22nd ACM international conference on information and knowledge management. International Conference on Information and Knowledge Management, Proceedings, pp 885\u2013890. https:\/\/doi.org\/10.1145\/2505515.2505612","DOI":"10.1145\/2505515.2505612"},{"key":"1665_CR49","doi-asserted-by":"crossref","unstructured":"Kruengkrai C, Torisawa K, Hashimoto C, Kloetzer J, Oh JH, Tanaka M (2017) Improving event causality recognition with multiple background knowledge sources using multi-column convolutional neural networks. In: Proceedings of the thirty-first AAAI conference on artificial intelligence. AAAI Press, pp 3466-3473","DOI":"10.1609\/aaai.v31i1.11005"},{"key":"#cr-split#-1665_CR50.1","unstructured":"Kubat M, Matwin S (1997) Addressing the curse of imbalanced training sets: one-sided selection. In: Fisher DH"},{"key":"#cr-split#-1665_CR50.2","unstructured":"(ed) Proceedings of the fourteenth international conference on machine learning (ICML 1997). Nashville, Tennessee, USA, July 8-12, 1997. Morgan Kaufmann, pp 179-186"},{"key":"1665_CR51","doi-asserted-by":"publisher","unstructured":"Kyriakakis M, Androutsopoulos I, Saudabayev A, Gin\u00e9s\u00a0i Ametll\u00e9 J (2019) Transfer learning for causal sentence detection. In: Proceedings of the 18th BioNLP workshop and shared task. Association for Computational Linguistics, Florence, Italy, pp 292\u2013297. https:\/\/doi.org\/10.18653\/v1\/W19-5031","DOI":"10.18653\/v1\/W19-5031"},{"key":"1665_CR52","doi-asserted-by":"publisher","unstructured":"Lan M, Wang J, Wu Y, Niu ZY, Wang H (2017) Multi-task attention-based neural networks for implicit discourse relationship representation and identification. In: Proceedings of the 2017 conference on empirical methods in natural language processing. Association for Computational Linguistics, Copenhagen, Denmark, pp 1299\u20131308. https:\/\/doi.org\/10.18653\/v1\/D17-1134","DOI":"10.18653\/v1\/D17-1134"},{"key":"1665_CR53","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436\u2013444. https:\/\/doi.org\/10.1038\/nature14539","journal-title":"Nature"},{"key":"1665_CR54","doi-asserted-by":"publisher","first-page":"452","DOI":"10.3389\/fphys.2021.658633","volume":"12","author":"A Li","year":"2021","unstructured":"Li A, Deng Y, Tan Y, Chen M (2021) A transfer learning-based approach for lysine propionylation prediction. Front Physiol 12:452. https:\/\/doi.org\/10.3389\/fphys.2021.658633","journal-title":"Front Physiol"},{"key":"1665_CR55","doi-asserted-by":"publisher","unstructured":"Li F, Zhang M, Fu G, Ji D (2017) A neural joint model for entity and relation extraction from biomedical text. BMC Bioinform 18. https:\/\/doi.org\/10.1186\/s12859-017-1609-9","DOI":"10.1186\/s12859-017-1609-9"},{"key":"1665_CR56","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1016\/j.neucom.2020.08.078","volume":"423","author":"Z Li","year":"2021","unstructured":"Li Z, Li Q, Zou X, Ren J (2021) Causality extraction based on self-attentive bilstm-crf with transferred embeddings. Neurocomputing 423:207\u2013219. https:\/\/doi.org\/10.1016\/j.neucom.2020.08.078","journal-title":"Neurocomputing"},{"key":"1665_CR57","doi-asserted-by":"crossref","unstructured":"Lin Z, Kan MY, Ng HT (2009) Recognizing implicit discourse relations in the Penn Discourse Treebank. In: Proceedings of the 2009 conference on empirical methods in natural language processing. Association for Computational Linguistics, Singapore, pp 343\u2013351","DOI":"10.3115\/1699510.1699555"},{"key":"1665_CR58","doi-asserted-by":"publisher","first-page":"1783","DOI":"10.3389\/fimmu.2018.01783","volume":"9","author":"B Manavalan","year":"2018","unstructured":"Manavalan B, Shin TH, Kim MO, Lee G (2018) Pip-el: a new ensemble learning method for improved proinflammatory peptide predictions. Front Immunol 9:1783. https:\/\/doi.org\/10.3389\/fimmu.2018.01783","journal-title":"Front Immunol"},{"key":"1665_CR59","doi-asserted-by":"publisher","unstructured":"Manning C, Surdeanu M, Bauer J, Finkel J, Bethard S, McClosky D (2014) The Stanford CoreNLP natural language processing toolkit. In: Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations, association for computational linguistics. Baltimore, Maryland, pp 55\u201360. https:\/\/doi.org\/10.3115\/v1\/P14-5010","DOI":"10.3115\/v1\/P14-5010"},{"key":"1665_CR60","doi-asserted-by":"publisher","unstructured":"Marcu D, Echihabi A (2002) An unsupervised approach to recognizing discourse relations. In: Proceedings of the 40th annual meeting of the association for computational linguistics. Association for Computational Linguistics, Philadelphia, Pennsylvania, USA, pp 368\u2013375. https:\/\/doi.org\/10.3115\/1073083.1073145","DOI":"10.3115\/1073083.1073145"},{"key":"1665_CR61","unstructured":"Mart\u00ednez-C\u00e1mara E, Shwartz V, Gurevych I, Dagan I (2017) Neural disambiguation of causal lexical markers based on context. In: IWCS 2017\u201412th international conference on computational semantics\u2014short papers"},{"key":"1665_CR62","doi-asserted-by":"publisher","unstructured":"Matthews B (1975) Comparison of the predicted and observed secondary structure of t4 phage lysozyme. Biochimica et Biophysica Acta (BBA) Prot Struct 405(2):442\u2013451. https:\/\/doi.org\/10.1016\/0005-2795(75)90109-9","DOI":"10.1016\/0005-2795(75)90109-9"},{"issue":"2","key":"1665_CR63","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1475-925X-13-S2-S1","volume":"13","author":"C Mihaila","year":"2014","unstructured":"Mihaila C, Ananiadou S (2014) Semi-supervised learning of causal relations in biomedical scientific discourse. Biomed Eng Online 13(2):1\u201324. https:\/\/doi.org\/10.1186\/1475-925X-13-S2-S1","journal-title":"Biomed Eng Online"},{"key":"1665_CR64","doi-asserted-by":"crossref","unstructured":"Minh\u00a0Tran H, Nguyen MT, Nguyen TH (2020) The dots have their values: exploiting the node-edge connections in graph-based neural models for document-level relation extraction. In: Findings of the association for computational linguistics: EMNLP 2020. Association for Computational Linguistics, Online, pp 4561\u20134567. https:\/\/doi.org\/10.18653\/v1\/2020.findings-emnlp.409","DOI":"10.18653\/v1\/2020.findings-emnlp.409"},{"key":"1665_CR65","doi-asserted-by":"crossref","unstructured":"Mintz M, Bills S, Snow R, Jurafsky D (2009) Distant supervision for relation extraction without labeled data. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for Computational Linguistics, Suntec, Singapore, pp 1003\u20131011","DOI":"10.3115\/1690219.1690287"},{"key":"1665_CR66","unstructured":"Oh JH, Torisawa K, Hashimoto C, Sano M, De\u00a0Saeger S, Ohtake K (2013) Why-question answering using intra-and inter-sentential causal relations. In: ACL 2013\u201451st annual meeting of the association for computational linguistics. Proceedings of the Conference, Sofia, Bulgaria, vol\u00a01, pp 1733\u20131743"},{"key":"1665_CR67","unstructured":"Oh JH, Torisawa K, Hashimoto C, Iida R, Tanaka M, Kloetzer J (2016) A semi-supervised learning approach to why-question answering. In: Proceedings of the thirtieth AAAI conference on artificial intelligence. AAAI Press, AAAI\u201916, pp 3022\u20133029"},{"key":"1665_CR68","doi-asserted-by":"publisher","unstructured":"Oh JH, Torisawa K, Kruengkrai C, Iida R, Kloetzer J (2017) Multi-column convolutional neural networks with causality-attention for why-question answering. In: Proceedings of the Tenth ACM international conference on web search and data mining, pp 415\u2013424. https:\/\/doi.org\/10.1145\/3018661.3018737","DOI":"10.1145\/3018661.3018737"},{"key":"1665_CR69","doi-asserted-by":"publisher","unstructured":"Pakray P, Gelbukh A (2014) An open domain causal relation detection from paired nominal. In: 13th Mexican international conference on artificial intelligence (MICAI-2014). Nature-Inspired Computation and Machine Learning, Chiapas, Mexico, vol 8857, pp 261\u2013271. https:\/\/doi.org\/10.1007\/978-3-319-13650-9_24","DOI":"10.1007\/978-3-319-13650-9_24"},{"key":"1665_CR70","unstructured":"Pechsiri C, Kawtrakul A, Piriyakul R (2006) Mining causality knowledge from textual data. In: Proceedings of the 24th IASTED international conference on artificial intelligence and applications. ACTA Press, USA, AIA\u201906, pp 85\u201390"},{"key":"1665_CR71","doi-asserted-by":"crossref","unstructured":"Peng, Yifan, Yan, Shankai (2019) Transfer learning in biomedical natural language processing: an evaluation of bert and elmo on ten benchmarking datasets. In: Proceedings of the BioNLP 2019 workshop. Association for Computational Linguistics, Florence, Italy, pp 58\u201365","DOI":"10.18653\/v1\/W19-5006"},{"key":"1665_CR72","doi-asserted-by":"publisher","unstructured":"Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L (2018) Deep contextualized word representations. In: Proceedings of the 2018 Conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (long papers). Association for Computational Linguistics, New Orleans, Louisiana, pp 2227\u20132237. https:\/\/doi.org\/10.18653\/v1\/N18-1202","DOI":"10.18653\/v1\/N18-1202"},{"key":"1665_CR73","doi-asserted-by":"publisher","unstructured":"Ponti EM, Korhonen A (2017) Event-related features in feedforward neural networks contribute to identifying causal relations in discourse. In: Proceedings of the 2nd workshop on linking models of lexical, sentential and discourse-level semantics. Association for Computational Linguistics, Valencia, Spain, pp 25\u201330. https:\/\/doi.org\/10.18653\/v1\/W17-0903","DOI":"10.18653\/v1\/W17-0903"},{"key":"1665_CR74","unstructured":"Prasad R, Miltsakaki E, Dinesh N, Lee A, Joshi A (2007) The penn discourse treebank 2.0 annotation manual. IRCS technical reports series 203 Philadelphia: University of Pennsylvania ScholarlyCommons, p 105"},{"key":"1665_CR75","doi-asserted-by":"crossref","unstructured":"Pyysalo S, Ginter F, Heimonen J, Bj\u00f6rne J, Boberg J, J\u00e4rvinen J, Salakoski T (2007) Bioinfer: a corpus for information extraction in the biomedical domain. BMC Bioinform 8:50. https:\/\/doi.org\/10.1186\/1471-2105-8-50","DOI":"10.1186\/1471-2105-8-50"},{"key":"1665_CR76","doi-asserted-by":"publisher","unstructured":"Qi P, Zhang Y, Zhang Y, Bolton J, Manning CD (2020) Stanza: a python natural language processing toolkit for many human languages. In: Proceedings of the 58th annual meeting of the association for computational linguistics: system demonstrations. Association for Computational Linguistics, Online, pp 101\u2013108. https:\/\/doi.org\/10.18653\/v1\/2020.acl-demos.14","DOI":"10.18653\/v1\/2020.acl-demos.14"},{"key":"1665_CR77","doi-asserted-by":"publisher","unstructured":"Qian L, Zhou G (2016) Chemical-induced disease relation extraction with various linguistic features. Database 2016:baw042. https:\/\/doi.org\/10.1093\/database\/baw042","DOI":"10.1093\/database\/baw042"},{"key":"1665_CR78","doi-asserted-by":"publisher","unstructured":"Qiu J, Xu L, Zhai J, Luo L (2017) Extracting causal relations from emergency cases based on conditional random fields. Procedia Comput Sci 112(C):1623\u20131632. https:\/\/doi.org\/10.1016\/j.procs.2017.08.252","DOI":"10.1016\/j.procs.2017.08.252"},{"key":"1665_CR79","doi-asserted-by":"publisher","unstructured":"Radinsky K, Davidovich S, Markovitch S (2012) Learning causality for news events prediction. WWW\u201912\u2014proceedings of the 21st annual conference on world wide web, pp 909\u2013918. https:\/\/doi.org\/10.1145\/2187836.2187958","DOI":"10.1145\/2187836.2187958"},{"key":"1665_CR80","unstructured":"Rink B, Bejan C, Harabagiu S (2010) Learning textual graph patterns to detect causal event relations. In: Proceedings of the 23rd international Florida artificial intelligence research society conference, pp 265\u2013270"},{"key":"1665_CR81","doi-asserted-by":"publisher","unstructured":"Rutherford A, Xue N (2014) Discovering implicit discourse relations through brown cluster pair representation and coreference patterns. In: Proceedings of the 14th conference of the European chapter of the association for computational linguistics. Association for Computational Linguistics, Gothenburg, Sweden, pp 645\u2013654. https:\/\/doi.org\/10.3115\/v1\/E14-1068","DOI":"10.3115\/v1\/E14-1068"},{"key":"1665_CR82","doi-asserted-by":"publisher","unstructured":"Smirnova A, Cudr\u00e9-Mauroux P (2018) Relation extraction using distant supervision: a survey. ACM Comput Surv 51(5). https:\/\/doi.org\/10.1145\/3241741","DOI":"10.1145\/3241741"},{"key":"1665_CR83","unstructured":"Sorgente A, Vettigli G, Mele F (2013) Automatic extraction of cause-effect relations in natural language text. DART@ AI* IA 1109:37\u201348"},{"issue":"10","key":"1665_CR84","doi-asserted-by":"publisher","first-page":"1321","DOI":"10.1109\/TKDE.2007.190623","volume":"19","author":"CT Su","year":"2007","unstructured":"Su CT, Hsiao YH (2007) An evaluation of the robustness of mts for imbalanced data. IEEE Trans Knowl Data Eng 19(10):1321\u20131332. https:\/\/doi.org\/10.1109\/TKDE.2007.190623","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"1665_CR85","doi-asserted-by":"crossref","unstructured":"Voorhees E (2001) Overview of the trec-9 question answering track. Overview of the TREC-9 Question Answering Track, pp 71\u201380","DOI":"10.1145\/502585.502679"},{"key":"1665_CR86","doi-asserted-by":"publisher","unstructured":"Wang D, Hu W, Cao E, Sun W (2020) Global-to-local neural networks for document-level relation extraction. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Online, pp 3711\u20133721. https:\/\/doi.org\/10.18653\/v1\/2020.emnlp-main.303","DOI":"10.18653\/v1\/2020.emnlp-main.303"},{"key":"1665_CR87","doi-asserted-by":"publisher","unstructured":"Wang H, Tan M, Yu M, Chang S, Wang D, Xu K, Guo X, Potdar S (2019) Extracting multiple-relations in one-pass with pre-trained transformers. In: Proceedings of the 57th annual meeting of the association for computational linguistics. Association for Computational Linguistics, Florence, Italy, pp 1371\u20131377. https:\/\/doi.org\/10.18653\/v1\/P19-1132","DOI":"10.18653\/v1\/P19-1132"},{"key":"1665_CR88","doi-asserted-by":"publisher","unstructured":"Wang J, Lu W (2020) Two are better than one: Joint entity and relation extraction with table-sequence encoders. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Online, pp 1706\u20131721. https:\/\/doi.org\/10.18653\/v1\/2020.emnlp-main.133","DOI":"10.18653\/v1\/2020.emnlp-main.133"},{"key":"1665_CR89","doi-asserted-by":"publisher","unstructured":"Wang L, Cao Z, de\u00a0Melo G, Liu Z (2016) Relation classification via multi-level attention CNNs. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers). Association for Computational Linguistics, Berlin, Germany, pp 1298\u20131307. https:\/\/doi.org\/10.18653\/v1\/P16-1123","DOI":"10.18653\/v1\/P16-1123"},{"key":"1665_CR90","doi-asserted-by":"publisher","unstructured":"Wu R, Yao Y, Han X, Xie R, Liu Z, Lin F, Lin L, Sun M (2019) Open relation extraction: relational knowledge transfer from supervised data to unsupervised data. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, Hong Kong, China, pp 219\u2013228. https:\/\/doi.org\/10.18653\/v1\/D19-1021","DOI":"10.18653\/v1\/D19-1021"},{"key":"1665_CR91","doi-asserted-by":"publisher","unstructured":"Xu Y, Mou L, Li G, Chen Y, Peng H, Jin Z (2015) Classifying relations via long short term memory networks along shortest dependency paths. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, Portugal, pp 1785\u20131794. https:\/\/doi.org\/10.18653\/v1\/D15-1206","DOI":"10.18653\/v1\/D15-1206"},{"issue":"16","key":"1665_CR92","doi-asserted-by":"publisher","first-page":"7171","DOI":"10.1016\/j.eswa.2014.05.044","volume":"41","author":"X Yang","year":"2014","unstructured":"Yang X, Mao K (2014) Multi level causal relation identification using extended features. Expert Syst Appl 41(16):7171\u20137181. https:\/\/doi.org\/10.1016\/j.eswa.2014.05.044","journal-title":"Expert Syst Appl"},{"key":"1665_CR93","doi-asserted-by":"crossref","unstructured":"Yao L, Mao C, Luo Y (2019a) Graph convolutional networks for text classification. In: In 33rd AAAI conference on artificial intelligence, pp 7370\u20137377","DOI":"10.1609\/aaai.v33i01.33017370"},{"key":"1665_CR94","doi-asserted-by":"publisher","unstructured":"Yao Y, Ye D, Li P, Han X, Lin Y, Liu Z, Liu Z, Huang L, Zhou J, Sun M (2019b) DocRED: a large-scale document-level relation extraction dataset. In: Proceedings of the 57th annual meeting of the association for computational linguistics. Association for Computational Linguistics, Florence, Italy, pp 764\u2013777. https:\/\/doi.org\/10.18653\/v1\/P19-1074","DOI":"10.18653\/v1\/P19-1074"},{"key":"1665_CR95","doi-asserted-by":"publisher","unstructured":"Zeng S, Xu R, Chang B, Li L (2020) Double graph based reasoning for document-level relation extraction. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Online, pp 1630\u20131640. https:\/\/doi.org\/10.18653\/v1\/2020.emnlp-main.127","DOI":"10.18653\/v1\/2020.emnlp-main.127"},{"key":"1665_CR96","doi-asserted-by":"publisher","unstructured":"Zhang Q, Chen M, Liu L (2017) A review on entity relation extraction. In: 2017 second international conference on mechanical, control and computer engineering (ICMCCE), vol\u00a01, pp 178\u2013183. https:\/\/doi.org\/10.1109\/ICMCCE.2017.14","DOI":"10.1109\/ICMCCE.2017.14"},{"key":"1665_CR97","doi-asserted-by":"publisher","unstructured":"Zhang Y, Zhong V, Chen D, Angeli G, Manning CD (2017) Position-aware attention and supervised data improve slot filling. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Copenhagen, Denmark, pp 35\u201345. https:\/\/doi.org\/10.18653\/v1\/D17-1004","DOI":"10.18653\/v1\/D17-1004"},{"key":"1665_CR98","doi-asserted-by":"publisher","unstructured":"Zhang Y, Qi P, Manning CD (2018) Graph convolution over pruned dependency trees improves relation extraction. In: Proceedings of the 2018 conference on empirical methods in natural language processing. Association for Computational Linguistics, Brussels, Belgium, pp 2205\u20132215. https:\/\/doi.org\/10.18653\/v1\/D18-1244","DOI":"10.18653\/v1\/D18-1244"},{"key":"1665_CR99","doi-asserted-by":"publisher","first-page":"1943","DOI":"10.1016\/j.neucom.2015.09.066","volume":"173","author":"S Zhao","year":"2016","unstructured":"Zhao S, Liu T, Zhao S, Chen Y, Nie JY (2016) Event causality extraction based on connectives analysis. Neurocomputing 173:1943\u20131950. https:\/\/doi.org\/10.1016\/j.neucom.2015.09.066","journal-title":"Neurocomputing"},{"key":"1665_CR100","doi-asserted-by":"publisher","unstructured":"Zhao S, Hu M, Cai Z, Liu F (2020) Modeling dense cross-modal interactions for joint entity-relation extraction. In: Bessiere C (ed) Proceedings of the twenty-ninth international joint conference on artificial intelligence, IJCAI-20. International Joint Conferences on Artificial Intelligence Organization, pp 4032\u20134038. https:\/\/doi.org\/10.24963\/ijcai.2020\/558","DOI":"10.24963\/ijcai.2020\/558"},{"key":"1665_CR101","doi-asserted-by":"publisher","first-page":"298","DOI":"10.1155\/2014\/298473","volume":"24","author":"D Zhou","year":"2014","unstructured":"Zhou D, Zhong D (2014) Biomedical relation extraction: from binary to complex. Comput Math Methods Med 24:298\u2013473. https:\/\/doi.org\/10.1155\/2014\/298473","journal-title":"Comput Math Methods Med"}],"container-title":["Knowledge and Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-022-01665-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10115-022-01665-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-022-01665-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,28]],"date-time":"2023-01-28T22:28:43Z","timestamp":1674944923000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10115-022-01665-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,12]]},"references-count":102,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,5]]}},"alternative-id":["1665"],"URL":"https:\/\/doi.org\/10.1007\/s10115-022-01665-w","relation":{},"ISSN":["0219-1377","0219-3116"],"issn-type":[{"value":"0219-1377","type":"print"},{"value":"0219-3116","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,12]]},"assertion":[{"value":"23 January 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 February 2022","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 February 2022","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 March 2022","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}