{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T10:21:43Z","timestamp":1777890103829,"version":"3.51.4"},"reference-count":43,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2022,9,26]],"date-time":"2022-09-26T00:00:00Z","timestamp":1664150400000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["SW"],"published-print":{"date-parts":[[2022,9,26]]},"abstract":"<jats:p>The single biggest obstacle in performing comprehensive cross-lingual discourse analysis is the scarcity of multilingual resources. The existing resources are overwhelmingly monolingual, compelling researchers to infer the discourse-level information in the target languages through error-prone automatic means. The current paper aims to provide a more direct insight into the cross-lingual variations in discourse structures by linking the annotated relations of the TED-Multilingual Discourse Bank, which consists of independently annotated six TED talks in seven different languages. It is shown that the linguistic labels over the relations annotated in the texts of these languages can be automatically linked with English with high accuracy, as verified against the relations of three diverse languages semi-automatically linked with relations over English texts. The resulting corpus has a great potential to reveal the divergences in local discourse relations, as well as leading to new resources, as exemplified by the induction of bilingual discourse connective lexicons.<\/jats:p>","DOI":"10.3233\/sw-223011","type":"journal-article","created":{"date-parts":[[2022,6,21]],"date-time":"2022-06-21T13:52:15Z","timestamp":1655819535000},"page":"1081-1102","source":"Crossref","is-referenced-by-count":2,"title":["Linking discourse-level information and the induction of bilingual discourse connective lexicons"],"prefix":"10.1177","volume":"13","author":[{"given":"Sibel","family":"\u00d6zer","sequence":"first","affiliation":[{"name":"Cognitive Science Department, Middle East Technical University, Ankara, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Murathan","family":"Kurfal\u0131","sequence":"additional","affiliation":[{"name":"Linguistics Department, Stockholm University, Stockholm, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Deniz","family":"Zeyrek","sequence":"additional","affiliation":[{"name":"Cognitive Science Department, Middle East Technical University, Ankara, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Am\u00e1lia","family":"Mendes","sequence":"additional","affiliation":[{"name":"Center of Linguistics, School of Arts and Humanities, University of Lisbon, Lisbon, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Giedr\u0117","family":"Val\u016bnait\u0117 Ole\u0161kevi\u010dien\u0117","sequence":"additional","affiliation":[{"name":"Institute of Humanities, Mykolas Romeris University, Vilnius, Lietuva"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","reference":[{"key":"10.3233\/SW-223011_ref1","doi-asserted-by":"publisher","first-page":"597","DOI":"10.1162\/tacl_a_00288","article-title":"Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond","volume":"7","author":"Artetxe","year":"2019","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"10.3233\/SW-223011_ref2","unstructured":"M. Aulamo, U. Sulubacak, S. Virpioja and J. Tiedemann, OpusTools and parallel corpus diagnostics, in: Proceedings of the 12th Language Resources and Evaluation Conference, European Language Resources Association, 2020, pp. 3782\u20133789. https:\/\/www.aclweb.org\/anthology\/2020.lrec-1.467. ISBN 979-10-95546-34-4."},{"key":"10.3233\/SW-223011_ref3","doi-asserted-by":"publisher","DOI":"10.4000\/books.aaccademia.2360"},{"key":"10.3233\/SW-223011_ref4","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.505"},{"key":"10.3233\/SW-223011_ref5","doi-asserted-by":"crossref","unstructured":"E. Breindl, A. Volodina and U.H. Wa\u00dfner, Handbuch der Deutschen Konnektoren 2: Semantik der Deutschen Satzverkn\u00fcpfer, Vol. 13, Walter de Gruyter GmbH & Co KG, 2014.","DOI":"10.1515\/9783110341447"},{"key":"10.3233\/SW-223011_ref6","unstructured":"A. Briz, S. Pons and J. Portol\u00e9s, Diccionario de part\u00edculas discursivas del espa\u00f1ol, in: El diccionario como puente entre las lenguas y culturas del mundo. Actas del II Congreso Internacional de Lexicograf\u00eda Hisp\u00e1nica. Alicante, Biblioteca Virtual Cervantes, 2008, pp. 217\u2013227."},{"key":"10.3233\/SW-223011_ref7","doi-asserted-by":"publisher","DOI":"10.4230\/OASIcs.LDK.2021.40"},{"key":"10.3233\/SW-223011_ref8","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-31782-8_2"},{"key":"10.3233\/SW-223011_ref9","doi-asserted-by":"crossref","unstructured":"C. Chiarcos and A. Pareja-Lora, 1: Open data \u2013 linked data \u2013 linked open data \u2013 linguistic linked open data (LLOD): A general introduction, in: Development of Linguistic Linked Open Data Resources for Collaborative Data-Intensive Research in the Language Sciences, 2019, pp. 1\u201317.","DOI":"10.7551\/mitpress\/10990.001.0001"},{"key":"10.3233\/SW-223011_ref10","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-44918-8_6"},{"key":"10.3233\/SW-223011_ref11","doi-asserted-by":"publisher","DOI":"10.4000\/books.aaccademia.1770"},{"key":"10.3233\/SW-223011_ref12","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1016\/j.pragma.2017.10.010","article-title":"Cognitive complexity and the linguistic marking of coherence relations: A parallel corpus study","volume":"121","author":"Hoek","year":"2017","journal-title":"Journal of Pragmatics"},{"key":"10.3233\/SW-223011_ref13","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1162\/tacl_a_00142","article-title":"One vector is not enough: Entity-augmented distributed semantics for discourse relations","volume":"3","author":"Ji","year":"2015","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"10.3233\/SW-223011_ref14","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.codi-1.15"},{"key":"10.3233\/SW-223011_ref16","unstructured":"J.J. Li, M. Carpuat and A. Nenkova, Cross-lingual discourse relation analysis: A corpus study and a semi-supervised classification system, in: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, Dublin City University and Association for Computational Linguistics, Dublin, Ireland, 2014, pp. 577\u2013587. https:\/\/aclanthology.org\/C14-1055."},{"key":"10.3233\/SW-223011_ref17","unstructured":"A.\u00a0Mendes, I.\u00a0del Rio, M.\u00a0Stede and F.\u00a0Dombek, A lexicon of discourse markers for Portuguese \u2013 LDM-PT, in: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), European Language Resources Association (ELRA), Miyazaki, Japan, 2018, pp. 4379\u20134384. https:\/\/aclanthology.org\/L18-1693."},{"key":"10.3233\/SW-223011_ref18","doi-asserted-by":"publisher","DOI":"10.1075\/btl.158.08men"},{"key":"10.3233\/SW-223011_ref19","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1111\/lang.12233","article-title":"Evidence and interpretation in language learning research: Opportunities for collaboration with computational linguistics","volume":"67","author":"Meurers","year":"2017","journal-title":"Language Learning"},{"key":"10.3233\/SW-223011_ref20","unstructured":"T. Meyer, A. Popescu-Belis, N. Hajlaoui and A. Gesmundo, Machine translation of labeled discourse connectives, in: Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, Association for Machine Translation in the Americas, San Diego, California, USA, 2012, https:\/\/aclanthology.org\/2012.amta-papers.20."},{"key":"10.3233\/SW-223011_ref21","unstructured":"J. M\u00edrovsk\u00fd, L. Mladov\u00e1 and \u0160. Zik\u00e1nov\u00e1, Connective-based measuring of the inter-annotator agreement in the annotation of discourse in PDT, in: Coling 2010: Posters, Coling 2010 Organizing Committee, Beijing, China, 2010, pp. 775\u2013781. https:\/\/aclanthology.org\/C10-2089."},{"issue":"1","key":"10.3233\/SW-223011_ref22","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1515\/pralin-2017-0039","article-title":"CzeDLex-A lexicon of Czech discourse connectives","volume":"109","author":"M\u00edrovsk\u1ef3","year":"2017","journal-title":"The Prague Bulletin of Mathematical Linguistics"},{"issue":"1","key":"10.3233\/SW-223011_ref23","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1515\/pralin-2016-0013","article-title":"Efficient word alignment with Markov chain Monte Carlo","volume":"106","author":"\u00d6stling","year":"2016","journal-title":"The Prague Bulletin of Mathematical Linguistics"},{"key":"10.3233\/SW-223011_ref24","unstructured":"S. \u00d6zer and D. Zeyrek, An automatic discourse relation alignment experiment on TED-MDB, in: Proceedings of the 2019 Workshop on Widening NLP, Association for Computational Linguistics, Florence, Italy, 2019, pp. 31\u201334."},{"key":"10.3233\/SW-223011_ref25","unstructured":"J. Park and C. Cardie, Improving implicit discourse relation recognition through feature set optimization, in: Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Association for Computational Linguistics, Seoul, South Korea, 2012, pp. 108\u2013112. https:\/\/aclanthology.org\/W12-1614."},{"key":"10.3233\/SW-223011_ref26","doi-asserted-by":"crossref","unstructured":"R. Pasch, U. Brau\u00dfe, E. Breindl and U.H. Wa\u00dfner, Handbuch der Deutschen Konnektoren: Linguistische Grundlagen der Beschreibung und Syntaktische Merkmale der Deutschen Satzverkn\u00fcpfer (Konjunktionen, Satzadverbien und Partikeln), Vol. 2, Walter de Gruyter, 2003.","DOI":"10.1515\/9783110201666"},{"key":"10.3233\/SW-223011_ref27","unstructured":"L. Pol\u00e1kov\u00e1, K. Rysov\u00e1, M. Rysov\u00e1 and J. M\u00edrovsk\u00fd, GeCzLex: Lexicon of Czech and German anaphoric connectives, in: Proceedings of the 12th Language Resources and Evaluation Conference, European Language Resources Association, Marseille, France, 2020, pp. 1089\u20131096. https:\/\/aclanthology.org\/2020.lrec-1.137."},{"issue":"4","key":"10.3233\/SW-223011_ref28","doi-asserted-by":"publisher","first-page":"921","DOI":"10.1162\/COLI_a_00204","article-title":"Reflections on the penn discourse TreeBank, comparable corpora, and complementary annotation","volume":"40","author":"Prasad","year":"2014","journal-title":"Computational Linguistics"},{"key":"10.3233\/SW-223011_ref29","doi-asserted-by":"crossref","unstructured":"V. Pyatkin and B. Webber, Discourse relations and conjoined VPs: Automated sense recognition, in: Proceedings of the Student Research Workshop at the 15th Conference of the European Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, Valencia, Spain, 2017, pp. 33\u201342. https:\/\/aclanthology.org\/E17-4004.","DOI":"10.18653\/v1\/E17-4004"},{"key":"10.3233\/SW-223011_ref30","doi-asserted-by":"publisher","DOI":"10.4000\/discours.8645"},{"key":"10.3233\/SW-223011_ref31","unstructured":"T. Scheffler and M. Stede, Adding semantic relations to a large-coverage connective lexicon of German, in: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC\u201916), European Language Resources Association (ELRA),, Portoro\u017e, Slovenia, 2016, pp. 1008\u20131013. https:\/\/aclanthology.org\/L16-1160."},{"key":"10.3233\/SW-223011_ref32","unstructured":"M. \u0160krabal and M. Vav\u0159\u00edn, The translation equivalents database (treq) as a lexicographer\u2019s aid, in: Electronic Lexicography in the 21st Century, Proceedings of eLex 2017 Conference, 2017, pp. 124\u2013137."},{"key":"10.3233\/SW-223011_ref33","doi-asserted-by":"publisher","DOI":"10.3115\/1220355.1220416"},{"key":"10.3233\/SW-223011_ref34","doi-asserted-by":"publisher","DOI":"10.4000\/discours.10098"},{"key":"10.3233\/SW-223011_ref35","doi-asserted-by":"crossref","unstructured":"J. Tiedemann, Bitext Alignment, Morgan and Claypool Publishers, an Rafael, California, 2011.","DOI":"10.1007\/978-3-031-02142-8"},{"key":"10.3233\/SW-223011_ref36","unstructured":"J. Tiedemann, Parallel data, tools and interfaces in OPUS, in: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC\u201912), European Language Resources Association (ELRA), Istanbul, Turkey, 2012, pp. 2214\u20132218. http:\/\/www.lrec-conf.org\/proceedings\/lrec2012\/pdf\/463_Paper.pdf."},{"key":"10.3233\/SW-223011_ref37","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1075\/cilt.292.32var","article-title":"Parallel corpora for medium density languages","volume":"292","author":"Varga","year":"2007","journal-title":"Amsterdam Studies in The Theory And History Of Linguistic Science Series 4"},{"key":"10.3233\/SW-223011_ref38","unstructured":"Y. Versley, Discovery of ambiguous and unambiguous discourse connectives via annotation projection, in: Workshop on the Annotation and Exploitation of Parallel Corpora (AEPC), 2010, pp. 83\u201382, http:\/\/hdl.handle.net\/10062\/15953."},{"key":"10.3233\/SW-223011_ref40","doi-asserted-by":"publisher","DOI":"10.3390\/languages5030035"},{"key":"10.3233\/SW-223011_ref41","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-3308"},{"key":"10.3233\/SW-223011_ref42","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-0809"},{"issue":"2","key":"10.3233\/SW-223011_ref43","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1007\/s10579-019-09445-9","article-title":"TED multilingual discourse bank (TED-MDB): A parallel corpus annotated in the PDTB style","volume":"54","author":"Zeyrek","year":"2020","journal-title":"Language Resources and Evaluation"},{"issue":"2","key":"10.3233\/SW-223011_ref44","doi-asserted-by":"publisher","first-page":"264","DOI":"10.1075\/lic.16.2.05zuf","article-title":"Discourse connectives across languages: Factors influencing their explicit or implicit translation","volume":"16","author":"Zufferey","year":"2016","journal-title":"Languages in Contrast"},{"issue":"3","key":"10.3233\/SW-223011_ref45","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1177\/0267658315573349","article-title":"Advanced learners\u2019 comprehension of discourse connectives: The role of L1 transfer across on-line and off-line tasks","volume":"31","author":"Zufferey","year":"2015","journal-title":"Second Language Research"}],"container-title":["Semantic Web"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/SW-223011","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T05:26:35Z","timestamp":1777613195000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/SW-223011"}},"subtitle":[],"editor":[{"given":"Julia","family":"Bosque-Gil","sequence":"additional","affiliation":[{"name":"University of Zaragoza, Spain"}],"role":[{"role":"editor","vocabulary":"crossref"}]},{"given":"Milan","family":"Dojchinovski","sequence":"additional","affiliation":[{"name":"Czech Technical University in Prague, Czech Republic"}],"role":[{"role":"editor","vocabulary":"crossref"}]},{"given":"Philipp","family":"Cimiano","sequence":"additional","affiliation":[{"name":"Bielefeld University, Germany"}],"role":[{"role":"editor","vocabulary":"crossref"}]},{"given":"Julia","family":"Bosque-Gil","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]},{"given":"Philipp","family":"Cimiano","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]},{"given":"Milan","family":"Dojchinovski","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2022,9,26]]},"references-count":43,"journal-issue":{"issue":"6"},"URL":"https:\/\/doi.org\/10.3233\/sw-223011","relation":{},"ISSN":["2210-4968","1570-0844"],"issn-type":[{"value":"2210-4968","type":"electronic"},{"value":"1570-0844","type":"print"}],"subject":[],"published":{"date-parts":[[2022,9,26]]}}}