{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:51:16Z","timestamp":1750308676851,"version":"3.41.0"},"reference-count":59,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2014,6,1]],"date-time":"2014-06-01T00:00:00Z","timestamp":1401580800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Transactions on Asian Language Information Processing"],"published-print":{"date-parts":[[2014,6]]},"abstract":"<jats:p>In this article, we propose the first work that investigates the feasibility of Arabic discourse segmentation into elementary discourse units within the segmented discourse representation theory framework. We first describe our annotation scheme that defines a set of principles to guide the segmentation process. Two corpora have been annotated according to this scheme: elementary school textbooks and newspaper documents extracted from the syntactically annotated Arabic Treebank. Then, we propose a multiclass supervised learning approach that predicts nested units. Our approach uses a combination of punctuation, morphological, lexical, and shallow syntactic features. We investigate how each feature contributes to the learning process. We show that an extensive morphological analysis is crucial to achieve good results in both corpora. In addition, we show that adding chunks does not boost the performance of our system.<\/jats:p>","DOI":"10.1145\/2601401","type":"journal-article","created":{"date-parts":[[2014,6,24]],"date-time":"2014-06-24T12:16:16Z","timestamp":1403612176000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Splitting Arabic Texts into Elementary Discourse Units"],"prefix":"10.1145","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7817-1991","authenticated-orcid":false,"given":"Iskandar","family":"Keskes","sequence":"first","affiliation":[{"name":"Sfax University, Tunisia and IRIT-Toulouse University, France"}]},{"given":"Farah Benamara","family":"Zitoune","sequence":"additional","affiliation":[{"name":"IRIT-Toulouse University, France"}]},{"given":"Lamia Hadrich","family":"Belguith","sequence":"additional","affiliation":[{"name":"Sfax University, Tunisia"}]}],"member":"320","published-online":{"date-parts":[[2014,6]]},"reference":[{"volume-title":"Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912)","author":"Abdul-Mageed M.","key":"e_1_2_1_1_1","unstructured":"Abdul-Mageed , M. , and Diab , M . 2012. AWATIF: A multi-genre corpus for modern standard Arabic subjectivity and sentiment analysis . In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912) . Abdul-Mageed, M., and Diab, M. 2012. AWATIF: A multi-genre corpus for modern standard Arabic subjectivity and sentiment analysis. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912)."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3844\/jcssp.2013.922.927"},{"volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics - Short Papers (ACLShortPapers\u201913)","author":"Abu-Jbara A.","key":"e_1_2_1_3_1","unstructured":"Abu-Jbara , A. King , B. Diab , M. , and Radev , D . 2013. Identifying opinion subgroups in Arabic online discussions . In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics - Short Papers (ACLShortPapers\u201913) . Abu-Jbara, A. King, B. Diab, M., and Radev, D. 2013. Identifying opinion subgroups in Arabic online discussions. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics - Short Papers (ACLShortPapers\u201913)."},{"volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201910)","author":"Afantenos S. D.","key":"e_1_2_1_4_1","unstructured":"Afantenos , S. D. , Denis , P. , Muller , P. , and Danlos , L . 2010. Learning recursive segments for discourse parsing . In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201910) . Afantenos, S. D., Denis, P., Muller, P., and Danlos, L. 2010. Learning recursive segments for discourse parsing. In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201910)."},{"volume-title":"Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912)","author":"Afantenos S.","key":"e_1_2_1_5_1","unstructured":"Afantenos , S. , Asher , N. , Benamara , F. , Bras , M. , Fabre , C. , Ho-Dac , M. , Draoulec , A. L. , Muller , P. , Pery-Woodley , M.-P. , Prevot , L. , Rebeyrolles , J. , Tanguy , L. , Vergez-Couret , M. , and Vieu , L . 2012. An empirical resource for discovering cognitive principles of discourse organisation: The annodis corpus . In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912) . Afantenos, S., Asher, N., Benamara, F., Bras, M., Fabre, C., Ho-Dac, M., Draoulec, A. L., Muller, P., Pery-Woodley, M.-P., Prevot, L., Rebeyrolles, J., Tanguy, L., Vergez-Couret, M., and Vieu, L. 2012. An empirical resource for discovering cognitive principles of discourse organisation: The annodis corpus. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912)."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.3844\/jcssp.2011.1505.1514"},{"volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201910)","author":"Al-Saif A.","key":"e_1_2_1_7_1","unstructured":"Al-Saif , A. , and Markert , K . 2010. The Leeds Arabic discourse treebank: Annotating discourse connectives for Arabic . In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201910) . Al-Saif, A., and Markert, K. 2010. The Leeds Arabic discourse treebank: Annotating discourse connectives for Arabic. In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201910)."},{"volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201911)","author":"Al-Saif A.","key":"e_1_2_1_8_1","unstructured":"Al-Saif , A. , and Markert , K . 2011. Modelling discourse relations for Arabic . In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201911) . Al-Saif, A., and Markert, K. 2011. Modelling discourse relations for Arabic. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201911)."},{"key":"e_1_2_1_9_1","unstructured":"Asher N. and Lascarides A. 2003. Logics of Conversation. Cambridge University Press.  Asher N. and Lascarides A. 2003. Logics of Conversation . Cambridge University Press."},{"key":"e_1_2_1_10_1","unstructured":"Bebajiba Y. Rosso P. Abouenour L. Trigui O. Bouzoubaa K. and Belguith HL. 2010. Question answering for semitic languages. In Natural Language Processing Approaches to Semitic Languages Pr. Imed Zitouni Ed. Springer 345--347.  Bebajiba Y. Rosso P. Abouenour L. Trigui O. Bouzoubaa K. and Belguith HL. 2010. Question answering for semitic languages. In Natural Language Processing Approaches to Semitic Languages Pr. Imed Zitouni Ed. Springer 345--347."},{"key":"e_1_2_1_11_1","unstructured":"Belguith H. L. 2009. Document analysis and summarisation: Problems conception and implementation. In Habilitation at Faculty of Economics and Management of SFAX.  Belguith H. L. 2009. Document analysis and summarisation: Problems conception and implementation. In Habilitation at Faculty of Economics and Management of SFAX ."},{"volume-title":"Proceedings of the 12th Conference on Natural Language Processing.","author":"Belguith H. L.","key":"e_1_2_1_12_1","unstructured":"Belguith , H. L. , Baccour , L. , and Mourad , G . 2005. Segmentation de textes arabes bas\u00e9e sur l\u2019analyse contextuelle des signes de ponctuations et de certaines particules . In Proceedings of the 12th Conference on Natural Language Processing. Belguith, H. L., Baccour, L., and Mourad, G. 2005. Segmentation de textes arabes bas\u00e9e sur l\u2019analyse contextuelle des signes de ponctuations et de certaines particules. In Proceedings of the 12th Conference on Natural Language Processing."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/234285.234289"},{"key":"e_1_2_1_14_1","unstructured":"Boudlal A. Lakhouaja A. Mazroui A. Meziane A. and Bebah M. 2011. Alkhalil morpho sys: A morphosyntactic analysis system for Arabic texts. http:\/\/www.itpapers.info\/acit10\/Papers\/f653.pdf.  Boudlal A. Lakhouaja A. Mazroui A. Meziane A. and Bebah M. 2011. Alkhalil morpho sys: A morphosyntactic analysis system for Arabic texts. http:\/\/www.itpapers.info\/acit10\/Papers\/f653.pdf."},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems. 337--342","author":"Boujelben I.","year":"2013","unstructured":"Boujelben , I. Jamoussi , S. , and Ben Hamadou , A. 2013 . Enhancing machine learning results for se-mantic relation extraction . In Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems. 337--342 . Boujelben, I. Jamoussi, S., and Ben Hamadou, A. 2013. Enhancing machine learning results for se-mantic relation extraction. In Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems. 337--342."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118078.1118083"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10590-011-9112-y"},{"volume-title":"Proceedings of the 9th National Computer Science and Engineering Conference.","author":"Charoensuk J.","key":"e_1_2_1_18_1","unstructured":"Charoensuk , J. , Suvakree , T. , and Kawtrakul , A . 2005. Thai element discourse unit segmentation for Thai discourse cues and syntactic information . In Proceedings of the 9th National Computer Science and Engineering Conference. Charoensuk, J., Suvakree, T., and Kawtrakul, A. 2005. Thai element discourse unit segmentation for Thai discourse cues and syntactic information. In Proceedings of the 9th National Computer Science and Engineering Conference."},{"volume-title":"Proceedings of the 9th Mexican International Conference on Advances in Artificial Intelligence (MICAI\u201910)","author":"Da Cunha I.","key":"e_1_2_1_19_1","unstructured":"Da Cunha , I. , San Juan , E. , and Torres M . 2010. Discourse segmentation for Spanish based on shallow parsing . In Proceedings of the 9th Mexican International Conference on Advances in Artificial Intelligence (MICAI\u201910) . Springer, 13--23. Da Cunha, I., San Juan, E., and Torres M. 2010. Discourse segmentation for Spanish based on shallow parsing. In Proceedings of the 9th Mexican International Conference on Advances in Artificial Intelligence (MICAI\u201910). Springer, 13--23."},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL\u201913)","author":"Darwish K.","year":"2013","unstructured":"Darwish , K. 2013 . Named entity recognition using cross-lingual resources: Arabic as an example . In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL\u201913) . Darwish, K. 2013. Named entity recognition using cross-lingual resources: Arabic as an example. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL\u201913)."},{"key":"e_1_2_1_21_1","unstructured":"Debili F. Achour H. and Souissi E. 2002. La langue arabe et l\u2018ordinateur de l\u2018\u00e9tiquetage grammatical \u00e0 la voyellation automatique. Correspondances de I\u2019IRMC 71 10--28.  Debili F. Achour H. and Souissi E. 2002. La langue arabe et l\u2018ordinateur de l\u2018\u00e9tiquetage grammatical \u00e0 la voyellation automatique. Correspondances de I\u2019IRMC 71 10--28."},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 2nd International Conference on Arabic Language Resources and Tools.","author":"Diab M.","year":"2009","unstructured":"Diab , M. 2009 . Second generation Amira tools for Arabic processing: Fast and robust tokenization, pos tagging, and base phrase chunking . In Proceedings of the 2nd International Conference on Arabic Language Resources and Tools. Diab, M. 2009. Second generation Amira tools for Arabic processing: Fast and robust tokenization, pos tagging, and base phrase chunking. In Proceedings of the 2nd International Conference on Arabic Language Resources and Tools."},{"key":"e_1_2_1_23_1","unstructured":"Diab M. Hacioglu K. and Jurafsky D. 2007. Automated methods for processing Arabic text: From tokenization to base phrase chunking. In Arabic Computational Morphology: Knowledge-Based and Empirical Methods. Springer 159--179.  Diab M. Hacioglu K. and Jurafsky D. 2007. Automated methods for processing Arabic text: From tokenization to base phrase chunking. In Arabic Computational Morphology: Knowledge-Based and Empirical Methods . Springer 159--179."},{"volume-title":"Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL\u201908)","author":"Diab M.","key":"e_1_2_1_24_1","unstructured":"Diab , M. , Moschitti , A. , and Pighin , D . 2008. Semantic role labeling systems for Arabic using kernel methods . In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL\u201908) . Diab, M., Moschitti, A., and Pighin, D. 2008. Semantic role labeling systems for Arabic using kernel methods. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL\u201908)."},{"volume-title":"Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL\u201913)","author":"Eskander R.","key":"e_1_2_1_25_1","unstructured":"Eskander , R. Habash , N. Bies , A. Kulick , S. , and Maamouri M . 2013. Automatic correction and extension of morphological annotations . In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL\u201913) . Eskander, R. Habash, N. Bies, A. Kulick, S., and Maamouri M. 2013. Automatic correction and extension of morphological annotations. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL\u201913)."},{"volume-title":"Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. 488--495","author":"Fisher S.","key":"e_1_2_1_26_1","unstructured":"Fisher , S. and Roark , B . 2007. The utility of parse-derived features for automatic discourse segmentation . In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. 488--495 . Fisher, S. and Roark, B. 2007. The utility of parse-derived features for automatic discourse segmentation. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. 488--495."},{"volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics (COLING\u201910)","author":"Green S.","key":"e_1_2_1_27_1","unstructured":"Green , S. and Manning , C . 2010. Better Arabic parsing: Baselines, evaluations, and analysis . In Proceedings of the 23rd International Conference on Computational Linguistics (COLING\u201910) . 394--402. Green, S. and Manning, C. 2010. Better Arabic parsing: Baselines, evaluations, and analysis. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING\u201910). 394--402."},{"volume-title":"Proceedings of the 2nd Workshop on South and Southeast Asian Natural Language Processing (WSSANLP\u201911)","author":"Gridach M.","key":"e_1_2_1_28_1","unstructured":"Gridach , M. and Chenfour , N . 2011. Developing a new system for Arabic morphological analysis and generation . In Proceedings of the 2nd Workshop on South and Southeast Asian Natural Language Processing (WSSANLP\u201911) . 52--57. Gridach, M. and Chenfour, N. 2011. Developing a new system for Arabic morphological analysis and generation. In Proceedings of the 2nd Workshop on South and Southeast Asian Natural Language Processing (WSSANLP\u201911). 52--57."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/12457.12458"},{"volume-title":"Introduction to Arabic Natural Language Processing. Synthesis Lectures on Human Language Technologies","author":"Habash N.","key":"e_1_2_1_30_1","unstructured":"Habash , N. 2010. Introduction to Arabic Natural Language Processing. Synthesis Lectures on Human Language Technologies . G. Hirst Ed., Morgan and Claypool . Habash, N. 2010. Introduction to Arabic Natural Language Processing. Synthesis Lectures on Human Language Technologies. G. Hirst Ed., Morgan and Claypool."},{"volume-title":"Proceedings of the 2nd International Conference on Arabic Language Resources and Tools.","author":"Habash N.","key":"e_1_2_1_31_1","unstructured":"Habash , N. , Owen R. , and Ryan R . 2009. MADA+TOKAN: A toolkit for Arabic tokenization, diacritization, morphological disambiguation, pos tagging, stemming and lemmatization . In Proceedings of the 2nd International Conference on Arabic Language Resources and Tools. Habash, N., Owen R., and Ryan R. 2009. MADA+TOKAN: A toolkit for Arabic tokenization, diacritization, morphological disambiguation, pos tagging, stemming and lemmatization. In Proceedings of the 2nd International Conference on Arabic Language Resources and Tools."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5087\/dad.2010.003"},{"volume-title":"Formal Semantics: The Essential Readings","author":"Kamp H.","key":"e_1_2_1_33_1","unstructured":"Kamp , H. 1981. A theory of truth and semantic representation . In Formal Semantics: The Essential Readings . Wiley , 189--213. Kamp, H. 1981. A theory of truth and semantic representation. In Formal Semantics: The Essential Readings. Wiley, 189--213."},{"volume-title":"Proceedings of the 8th International Conference on Language Resources and Evaluation.","author":"Keskes I.","key":"e_1_2_1_34_1","unstructured":"Keskes , I. , Benamara , F. , and Belguith , H. L . 2012. Clause-based discourse segmentation of Arabic texts . In Proceedings of the 8th International Conference on Language Resources and Evaluation. Keskes, I., Benamara, F., and Belguith, H. L. 2012. Clause-based discourse segmentation of Arabic texts. In Proceedings of the 8th International Conference on Language Resources and Evaluation."},{"key":"e_1_2_1_35_1","first-page":"1","article-title":"Arabic discourse segmentation based on rhetorical methods","volume":"11","author":"Khalifa I.","year":"2011","unstructured":"Khalifa , I. , Feki , Z. , and Farawila , A. 2011 . Arabic discourse segmentation based on rhetorical methods . Int. J. Electric Comput. Sci. 11 , 1 . Khalifa, I., Feki, Z., and Farawila, A. 2011. Arabic discourse segmentation based on rhetorical methods. Int. J. Electric Comput. Sci. 11, 1.","journal-title":"Int. J. Electric Comput. Sci."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220355.1220403"},{"volume-title":"Proceedings of the Conference on Electronic Publishing (ELPUB\u201906)","author":"L\u00fcngen H.","key":"e_1_2_1_37_1","unstructured":"L\u00fcngen , H. , Lobin , H. , B\u00e4renf\u00e4nger , M. , Hilbert , M. , and Puskas , C . 2006. Text parsing of a complex genre . In Proceedings of the Conference on Electronic Publishing (ELPUB\u201906) , B. Martens and M. Dobreva Eds. L\u00fcngen, H., Lobin, H., B\u00e4renf\u00e4nger, M., Hilbert, M., and Puskas, C. 2006. Text parsing of a complex genre. In Proceedings of the Conference on Electronic Publishing (ELPUB\u201906), B. Martens and M. Dobreva Eds."},{"volume-title":"Catalog No. LDC2010T08","author":"Maamouri M.","key":"e_1_2_1_38_1","unstructured":"Maamouri , M. , Graff , D. , Bouziri , B. , Krouna , S. , Bies , A. , and Kulick , S . 2010a. Standard Arabic morphological analyzer (sama) version 3.1. Linguistic Data Consortium , Catalog No. LDC2010T08 . Maamouri, M., Graff, D., Bouziri, B., Krouna, S., Bies, A., and Kulick, S. 2010a. Standard Arabic morphological analyzer (sama) version 3.1. Linguistic Data Consortium, Catalog No. LDC2010T08."},{"volume-title":"Catalog No.: LDC2010T01","author":"Maamouri M.","key":"e_1_2_1_39_1","unstructured":"Maamouri , M. , Bies , A. , Kulick , S. Krouma , S. , Gaddeche , F. , and Zaghouani , W . 2010b. Arabic Treebank (ATB): Part 3 Version 3.2. Linguistic Data Consortium , Catalog No.: LDC2010T01 . Maamouri, M., Bies, A., Kulick, S. Krouma, S., Gaddeche, F., and Zaghouani, W. 2010b. Arabic Treebank (ATB): Part 3 Version 3.2. Linguistic Data Consortium, Catalog No.: LDC2010T01."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1515\/text.1.1988.8.3.243"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00138"},{"volume-title":"Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. 55--64","author":"Mourad A.","key":"e_1_2_1_42_1","unstructured":"Mourad , A. and Darwish , K . 2013. Subjectivity and sentiment analysis of modern standard Arabic and Arabic microblogs . In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. 55--64 . Mourad, A. and Darwish, K. 2013. Subjectivity and sentiment analysis of modern standard Arabic and Arabic microblogs. In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. 55--64."},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT\u201907)","author":"Nivre J.","year":"2007","unstructured":"Nivre , J. 2007 . Incremental non-projective dependency parsing . In Proceedings of the Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT\u201907) . 396--403. Nivre, J. 2007. Incremental non-projective dependency parsing. In Proceedings of the Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT\u201907). 396--403."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-28645-5_23"},{"key":"e_1_2_1_45_1","volume-title":"Proceedings of the 6th International Conference on Language Resources and Evaluation.","author":"Prasad A.","year":"2008","unstructured":"Prasad , A. , Miltsakaki , R. , Dinesh , E. , Lee , N. , Joshi , A. , and Webber . 2008 . The penn discourse treebank 2.0 . In Proceedings of the 6th International Conference on Language Resources and Evaluation. Prasad, A., Miltsakaki, R., Dinesh, E., Lee, N., Joshi, A., and Webber. 2008. The penn discourse treebank 2.0. In Proceedings of the 6th International Conference on Language Resources and Evaluation."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1016\/0378-2166(88)90050-1"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.3115\/980491.980576"},{"volume-title":"Proceedings of the 10th Amsterdam Colloquium on Formal Semantics.","author":"Polanyi L.","key":"e_1_2_1_48_1","unstructured":"Polanyi , L. and Van Den Berg, M. 1996. Discourse structure and discourse interpretation . In Proceedings of the 10th Amsterdam Colloquium on Formal Semantics. Polanyi, L. and Van Den Berg, M. 1996. Discourse structure and discourse interpretation. In Proceedings of the 10th Amsterdam Colloquium on Formal Semantics."},{"volume-title":"Proceedings of the 26th Canadian Conference on Artificial Intelligence (AI\u201913)","author":"Sadat F.","key":"e_1_2_1_49_1","unstructured":"Sadat , F. and Mohamed , E . 2013. Improved Arabic-French machine translation through preprocessing schemes and language analysis . In Proceedings of the 26th Canadian Conference on Artificial Intelligence (AI\u201913) . 308--314. Sadat, F. and Mohamed, E. 2013. Improved Arabic-French machine translation through preprocessing schemes and language analysis. In Proceedings of the 26th Canadian Conference on Artificial Intelligence (AI\u201913). 308--314."},{"volume-title":"Proceedings of the 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA\u201913)","author":"Sawalha M.","key":"e_1_2_1_50_1","unstructured":"Sawalha , M. Atwell , E. S. , and Abushariah M . 2013. SALMA: Standard Arabic language morphological analysis . In Proceedings of the 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA\u201913) . 1--6. Sawalha, M. Atwell, E. S., and Abushariah M. 2013. SALMA: Standard Arabic language morphological analysis. In Proceedings of the 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA\u201913). 1--6."},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073445.1073475"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220575.1220608"},{"volume-title":"Proceedings of the 11th Workshop on the Semantics and Pragmatics of Dialogue. 189--190","author":"Subba R.","key":"e_1_2_1_53_1","unstructured":"Subba , R. and Di Eugenio, B. 2007. Automatic discourse segmentation using neural networks . In Proceedings of the 11th Workshop on the Semantics and Pragmatics of Dialogue. 189--190 . Subba, R. and Di Eugenio, B. 2007. Automatic discourse segmentation using neural networks. In Proceedings of the 11th Workshop on the Semantics and Pragmatics of Dialogue. 189--190."},{"volume-title":"Proceedings of the International Conference on 5th Generation Computer Systems. 1133--1140","author":"Sumita K.","key":"e_1_2_1_54_1","unstructured":"Sumita , K. , Ono , K. , Chino , T. , Ukita , T. , and Amano , S . 1992. A discourse structure analyzer for Japanese text . In Proceedings of the International Conference on 5th Generation Computer Systems. 1133--1140 . Sumita, K., Ono, K., Chino, T., Ukita, T., and Amano, S. 1992. A discourse structure analyzer for Japanese text. In Proceedings of the International Conference on 5th Generation Computer Systems. 1133--1140."},{"volume-title":"Proceedings of the ACL-IJCNLP Conference of Short Papers. Association for Computational Linguistics, 77--80","author":"Tofiloski M.","key":"e_1_2_1_55_1","unstructured":"Tofiloski , M. , Brooke , J. , and Taboada , M . 2009. A syntactic and lexical-based discourse segmenter . In Proceedings of the ACL-IJCNLP Conference of Short Papers. Association for Computational Linguistics, 77--80 . Tofiloski, M., Brooke, J., and Taboada, M. 2009. A syntactic and lexical-based discourse segmenter. In Proceedings of the ACL-IJCNLP Conference of Short Papers. Association for Computational Linguistics, 77--80."},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.3923\/itj.2008.1009.1015"},{"volume-title":"Notebook Papers of CLEF LABs and Workshops (CLEF\u201912)","author":"Trigui O.","key":"e_1_2_1_57_1","unstructured":"Trigui , O. , Hadrich-Belguith , L. , Rosso , P. , Ben Amor , H. , and Gafsaoui , B . 2012. IDRAAQ: New Arabic question answering system based on query expansion and passage retrieval . In Notebook Papers of CLEF LABs and Workshops (CLEF\u201912) . Trigui, O., Hadrich-Belguith, L., Rosso, P., Ben Amor, H., and Gafsaoui, B. 2012. IDRAAQ: New Arabic question answering system based on query expansion and passage retrieval. In Notebook Papers of CLEF LABs and Workshops (CLEF\u201912)."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1207\/s15516709cog2805_6"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/1899.001.0001"}],"container-title":["ACM Transactions on Asian Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2601401","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2601401","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T20:00:53Z","timestamp":1750276853000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2601401"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,6]]},"references-count":59,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2014,6]]}},"alternative-id":["10.1145\/2601401"],"URL":"https:\/\/doi.org\/10.1145\/2601401","relation":{},"ISSN":["1530-0226","1558-3430"],"issn-type":[{"type":"print","value":"1530-0226"},{"type":"electronic","value":"1558-3430"}],"subject":[],"published":{"date-parts":[[2014,6]]},"assertion":[{"value":"2013-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-06-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}