{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:42:56Z","timestamp":1754156576509,"version":"3.41.2"},"reference-count":26,"publisher":"Emerald","issue":"2","license":[{"start":{"date-parts":[[2016,6,13]],"date-time":"2016-06-13T00:00:00Z","timestamp":1465776000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,6,13]]},"abstract":"<jats:sec>\n               <jats:title content-type=\"abstract-heading\">Purpose<\/jats:title>\n               <jats:p> \u2013 Topic segmentation is one of the active research fields in natural language processing. Also, many topic segmenters have been proposed. However, the current challenge of researchers is the improvement of these segmenters by using external resources. Therefore, the purpose of this paper is to integrate study and evaluate a new external semantic resource in topic segmentation. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Design\/methodology\/approach<\/jats:title>\n               <jats:p> \u2013 New topic segmenters (TSS-Onto and TSB-Onto) are proposed based on the two well-known segmenters C99 and TextTiling. The proposed segmenters integrate semantic knowledge to the segmentation process by using a domain ontology as an external resource. Subsequently, an evaluation is made to study the effect of this resource on the quality of topic segmentation along with a comparative study with related works. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Findings<\/jats:title>\n               <jats:p> \u2013 Based on this study, the authors showed that adding semantic knowledge, which is extracted from a domain ontology, improves the quality of topic segmentation. Moreover, TSS-Ont outperforms TSB-Ont in terms of quality of topic segmentation. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Research limitations\/implications<\/jats:title>\n               <jats:p> \u2013 The main limitation of this study is the used test corpus for the evaluation which is not a benchmark. However, we used a collection of scientific papers from well-known digital libraries (ArXiv and ACM). <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Practical implications<\/jats:title>\n               <jats:p> \u2013 The proposed topic segmenters can be useful in different NLP applications such as information retrieval and text summarizing. <\/jats:p>\n            <\/jats:sec>\n            <jats:sec>\n               <jats:title content-type=\"abstract-heading\">Originality\/value<\/jats:title>\n               <jats:p> \u2013 The primary original contribution of this paper is the improvement of topic segmentation based on semantic knowledge. This knowledge is extracted from an ontological external resource.<\/jats:p>\n            <\/jats:sec>","DOI":"10.1108\/ijicc-01-2016-0001","type":"journal-article","created":{"date-parts":[[2016,7,4]],"date-time":"2016-07-04T06:02:38Z","timestamp":1467612158000},"page":"165-178","source":"Crossref","is-referenced-by-count":3,"title":["Exogenous approach to improve topic segmentation"],"prefix":"10.1108","volume":"9","author":[{"given":"Marwa","family":"Naili","sequence":"first","affiliation":[]},{"given":"Anja","family":"Habacha Chaibi","sequence":"additional","affiliation":[]},{"given":"Henda","family":"Hajjami Ben Ghezala","sequence":"additional","affiliation":[]}],"member":"140","reference":[{"key":"key2020121502255667500_b1","doi-asserted-by":"crossref","unstructured":"Bayomi, M.\n               , \n                  Levacher, K.\n               , \n                  Ghorab, M.R.\n                and \n                  Lawless, S.\n                (2015), \u201cOntoSeg: a novel approach to text segmentation using ontological similarity\u201d, IEEE International Conference on Data Mining Workshop, ICDMW 2015, Atlantic City, NJ, pp. 1274-1283.","DOI":"10.1109\/ICDMW.2015.6"},{"key":"key2020121502255667500_b2","doi-asserted-by":"crossref","unstructured":"Bestgen, Y.\n                (2006), \u201cImproving text segmentation using latent semantic analysis: a reanalysis of Choi, Wiemer-Hastings and Moore\u201d, \n                  Computational Linguistics\n               , Vol. 32 No. 3, pp. 5-12.","DOI":"10.1162\/coli.2006.32.1.5"},{"key":"key2020121502255667500_b3","unstructured":"Bestgen, Y.\n                and \n                  Pierard, S.\n                (2006), \u201cComment evaluer les algorithmes de segmentation thematique? Essai de construction d\u2019un mmateriel de reference\u201d, Actes de TALN: Verbum ex machina, Louvain-La-Neuve, Presse universitaire de Louvain, pp. 407-414."},{"key":"key2020121502255667500_b4","unstructured":"Blei, D.M.\n               , \n                  Ng, A.Y.\n                and \n                  Jordan, M.I.\n                (2003), \u201cLatent dirichlet allocation\u201d, \n                  The Journal of Machine Learning Research\n               , Vol. 3, January, pp. 993-1022."},{"key":"key2020121502255667500_b5","unstructured":"Brants, T.\n               , \n                  Chen, F.\n                and \n                  Farahat, A.\n                (2002), \u201cArabic document topic analysis\u201d, TREC, NIST, Gaithersburg, MD."},{"key":"key2020121502255667500_b6","unstructured":"Choi, F.Y.Y.\n                (2000), \u201cAdvances in domain independent linear text segmentation\u201d, North American Chapter of the Association for Computational Linguistics (NAACL), pp. 26-33."},{"key":"key2020121502255667500_b7","unstructured":"Choi, F.Y.Y.\n               , \n                  Wiemer-Hastings, P.\n                and \n                  Moore, J.\n                (2001), \u201cLatent semantic analysis for text segmentation\u201d, Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 109-117."},{"key":"key2020121502255667500_b8","doi-asserted-by":"crossref","unstructured":"Coustaty, M.\n               , \n                  Bouju, A.\n               , \n                  Bertet, K.\n                and \n                  Louis, G.\n                (2011), \u201cUsing ontologies to reduce the semantic gap between historians and image processing algorithms\u201d, \n                  International Conference on Document Analysis and Recognition (ICDAR)\n               , IEEE, Beijing, pp. 156-160.","DOI":"10.1109\/ICDAR.2011.40"},{"key":"key2020121502255667500_b10","unstructured":"De Almeida Falbo, R.\n               , \n                  Ruy, F.B.\n                and \n                  Dal Moro, R.\n                (2005), \u201cUsing ontologies to add semantics to a software engineering environment\u201d, \n                  17th International Conference on Software Engineering and Knowledge Engineering, Taipei\n               , pp. 151-156."},{"key":"key2020121502255667500_b9","doi-asserted-by":"crossref","unstructured":"Deerwester, S.\n               , \n                  Dumais, S.T.\n               , \n                  Furnas, G.W.\n               , \n                  Landauer, T.K.\n                and \n                  Harshman, R.\n                (1990), \u201cIndexing by latent semantic analysis\u201d, \n                  Journal of the American Society for Information Science\n               , Vol. 41 No. 6, pp. 391-407.","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"key2020121502255667500_b11","unstructured":"Ferret, O.\n                (2009), \u201cImproving text segmentation by combining endogenous and exogenous methods\u201d, International Conference of Recent Advances in Natural Language Processing (RANLP), Borovets, pp. 88-93."},{"key":"key2020121502255667500_b12","doi-asserted-by":"crossref","unstructured":"Gebremeskel, G.B.\n               , \n                  Yi, C.\n               , \n                  He, Z.\n                and \n                  Haile, D.\n                (2016), \u201cCombined data mining techniques based patient data outlier detection for healthcare safety\u201d, \n                  International Journal of Intelligent Computing and Cybernetics\n               , Vol. 9 No. 1, pp. 42-68.","DOI":"10.1108\/IJICC-07-2015-0024"},{"key":"key2020121502255667500_b13","doi-asserted-by":"crossref","unstructured":"Gruber, T.R.\n                (1993), \u201cA translation approach to portable ontology specifications\u201d, \n                  Knowledge Acquisition\n               , Vol. 5 No. 2, pp. 199-220.","DOI":"10.1006\/knac.1993.1008"},{"key":"key2020121502255667500_b14","unstructured":"Habacha, A.C.\n               , \n                  Naili, M.\n                and \n                  Sammoud, S.\n                (2014), \u201cTopic segmentation for textual document written in Arabic language\u201d, International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, KES-2014, Procedia Computer Science, Gdynia, Vol. 35, pp. 437-446."},{"key":"key2020121502255667500_b15","unstructured":"Hearst, M.A.\n                (1997), \u201cTextTiling: segmenting text into multi-paragraph subtopic passages\u201d, \n                  Computational Linguistics\n               , Vol. 23 No. 1, pp. 33-64."},{"key":"key2020121502255667500_b16","unstructured":"Hofmann, T.\n                (1999), \u201cProbabilistic latent semantic analysis\u201d, Proceedings of Uncertainty in Artificial Intelligence, Stockholm, pp. 289-296."},{"key":"key2020121502255667500_b17","doi-asserted-by":"crossref","unstructured":"Labadie, A.\n                and \n                  Prince, V.\n                (2008), \u201cLexical and semantic methods in inner text topic segmentation: a comparison between C99 and Transeg\u201d, NLDB, pp. 347-349.","DOI":"10.1007\/978-3-540-69858-6_40"},{"key":"key2020121502255667500_b20","doi-asserted-by":"crossref","unstructured":"Misra, H.\n               , \n                  Yvon, F.\n               , \n                  Jose, J.M.\n                and \n                  Cappe, O.\n                (2009), \u201cText segmentation via topic modeling: an analytical study\u201d, Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 1553-1556.","DOI":"10.1145\/1645953.1646170"},{"key":"key2020121502255667500_b18","unstructured":"Naili, M.\n               , \n                  Habacha, A.C.\n                and \n                  Ben Ghezala, H.H.\n                (2016), \u201cParameters driving effectiveness of LSA on topic segmentation\u201d, 17th International Conference on Intelligent Text Processing and Computational Linguistics CICLing, Springer LNCS Series, Lecture Notes in Computer Science, Konya, April 3-9."},{"key":"key2020121502255667500_b19","doi-asserted-by":"crossref","unstructured":"Nebhi, K.\n                (2012), \u201cOntology-based information extraction for French newspaper articles\u201d, 35th German Conference on Artificial Intelligence, KI 2012: Advances in Artificial Intelligence, Springer Berlin, Heidelberg, pp. 237-240.","DOI":"10.1007\/978-3-642-33347-7_22"},{"key":"key2020121502255667500_b21","doi-asserted-by":"crossref","unstructured":"Pevzner, L.\n                and \n                  Hearst, M.A.\n                (2002), \u201cA critique and improvement of an evaluation metric for text segmentation\u201d, \n                  Computational Linguistics\n               , Vol. 28 No. 1, pp. 19-36.","DOI":"10.1162\/089120102317341756"},{"key":"key2020121502255667500_b22","doi-asserted-by":"crossref","unstructured":"Porter, M.F.\n                (1980), \u201cAn algorithm for suffix stripping\u201d, \n                  Program\n               , pp. 130-137.","DOI":"10.1108\/eb046814"},{"key":"key2020121502255667500_b23","unstructured":"Reynar, J.\n                (1998), \u201cTopic segmentation: algorithms and application\u201d, PhD thesis, Computer and Information Science, University of Pennsylvania, Pennsylvania."},{"key":"key2020121502255667500_b24","doi-asserted-by":"crossref","unstructured":"Sriharee, G.\n                (2014), \u201cAn ontology-based approach to auto-tagging articles\u201d, \n                  Vietnam Journal of Computer Science\n               , Vol. 2 No. 2, pp. 85-94.","DOI":"10.1007\/s40595-014-0033-6"},{"key":"key2020121502255667500_b25","unstructured":"Stokes, N.\n               , \n                  Carthy, J.\n                and \n                  Smeaton, A.F.\n                (2004), \u201cSeLeCT: a lexical cohesion based news story segmentation system\u201d, \n                  AI Communications\n               , Vol. 17 No. 1, pp. 3-12."},{"key":"key2020121502255667500_b26","doi-asserted-by":"crossref","unstructured":"Uschold, M.\n                and \n                  King, M.\n                (1996), \u201cOntologies: principles, methods and applications\u201d, \n                  Knowledge Engineering Review\n               , Vol. 11 No. 2, pp. 93-155.","DOI":"10.1017\/S0269888900007797"}],"container-title":["International Journal of Intelligent Computing and Cybernetics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/IJICC-01-2016-0001","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IJICC-01-2016-0001\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IJICC-01-2016-0001\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:54:01Z","timestamp":1753397641000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/ijicc\/article\/9\/2\/165-178\/135104"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,6,13]]},"references-count":26,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2016,6,13]]}},"alternative-id":["10.1108\/IJICC-01-2016-0001"],"URL":"https:\/\/doi.org\/10.1108\/ijicc-01-2016-0001","relation":{},"ISSN":["1756-378X"],"issn-type":[{"type":"print","value":"1756-378X"}],"subject":[],"published":{"date-parts":[[2016,6,13]]}}}