{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,2]],"date-time":"2025-03-02T06:40:06Z","timestamp":1740897606113,"version":"3.38.0"},"reference-count":43,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2000,10,1]],"date-time":"2000-10-01T00:00:00Z","timestamp":970358400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2000,10]]},"abstract":"<jats:p> This paper proposes a novel approach to automatic text segmentation without a full semantic understanding. In order to analyse the linguistic bonds and determine the degree of coherence that a text may exhibit, the tremendous diversity of textual relations in a discourse network is represented. A corpus of mutual linguistic knowledge that captures the similarity of meaning and causal relations is encoded in the discourse network, which is then subjected to a cluster algorithm. As a result, segments in the text are segregated into clusters according to their textual similarity. Topic boundaries in a text can be identified by observing the shifts of segments from one cluster to another. The experimental results show that the combination of the heterogeneous knowledge is capable of addressing the topic shifts. Comparison with a related method demonstrates that the algorithm is closely related to the topic boundaries. Given the increasing recognition of text structure in the fields of information retrieval in unpartitioned text, this approach provides a quantitative model and an efficient tool in text segmentation. <\/jats:p>","DOI":"10.1177\/016555150002600504","type":"journal-article","created":{"date-parts":[[2004,12,18]],"date-time":"2004-12-18T01:36:35Z","timestamp":1103333795000},"page":"313-328","source":"Crossref","is-referenced-by-count":1,"title":["Using heterogeneous linguistic knowledge in local coherence identification                 for information retrieval"],"prefix":"10.1177","volume":"26","author":[{"given":"Samuel W.K.","family":"Chan","sequence":"first","affiliation":[{"name":"The Chinese University of Hong Kong, Hong Kong, China,"}]}],"member":"179","published-online":{"date-parts":[[2000,10,1]]},"reference":[{"key":"atypb1","doi-asserted-by":"publisher","DOI":"10.1207\/s15516709cog1602_3"},{"key":"atypb2","doi-asserted-by":"publisher","DOI":"10.1016\/0004-3702(95)00116-6"},{"key":"atypb3","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198538493.001.0001","volume-title":"Neural Networks for Pattern Recognition","author":"C.M. Bishop","year":"1995"},{"key":"atypb4","doi-asserted-by":"publisher","DOI":"10.1177\/016555159902500203"},{"first-page":"694","volume-title":"Proceedings of IEEE International Conference on Neural Networks","author":"S.W.K. Chan","key":"atypb5"},{"volume-title":"On Human Communication","year":"1975","author":"C. Cherry","key":"atypb6"},{"key":"atypb7","first-page":"21","volume-title":"Connectionist Approaches to Natural Language Processing","author":"M.G. Dyer","year":"1992"},{"key":"atypb8","first-page":"203","volume":"21","author":"B.J. Grosz","year":"1995","journal-title":"Computational Linguistics"},{"key":"atypb9","doi-asserted-by":"publisher","DOI":"10.1177\/016555159602200503"},{"key":"atypb10","first-page":"33","volume":"23","author":"M.A. Hearst","year":"1997","journal-title":"Computational Linguistics"},{"first-page":"41","volume-title":"Proceedings of the NATO Advanced Research Workshop on Burning Issues in Discourse","author":"J.R. Hobbs","key":"atypb11"},{"key":"atypb12","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.85.5.363"},{"first-page":"286","volume-title":"Proceedings of the Thirty-first Annual Meeting of the Association for Computational Linguistics","author":"H. Kozima","key":"atypb13"},{"key":"atypb14","doi-asserted-by":"publisher","DOI":"10.1108\/eb026913"},{"first-page":"108","volume-title":"Proceedings of the Thirty-third Annual Meeting of the Association for Computational Linguistics","author":"D.J. Litman","key":"atypb15"},{"volume-title":"Reasoning by Analogy and Causality: A Model and Application","year":"1994","author":"D. Long","key":"atypb16"},{"first-page":"1092","volume-title":"Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence","author":"W. Lowe","key":"atypb17"},{"first-page":"660","volume-title":"Proceedings of the Seventeenth Annual Conference of the Cognitive Science Society","author":"L. Lund","key":"atypb18"},{"issue":"3","key":"atypb19","first-page":"243","volume":"8","author":"W.C. Mann","year":"1988","journal-title":"Text"},{"key":"atypb20","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511620751"},{"key":"atypb21","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.99.3.440"},{"volume-title":"Subsymbolic Natural Language Processing: An Integrated Model of Scripts, Lexicon, and Memory","year":"1993","author":"R. Miikklulainen","key":"atypb22"},{"key":"atypb23","first-page":"21","volume":"17","author":"J. Morris","year":"1991","journal-title":"Computational Linguistics"},{"key":"atypb24","first-page":"228","volume-title":"New Methods in Language Processing","author":"T. Nomoto","year":"1997"},{"volume-title":"Discourse Analysis Monographs","year":"1989","author":"M. Phillips","key":"atypb25"},{"key":"atypb26","doi-asserted-by":"publisher","DOI":"10.1177\/016555159902500503"},{"key":"atypb27","doi-asserted-by":"publisher","DOI":"10.1016\/0378-2166(88)90050-1"},{"key":"atypb28","doi-asserted-by":"publisher","DOI":"10.1016\/0004-3702(90)90005-K"},{"first-page":"113","volume-title":"Proceedings of First European Conference on Research and Advanced Technology for Digital Libraries","author":"J.M. Ponte","key":"atypb29"},{"key":"atypb30","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(198909)40:5<304::AID-ASI2>3.0.CO;2-6"},{"key":"atypb31","doi-asserted-by":"publisher","DOI":"10.1109\/21.24528"},{"key":"atypb32","doi-asserted-by":"publisher","DOI":"10.1613\/jair.514"},{"key":"atypb33","doi-asserted-by":"publisher","DOI":"10.1007\/BF00203171"},{"volume-title":"Automatic Text Processing","year":"1989","author":"G. Salton","key":"atypb34"},{"key":"atypb35","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4573(96)00062-3"},{"key":"atypb36","doi-asserted-by":"publisher","DOI":"10.1007\/978-94-017-2388-6_1"},{"volume-title":"Text and Texture: Patterns of Cohesion. Advances in Discourse Processes","year":"1991","author":"S. Stoddard","key":"atypb37"},{"key":"atypb38","doi-asserted-by":"publisher","DOI":"10.1016\/0749-596X(88)90011-3"},{"key":"atypb39","doi-asserted-by":"publisher","DOI":"10.1080\/01638539309544835"},{"first-page":"23","volume-title":"Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics","author":"G. Whittemore","key":"atypb40"},{"first-page":"59","volume-title":"Proceedings of the International Conference of Recent Advances in Natural Language Processing","author":"Y. Yaari","key":"atypb41"},{"key":"atypb42","doi-asserted-by":"publisher","DOI":"10.2307\/415076"},{"key":"atypb43","doi-asserted-by":"publisher","DOI":"10.1037\/0278-7393.21.2.386"}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/016555150002600504","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/016555150002600504","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,2]],"date-time":"2025-03-02T05:28:20Z","timestamp":1740893300000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/016555150002600504"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2000,10]]},"references-count":43,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2000,10]]}},"alternative-id":["10.1177\/016555150002600504"],"URL":"https:\/\/doi.org\/10.1177\/016555150002600504","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"type":"print","value":"0165-5515"},{"type":"electronic","value":"1741-6485"}],"subject":[],"published":{"date-parts":[[2000,10]]}}}