{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T22:22:16Z","timestamp":1759962136059,"version":"3.41.2"},"reference-count":26,"publisher":"Emerald","issue":"4","license":[{"start":{"date-parts":[[2001,8,1]],"date-time":"2001-08-01T00:00:00Z","timestamp":996624000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2001,8,1]]},"abstract":"<jats:p>This article investigates how consistent different newspapers are in their choice of words when writing about the same news events. News articles on the same news events were taken from three Finnish newspapers and compared in regard to their central concepts and words representing the concepts in the news texts. Consistency figures were calculated for each set of three articles (the total number of sets was sixty). Inconsistency in words and concepts was found between news articles from different newspapers. The mean value of consistency calculated on the basis of words was 65 per cent; this however depended on the article length. For short news wires consistency was 83 per cent while for long articles it was only 47 per cent. At the concept level, consistency was considerably higher, ranging from 92 per cent to 97 per cent between short and long articles. The articles also represented three categories of topic (event, process and opinion). Statistically significant differences in consistency were found in regard to length but not in regard to the categories of topic. We argue that the expression inconsistency is a clear sign of a retrieval problem and that query expansion based on semantic relationships can significantly improve retrieval performance on free\u2010text sources.<\/jats:p>","DOI":"10.1108\/eum0000000007104","type":"journal-article","created":{"date-parts":[[2002,11,21]],"date-time":"2002-11-21T21:43:51Z","timestamp":1037915031000},"page":"535-548","source":"Crossref","is-referenced-by-count":3,"title":["Consistency of textual expression in newspaper articles: an argument for semantically based query expansion"],"prefix":"10.1108","volume":"57","author":[{"given":"Raija","family":"Lehtokangas","sequence":"first","affiliation":[]},{"given":"Kalervo","family":"J\u00e4rvelin","sequence":"additional","affiliation":[]}],"member":"140","reference":[{"volume-title":"Natural language processing: toward large-scale, robust systems","year":"1996","author":"Haas S.W.","key":"p_1"},{"key":"p_2","unstructured":"Pirkola, A. Studies on linguistic problems and methods in text retrieval. PhD dissertation, University of Tampere.Acta Universitatis Tamperensis672,1999."},{"key":"p_3","unstructured":"Kek\u00e4l\u00e4inen, J. The effects of query complexity, expansion and structure on retrieval performance in probabilistic text retrieval. PhD dissertation, University of Tampere.Acta Universitatis Tamperensis678,1999. http:\/\/ www.info.uta.fi\/research\/postscript_docs\/JK1_99.pdf (visited 16 November 2000)."},{"key":"p_4","doi-asserted-by":"publisher","DOI":"10.1108\/EUM0000000007087"},{"key":"p_5","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(198805)39:3<161::AID-ASI2>3.0.CO;2-0"},{"key":"p_6","doi-asserted-by":"publisher","DOI":"10.1108\/eb026736"},{"issue":"1","key":"p_7","doi-asserted-by":"crossref","first-page":"37","DOI":"10.3233\/ISU-1984-41-204","volume":"4","author":"Cleverdon C","year":"1984","journal-title":"Information Services & Use"},{"key":"p_8","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(95)80034-Q"},{"key":"p_9","unstructured":"Lehtokangas, R. Sananvalinnan yhdenmukaisuus sanomalehtiuutisissa [Expression consistency in newspaper articles]. MSc thesis,Department of Information Studies, Universityof Tampere.1999. (In Finnish)"},{"key":"p_10","doi-asserted-by":"crossref","unstructured":"Sormunen, E. A method for measuring wide range performance of Boolean queries in full-text databases. PhD dissertation, University of Tampere.Acta Electronica Universitatis Tamperensis, 2000. http:\/\/acta.uta.fi\/ pdf\/951-44-4732-8.pdf (visited 16 November 2000).","DOI":"10.1145\/345508.345541"},{"key":"p_11","doi-asserted-by":"publisher","DOI":"10.1515\/9783110852141"},{"volume-title":"News analysis: case studies of international and national news in the press","year":"1988","author":"van Dijk T.A.","key":"p_12"},{"volume-title":"News as discourse","year":"1988","author":"van Dijk T.A.","key":"p_13"},{"volume-title":"Documentation - guidelines for the establishment and development of monolingual thesauri","year":"1986","key":"p_14"},{"key":"p_15","first-page":"183","volume-title":"Helsinki: WSOY","author":"Sivula J.","year":"1989"},{"volume-title":"Helsinki: Finn Lectura","year":"1996","author":"Lep\u00e4smaa A-L.","key":"p_16"},{"volume-title":"WSOY","year":"1978","author":"Vesikansa J.","key":"p_17"},{"volume-title":"Mist\u00e4 sanat tulevat: suomalaista etymologiaa [Where do the words come from: Finnish etymology]. Helsinki: Suomalaisen kirjallisuuden seura","year":"1990","author":"H\u00e4kkinen K.","key":"p_18"},{"key":"p_19","doi-asserted-by":"publisher","DOI":"10.1108\/eb026960"},{"volume-title":"Query expansion","year":"1996","author":"Efthimiadis E.N.","key":"p_20"},{"key":"p_21","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(93)90102-J"},{"key":"p_22","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009983401464"},{"key":"p_23","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345545"},{"key":"p_24","doi-asserted-by":"publisher","DOI":"10.1145\/219717.219748"},{"volume-title":"Information technology: The Sixth Text Retrieval Conference (TREC-6). Gaithersburg, Md.: National Institute of Standards and Technology","year":"1997","author":"Crestani F.","key":"p_25"},{"key":"p_26","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4471-2099-5_7"}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/EUM0000000007104\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/EUM0000000007104\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T00:52:34Z","timestamp":1753404754000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/57\/4\/535-548\/204613"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2001,8,1]]},"references-count":26,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2001,8,1]]}},"alternative-id":["10.1108\/EUM0000000007104"],"URL":"https:\/\/doi.org\/10.1108\/eum0000000007104","relation":{},"ISSN":["0022-0418"],"issn-type":[{"type":"print","value":"0022-0418"}],"subject":[],"published":{"date-parts":[[2001,8,1]]}}}