{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T19:15:09Z","timestamp":1762542909230,"version":"3.41.2"},"reference-count":40,"publisher":"Emerald","issue":"7","license":[{"start":{"date-parts":[[2018,9,5]],"date-time":"2018-09-05T00:00:00Z","timestamp":1536105600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["OIR"],"published-print":{"date-parts":[[2018,10,16]]},"abstract":"<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title>\n<jats:p>The quality of consumer-oriented health information on the web has been defined and evaluated in several studies. Usually it is based on evaluation criteria identified by the researchers and, so far, there is no agreed standard for the quality indicators to use. Based on such indicators, tools have been developed to evaluate the quality of web information. The HONcode is one of such tools. The purpose of this paper is to investigate the influence of web document features on their quality, using HONcode as ground truth, with the aim of finding whether it is possible to predict the quality of a document using its characteristics.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title>\n<jats:p>The present work uses a set of health documents and analyzes how their characteristics (e.g. web domain, last update, type, mention of places of treatment and prevention strategies) are associated with their quality. Based on these features, statistical models are built which predict whether health-related web documents have certification-level quality. Multivariate analysis is performed, using classification to estimate the probability of a document having quality given its characteristics. This approach tells us which predictors are important. Three types of full and reduced logistic regression models are built and evaluated. The first one includes every feature, without any exclusion, the second one disregards the Utilization Review Accreditation Commission variable, due to it being a quality indicator, and the third one excludes the variables related to the HONcode principles, which might also be indicators of quality. The reduced models were built with the aim to see whether they reach similar results with a smaller number of features.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Findings<\/jats:title>\n<jats:p>The prediction models have high accuracy, even without including the characteristics of Health on the Net code principles in the models. The most informative prediction model considers characteristics that can be assessed automatically (e.g. split content, type, process of revision and place of treatment). It has an accuracy of 89 percent.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title>\n<jats:p>This paper proposes models that automatically predict whether a document has quality or not. Some of the used features (e.g. prevention, prognosis or treatment) have not yet been explicitly considered in this context. The findings of the present study may be used by search engines to promote high-quality documents. This will improve health information retrieval and may contribute to reduce the problems caused by inaccurate information.<\/jats:p>\n<\/jats:sec>","DOI":"10.1108\/oir-01-2017-0028","type":"journal-article","created":{"date-parts":[[2018,9,5]],"date-time":"2018-09-05T10:43:55Z","timestamp":1536144235000},"page":"1024-1047","source":"Crossref","is-referenced-by-count":8,"title":["Predicting the quality of health web documents using their characteristics"],"prefix":"10.1108","volume":"42","author":[{"given":"Melinda","family":"Oroszl\u00e1nyov\u00e1","sequence":"first","affiliation":[]},{"given":"Carla","family":"Teixeira Lopes","sequence":"additional","affiliation":[]},{"given":"S\u00e9rgio","family":"Nunes","sequence":"additional","affiliation":[]},{"given":"Cristina","family":"Ribeiro","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2018,9,5]]},"reference":[{"issue":"4","key":"key2021041509444695000_ref001","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2382438.2382441","article-title":"Detecting fake medical web sites using recursive trust labeling","volume":"30","year":"2012","journal-title":"ACM Transactions on Information Systems"},{"issue":"20","key":"key2021041509444695000_ref002","doi-asserted-by":"crossref","first-page":"2612","DOI":"10.1001\/jama.285.20.2612","article-title":"Health information on the internet: accessibility, quality, and readability in English and Spanish","volume":"285","year":"2001","journal-title":"Journal of the American Medical Association"},{"issue":"1","key":"key2021041509444695000_ref003","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.ijmedinf.2004.10.001","article-title":"Instruments to assess the quality of health information on the World Wide Web: what can our patients actually use?","volume":"74","year":"2005","journal-title":"International Journal of Medical Informatics"},{"key":"key2021041509444695000_ref005","doi-asserted-by":"crossref","first-page":"940","DOI":"10.1016\/j.procs.2017.11.122","article-title":"How to sort trustworthy health online information? Improvements of the automated detection of HONcode criteria","volume":"121","year":"2017","journal-title":"Procedia Computer Science"},{"key":"key2021041509444695000_ref006","first-page":"53","article-title":"Evolution of health web certification through the HON code experience","volume":"169","year":"2011","journal-title":"Stud Health Technology and Inform"},{"key":"key2021041509444695000_ref007","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1016\/j.procs.2015.08.484","article-title":"Language independent tokenization vs. stemming in automated detection of health websites\u2019 HONcode conformity: an evaluation","volume":"64","year":"2015","journal-title":"Procedia Computer Science"},{"issue":"6","key":"key2021041509444695000_ref004","article-title":"Automated detection of HONcode website conformity compared to manual detection: an evaluation","volume":"17","year":"2015","journal-title":"Journal of Medical Internet Research"},{"issue":"4","key":"key2021041509444695000_ref008","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1080\/1369118042000305610","article-title":"Health information seals of approval: what do they signify?","volume":"7","year":"2004","journal-title":"Information, Communication & Society"},{"key":"key2021041509444695000_ref009","first-page":"1","article-title":"Quality of online resources for pancreatic cancer patients","year":"2017","journal-title":"Journal of Cancer Education"},{"issue":"20","key":"key2021041509444695000_ref010","doi-asserted-by":"crossref","first-page":"2691","DOI":"10.1001\/jama.287.20.2691","article-title":"Empirical studies assessing the quality of health information for consumers on the world wide web: a systematic review","volume":"287","year":"2002","journal-title":"Journal of the American Medical Association"},{"issue":"7337","key":"key2021041509444695000_ref011","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1136\/bmj.324.7337.573","article-title":"How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth","volume":"324","year":"2002","journal-title":"British Medical Journal"},{"issue":"1","key":"key2021041509444695000_ref012","doi-asserted-by":"crossref","first-page":"24","DOI":"10.4066\/AMJ.2014.1900","article-title":"Quality of patient health information on the internet: reviewing a complete and evolving landscape","volume":"7","year":"2014","journal-title":"Australasian Medical Journal"},{"volume-title":"The Social Life of Health Information","year":"2011","key":"key2021041509444695000_ref013"},{"first-page":"185","article-title":"Automatic retrieval of web pages with standards of ethics and trustworthiness within a medical portal: what a page name tells Us","year":"2007","key":"key2021041509444695000_ref014"},{"article-title":"Machine learning approach for automatic quality criteria detection of health web pages","volume-title":"Studies in Health Technology and Informatics 129(Pt 1)","year":"2007","key":"key2021041509444695000_ref015"},{"issue":"2","key":"key2021041509444695000_ref016","doi-asserted-by":"crossref","first-page":"1","DOI":"10.4018\/IJUDH.2016070101","article-title":"Communication assessment checklist in health: assessment and comparison of web-based health resources","volume":"6","year":"2016","journal-title":"International Journal of User-Driven Healthcare"},{"key":"key2021041509444695000_ref017","unstructured":"HON (2015), \u201cHealth on the Net Foundation\u201d, available at: www.hon.ch\/ (accessed January 2017)."},{"issue":"7098","key":"key2021041509444695000_ref018","doi-asserted-by":"crossref","first-page":"1875","DOI":"10.1136\/bmj.314.7098.1875","article-title":"Reliability of health information for the public on the World Wide Web: systematic survey of advice on managing fever in children at home","volume":"314","year":"1997","journal-title":"British Medical Journal"},{"issue":"8","key":"key2021041509444695000_ref019","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1001\/jama.279.8.611","article-title":"Rating health information on the internet: navigating to knowledge or to babel?","volume":"279","year":"1998","journal-title":"The Journal of the American Medical Association"},{"volume-title":"An Introduction to Statistical Learning with Applications in R","year":"2013","key":"key2021041509444695000_ref020"},{"issue":"4","key":"key2021041509444695000_ref021","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1002\/asi.21035","article-title":"Describing and predicting information-seeking behavior on the web","volume":"60","year":"2009","journal-title":"Journal of the American Society for Information Science and Technology"},{"first-page":"205","article-title":"Context effect on query formulation and subjective relevance in health searches","year":"2010","key":"key2021041509444695000_ref022"},{"issue":"5","key":"key2021041509444695000_ref023","doi-asserted-by":"crossref","first-page":"951","DOI":"10.1002\/asi.22812","article-title":"Measuring the value of health query translation: an analysis by user language proficiency","volume":"64","year":"2013","journal-title":"Journal of the American Society for Information Science and Technology"},{"key":"key2021041509444695000_ref024","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1038\/ijir.2017.4","article-title":"Readability, credibility and quality of patient information for hypogonadism and testosterone replacement therapy on the internet","volume":"29","year":"2017","journal-title":"International Journal of Impotence Research"},{"key":"key2021041509444695000_ref025","doi-asserted-by":"crossref","first-page":"771","DOI":"10.1016\/j.procs.2015.08.627","article-title":"The influence of documents, users and tasks on the relevance and comprehension of health web documents","volume":"64","year":"2015","journal-title":"Procedia Computer Science"},{"key":"key2021041509444695000_ref026","doi-asserted-by":"crossref","first-page":"1552","DOI":"10.1136\/bmj.311.7019.1552","article-title":"Guide to the internet. The world wide web","volume":"311","year":"1995","journal-title":"British Medical Journal"},{"issue":"12","key":"key2021041509444695000_ref027","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1080\/09513590601012603","article-title":"Assessing the content and quality of information on the treatment of postmenopausal osteoporosis on the World Wide Web","volume":"22","year":"2006","journal-title":"Gynecological Endocrinology"},{"key":"key2021041509444695000_ref028","first-page":"911","article-title":"Filtering web pages for quality indicators: an empirical approach to finding high quality consumer health information on the World Wide Web","year":"1999","journal-title":"American Medical Informatics"},{"issue":"1c","key":"key2021041509444695000_ref029","first-page":"1","article-title":"Evaluation of dengue-related health information on the internet","volume":"9","year":"2012","journal-title":"Perspectives in Health Information Management"},{"key":"key2021041509444695000_ref030","doi-asserted-by":"crossref","first-page":"274","DOI":"10.1016\/j.ipm.2007.02.008","article-title":"Source preferences in the context of seeking problem-specific information","volume":"44","year":"2008","journal-title":"Information Processing and Management"},{"key":"key2021041509444695000_ref031","doi-asserted-by":"crossref","first-page":"1244","DOI":"10.1001\/jama.1997.03540390074039","article-title":"Assessing, controlling, and assuring the quality of medical information on the internet","volume":"277","year":"1997","journal-title":"Journal of the American Medical Association"},{"key":"key2021041509444695000_ref032","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1007\/978-3-642-28997-2_19","article-title":"Reliability prediction of webpages in the medical domain","volume":"7224","year":"2012","journal-title":"Advances in Information Retrieval"},{"key":"key2021041509444695000_ref033","unstructured":"Sousa, M.I. (2011), \u201cCharacterization of health web documents\u201d, Master\u2019s thesis, Master in Information Science, University of Porto."},{"key":"key2021041509444695000_ref034","unstructured":"URAC (2015), \u201cUtilization review accreditation commission\u201d, available at: www.urac.org\/ (accessed January 2017"},{"issue":"5","key":"key2021041509444695000_ref035","first-page":"288","article-title":"Rule-based automatic criteria detection for assessing quality of online health information","volume":"5","year":"2007","journal-title":"Journal on Information Technology in Healthcare"},{"issue":"10","key":"key2021041509444695000_ref036","first-page":"2071","article-title":"Quality of health information for consumers on the web: a systematic review of indicators, criteria, tools, and evaluation results","volume":"66","year":"2015","journal-title":"Journal of the American Society for Information Science and Technology"},{"issue":"5","key":"key2021041509444695000_ref037","first-page":"980","article-title":"Predicting users\u2019 domain knowledge in information retrieval using multiple regression analysis of search behaviors","volume":"66","year":"2014","journal-title":"Journal of the American Society for Information Science and Technology"},{"issue":"12","key":"key2021041509444695000_ref038","first-page":"1","article-title":"Motivations for contributing to health-related articles on Wikipedia: an interview study","volume":"16","year":"2014","journal-title":"Journal of Medical Internet Research"},{"issue":"1","key":"key2021041509444695000_ref039","first-page":"1","article-title":"Wikipedia: a key tool for global public health promotion","volume":"13","year":"2011","journal-title":"Journal of Medical Internet Research"},{"issue":"4","key":"key2021041509444695000_ref040","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1197\/jamia.M3059","article-title":"Seeking health information online: does wikipedia matter?","volume":"16","year":"2009","journal-title":"Journal of the American Medical Informatics Association"}],"container-title":["Online Information Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/OIR-01-2017-0028\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/OIR-01-2017-0028\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:42:03Z","timestamp":1753396923000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/oir\/article\/42\/7\/1024-1047\/314625"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,9,5]]},"references-count":40,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2018,9,5]]},"published-print":{"date-parts":[[2018,10,16]]}},"alternative-id":["10.1108\/OIR-01-2017-0028"],"URL":"https:\/\/doi.org\/10.1108\/oir-01-2017-0028","relation":{},"ISSN":["1468-4527"],"issn-type":[{"type":"print","value":"1468-4527"}],"subject":[],"published":{"date-parts":[[2018,9,5]]}}}