{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T19:27:01Z","timestamp":1754162821550,"version":"3.41.2"},"reference-count":42,"publisher":"MIT Press","issue":"2","license":[{"start":{"date-parts":[[2021,2,17]],"date-time":"2021-02-17T00:00:00Z","timestamp":1613520000000},"content-version":"vor","delay-in-days":413,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Online health communications often provide biased interpretations of evidence and have unreliable links to the source research. We tested the feasibility of a tool for matching web pages to their source evidence. From 207,538 eligible vaccination-related PubMed articles, we evaluated several approaches using 3,573 unique links to web pages from Altmetric. We evaluated methods for ranking the source articles for vaccine-related research described on web pages, comparing simple baseline feature representation and dimensionality reduction approaches to those augmented with canonical correlation analysis (CCA). Performance measures included the median rank of the correct source article; the percentage of web pages for which the source article was correctly ranked first (recall@1); and the percentage ranked within the top 50 candidate articles (recall@50). While augmenting baseline methods using CCA generally improved results, no CCA-based approach outperformed a baseline method, which ranked the correct source article first for over one quarter of web pages and in the top 50 for more than half. Tools to help people identify evidence-based sources for the content they access on vaccination-related web pages are potentially feasible and may support the prevention of bias and misrepresentation of research in news and social media.<\/jats:p>","DOI":"10.1162\/qss_a_00030","type":"journal-article","created":{"date-parts":[[2020,3,25]],"date-time":"2020-03-25T13:06:31Z","timestamp":1585141591000},"page":"810-823","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":0,"title":["Recommending research articles to consumers of online vaccination information"],"prefix":"10.1162","volume":"1","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7290-8187","authenticated-orcid":false,"given":"Eliza","family":"Harrison","sequence":"first","affiliation":[{"name":"Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6157-4740","authenticated-orcid":false,"given":"Paige","family":"Martin","sequence":"additional","affiliation":[{"name":"Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2299-2971","authenticated-orcid":false,"given":"Didi","family":"Surian","sequence":"additional","affiliation":[{"name":"Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1720-8209","authenticated-orcid":false,"given":"Adam G.","family":"Dunn","sequence":"additional","affiliation":[{"name":"Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia"},{"name":"Discipline of Biomedical Informatics and Digital Health, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Sydney, Australia"}]}],"member":"281","published-online":{"date-parts":[[2020,6,1]]},"reference":[{"key":"2025073014021045300_bib1","unstructured":"Beam,  A. L., Kompa,  B., Fried,  I., Palmer,  N. P., Shi,  X., Cai,  T., & Kohane,  I. S. (2020). Clinical concept embeddings learned from massive sources of medical data. Pacific Symposium on Biocomputing, 25, 295\u2013306."},{"key":"2025073014021045300_bib2","doi-asserted-by":"crossref","unstructured":"Bean,  S. J.\n           (2011). Emerging and continuing trends in vaccine opposition website content. Vaccine, 29(10), 1874\u20131880. https:\/\/doi.org\/10.1016\/j.vaccine.2011.01.003","DOI":"10.1016\/j.vaccine.2011.01.003"},{"key":"2025073014021045300_bib3","unstructured":"Castell,  S., Charlton,  A., Clemence,  M., Pettigrew,  N., PopeS., Quigley,  A., Shah,  J. N., & Silman,  T. (2014). Public attitudes to science 2014: Main report. URN BIS\/14\/P111, Ipsos MORI."},{"key":"2025073014021045300_bib4","doi-asserted-by":"crossref","unstructured":"Charnock,  D., & Shepperd,  S. (2004). Learning to DISCERN online: Applying an appraisal tool to health websites in a workshop setting. Health Education Research, 19(4), 440\u2013446. https:\/\/doi.org\/10.1093\/her\/cyg046","DOI":"10.1093\/her\/cyg046"},{"key":"2025073014021045300_bib5","doi-asserted-by":"crossref","unstructured":"Charnock,  D., Shepperd,  S., Needham,  G., & Gann,  R. (1999). DISCERN: an instrument for judging the quality of written consumer health information on treatment choices. Journal of Epidemiology and Community Health, 53(2), 105\u2013111. https:\/\/doi.org\/10.1136\/jech.53.2.105","DOI":"10.1136\/jech.53.2.105"},{"key":"2025073014021045300_bib6","doi-asserted-by":"crossref","unstructured":"Cooper Robbins,  S. C., Pang,  C., & Leask,  J. (2012). Australian Newspaper Coverage of Human Papillomavirus Vaccination, October 2006\u2013December 2009. Journal of Health Communication, 17(2), 149\u2013159. https:\/\/doi.org\/10.1080\/10810730.2011.585700","DOI":"10.1080\/10810730.2011.585700"},{"key":"2025073014021045300_bib7","doi-asserted-by":"crossref","unstructured":"Dunn,  A. G., Coiera,  E., & Bourgeois,  F. T. (2018). Unreported links between trial registrations and published articles were identified using document similarity measures in a cross-sectional analysis of ClinicalTrials.gov. Journal of Clinical Epidemiology, 95(Mar), 94\u2013101. https:\/\/doi.org\/10.1016\/j.jclinepi.2017.12.007","DOI":"10.1016\/j.jclinepi.2017.12.007"},{"key":"2025073014021045300_bib8","doi-asserted-by":"crossref","unstructured":"Eysenbach,  G.\n           (2002). How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews. BMJ, 324(7337), 573\u2013577. https:\/\/doi.org\/10.1136\/bmj.324.7337.573","DOI":"10.1136\/bmj.324.7337.573"},{"key":"2025073014021045300_bib9","unstructured":"Fox,  S., & Duggan,  M. (2013). Health Online 2013. Pew Internet & American Life Project. https:\/\/www.pewresearch.org\/internet\/2013\/01\/15\/health-online-2013\/"},{"key":"2025073014021045300_bib10","unstructured":"Fox,  S., & Rainie,  L. (2000). The online health care revolution. Pew Internet & American Life Project: Online Life Report. https:\/\/www.pewresearch.org\/internet\/2000\/11\/26\/the-online-health-care-revolution\/"},{"key":"2025073014021045300_bib11","unstructured":"Fox,  S., & Rainie,  L. (2002). Vital decisions: A Pew Internet Health Report. Pew Internet & American Life Project. https:\/\/www.pewresearch.org\/internet\/2002\/05\/22\/vital-decisions-a-pew-internet-health-report\/"},{"key":"2025073014021045300_bib12","doi-asserted-by":"crossref","unstructured":"Grundy,  Q., Dunn,  A. G., Bourgeois,  F. T., Coiera,  E., & Bero,  L. (2018). Prevalence of disclosed conflicts of interest in biomedical research and associations with journal impact factors and altmetric scores. JAMA, 319(4), 408. https:\/\/doi.org\/10.1001\/jama.2017.20738","DOI":"10.1001\/jama.2017.20738"},{"key":"2025073014021045300_bib13","doi-asserted-by":"crossref","unstructured":"Halko\n              N.\n            , MartinssonP. G., & TroppJ. A. (2011). Finding structure with randomness: probabilistic algorithms for constructing approximate matrix decompositions. SIAM Review, 53(2) (May), 217\u2013288. https:\/\/doi.org\/10.1137\/090771806","DOI":"10.1137\/090771806"},{"key":"2025073014021045300_bib14","doi-asserted-by":"crossref","unstructured":"Haneef,  R., Ravaud,  P., Baron,  G., Ghosn,  L., & Boutron,  I. (2017). Factors associated with online media attention to research: A cohort study of articles evaluating cancer treatments. Research Integrity and Peer Review, 2(9), 1\u20138. https:\/\/doi.org\/10.1186\/s41073-017-0033-z","DOI":"10.1186\/s41073-017-0033-z"},{"key":"2025073014021045300_bib15","doi-asserted-by":"crossref","unstructured":"Hotelling,  H.\n           (1936). Relations between two sets of variates. Biometrika, 28(3\/4), 321. https:\/\/doi.org\/10.2307\/2333955","DOI":"10.2307\/2333955"},{"key":"2025073014021045300_bib16","doi-asserted-by":"crossref","unstructured":"Howard,  J., & Ruder,  S. (2018). Universal language model fine-tuning for text classification. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Melbourne, Australia. https:\/\/doi.org\/10.18653\/v1\/P18-1031","DOI":"10.18653\/v1\/P18-1031"},{"key":"2025073014021045300_bib17","doi-asserted-by":"crossref","unstructured":"Huang,  Z., Li,  X., & Chen,  H. (2005). Link prediction approach to collaborative filtering. In Proceedings of the 5th ACM\/IEEE-CS joint conference on Digital libraries\u2014JCDL \u201905, pp. 141\u2013142. New York, NY: ACM Press. https:\/\/doi.org\/10.1145\/1065385.1065415","DOI":"10.1145\/1065385.1065415"},{"key":"2025073014021045300_bib18","unstructured":"Huskinson,  T., Gilby,  N., Evans,  H., Stevens,  J., & Tipping,  S. (2016). Wellcome Trust Monitor Report Wave 3 Tracking public views on science and biomedical research. Wellcome Trust Monitor: Wave 3. https:\/\/wellcome.ac.uk\/sites\/default\/files\/monitor-wave3-full-wellcome-apr16.pdf"},{"key":"2025073014021045300_bib19","doi-asserted-by":"crossref","unstructured":"Ibrahim,  O. A. S., & Landa-Silva,  D. (2017). ES-Rank: Evolution strategy learning to rank approach. In Proceedings of the Symposium on Applied Computing\u2014SAC \u201917, pp. 944\u2013950. New York, NY: ACM Press. https:\/\/doi.org\/10.1145\/3019612.3019696","DOI":"10.1145\/3019612.3019696"},{"key":"2025073014021045300_bib20","doi-asserted-by":"crossref","unstructured":"Kata,  A.\n           (2010). A postmodern Pandora\u2019s box: Anti-vaccination misinformation on the Internet. Vaccine, 28(7), 1709\u20131716. https:\/\/doi.org\/10.1016\/J.VACCINE.2009.12.022","DOI":"10.1016\/j.vaccine.2009.12.022"},{"key":"2025073014021045300_bib21","doi-asserted-by":"crossref","unstructured":"Kata,  A.\n           (2012). Anti-vaccine activists, Web 2.0, and the postmodern paradigm\u2014An overview of tactics and tropes used online by the anti-vaccination movement. Vaccine, 30(25), 3778\u20133789. https:\/\/doi.org\/10.1016\/J.VACCINE.2011.11.112","DOI":"10.1016\/j.vaccine.2011.11.112"},{"key":"2025073014021045300_bib22","doi-asserted-by":"crossref","unstructured":"Koren,  Y., Bell,  R., & Volinsky,  C. (2009). Matrix factorization techniques for recommender systems. Computer, 42(8), 30\u201337. https:\/\/doi.org\/10.1109\/MC.2009.263","DOI":"10.1109\/MC.2009.263"},{"key":"2025073014021045300_bib23","doi-asserted-by":"crossref","unstructured":"Larson,  H. J.\n           (2018). The biggest pandemic risk? Viral misinformation. Nature, 562(7727), 309\u2013309. https:\/\/doi.org\/10.1038\/d41586-018-07034-4","DOI":"10.1038\/d41586-018-07034-4"},{"key":"2025073014021045300_bib24","doi-asserted-by":"crossref","unstructured":"Larson,  H. J., Cooper,  L. Z., Eskola,  J., Katz,  S. L., & Ratzan,  S. (2011). Addressing the vaccine confidence gap. The Lancet, 378, 526\u2013535. https:\/\/doi.org\/10.1016\/S0140","DOI":"10.1016\/S0140-6736(11)60678-8"},{"key":"2025073014021045300_bib25","doi-asserted-by":"crossref","unstructured":"Lau,  A. Y. S., & Coiera,  E. W. (2007). Do people experience cognitive biases while searching for information?Journal of the American Medical Informatics Association, 14(5), 599\u2013608. https:\/\/doi.org\/10.1197\/jamia.M2411","DOI":"10.1197\/jamia.M2411"},{"key":"2025073014021045300_bib26","doi-asserted-by":"crossref","unstructured":"Liu,  T.-Y.\n           (2009). Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, 3(3), 225\u2013331. https:\/\/doi.org\/10.1561\/1500000016","DOI":"10.1561\/1500000016"},{"key":"2025073014021045300_bib27","doi-asserted-by":"crossref","unstructured":"Magerman,  T., van Looy,  B., & Song,  X. (2010). Exploring the feasibility and accuracy of Latent Semantic Analysis based text mining techniques to detect similarity between patent documents and scientific publications. Scientometrics, 82(2), 289\u2013306. https:\/\/doi.org\/10.1007\/s11192-009-0046-6","DOI":"10.1007\/s11192-009-0046-6"},{"key":"2025073014021045300_bib28","doi-asserted-by":"crossref","unstructured":"Menon,  A. K., Surian,  D., & Chawla,  S. (2015). Cross-modal retrieval: A pairwise classification approach. In Proceedings of the 2015 SIAM International Conference on Data Mining, pp. 199\u2013207. Philadelphia, PA: Society for Industrial and Applied Mathematics. https:\/\/doi.org\/10.1137\/1.9781611974010.23","DOI":"10.1137\/1.9781611974010.23"},{"key":"2025073014021045300_bib29","unstructured":"Mikolov,  T., Sutskever,  I., Chen,  K., Corrado,  G. S., & Dean,  J. (2013). Distributed representations of words and phrases and their compositionality. In NIPS \u201913 Proceedings of the 26th International Conference on Neural Information Processing Systems\u2014Volume 2, pp. 3111\u20133119."},{"key":"2025073014021045300_bib30","doi-asserted-by":"crossref","unstructured":"Moran,  M. B., Lucas,  M., Everhart,  K., Morgan,  A., & Prickett,  E. (2016). What makes anti-vaccine websites persuasive? A content analysis of techniques used by anti-vaccine websites to engender anti-vaccine sentiment. Journal of Communication in Healthcare, 9(3), 151\u2013163. https:\/\/doi.org\/10.1080\/17538068.2016.1235531","DOI":"10.1080\/17538068.2016.1235531"},{"key":"2025073014021045300_bib31","doi-asserted-by":"crossref","unstructured":"Peters,  M. E., Neumann,  M., Iyyer,  M., Gardner,  M., Clark,  C., Lee,  K., & Zettlemoyer,  L. (2018). Deep contextualized word representations. In Proceedings of NAACL-HLT 2018, pp. 2227\u20132237.","DOI":"10.18653\/v1\/N18-1202"},{"key":"2025073014021045300_bib32","doi-asserted-by":"crossref","unstructured":"Rasiwasia,  N., Costa Pereira,  J., Coviello,  E., Doyle,  G., Lanckriet,  G. R. G., Levy,  R., & Vasconcelos,  N. (2010). A new approach to cross-modal multimedia retrieval. In Proceedings of the International Conference on Multimedia\u2014MM \u201910, pp. 251\u2013260. New York, NY: ACM Press. https:\/\/doi.org\/10.1145\/1873951.1873987","DOI":"10.1145\/1873951.1873987"},{"key":"2025073014021045300_bib33","unstructured":"Ramos,  J.\n           (2003). Using TF-IDF to determine word relevance in document queries. In Proceedings of the First Instructional Conference on Machine Learning, Piscataway, NJ. https:\/\/www.cs.rutgers.edu\/~mlittman\/courses\/ml03\/iCML03\/papers\/ramos.pdf"},{"key":"2025073014021045300_bib34","doi-asserted-by":"crossref","unstructured":"Robertson,  S.\n           (2004) Understanding inverse document frequency: On theoretical arguments for IDF. Journal of Documentation, 60(5), 503\u2013520. https:\/\/doi.org\/10.1108\/00220410410560582","DOI":"10.1108\/00220410410560582"},{"key":"2025073014021045300_bib35","doi-asserted-by":"crossref","unstructured":"Selvaraj,  S., Borkar,  D. S., & Prasad,  V. (2014). Media coverage of medical journals: Do the best articles make the news?PLoS ONE, 9(1), e85355. https:\/\/doi.org\/10.1371\/journal.pone.0085355","DOI":"10.1371\/journal.pone.0085355"},{"key":"2025073014021045300_bib36","doi-asserted-by":"crossref","unstructured":"Shah,  Z., Surian,  D., Mandl,  K. D., & Dunn,  A. G. (2019). Automatically applying a credibility appraisal tool to track vaccination-related communications shared on social media. Journal of Medical Internet Research, 21(11), e14007. https:\/\/doi.org\/10.2196\/14007","DOI":"10.2196\/14007"},{"key":"2025073014021045300_bib37","doi-asserted-by":"crossref","unstructured":"Sp\u00e4rck Jones,  K.\n           (1972). A statistical interpretation of term specificity and its application in retrieval, Journal of Documentation, 28(1), 11\u201321. https:\/\/doi.org\/10.1108\/eb026526","DOI":"10.1108\/eb026526"},{"key":"2025073014021045300_bib38","unstructured":"Steffens,  M., Dunn,  A. G., and Leask,  J. (2017). Meeting the challenges of reporting on public health in the new media landscape. Australian Journalism Review, 39(2), 119\u2013132."},{"key":"2025073014021045300_bib39","doi-asserted-by":"crossref","unstructured":"Weaver,  J. B., Thompson,  N. J., Weaver,  S. S., & Hopkins,  G. L. (2009). Healthcare non-adherence decisions and internet health information. Computers in Human Behavior, 25(6), 1373\u20131380. https:\/\/doi.org\/10.1016\/J.CHB.2009.05.011","DOI":"10.1016\/j.chb.2009.05.011"},{"key":"2025073014021045300_bib40","unstructured":"World Health Organization (WHO). (2019). Ten threats to global health in 2019. Retrieved March 1, 2019, from https:\/\/www.who.int\/emergencies\/ten-threats-to-global-health-in-2019"},{"key":"2025073014021045300_bib41","doi-asserted-by":"crossref","unstructured":"Yavchitz,  A., Boutron,  I., Bafeta,  A., Marroun,  I., Charles,  P., Mantz,  J., & Ravaud,  P. (2012). Misrepresentation of randomized controlled trials in press releases and news coverage: A cohort study. PLOS Medicine, 9(9), e1001308. https:\/\/doi.org\/10.1371\/journal.pmed.1001308","DOI":"10.1371\/journal.pmed.1001308"},{"key":"2025073014021045300_bib42","doi-asserted-by":"crossref","unstructured":"Zeraatkar,  D., Obeda,  M., Ginsberg,  J. S., & Hirsh,  J. (2017). The development and validation of an instrument to measure the quality of health research reports in the lay media. BMC Public Health, 17(1), 343. https:\/\/doi.org\/10.1186\/s12889-017-4259-y","DOI":"10.1186\/s12889-017-4259-y"}],"container-title":["Quantitative Science Studies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/qss\/article-pdf\/1\/2\/810\/1885763\/qss_a_00030.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/qss\/article-pdf\/1\/2\/810\/1885763\/qss_a_00030.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T18:02:24Z","timestamp":1753898544000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/qss\/article\/1\/2\/810\/96135\/Recommending-research-articles-to-consumers-of"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020]]},"references-count":42,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,6,1]]}},"URL":"https:\/\/doi.org\/10.1162\/qss_a_00030","relation":{},"ISSN":["2641-3337"],"issn-type":[{"type":"electronic","value":"2641-3337"}],"subject":[],"published-other":{"date-parts":[[2020]]},"published":{"date-parts":[[2020]]}}}