{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,23]],"date-time":"2025-06-23T11:45:37Z","timestamp":1750679137130},"reference-count":16,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2018,2,1]],"date-time":"2018-02-01T00:00:00Z","timestamp":1517443200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"name":"Fondo de Investigaciones Sanitarias","award":["PI15\/00558"],"award-info":[{"award-number":["PI15\/00558"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objective<\/jats:title>\n                  <jats:p>The most used search engine for scientific literature, PubMed, provides tools to filter results by several fields. When searching for reports on clinical trials, sample size can be among the most important factors to consider. However, PubMed does not currently provide any means of filtering search results by sample size. Such a filtering tool would be useful in a variety of situations, including meta-analyses or state-of-the-art analyses to support experimental therapies. In this work, a tool was developed to filter articles identified by PubMed based on their reported sample sizes.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>A search engine was designed to send queries to PubMed, retrieve results, and compute estimates of reported sample sizes using a combination of syntactical and machine learning methods. The sample size search tool is publicly available for download at http:\/\/ihealth.uemc.es. Its accuracy was assessed against a manually annotated database of 750 random clinical trials returned by PubMed.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Validation tests show that the sample size search tool is able to accurately (1) estimate sample size for 70% of abstracts and (2) classify 85% of abstracts into sample size quartiles.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusions<\/jats:title>\n                  <jats:p>The proposed tool was validated as useful for advanced PubMed searches of clinical trials when the user is interested in identifying trials of a given sample size.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocx155","type":"journal-article","created":{"date-parts":[[2017,12,20]],"date-time":"2017-12-20T12:39:46Z","timestamp":1513773586000},"page":"774-779","source":"Crossref","is-referenced-by-count":4,"title":["Tool for filtering PubMed search results by sample size"],"prefix":"10.1093","volume":"25","author":[{"given":"Carlos","family":"Baladr\u00f3n","sequence":"first","affiliation":[{"name":"i+HeALTH Research Group, Miguel de Cervantes European University, Higher Polytechnic School, Department of Technical Teachings, Valladolid, Spain"}]},{"given":"Alejandro","family":"Santos-Lozano","sequence":"additional","affiliation":[{"name":"i+HeALTH Research Group, Miguel de Cervantes European University, Faculty of Health Sciences, Department of Health Sciences, Valladolid, Spain"}]},{"given":"Javier M","family":"Aguiar","sequence":"additional","affiliation":[{"name":"Data Engineering Research Group, Universidad de Valladolid, Higher Technical School of Telecommunications Engineering, TSyCeIT Department, Valladolid, Spain"}]},{"given":"Alejandro","family":"Lucia","sequence":"additional","affiliation":[{"name":"Research Institute of Hospital 12 de Octubre and European University, Madrid, Spain"}]},{"given":"Juan","family":"Mart\u00edn-Hern\u00e1ndez","sequence":"additional","affiliation":[{"name":"i+HeALTH Research Group, Miguel de Cervantes European University, Faculty of Health Sciences, Department of Health Sciences, Valladolid, Spain"}]}],"member":"286","published-online":{"date-parts":[[2018,2,1]]},"reference":[{"key":"2020110612384142100_ocx155-B1"},{"issue":"4","key":"2020110612384142100_ocx155-B2","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1108\/LHT-06-2016-0066","article-title":"Advancing PubMed? A comparison of third-party PubMed\/Medline tools","volume":"34","author":"Wildgaard","year":"2016","journal-title":"Libr Hi Tech."},{"key":"2020110612384142100_ocx155-B3","article-title":"PubMed and beyond: a survey of web tools for searching biomedical literature","author":"Lu","journal-title":"Database"},{"issue":"4","key":"2020110612384142100_ocx155-B4","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1007\/s12038-015-9552-2","article-title":"Pubmed.mineR: an R package with text-mining algorithms to analyse PubMed abstracts","volume":"40","author":"Rani","year":"2015","journal-title":"J Biosci."},{"issue":"1","key":"2020110612384142100_ocx155-B5","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/1472-6947-7-16","article-title":"Utilization of the PICO framework to improve searching PubMed for clinical questions","volume":"7","author":"Schardt","year":"2007","journal-title":"BMC Med Inform Decis Mak."},{"issue":"5","key":"2020110612384142100_ocx155-B6","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1016\/j.molcel.2006.02.012","article-title":"Biomedical language processing: what\u2019s beyond PubMed?","volume":"21","author":"Hunter","year":"2006","journal-title":"Mol Cell."},{"issue":"18","key":"2020110612384142100_ocx155-B7","doi-asserted-by":"crossref","first-page":"2886","DOI":"10.1093\/bioinformatics\/btw511","article-title":"HiPub: translating PubMed and PMC texts to networks for knowledge discovery","volume":"32","author":"Lee","year":"2016","journal-title":"Bioinformatics."},{"issue":"8","key":"2020110612384142100_ocx155-B8","doi-asserted-by":"crossref","first-page":"1115","DOI":"10.1007\/s11136-009-9528-5","article-title":"Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments","volume":"18","author":"Terwee","year":"2009","journal-title":"Qual Life Res."},{"issue":"12","key":"2020110612384142100_ocx155-B9","doi-asserted-by":"crossref","first-page":"1244","DOI":"10.1157\/13096592","article-title":"Construcci\u00f3n de un filtro geogr\u00e1fico para la identificaci\u00f3n en PubMed de estudios realizados en Espa\u00f1a","volume":"59","author":"Valderas","year":"2006","journal-title":"Rev Esp Cardiol."},{"issue":"3","key":"2020110612384142100_ocx155-B10","doi-asserted-by":"crossref","first-page":"432","DOI":"10.1093\/bioinformatics\/btv585","article-title":"Automatic semantic classification of scientific literature according to the hallmarks of cancer","volume":"32","author":"Baker","year":"2016","journal-title":"Bioinformatics."},{"issue":"2","key":"2020110612384142100_ocx155-B11","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1089\/jwh.2015.5217","article-title":"Development of a PubMed based search tool for identifying sex and gender specific health literature","volume":"25","author":"Song","year":"2016","journal-title":"J Women\u2019s Health."},{"issue":"Database issue","key":"2020110612384142100_ocx155-B12","first-page":"D7","article-title":"Database resources of the National Center for Biotechnology information","volume":"42","author":"Acland","year":"2014","journal-title":"Nucleic Acids Res."},{"key":"2020110612384142100_ocx155-B13","article-title":"Latent Semantic Analysis","volume-title":"Encyclopedia of Cognitive Science","author":"Landauer","year":"2016"},{"issue":"1","key":"2020110612384142100_ocx155-B14","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1148\/radiology.143.1.7063747","article-title":"The meaning and use of the area under a receiver operating characteristic (ROC) curve","volume":"143","author":"Hanley","year":"1982","journal-title":"Radiology."},{"issue":"3","key":"2020110612384142100_ocx155-B15","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1109\/TPAMI.2009.187","article-title":"Sensitivity analysis of k-fold cross validation in prediction error estimation","volume":"32","author":"Rodriguez","year":"2010","journal-title":"IEEE Trans Pattern Anal Mach Intell."},{"issue":"1\u20132","key":"2020110612384142100_ocx155-B16","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1023\/A:1007515423169","article-title":"An empirical comparison of voting classification algorithms: bagging, boosting, and variants","volume":"36","author":"Bauer","year":"1999","journal-title":"Mach Learn."}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/25\/7\/774\/34150140\/ocx155.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/25\/7\/774\/34150140\/ocx155.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T18:01:57Z","timestamp":1604685717000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/25\/7\/774\/4835460"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,2,1]]},"references-count":16,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2018,2,1]]},"published-print":{"date-parts":[[2018,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocx155","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,7]]},"published":{"date-parts":[[2018,2,1]]}}}