{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T20:16:18Z","timestamp":1775160978365,"version":"3.50.1"},"reference-count":22,"publisher":"Oxford University Press (OUP)","issue":"24","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Nowadays, publishers of scientific journals face the tough task of selecting high-quality articles that will attract as many readers as possible from a pool of articles. This is due to the growth of scientific output and literature. The possibility of a journal having a tool capable of predicting the citation count of an article within the first few years after publication would pave the way for new assessment systems.<\/jats:p><jats:p>Results: This article presents a new approach based on building several prediction models for the Bioinformatics journal. These models predict the citation count of an article within 4 years after publication (global models). To build these models, tokens found in the abstracts of Bioinformatics papers have been used as predictive features, along with other features like the journal sections and 2-week post-publication periods. To improve the accuracy of the global models, specific models have been built for each Bioinformatics journal section (Data and Text Mining, Databases and Ontologies, Gene Expression, Genetics and Population Analysis, Genome Analysis, Phylogenetics, Sequence Analysis, Structural Bioinformatics and Systems Biology). In these new models, the average success rate for predictions using the naive Bayes and logistic regression supervised classification methods was 89.4% and 91.5%, respectively, within the nine sections and for 4-year time horizon.<\/jats:p><jats:p>Availability: \u00a0Supplementary material on this experimental survey is available at http:\/\/www.dia.fi.upm.es\/~concha\/bioinformatics.html<\/jats:p><jats:p>Contact: \u00a0aibanez@fi.upm.es<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp585","type":"journal-article","created":{"date-parts":[[2009,10,10]],"date-time":"2009-10-10T01:23:47Z","timestamp":1255137827000},"page":"3303-3309","source":"Crossref","is-referenced-by-count":41,"title":["Predicting citation count of<i>Bioinformatics<\/i>papers within four years of publication"],"prefix":"10.1093","volume":"25","author":[{"given":"Alfonso","family":"Ib\u00e1\u00f1ez","sequence":"first","affiliation":[{"name":"Departamento de Inteligencia Artificial, Universidad Polit\u00e9cnica de Madrid, 28660 Madrid, Spain"}]},{"given":"Pedro","family":"Larra\u00f1aga","sequence":"additional","affiliation":[{"name":"Departamento de Inteligencia Artificial, Universidad Polit\u00e9cnica de Madrid, 28660 Madrid, Spain"}]},{"given":"Concha","family":"Bielza","sequence":"additional","affiliation":[{"name":"Departamento de Inteligencia Artificial, Universidad Polit\u00e9cnica de Madrid, 28660 Madrid, Spain"}]}],"member":"286","published-online":{"date-parts":[[2009,10,9]]},"reference":[{"key":"2023013112143882900_B1","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1007\/s11192-008-0216-y","article-title":"Which h-index? A comparison of WoS, Scopus and Google Scholar","volume":"74","author":"Bar-Ilan","year":"2008","journal-title":"Scientometrics"},{"key":"2023013112143882900_B2","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1108\/00220410810844150","article-title":"What do citation counts measure?","volume":"64","author":"Bornmann","year":"2008","journal-title":"J. Doc."},{"key":"2023013112143882900_B3","doi-asserted-by":"crossref","first-page":"1060","DOI":"10.1002\/asi.20373","article-title":"Earlier web usage statistics as predictors of later citation impact","volume":"57","author":"Brody","year":"2005","journal-title":"J. Am. Assoc. Inf. Sci. Technol. (JASIST)"},{"key":"2023013112143882900_B4","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1007\/978-3-540-75530-2_10","article-title":"Estimating the number of citations using author reputation","volume-title":"Proceedings of String Processing and Information Retrieval (SPIRE)","author":"Castillo","year":"2007"},{"key":"2023013112143882900_B5","doi-asserted-by":"crossref","first-page":"e332","DOI":"10.1371\/journal.pone.0000332","article-title":"Statistical reviewers improve reporting in biomedical articles: a randomized trial","volume":"2","author":"Cobo","year":"2007","journal-title":"PLoS ONE"},{"key":"2023013112143882900_B6","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/BF00994110","article-title":"A Bayesian method for the induction of probabilistic networks from data","volume":"9","author":"Cooper","year":"1992","journal-title":"Mach. Learn."},{"key":"2023013112143882900_B7","first-page":"222","article-title":"Models for predicting and explaining citation count of biomedical articles","volume":"2008","author":"Fu","year":"2008","journal-title":"AMIA Annual Symposium Proceedings"},{"key":"2023013112143882900_B8","article-title":"Correlation-based Feature Selection for Machine Learning","volume-title":"PhD Thesis","author":"Hall","year":"1999"},{"key":"2023013112143882900_B9","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1191\/0269216305pm1039ed","article-title":"Peer review in action: the contribution of referees to advancing reliable knowledge","volume":"19","author":"Hanks","year":"2005","journal-title":"Palliat. Med."},{"key":"2023013112143882900_B10","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1109\/TIT.1968.1054155","article-title":"The condensed nearest neighbour rule","volume":"14","author":"Hart","year":"1968","journal-title":"Trans. Inf. Theory"},{"key":"2023013112143882900_B11","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/S0165-6147(00)01618-7","article-title":"Something rotten at the core of science","volume":"22","author":"Horrobin","year":"2001","journal-title":"Trends Pharmacol. Sci."},{"key":"2023013112143882900_B12","doi-asserted-by":"crossref","DOI":"10.1002\/0471722146","volume-title":"Applied Logistic Regression","author":"Hosmer","year":"2000","edition":"2"},{"key":"2023013112143882900_B13","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1016\/S0004-3702(97)00043-X","article-title":"Wrappers for feature subset selection","volume":"97","author":"Kohavi","year":"1997","journal-title":"Artif. Intelli."},{"key":"2023013112143882900_B14","doi-asserted-by":"crossref","first-page":"655","DOI":"10.1136\/bmj.39482.526713.BE","article-title":"Prediction of citation counts for clinical articles at two years using data available within three weeks of publication: retrospective cohort study","volume":"336","author":"Lokker","year":"2008","journal-title":"Br. Med. J."},{"key":"2023013112143882900_B15","doi-asserted-by":"crossref","first-page":"2105","DOI":"10.1002\/asi.20677","article-title":"Impact of data sources on citation counts and rankings of LIS faculty: Web of Science versus Scopus and Google Scholar","volume":"58","author":"Meho","year":"2007","journal-title":"J. Am. Soc. Inf. Sci. Technol."},{"key":"2023013112143882900_B16","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/JRPROC.1961.287775","article-title":"Steps toward artificial intelligence","volume":"49","author":"Minsky","year":"1961","journal-title":"IRE"},{"key":"2023013112143882900_B17","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1016\/j.oraloncology.2004.11.001","article-title":"Is peer review in crisis?","volume":"41","author":"Mulligan","year":"2005","journal-title":"Oral Oncology"},{"key":"2023013112143882900_B18","volume-title":"C4.5: Programs for Machine Learning.","author":"Quinlan","year":"1993"},{"key":"2023013112143882900_B19","doi-asserted-by":"crossref","first-page":"2507","DOI":"10.1093\/bioinformatics\/btm344","article-title":"A review of feature selection techniques in bioinformatics","volume":"23","author":"Saeys","year":"2007","journal-title":"Bioinformatics"},{"key":"2023013112143882900_B20","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1126\/science.1122796","article-title":"Peer review at NIH","volume":"311","author":"Scarpa","year":"2006","journal-title":"Science"},{"key":"2023013112143882900_B21","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1111\/j.2517-6161.1974.tb00994.x","article-title":"Cross-validation choice and assesment of statistical predictions","volume":"36","author":"Stone","year":"1974","journal-title":"J. R. Stat. Soc."},{"key":"2023013112143882900_B22","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten","year":"2005","edition":"2"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/24\/3303\/48997325\/bioinformatics_25_24_3303.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/24\/3303\/48997325\/bioinformatics_25_24_3303.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,12]],"date-time":"2025-02-12T17:34:21Z","timestamp":1739381661000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/24\/3303\/235842"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,10,9]]},"references-count":22,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2009,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp585","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,12,15]]},"published":{"date-parts":[[2009,10,9]]}}}