{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T04:23:17Z","timestamp":1744258997222,"version":"3.40.4"},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>MEDLINE\u00ae\/PubMed\u00ae indexes over 20 million biomedical articles, providing curated annotation of its contents using a controlled vocabulary known as Medical Subject Headings (MeSH). The MeSH vocabulary, developed over 50+ years, provides a broad coverage of topics across biomedical research. Distilling the essential biomedical themes for a topic of interest from the relevant literature is important to both understand the importance of related concepts and discover new relationships.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We introduce a novel method for determining enriched curator-assigned MeSH annotations in a set of papers associated to a topic, such as a gene, an author or a disease. We generate MeSH Over-representation Profiles (MeSHOPs) to quantitatively summarize the annotations in a form convenient for further computational analysis and visualization. Based on a hypergeometric distribution of assigned terms, MeSHOPs statistically account for the prevalence of the associated biomedical annotation while highlighting unusually prevalent terms based on a specified background. MeSHOPs can be visualized using word clouds, providing a succinct quantitative graphical representation of the relative importance of terms. Using the publication dates of articles, MeSHOPs track changing patterns of annotation over time. Since MeSHOPs are quantitative vectors, MeSHOPs can be compared using standard techniques such as hierarchical clustering. The reliability of MeSHOP annotations is assessed based on the capacity to re-derive the subset of the Gene Ontology annotations with equivalent MeSH terms.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>MeSHOPs allows quantitative measurement of the degree of association between any entity and the annotated medical concepts, based directly on relevant primary literature. Comparison of MeSHOPs allows entities to be related based on shared medical themes in their literature. A web interface is provided for generating and visualizing MeSHOPs.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-13-249","type":"journal-article","created":{"date-parts":[[2012,9,30]],"date-time":"2012-09-30T07:47:51Z","timestamp":1348991271000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":24,"title":["Quantitative biomedical annotation using medical subject heading over-representation profiles (MeSHOPs)"],"prefix":"10.1186","volume":"13","author":[{"given":"Warren A","family":"Cheung","sequence":"first","affiliation":[]},{"given":"BF Francis","family":"Ouellette","sequence":"additional","affiliation":[]},{"given":"Wyeth W","family":"Wasserman","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,9,27]]},"reference":[{"key":"5643_CR1","volume-title":"Nucleic Acids Res","author":"E Sayers","year":"2009","unstructured":"Sayers E, Barrett T, Benson D, Bryant S, Canese K, Chetvernin V, Church D, Dicuccio M, Edgar R, Federhen S, Feolo M, Geer L, Helmberg W, Kapustin Y, Landsman D, Lipman D, Madden T, Maglott D, Miller V, Mizrachi I, Ostell J, Pruitt K, Schuler G, Sequeira E, Sherry S, Shumway M, Sirotkin K, Souvorov A, Starchenko G, Tatusova T, et al.: Database resources of the national center for biotechnology information. Nucleic Acids Res 2009., 37:"},{"unstructured":"Chapter 11 Relationships in Medical Subject Headings[http:\/\/www.nlm.nih.gov\/mesh\/meshrels.html]","key":"5643_CR2"},{"key":"5643_CR3","doi-asserted-by":"publisher","first-page":"119","DOI":"10.1038\/nrg1768","volume":"7","author":"LJ Jensen","year":"2006","unstructured":"Jensen LJ, Saric J, Bork P: Literature mining for the biologist: from information retrieval to biological discovery. Nat Rev Genet 2006, 7: 119\u2013129. 10.1038\/nrg1768","journal-title":"Nat Rev Genet"},{"key":"5643_CR4","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1007\/978-0-387-48438-9_4","volume-title":"In Semantic Web","author":"L Hirschman","year":"2007","unstructured":"Hirschman L, Hayes W, Valencia A: Knowledge acquisition from the biomedical literature. In Semantic Web 2007, 53\u201381."},{"key":"5643_CR5","doi-asserted-by":"publisher","first-page":"3324","DOI":"10.1093\/bioinformatics\/bti503","volume":"21","author":"A Djebbari","year":"2005","unstructured":"Djebbari A, Karamycheva S, Howe E, Quackenbush J: MeSHer: identifying biological concepts in microarray assays based on PubMed references and MeSH terms. Bioinformatics (Oxford, England) 2005, 21: 3324\u20136. 10.1093\/bioinformatics\/bti503","journal-title":"Bioinformatics (Oxford, England)"},{"key":"5643_CR6","first-page":"563","volume":"1","author":"IN Sarkar","year":"2009","unstructured":"Sarkar IN, Schenk R, Miller H, Norton CN: LigerCat: using \u201cMeSH clouds\u201d from journal, article, or gene citations to facilitate the identification of relevant biomedical literature. In Information Retrieval . Med Inform Assoc 2009, 1: 563\u2013567.","journal-title":"Med Inform Assoc"},{"key":"5643_CR7","doi-asserted-by":"publisher","first-page":"6097","DOI":"10.1093\/nar\/18.20.6097","volume":"18","author":"TD Schneider","year":"1990","unstructured":"Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res 1990, 18: 6097\u2013100. 10.1093\/nar\/18.20.6097","journal-title":"Nucleic Acids Res"},{"key":"5643_CR8","doi-asserted-by":"publisher","first-page":"1639","DOI":"10.1101\/gr.092759.109","volume":"19","author":"M Krzywinski","year":"2009","unstructured":"Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA: Circos: an information aesthetic for comparative genomics. Genome Res 2009, 19: 1639\u201345. 10.1101\/gr.092759.109","journal-title":"Genome Res"},{"key":"5643_CR9","doi-asserted-by":"publisher","first-page":"GC1","DOI":"10.1016\/0378-1119(95)00714-8","volume":"167","author":"EL Sonnhammer","year":"1995","unstructured":"Sonnhammer EL, Durbin R: A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene 1995, 167: GC1\u201310. 10.1016\/0378-1119(95)00714-8","journal-title":"Gene"},{"key":"5643_CR10","doi-asserted-by":"publisher","first-page":"3518","DOI":"10.1093\/nar\/gkg579","volume":"31","author":"S Schwartz","year":"2003","unstructured":"Schwartz S: MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res 2003, 31: 3518\u20133524. 10.1093\/nar\/gkg579","journal-title":"Nucleic Acids Res"},{"key":"5643_CR11","doi-asserted-by":"publisher","first-page":"3442","DOI":"10.1093\/nar\/28.18.3442","volume":"28","author":"B Snel","year":"2000","unstructured":"Snel B, Lehmann G, Bork P, Huynen MA: STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. Nucleic Acids Res 2000, 28: 3442\u20134. 10.1093\/nar\/28.18.3442","journal-title":"Nucleic Acids Res"},{"key":"5643_CR12","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1186\/1751-0473-6-15","volume":"6","author":"C Baroukh","year":"2011","unstructured":"Baroukh C, Jenkins S, Dannenfelser R, Ma\u2019ayan A: Genes2WordCloud: a quick way to identify biological themes from gene lists and free text. Source code for biology and medicine 2011, 6: 15. 10.1186\/1751-0473-6-15","journal-title":"Source code for biology and medicine"},{"key":"5643_CR13","first-page":"709","volume":"680","author":"J Desai","year":"2011","unstructured":"Desai J, Flatow JM, Song J, Zhu LJ, Du P, Huang C-c, Lin SM, Kibbe WA: Advances in computational biology. Cancer 2011, 680: 709\u2013715.","journal-title":"Cancer"},{"key":"5643_CR14","doi-asserted-by":"publisher","first-page":"P3","DOI":"10.1186\/gb-2003-4-5-p3","volume":"4","author":"G Dennis","year":"2003","unstructured":"Dennis G, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA: DAVID: Database for annotation, visualization, and integrated discovery. Genome Biol 2003, 4: P3. 10.1186\/gb-2003-4-5-p3","journal-title":"Genome Biol"},{"key":"5643_CR15","doi-asserted-by":"publisher","first-page":"W245","DOI":"10.1093\/nar\/gkm427","volume":"35","author":"SJ Ho Sui","year":"2007","unstructured":"Ho Sui SJ, Fulton DL, Arenillas DJ, Kwon AT, Wasserman WW: oPOSSUM: integrated tools for analysis of regulatory motif over-representation. Nucleic Acids Res 2007, 35: W245\u201352. 10.1093\/nar\/gkm427","journal-title":"Nucleic Acids Res"},{"key":"5643_CR16","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1007\/978-1-61779-027-0_21","volume-title":"Bioinformatics for Omics Data: Methods and Protocols","author":"V Kumar","year":"2011","unstructured":"Kumar V: Omics and literature mining. In Bioinformatics for Omics Data: Methods and Protocols. 719th edition. Edited by: Mayer B. Totowa, NJ: Humana Press; 2011:457\u2013477.","edition":"719th"},{"key":"5643_CR17","doi-asserted-by":"publisher","first-page":"166","DOI":"10.1186\/1471-2105-11-166","volume":"11","author":"SD Jani","year":"2010","unstructured":"Jani SD, Argraves GL, Barth JL, Argraves WS: GeneMesh: a web-based microarray analysis tool for relating differentially expressed genes to MeSH terms. BMC Bioinforma 2010, 11: 166. 10.1186\/1471-2105-11-166","journal-title":"BMC Bioinforma"},{"key":"5643_CR18","doi-asserted-by":"publisher","first-page":"838","DOI":"10.1093\/bioinformatics\/btp049","volume":"25","author":"J Hur","year":"2009","unstructured":"Hur J, Schuyler AD, States DJ, Feldman EL: SciMiner: web-based literature mining tool for target identification and functional enrichment analysis. Bioinformatics (Oxford, England) 2009, 25: 838\u201340. 10.1093\/bioinformatics\/btp049","journal-title":"Bioinformatics (Oxford, England)"},{"key":"5643_CR19","first-page":"689","volume-title":"AMIA Symposium","author":"IN Sarkar","year":"2006","unstructured":"Sarkar IN, Agrawal A: Literature based discovery of gene clusters using phylogenetic methods. AMIA \u2026 Annual symposium proceedings\/AMIA symposium. AMIA Symposium 2006, 689\u201393."},{"key":"5643_CR20","doi-asserted-by":"publisher","first-page":"479","DOI":"10.1093\/bib\/bbn035","volume":"9","author":"P Agarwal","year":"2008","unstructured":"Agarwal P, Searls DB: Literature mining in support of drug discovery. Brief Bioinform 2008, 9: 479\u2013492. 10.1093\/bib\/bbn035","journal-title":"Brief Bioinform"},{"unstructured":"Gene2MeSH[Internet] [http:\/\/gene2mesh.ncibi.org] [Internet]","key":"5643_CR21"},{"issue":"1","key":"5643_CR22","doi-asserted-by":"crossref","first-page":"53","DOI":"10.3233\/ISB-00343","volume":"8","author":"T Nakazato","year":"2007","unstructured":"Nakazato T, Takinaka T, Mizuguchi H, Matsuda H, Bono H, Asogawa M: BioCompass: A novel functional inference tool that utilizes MeSH hierarchy to analyze groups of genes. In Silico Biology 2007, 8(1):53\u201361.","journal-title":"In Silico Biology"},{"issue":"suppl 2","key":"5643_CR23","doi-asserted-by":"publisher","first-page":"W166","DOI":"10.1093\/nar\/gkp483","volume":"37","author":"T Nakazato","year":"2009","unstructured":"Nakazato T, Bono H, Matsuda H, Takagi T: Gendoo: functional profiling of gene and disease features using MeSH vocabulary. Nucleic Acids Res 2009, 37(suppl 2):W166-W166.","journal-title":"Nucleic Acids Res"},{"key":"5643_CR24","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1038\/nrd2973","volume":"8","author":"P Agarwal","year":"2009","unstructured":"Agarwal P, Searls DB: Can literature analysis identify innovation drivers in drug discovery? Nature reviews. Drug discovery 2009, 8: 865\u201378. 10.1038\/nrd2973","journal-title":"Drug discovery"},{"key":"5643_CR25","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1002\/ddr.20416","volume":"72","author":"DK Rajpal","year":"2011","unstructured":"Rajpal DK, Kumar V, Agarwal P: Scientific literature mining for drug discovery: a case study on obesity. Drug Dev Res 2011, 72: 201\u2013208. 10.1002\/ddr.20416","journal-title":"Drug Dev Res"},{"key":"5643_CR26","doi-asserted-by":"publisher","first-page":"e5203","DOI":"10.1371\/journal.pone.0005203","volume":"4","author":"DA Hanauer","year":"2009","unstructured":"Hanauer DA, Rhodes DR, Chinnaiyan AM: Exploring clinical associations using \u201c-omics\u201d based enrichment analyses. PLoS One 2009, 4: e5203. 10.1371\/journal.pone.0005203","journal-title":"PLoS One"},{"key":"5643_CR27","first-page":"797","volume":"2010","author":"R Tirrell","year":"2010","unstructured":"Tirrell R, Evani U, Berman AE, Mooney SD, Musen MA, Shah NH: An ontology-neutral framework for enrichment analysis. AMIA \u2026 Annual symposium proceedings\/AMIA symposium . AMIA Symposium 2010, 2010: 797\u2013801.","journal-title":"AMIA Symposium"},{"unstructured":"Statistical Tracking of Ontological Phrases (STOP)[http:\/\/www.mooneygroup.org\/stop\/input]","key":"5643_CR28"},{"issue":"Suppl 1","key":"5643_CR29","doi-asserted-by":"publisher","first-page":"S31","DOI":"10.1016\/j.jbi.2011.04.007","volume":"44","author":"P LePendu","year":"2011","unstructured":"LePendu P, Musen MA, Shah NH: Enabling enrichment analysis with the human disease ontology. J biomed inform 2011, 44(Suppl 1):S31\u20138.","journal-title":"J biomed inform"},{"key":"5643_CR30","doi-asserted-by":"publisher","first-page":"603","DOI":"10.1186\/1471-2164-12-603","volume":"12","author":"BM Good","year":"2011","unstructured":"Good BM, Howe DG, Lin SM, Kibbe WA, Su AI: Mining the Gene Wiki for functional genomic knowledge. BMC genomics 2011, 12: 603. 10.1186\/1471-2164-12-603","journal-title":"BMC genomics"},{"key":"5643_CR31","doi-asserted-by":"publisher","first-page":"3024","DOI":"10.1093\/bioinformatics\/btm440","volume":"23","author":"S Grossmann","year":"2007","unstructured":"Grossmann S, Bauer S, Robinson PN, Vingron M: Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis. Bioinformatics (Oxford, England) 2007, 23: 3024\u201331. 10.1093\/bioinformatics\/btm440","journal-title":"Bioinformatics (Oxford, England)"},{"key":"5643_CR32","doi-asserted-by":"publisher","first-page":"664","DOI":"10.1038\/ng0704-664","volume":"36","author":"R Hoffmann","year":"2004","unstructured":"Hoffmann R, Valencia A: A gene network for navigating the literature. Nat Genet 2004, 36: 664. 10.1038\/ng0704-664","journal-title":"Nat Genet"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-13-249.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,9]],"date-time":"2025-04-09T18:40:43Z","timestamp":1744224043000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-13-249"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9,27]]},"references-count":32,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["5643"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-13-249","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2012,9,27]]},"assertion":[{"value":"23 February 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 September 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 September 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"249"}}