{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,9,13]],"date-time":"2023-09-13T21:31:07Z","timestamp":1694640667036},"reference-count":20,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Journal articles and databases are two major modes of communication in the biological sciences, and thus integrating these critical resources is of urgent importance to increase the pace of discovery. Projects focused on bridging the gap between journals and databases have been on the rise over the last five years and have resulted in the development of automated tools that can recognize entities within a document and link those entities to a relevant database. Unfortunately, automated tools cannot resolve ambiguities that arise from one term being used to signify entities that are quite distinct from one another. Instead, resolving these ambiguities requires some manual oversight. Finding the right balance between the speed and portability of automation and the accuracy and flexibility of manual effort is a crucial goal to making text markup a successful venture.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We have established a journal article mark-up pipeline that links GENETICS journal articles and the model organism database (MOD) WormBase. This pipeline uses a lexicon built with entities from the database as a first step. The entity markup pipeline results in links from over nine classes of objects including genes, proteins, alleles, phenotypes and anatomical terms. New entities and ambiguities are discovered and resolved by a database curator through a manual quality control (QC) step, along with help from authors via a web form that is provided to them by the journal. New entities discovered through this pipeline are immediately sent to an appropriate curator at the database. Ambiguous entities that do not automatically resolve to one link are resolved by hand ensuring an accurate link. This pipeline has been extended to other databases, namely Saccharomyces Genome Database (SGD) and FlyBase, and has been implemented in marking up a paper with links to multiple databases.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>Our semi-automated pipeline hyperlinks articles published in GENETICS to model organism databases such as WormBase. Our pipeline results in interactive articles that are data rich with high accuracy. The use of a manual quality control step sets this pipeline apart from other hyperlinking tools and results in benefits to authors, journals, readers and databases.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-175","type":"journal-article","created":{"date-parts":[[2011,5,19]],"date-time":"2011-05-19T19:34:37Z","timestamp":1305833677000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Toward an interactive article: integrating journals and biological databases"],"prefix":"10.1186","volume":"12","author":[{"given":"Arun","family":"Rangarajan","sequence":"first","affiliation":[]},{"given":"Tim","family":"Schedl","sequence":"additional","affiliation":[]},{"given":"Karen","family":"Yook","sequence":"additional","affiliation":[]},{"given":"Juancarlos","family":"Chan","sequence":"additional","affiliation":[]},{"given":"Stephen","family":"Haenel","sequence":"additional","affiliation":[]},{"given":"Lolly","family":"Otis","sequence":"additional","affiliation":[]},{"given":"Sharon","family":"Faelten","sequence":"additional","affiliation":[]},{"given":"Tracey","family":"DePellegrin-Connelly","sequence":"additional","affiliation":[]},{"given":"Ruth","family":"Isaacson","sequence":"additional","affiliation":[]},{"given":"Marek S","family":"Skrzypek","sequence":"additional","affiliation":[]},{"given":"Steven J","family":"Marygold","sequence":"additional","affiliation":[]},{"given":"Raymund","family":"Stefancsik","sequence":"additional","affiliation":[]},{"given":"J Michael","family":"Cherry","sequence":"additional","affiliation":[]},{"given":"Paul W","family":"Sternberg","sequence":"additional","affiliation":[]},{"given":"Hans-Michael","family":"M\u00fcller","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,5,19]]},"reference":[{"issue":"6","key":"4624_CR1","doi-asserted-by":"publisher","first-page":"508","DOI":"10.1038\/nbt0609-508","volume":"27","author":"E Pafilis","year":"2009","unstructured":"Pafilis E, O'Donoghue SI, Jensen LJ, Horn H, Kuhn M, Brown NP, Schneider R: Reflect: augmented browsing for the life scientist. Nat Biotech 2009, 27(6):508\u2013510. 10.1038\/nbt0609-508","journal-title":"Nat Biotech"},{"key":"4624_CR2","unstructured":"Textpresso search engine[http:\/\/www.textpresso.org]"},{"issue":"11","key":"4624_CR3","doi-asserted-by":"publisher","first-page":"e309","DOI":"10.1371\/journal.pbio.0020309","volume":"2","author":"HM M\u00fcller","year":"2004","unstructured":"M\u00fcller HM, Kenny EE, Sternberg PW: Textpresso: an ontology-based information retrieval and extraction system for biological literature. PLoS Biol 2004, 2(11):e309. 10.1371\/journal.pbio.0020309","journal-title":"PLoS Biol"},{"key":"4624_CR4","unstructured":"WormBase - the biology and genome of C. elegans[http:\/\/www.wormbase.org]"},{"key":"4624_CR5","unstructured":"Saccharomyces Genome Database (SGD)[http:\/\/www.yeastgenome.org]"},{"key":"4624_CR6","unstructured":"Flybase - a database of Drosophila genes and genomes[http:\/\/www.flybase.org]"},{"key":"4624_CR7","unstructured":"GENETICS - a publication of the Genetics Society of America (GSA)[http:\/\/www.genetics.org]"},{"key":"4624_CR8","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1534\/genetics.109.105270","volume":"183","author":"M Dorsett","year":"2009","unstructured":"Dorsett M, Westlund B, Schedl T: METT-10, A Putative Methyltransferase, Inhibits Germ Cell Proliferative Fate in Caenorhabditis elegans . Genetics 2009, 183: 233\u2013247. 10.1534\/genetics.109.105270","journal-title":"Genetics"},{"issue":"2","key":"4624_CR9","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1007\/BF00425528","volume":"175","author":"HR Horvitz","year":"1979","unstructured":"Horvitz HR, Brenner S, Hodgkin J, Herman RK: A uniform genetic nomenclature for the nematode C elegans. Mol Gen Genet 1979, 175(2):129\u2013133. 10.1007\/BF00425528","journal-title":"Mol Gen Genet"},{"issue":"2","key":"4624_CR10","doi-asserted-by":"publisher","first-page":"467","DOI":"10.1534\/genetics.110.121996","volume":"187","author":"R Mesa","year":"2011","unstructured":"Mesa R, Luo S, Hoover CM, Miller K, Minniti A, Inestrosa N, Nonet ML: HID-1, a New Component of the Peptidergic Signaling Pathway. Genetics 2011, 187(2):467\u2013483. 10.1534\/genetics.110.121996","journal-title":"Genetics"},{"key":"4624_CR11","doi-asserted-by":"publisher","first-page":"1187","DOI":"10.1534\/genetics.110.121541","volume":"186","author":"LL Maduzia","year":"2010","unstructured":"Maduzia LL, Moreau A, Poullet N, Chaffre S, Zhang Y: The Role of eIF1 in Translation Initiation Codon Selection in Caenorhabditis elegans. Genetics 2010, 186: 1187\u20131196. 10.1534\/genetics.110.121541","journal-title":"Genetics"},{"issue":"9","key":"4624_CR12","doi-asserted-by":"publisher","first-page":"897","DOI":"10.1038\/nbt0910-897","volume":"28","author":"F Leitner","year":"2010","unstructured":"Leitner F, Chatr-aryamontri A, Mardis SA, Ceol A, Krallinger M, Licata L, Hirschman L, Cesareni G, Valencia A: The FEBS Letters \/BioCreative II.5 experiment: making biological information accessible. Nat Biotech 2010, 28(9):897\u2013899. 10.1038\/nbt0910-897","journal-title":"Nat Biotech"},{"issue":"3","key":"4624_CR13","doi-asserted-by":"publisher","first-page":"1022","DOI":"10.1104\/pp.104.900252","volume":"146","author":"DR Ort","year":"2008","unstructured":"Ort DR, Grennan AK: Plant Physiology and TAIR Partnership. Plant Physiol 2008, 146(3):1022. 10.1104\/pp.104.900252","journal-title":"Plant Physiol"},{"issue":"18","key":"4624_CR14","doi-asserted-by":"publisher","first-page":"i568","DOI":"10.1093\/bioinformatics\/btq383","volume":"26","author":"TK Attwood","year":"2010","unstructured":"Attwood TK, Kell DB, McDermott P, Marsh J, Pettifer SR, Thorne D: Utopia documents: linking scholarly literature with research data. Bioinformatics 2010, 26(18):i568-i574. 10.1093\/bioinformatics\/btq383","journal-title":"Bioinformatics"},{"key":"4624_CR15","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1186\/gb-2005-6-7-224","volume":"6","author":"M Krallinger","year":"2005","unstructured":"Krallinger M, Valencia A, Genome Biology: Text-mining and information-retrieval services for molecular biology. Genome Biol 2005, 6: 224. 10.1186\/gb-2005-6-7-224","journal-title":"Genome Biol"},{"key":"4624_CR16","doi-asserted-by":"publisher","first-page":"710","DOI":"10.1016\/j.jbi.2009.04.002","volume":"42","author":"A Louren\u00e7o","year":"2009","unstructured":"Louren\u00e7o A, Carreira R, Carneiro S, Maia P, Glez-Pe\u00f1a D, Fdez-Riverola F, Ferreira EC, Rocha I, Rocha M: @Note: a workbench for biomedical text mining. Journal of biomedical informatics 2009, 42: 710\u2013720. 10.1016\/j.jbi.2009.04.002","journal-title":"Journal of biomedical informatics"},{"issue":"3","key":"4624_CR17","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1042\/BJ20091474","volume":"424","author":"TK Attwood","year":"2009","unstructured":"Attwood TK, Kell DB, McDermott P, Marsh J, Pettifer SR, Thorne D: Calling International Rescue: knowledge lost in literature and data landslide! Biochem J 2009, 424(3):317\u2013333. 10.1042\/BJ20091474","journal-title":"Biochem J"},{"issue":"2","key":"4624_CR18","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1093\/bioinformatics\/bth496","volume":"21","author":"L Chen","year":"2005","unstructured":"Chen L, Liu H, Friedman C: Gene name ambiguity of eukaryotic nomenclatures. Bioinformatics 2005, 21(2):248\u2013256. 10.1093\/bioinformatics\/bth496","journal-title":"Bioinformatics"},{"key":"4624_CR19","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1186\/1471-2105-11-85","volume":"11","author":"M Gerner","year":"2010","unstructured":"Gerner M, Nenadic G, Bergman M: LINNAEUS: A species name identification system for biomedical literature. BMC Bioinformatics 2010, 11: 85. 10.1186\/1471-2105-11-85","journal-title":"BMC Bioinformatics"},{"key":"4624_CR20","first-page":"55","volume-title":"BioCreative III workshop proceedings","author":"S Bhattacharya","year":"2010","unstructured":"Bhattacharya S, Sehgal AK, Srinivasan P: Cross-species gene normalization at the University of Iowa. In BioCreative III workshop proceedings. Bethesda, MD, USA; 2010:55\u201359."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-175.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T14:52:02Z","timestamp":1630507922000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-175"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,5,19]]},"references-count":20,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4624"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-175","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,5,19]]},"assertion":[{"value":"13 August 2010","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 May 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 May 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"175"}}