{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T15:22:07Z","timestamp":1764688927319},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1201,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: To create, verify and maintain pathway models, curators must discover and assess knowledge distributed over the vast body of biological literature. Methods supporting these tasks must understand both the pathway model representations and the natural language in the literature. These methods should identify and order documents by relevance to any given pathway reaction. No existing system has addressed all aspects of this challenge.<\/jats:p>\n               <jats:p>Method: We present novel methods for associating pathway model reactions with relevant publications. Our approach extracts the reactions directly from the models and then turns them into queries for three text mining-based MEDLINE literature search systems. These queries are executed, and the resulting documents are combined and ranked according to their relevance to the reactions of interest. We manually annotate document-reaction pairs with the relevance of the document to the reaction and use this annotation to study several ranking methods, using various heuristic and machine-learning approaches.<\/jats:p>\n               <jats:p>Results: Our evaluation shows that the annotated document-reaction pairs can be used to create a rule-based document ranking system, and that machine learning can be used to rank documents by their relevance to pathway reactions. We find that a Support Vector Machine-based system outperforms several baselines and matches the performance of the rule-based system. The success of the query extraction and ranking methods are used to update our existing pathway search system, PathText.<\/jats:p>\n               <jats:p>Availability: An online demonstration of PathText 2 and the annotated corpus are available for research purposes at http:\/\/www.nactem.ac.uk\/pathtext2\/.<\/jats:p>\n               <jats:p>Contact: \u00a0makoto.miwa@manchester.ac.uk<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt227","type":"journal-article","created":{"date-parts":[[2013,6,27]],"date-time":"2013-06-27T05:33:26Z","timestamp":1372311206000},"page":"i44-i52","source":"Crossref","is-referenced-by-count":32,"title":["A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text"],"prefix":"10.1093","volume":"29","author":[{"given":"Makoto","family":"Miwa","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tomoko","family":"Ohta","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rafal","family":"Rak","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrew","family":"Rowley","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Douglas B.","family":"Kell","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sampo","family":"Pyysalo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sophia","family":"Ananiadou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2013,6,19]]},"reference":[{"key":"2023062614284128900_btt227-B1","doi-asserted-by":"crossref","first-page":"571","DOI":"10.1016\/j.tibtech.2006.10.002","article-title":"Text mining and its potential applications in systems biology","volume":"24","author":"Ananiadou","year":"2006","journal-title":"Trends Biotechnol."},{"key":"2023062614284128900_btt227-B2","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1016\/j.tibtech.2010.04.005","article-title":"Event extraction for systems biology by text mining the literature","volume":"28","author":"Ananiadou","year":"2010","journal-title":"Trends Biotechnol."},{"key":"2023062614284128900_btt227-B3","doi-asserted-by":"crossref","first-page":"543","DOI":"10.1038\/msb.2011.77","article-title":"Controlled vocabularies and semantics in systems biology","volume":"7","author":"Courtot","year":"2011","journal-title":"Mol. Syst. Biol."},{"key":"2023062614284128900_btt227-B4","doi-asserted-by":"crossref","first-page":"935","DOI":"10.1038\/nbt.1666","article-title":"The BioPAX community standard for pathway data sharing","volume":"28","author":"Demir","year":"2010","journal-title":"Nat. Biotechnol."},{"key":"2023062614284128900_btt227-B5","author":"Drucker","year":"1996"},{"key":"2023062614284128900_btt227-B6","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1016\/S1478-5382(03)02370-9","article-title":"Celldesigner: a process diagram editor for gene-regulatory and biochemical networks","volume":"1","author":"Funahashi","year":"2003","journal-title":"Biosilico"},{"key":"2023062614284128900_btt227-B7","author":"He","year":"2011"},{"key":"2023062614284128900_btt227-B8","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1038\/nbt1492","article-title":"A consensus yeast metabolic network obtained from a community approach to systems biology","volume":"26","author":"Herrg\u00e5rd","year":"2008","journal-title":"Nat. Biotechnol."},{"key":"2023062614284128900_btt227-B9","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1093\/bioinformatics\/btg015","article-title":"The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models","volume":"19","author":"Hucka","year":"2003","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B10","doi-asserted-by":"crossref","first-page":"422","DOI":"10.1145\/582415.582418","article-title":"Cumulated gain-based evaluation of ir techniques","volume":"20","author":"J\u00e4rvelin","year":"2002","journal-title":"ACM Trans. Inf. Syst."},{"key":"2023062614284128900_btt227-B11","author":"Joachims","year":"2002"},{"key":"2023062614284128900_btt227-B12","doi-asserted-by":"crossref","first-page":"i374","DOI":"10.1093\/bioinformatics\/btq221","article-title":"PathText: a text mining integrator for biological pathway visualizations","volume":"26","author":"Kemper","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B13","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1186\/1471-2105-9-10","article-title":"Corpus annotation for mining biomedical events from literature","volume":"9","author":"Kim","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023062614284128900_btt227-B14","doi-asserted-by":"crossref","first-page":"1662","DOI":"10.1126\/science.1069492","article-title":"Systems biology: a brief overview","volume":"295","author":"Kitano","year":"2002","journal-title":"Science"},{"key":"2023062614284128900_btt227-B15","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1038\/nbt1156","article-title":"Minimum information requested in the annotation of biochemical models (MIRIAM)","volume":"23","author":"Le Novre","year":"2005","journal-title":"Nat. Biotechnol."},{"key":"2023062614284128900_btt227-B16","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1186\/1752-0509-4-92","article-title":"Biomodels database: an enhanced, curated and annotated resource for published quantitative kinetic models","volume":"4","author":"Li","year":"2010","journal-title":"BMC Syst. Biol."},{"key":"2023062614284128900_btt227-B17","doi-asserted-by":"crossref","first-page":"baq036","DOI":"10.1093\/database\/baq036","article-title":"Pubmed and beyond: a survey of web tools for searching biomedical literature","volume":"2011","author":"Lu","year":"2011","journal-title":"Database"},{"key":"2023062614284128900_btt227-B18","doi-asserted-by":"crossref","first-page":"D247","DOI":"10.1093\/nar\/gkl869","article-title":"PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathways","volume":"35(Suppl. 1)","author":"Mi","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023062614284128900_btt227-B19","doi-asserted-by":"crossref","first-page":"3437","DOI":"10.1093\/bioinformatics\/btr586","article-title":"BioPAX support in CellDesigner","volume":"27","author":"Mi","year":"2011","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B20","doi-asserted-by":"crossref","first-page":"1759","DOI":"10.1093\/bioinformatics\/bts237","article-title":"Boosting automatic event extraction from the literature using domain adaptation and coreference resolution","volume":"28","author":"Miwa","year":"2012","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B21","author":"Miyao","year":"2006"},{"key":"2023062614284128900_btt227-B22","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1162\/coli.2008.34.1.35","article-title":"Feature forest models for probabilistic HPSG parsing","volume":"34","author":"Miyao","year":"2008","journal-title":"Comput. Linguist."},{"key":"2023062614284128900_btt227-B23","author":"Nobata","year":"2008"},{"key":"2023062614284128900_btt227-B24","doi-asserted-by":"crossref","first-page":"D689","DOI":"10.1093\/nar\/gkj092","article-title":"Biomodels database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems","volume":"34(Suppl. 1)","author":"Novere","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023062614284128900_btt227-B25","author":"Ohta","year":"2011"},{"key":"2023062614284128900_btt227-B26","author":"Okanohara","year":"2006"},{"key":"2023062614284128900_btt227-B27","doi-asserted-by":"crossref","first-page":"3089","DOI":"10.1093\/bioinformatics\/btl534","article-title":"Building an abbreviation dictionary using a term recognition approach","volume":"22","author":"Okazaki","year":"2006","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B28","doi-asserted-by":"crossref","first-page":"1246","DOI":"10.1093\/bioinformatics\/btq129","article-title":"Building a high-quality sense inventory for improved abbreviation disambiguation","volume":"26","author":"Okazaki","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B29","first-page":"396","article-title":"Bidirectional incremental parsing for automatic pathway identification with combinatory categorial grammar","volume":"6","author":"Park","year":"2001","journal-title":"Pac. Symp. Biocomput."},{"key":"2023062614284128900_btt227-B30","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1093\/bioinformatics\/bti069","article-title":"Inferring pathways from gene lists using a literature-derived network of biological relationships","volume":"21","author":"Rajagopalan","year":"2005","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B31","author":"Robertson","year":"1999"},{"key":"2023062614284128900_btt227-B32","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1016\/j.jbi.2003.10.001","article-title":"Geneways: a system for extracting, analyzing, visualizing, and integrating molecular pathway data","volume":"37","author":"Rzhetsky","year":"2004","journal-title":"J. Biomed. Inform."},{"key":"2023062614284128900_btt227-B33","doi-asserted-by":"crossref","first-page":"S5","DOI":"10.1186\/1471-2105-9-S11-S5","article-title":"How to make the most of ne dictionaries in statistical NER","volume":"9(Suppl. 11)","author":"Sasaki","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023062614284128900_btt227-B34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1561\/1500000010","article-title":"Federated search","volume":"5","author":"Shokouhi","year":"2011","journal-title":"Found. Trends Inf. Retr."},{"key":"2023062614284128900_btt227-B35","doi-asserted-by":"crossref","first-page":"14:1","DOI":"10.1145\/1508850.1508852","article-title":"Robust result merging using sample-based score estimates","volume":"27","author":"Shokouhi","year":"2009","journal-title":"ACM Trans. Inf. Syst."},{"key":"2023062614284128900_btt227-B36","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1145\/944012.944017","article-title":"A semisupervised learning method to merge search engine results","volume":"21","author":"Si","year":"2003","journal-title":"ACM Trans. Inf. Syst."},{"key":"2023062614284128900_btt227-B37","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1093\/bioinformatics\/btq675","article-title":"Cytoscape 2.8: new features for data integration and network visualization","volume":"27","author":"Smoot","year":"2011","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B38","doi-asserted-by":"crossref","first-page":"4401","DOI":"10.1093\/bioinformatics\/bti718","article-title":"Representations of molecular pathways: an evaluation of SBML, PSI MI and BioPAX","volume":"21","author":"Str\u00f6mb\u00e4ck","year":"2005","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B39","first-page":"186","article-title":"The subliminal toolbox: automating steps in the reconstruction of metabolic networks","volume":"8","author":"Swainston","year":"2011","journal-title":"Integr. Bioinformatics"},{"key":"2023062614284128900_btt227-B40","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1038\/msb.2010.15","article-title":"Reconstruction annotation jamborees: a community approach to systems biology","volume":"6","author":"Thiele","year":"2010","journal-title":"Mol. Syst. Biol."},{"key":"2023062614284128900_btt227-B41","doi-asserted-by":"crossref","first-page":"2768","DOI":"10.1093\/bioinformatics\/btm393","article-title":"Learning string similarity measures for gene\/protein name dictionary look-up using logistic regression","volume":"23","author":"Tsuruoka","year":"2007","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B42","doi-asserted-by":"crossref","first-page":"i111","DOI":"10.1093\/bioinformatics\/btr214","article-title":"Discovering and visualizing indirect associations between biomedical concepts","volume":"27","author":"Tsuruoka","year":"2011","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B43","volume-title":"Statistical Learning Theory","author":"Vapnik","year":"1998"},{"key":"2023062614284128900_btt227-B44","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1093\/bioinformatics\/btq002","article-title":"Disambiguating the species of biomedical named entities using natural language parsers","volume":"26","author":"Wang","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062614284128900_btt227-B45","author":"Yao","year":"2004"},{"key":"2023062614284128900_btt227-B46","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1186\/1471-2105-7-171","article-title":"Automatic pathway building in biological association networks","volume":"7","author":"Yuryev","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023062614284128900_btt227-B47","doi-asserted-by":"crossref","first-page":"S18","DOI":"10.1186\/1471-2105-10-S11-S18","article-title":"Pathbinder\u2013text empirics and automatic extraction of biomolecular interactions","volume":"10(Suppl. 11)","author":"Zhang","year":"2009","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/13\/i44\/50702723\/bioinformatics_29_13_i44.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/13\/i44\/50702723\/bioinformatics_29_13_i44.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,26]],"date-time":"2023-06-26T15:28:20Z","timestamp":1687793300000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/13\/i44\/195236"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,6,19]]},"references-count":47,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2013,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt227","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2013,7]]},"published":{"date-parts":[[2013,6,19]]}}}