{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,11,14]],"date-time":"2023-11-14T02:50:50Z","timestamp":1699930250834},"reference-count":58,"publisher":"Oxford University Press (OUP)","issue":"11","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Most experimental evidence on kinetic parameters is buried in the literature, whose manual searching is complex, time consuming and partial. These shortcomings become particularly acute in systems biology, where these parameters need to be integrated into detailed, genome-scale, metabolic models. These problems are addressed by KiPar, a dedicated information retrieval system designed to facilitate access to the literature relevant for kinetic modelling of a given metabolic pathway in yeast. Searching for kinetic data in the context of an individual pathway offers modularity as a way of tackling the complexity of developing a full metabolic model. It is also suitable for large-scale mining, since multiple reactions and their kinetic parameters can be specified in a single search request, rather than one reaction at a time, which is unsuitable given the size of genome-scale models.<\/jats:p>\n               <jats:p>Results: We developed an integrative approach, combining public data and software resources for the rapid development of large-scale text mining tools targeting complex biological information. The user supplies input in the form of identifiers used in relevant data resources to refer to the concepts of interest, e.g. EC numbers, GO and SBO identifiers. By doing so, the user is freed from providing any other knowledge or terminology concerned with these concepts and their relations, since they are retrieved from these and cross-referenced resources automatically. The terminology acquired is used to index the literature by mapping concepts to their synonyms, and then to textual documents mentioning them. The indexing results and the previously acquired knowledge about relations between concepts are used to formulate complex search queries aiming at documents relevant to the user's information needs. The conceptual approach is demonstrated in the implementation of KiPar. Evaluation reveals that KiPar performs better than a Boolean search. The precision achieved for abstracts (60%) and full-text articles (48%) is considerably better than the baseline precision (44% and 24%, respectively). The baseline recall is improved by 36% for abstracts and by 100% for full text. It appears that full-text articles are a much richer source of information on kinetic data than are their abstracts. Finally, the combined results for abstracts and full text compared with the curated literature provide high values for relative recall (88%) and novelty ratio (92%), suggesting that the system is able to retrieve a high proportion of new documents.<\/jats:p>\n               <jats:p>Availability: Source code and documentation are available at: http:\/\/www.mcisb.org\/resources\/kipar\/<\/jats:p>\n               <jats:p>Contact: \u00a0i.spasic@manchester.ac.uk; dbk@manchester.ac.uk<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp175","type":"journal-article","created":{"date-parts":[[2009,4,1]],"date-time":"2009-04-01T00:33:57Z","timestamp":1238546037000},"page":"1404-1411","source":"Crossref","is-referenced-by-count":12,"title":["KiPar, a tool for systematic information retrieval regarding parameters for kinetic modelling of yeast metabolic pathways"],"prefix":"10.1093","volume":"25","author":[{"given":"Irena","family":"Spasi\u0107","sequence":"first","affiliation":[{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"},{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Evangelos","family":"Simeonidis","sequence":"additional","affiliation":[{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"},{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hanan L.","family":"Messiha","sequence":"additional","affiliation":[{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"},{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Norman W.","family":"Paton","sequence":"additional","affiliation":[{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"},{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Douglas B.","family":"Kell","sequence":"additional","affiliation":[{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"},{"name":"1 Manchester Centre for Integrative Systems Biology, 2School of Computer Science, 3School of Chemical Engineering and Analytical Science and 4School of Chemistry, The University of Manchester, Manchester, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2009,3,31]]},"reference":[{"key":"2023013111545369400_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B2","doi-asserted-by":"crossref","first-page":"571","DOI":"10.1016\/j.tibtech.2006.10.002","article-title":"Text mining and its potential applications in Systems Biology","volume":"24","author":"Ananiadou","year":"2006","journal-title":"Trends Biotechnol."},{"key":"2023013111545369400_B3","first-page":"485","article-title":"Query expansion using the UMLS Metathesaurus","author":"Aronson","year":"1997","journal-title":"proc of AMIA Annu. Fall Symp."},{"key":"2023013111545369400_B4","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene Ontology: tool for the unification of biology","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023013111545369400_B5","volume-title":"Modern Information Retrieval.","author":"Baeza-Yates","year":"1999"},{"key":"2023013111545369400_B6","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1186\/1471-2105-4-61","article-title":"PubMatrix: a tool for multiplex literature mining","volume":"4","author":"Becker","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023013111545369400_B7","doi-asserted-by":"crossref","first-page":"D267","DOI":"10.1093\/nar\/gkh061","article-title":"The Unified Medical Language System (UMLS): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B8","author":"ChEBI","year":"2008"},{"key":"2023013111545369400_B9","doi-asserted-by":"crossref","first-page":"W399","DOI":"10.1093\/nar\/gkn296","article-title":"PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites","volume":"36","author":"Cheng","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B10","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1093\/nar\/26.1.73","article-title":"SGD: Saccharomyces Genome Database","volume":"26","author":"Cherry","year":"1998","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B11","author":"CYGD","year":"2008"},{"key":"2023013111545369400_B12","doi-asserted-by":"crossref","first-page":"D344","DOI":"10.1093\/nar\/gkm791","article-title":"ChEBI: a database and ontology for chemical entities of biological interest","volume":"36","author":"Degtyarenko","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B13","first-page":"326","article-title":"Mining MEDLINE: abstracts, sentences, or phrases","volume-title":"Proceedings of the 7th Pacific Symposium on Biocomputing (PSB 2002).","author":"Ding","year":"2002"},{"key":"2023013111545369400_B14","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/1471-2105-4-11","article-title":"PreBIND and Textomy: mining the biomedical literature for protein-protein interactions using a support vector machine","volume":"4","author":"Donaldson","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023013111545369400_B15","author":"Entrez","year":"2008"},{"key":"2023013111545369400_B16","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1016\/S0304-3975(99)00224-8","article-title":"A formula for incorporating weights into scoring rules","volume":"239","author":"Fagin","year":"2000","journal-title":"Theor. Comput. Sci."},{"key":"2023013111545369400_B17","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1016\/S1532-0464(03)00012-1","article-title":"Two biomedical sublanguages: a description based on the theories of Zellig Harris","volume":"35","author":"Friedman","year":"2002","journal-title":"J. Biomed. Inform."},{"key":"2023013111545369400_B18","doi-asserted-by":"crossref","first-page":"2463","DOI":"10.1093\/bioinformatics\/bth251","article-title":"Pedro: a configurable data entry tool for XML","volume":"20","author":"Garwood","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013111545369400_B19","author":"GO","year":"2008"},{"key":"2023013111545369400_B20","doi-asserted-by":"crossref","first-page":"D364","DOI":"10.1093\/nar\/gki053","article-title":"CYGD: the Comprehensive Yeast Genome Database","volume":"33","author":"G\u00fcldener","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B21","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1089\/1536231041388366","article-title":"Finding kinetic parameters using text mining","volume":"8","author":"Hakenberg","year":"2004","journal-title":"OMICS"},{"key":"2023013111545369400_B22","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/S1532-0464(03)00011-X","article-title":"The structure of science information","volume":"35","author":"Harris","year":"2002","journal-title":"J. Biomed. Inform."},{"key":"2023013111545369400_B23","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-21606-5","volume-title":"The Elements of Statistical Learning: Data Mining, Inference and Prediction.","author":"Hastie","year":"2001"},{"key":"2023013111545369400_B24","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1021\/cen-v081n020.p045","article-title":"Systems biology","volume":"81","author":"Henry","year":"2003","journal-title":"Chem. Eng. News"},{"key":"2023013111545369400_B25","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1038\/nbt1492","article-title":"A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology","volume":"26","author":"Herrg\u00e5rd","year":"2008","journal-title":"Nat. Biotechnol."},{"key":"2023013111545369400_B26","doi-asserted-by":"crossref","DOI":"10.3115\/1072017.1072029","article-title":"The generic information extraction system","volume-title":"Fifth Message Understanding Conference (MUC5).","author":"Hobbs","year":"1993"},{"key":"2023013111545369400_B27","doi-asserted-by":"crossref","first-page":"pe21","DOI":"10.1126\/stke.2832005pe21","article-title":"Text mining for metabolic pathways, signaling cascades, and protein networks","volume":"2005","author":"Hoffmann","year":"2005","journal-title":"Sci STKE"},{"key":"2023013111545369400_B28","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/S0047-6374(02)00164-1","article-title":"Systems biology: integrating technology, biology, and computation","volume":"124","author":"Hood","year":"2003","journal-title":"Mech. Ageing Dev."},{"key":"2023013111545369400_B29","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1093\/bioinformatics\/btg015","article-title":"The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models","volume":"19","author":"Hucka","year":"2003","journal-title":"Bioinformatics"},{"key":"2023013111545369400_B30","doi-asserted-by":"crossref","first-page":"e1000204","DOI":"10.1371\/journal.pcbi.1000204","article-title":"Defrosting the digital library: bibliographic tools for the next generation web","volume":"4","author":"Hull","year":"2008","journal-title":"PLoS Comput. Biol."},{"key":"2023013111545369400_B31","first-page":"505","article-title":"Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures","volume-title":"Proceedings of the 5th Pacific Symposium on Biocomputing (PSB 2000).","author":"Humphreys","year":"2000"},{"key":"2023013111545369400_B32","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1038\/nrg1768","article-title":"Literature mining for the biologist: from information retrieval to biological discovery","volume":"7","author":"Jensen","year":"2006","journal-title":"Nat. Rev. Genet."},{"key":"2023013111545369400_B33","doi-asserted-by":"crossref","first-page":"D480","DOI":"10.1093\/nar\/gkm882","article-title":"KEGG for linking genomes to life and the environment","volume":"36","author":"Kanehisa","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B34"},{"key":"2023013111545369400_B35","doi-asserted-by":"crossref","first-page":"873","DOI":"10.1111\/j.1742-4658.2006.05136.x","article-title":"Metabolomics, modelling and machine learning in systems biology: towards an understanding of the languages of cells. The 2005 Theodor B\u00fccher lecture","volume":"273","author":"Kell","year":"2006","journal-title":"FEBS J."},{"key":"2023013111545369400_B36","doi-asserted-by":"crossref","first-page":"S11","DOI":"10.1186\/1471-2202-7-S1-S11","article-title":"Model storage, exchange and integration","volume":"7","author":"Le Novere","year":"2006","journal-title":"BMC Neurosci."},{"key":"2023013111545369400_B37","volume-title":"Biochemical Pathways: an Atlas of Biochemistry and Molecular Biology.","author":"Michal","year":"1999"},{"key":"2023013111545369400_B38","doi-asserted-by":"crossref","first-page":"e309","DOI":"10.1371\/journal.pbio.0020309","article-title":"Textpresso: an ontology-based information retrieval and extraction system for biological literature","volume":"2","author":"M\u00fcller","year":"2004","journal-title":"PLoS Biol."},{"key":"2023013111545369400_B39"},{"key":"2023013111545369400_B40"},{"key":"2023013111545369400_B41","doi-asserted-by":"crossref","first-page":"3894","DOI":"10.1046\/j.1432-1033.2002.03055.x","article-title":"Schemes of flux control in a model of Saccharomyces cerevisiae glycolysis","volume":"269","author":"Pritchard","year":"2002","journal-title":"Eur. J. Biochem."},{"key":"2023013111545369400_B42"},{"key":"2023013111545369400_B43"},{"key":"2023013111545369400_B44","doi-asserted-by":"crossref","first-page":"e237","DOI":"10.1093\/bioinformatics\/btl302","article-title":"EBIMed\u2014text crunching to gather facts for proteins from Medline","volume":"23","author":"Rebholz-Schuhmann","year":"2007","journal-title":"Bioinformatics"},{"key":"2023013111545369400_B45","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1016\/j.jbi.2003.10.001","article-title":"GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data","volume":"37","author":"Rzhetsky","year":"2004","journal-title":"J. Biomed. Inform."},{"key":"2023013111545369400_B46"},{"key":"2023013111545369400_B47"},{"key":"2023013111545369400_B48"},{"key":"2023013111545369400_B49","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1093\/bib\/6.3.222","article-title":"Hairpins in bookstacks: information retrieval from biomedical text","volume":"6","author":"Shatkay","year":"2005","journal-title":"Brief. Bioinform."},{"key":"2023013111545369400_B50","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1093\/bib\/6.3.239","article-title":"Text mining and ontologies in biomedicine: making sense of raw text","volume":"6","author":"Spasic","year":"2005","journal-title":"Brief. Bioinform."},{"key":"2023013111545369400_B51","doi-asserted-by":"crossref","first-page":"1427","DOI":"10.1002\/asi.20438","article-title":"Ranking indirect connections in literature-based discovery: the role of Medical Subject Headings","volume":"57","author":"Swanson","year":"2006","journal-title":"J. Am. Soc. Inform. Sci. Technol."},{"key":"2023013111545369400_B52","doi-asserted-by":"crossref","first-page":"5313","DOI":"10.1046\/j.1432-1327.2000.01527.x","article-title":"Can yeast glycolysis be understood in terms of in vitro kinetics of the constituent enzymes? Testing biochemistry","volume":"267","author":"Teusink","year":"2000","journal-title":"Eur. J. Biochem."},{"key":"2023013111545369400_B53","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1093\/bioinformatics\/btg375","article-title":"MedBlast: searching articles related to a biological sequence","volume":"20","author":"Tu","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013111545369400_B54"},{"key":"2023013111545369400_B55","doi-asserted-by":"crossref","first-page":"D13","DOI":"10.1093\/nar\/gkm1000","article-title":"Database resources of the National Center for Biotechnology Information","volume":"36","author":"Wheeler","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023013111545369400_B56","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1016\/S1386-5056(97)00094-4","article-title":"Information retrieval: an overview of system characteristics","volume":"47","author":"Wiesman","year":"1997","journal-title":"Int. J. Med. Inform."},{"key":"2023013111545369400_B57","first-page":"94","article-title":"SABIO-RK: integration and curation of reaction kinetics data","volume":"4075","author":"Wittig","year":"2006","journal-title":"Lecture Notes in Bioinformatics"},{"key":"2023013111545369400_B58","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1186\/1471-2105-7-171","article-title":"Automatic pathway building in biological association networks","volume":"7","author":"Yuryev","year":"2006","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/11\/1404\/48987782\/bioinformatics_25_11_1404.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/11\/1404\/48987782\/bioinformatics_25_11_1404.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:00:38Z","timestamp":1675198838000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/11\/1404\/331612"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,3,31]]},"references-count":58,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2009,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp175","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,6,1]]},"published":{"date-parts":[[2009,3,31]]}}}