{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,10,8]],"date-time":"2023-10-08T09:26:51Z","timestamp":1696757211811},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Determination of the binding affinity of a protein\u2013ligand complex is important to quantitatively specify whether a particular small molecule will bind to the target protein. Besides, collection of comprehensive datasets for protein\u2013ligand complexes and their corresponding binding affinities is crucial in developing accurate scoring functions for the prediction of the binding affinities of previously unknown protein\u2013ligand complexes. In the past decades, several databases of protein\u2013ligand-binding affinities have been created via visual extraction from literature. However, such approaches are time-consuming and most of these databases are updated only a few times per year. Hence, there is an immediate demand for an automatic extraction method with high precision for binding affinity collection.<\/jats:p>\n               <jats:p>Result: We have created a new database of protein\u2013ligand-binding affinity data, AutoBind, based on automatic information retrieval. We first compiled a collection of 1586 articles where the binding affinities have been marked manually. Based on this annotated collection, we designed four sentence patterns that are used to scan full-text articles as well as a scoring function to rank the sentences that match our patterns. The proposed sentence patterns can effectively identify the binding affinities in full-text articles. Our assessment shows that AutoBind achieved 84.22% precision and 79.07% recall on the testing corpus. Currently, 13 616 protein\u2013ligand complexes and the corresponding binding affinities have been deposited in AutoBind from 17 221 articles.<\/jats:p>\n               <jats:p>Availability: AutoBind is automatically updated on a monthly basis, and it is freely available at http:\/\/autobind.csie.ncku.edu.tw\/ and http:\/\/autobind.mc.ntu.edu.tw\/. All of the deposited binding affinities have been refined and approved manually before being released.<\/jats:p>\n               <jats:p>Contact: \u00a0jchiang@mail.ncku.edu.tw<\/jats:p>\n               <jats:p>Supplementary Information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts367","type":"journal-article","created":{"date-parts":[[2012,7,3]],"date-time":"2012-07-03T03:08:37Z","timestamp":1341284917000},"page":"2162-2168","source":"Crossref","is-referenced-by-count":5,"title":["AutoBind: automatic extraction of protein\u2013ligand-binding affinity data from biological literature"],"prefix":"10.1093","volume":"28","author":[{"given":"Darby Tien-Hao","family":"Chang","sequence":"first","affiliation":[{"name":"1 Department of Electrical Engineering, 2Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, 3School of Pharmacy, National Taiwan University, Taipei 10051 and 4Institute of Biomedical Sciences, Academia Sinica, Taipei 11529, Taiwan"}]},{"given":"Chao-Hsuan","family":"Ke","sequence":"additional","affiliation":[{"name":"1 Department of Electrical Engineering, 2Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, 3School of Pharmacy, National Taiwan University, Taipei 10051 and 4Institute of Biomedical Sciences, Academia Sinica, Taipei 11529, Taiwan"}]},{"given":"Jung-Hsin","family":"Lin","sequence":"additional","affiliation":[{"name":"1 Department of Electrical Engineering, 2Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, 3School of Pharmacy, National Taiwan University, Taipei 10051 and 4Institute of Biomedical Sciences, Academia Sinica, Taipei 11529, Taiwan"},{"name":"1 Department of Electrical Engineering, 2Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, 3School of Pharmacy, National Taiwan University, Taipei 10051 and 4Institute of Biomedical Sciences, Academia Sinica, Taipei 11529, Taiwan"}]},{"given":"Jung-Hsien","family":"Chiang","sequence":"additional","affiliation":[{"name":"1 Department of Electrical Engineering, 2Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, 3School of Pharmacy, National Taiwan University, Taipei 10051 and 4Institute of Biomedical Sciences, Academia Sinica, Taipei 11529, Taiwan"}]}],"member":"286","published-online":{"date-parts":[[2012,7,5]]},"reference":[{"key":"2023012512531949300_B1","doi-asserted-by":"crossref","first-page":"1723","DOI":"10.1093\/bioinformatics\/btr194","article-title":"Figure summarizer browser extensions for PubMed Central","volume":"27","author":"Agarwal","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B2","doi-asserted-by":"crossref","first-page":"D154","DOI":"10.1093\/nar\/gki070","article-title":"The Universal Protein Resource (UniProt)","volume":"33","author":"Bairoch","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B3","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B4","doi-asserted-by":"crossref","first-page":"i120","DOI":"10.1093\/bioinformatics\/btr223","article-title":"MeSH: a window into full text for document summarization","volume":"27","author":"Bhattacharya","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B5","first-page":"60","article-title":"Automatic extraction of biological information from scientific text: protein-protein interactions","volume-title":"Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology.","author":"Blaschke","year":"1999"},{"key":"2023012512531949300_B6","doi-asserted-by":"crossref","first-page":"D522","DOI":"10.1093\/nar\/gkj039","article-title":"AffinDB: a freely accessible database of affinities for protein-ligand complexes from the PDB","volume":"34","author":"Block","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B7","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1093\/bioinformatics\/btq620","article-title":"A hybrid approach to extract protein\u2013protein interactions","volume":"27","author":"Bui","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B8","doi-asserted-by":"crossref","first-page":"D472","DOI":"10.1093\/nar\/gkr940","article-title":"AH-DB: collecting protein structure pairs before and after binding","volume":"40","author":"Chang","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B9","doi-asserted-by":"crossref","first-page":"e30446","DOI":"10.1371\/journal.pone.0030446","article-title":"Predicting target DNA sequences of DNA-binding proteins based on unbound structures","volume":"7","author":"Chen","year":"2012","journal-title":"PLoS One"},{"key":"2023012512531949300_B10","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1093\/bioinformatics\/18.1.130","article-title":"The Binding Database: data management and interface design","volume":"18","author":"Chen","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B11","doi-asserted-by":"crossref","first-page":"392","DOI":"10.1186\/1471-2105-7-392","article-title":"GeneLibrarian: an effective gene-information summarization and visualization system","volume":"7","author":"Chiang","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012512531949300_B12","doi-asserted-by":"crossref","first-page":"W173","DOI":"10.1093\/nar\/gks564","article-title":"DBD2BS: connecting a DNA-binding protein with its binding sites","volume":"40","author":"Chien","year":"2012","journal-title":"Nucleic Acids Research"},{"key":"2023012512531949300_B13","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1093\/bioinformatics\/btl616","article-title":"RelEx\u2014relation extraction using dependency parse trees","volume":"23","author":"Fundel","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B14","doi-asserted-by":"crossref","first-page":"e4554","DOI":"10.1371\/journal.pone.0004554","article-title":"PPI finder: a mining tool for human protein-protein interactions","volume":"4","author":"He","year":"2009","journal-title":"PLoS One"},{"key":"2023012512531949300_B15","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1186\/1471-2105-11-375","article-title":"KID\u2014an algorithm for fast and efficient text mining used to automatically generate a database containing kinetic information of enzymes","volume":"11","author":"Heinen","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012512531949300_B16","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1002\/prot.20512","article-title":"Binding MOAD (Mother Of All Databases)","volume":"60","author":"Hu","year":"2005","journal-title":"Prot. Struct. Funct. Bioinformatics"},{"key":"2023012512531949300_B17","doi-asserted-by":"crossref","first-page":"2759","DOI":"10.1093\/bioinformatics\/bti390","article-title":"Literature mining and database annotation of protein phosphorylation using a rule-based system","volume":"21","author":"Hu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B18","doi-asserted-by":"crossref","first-page":"e220","DOI":"10.1093\/bioinformatics\/btl203","article-title":"Finding the evidence for protein-protein interactions from PubMed abstracts","volume":"22","author":"Jang","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B19","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1186\/1758-2946-3-41","article-title":"OSCAR4: a flexible architecture for chemical text-mining","volume":"3","author":"Jessop","year":"2011","journal-title":"J. Cheminform."},{"key":"2023012512531949300_B20","first-page":"1","article-title":"Overview of BioNLP'09 shared task on event extraction","volume-title":"Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task.","author":"Kim","year":"2009"},{"key":"2023012512531949300_B21","first-page":"9","article-title":"PRIME: automatically extracted protein interactions and molecular information database","volume":"5","author":"Koike","year":"2005","journal-title":"In Silico Biol."},{"key":"2023012512531949300_B22","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1186\/gb-2005-6-7-224","article-title":"Text-mining and information-retrieval services for molecular biology","volume":"6","author":"Krallinger","year":"2005","journal-title":"Genome Biol."},{"key":"2023012512531949300_B23","doi-asserted-by":"crossref","first-page":"D198","DOI":"10.1093\/nar\/gkl999","article-title":"BindingDB: a web-accessible database of experimentally determined protein\u2013ligand binding affinities","volume":"35","author":"Liu","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B24","doi-asserted-by":"crossref","first-page":"3370","DOI":"10.1093\/bioinformatics\/bth409","article-title":"Extracting gene pathway relations using a hybrid grammar: the Arizona Relation Parser","volume":"20","author":"McDonald","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B25","doi-asserted-by":"crossref","first-page":"W634","DOI":"10.1093\/nar\/gkh427","article-title":"NLProt: extracting protein names and sequences from papers","volume":"32","author":"Mika","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B26","doi-asserted-by":"crossref","first-page":"D750","DOI":"10.1093\/nar\/gkp889","article-title":"BioNumbers\u2014the database of key numbers in molecular and cell biology","volume":"38","author":"Milo","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B27","doi-asserted-by":"crossref","first-page":"7068","DOI":"10.1073\/pnas.0701356104","article-title":"Connecting protein structure with predictions of regulatory sites","volume":"104","author":"Morozov","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012512531949300_B28","doi-asserted-by":"crossref","first-page":"3306","DOI":"10.1093\/bioinformatics\/btr573","article-title":"Extraction of data deposition statements from the literature: a method for automatically tracking research results","volume":"27","author":"N\u00e9v\u00e9ol","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B29","doi-asserted-by":"crossref","first-page":"1856","DOI":"10.1093\/bioinformatics\/btg243","article-title":"Protein Ligand Database (PLD): additional understanding of the nature and specificity of protein\u2013ligand complexes","volume":"19","author":"Puvanendrampillai","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B30","doi-asserted-by":"crossref","first-page":"188","DOI":"10.3115\/974147.974173","article-title":"Extracting molecular binding relationships from biomedical text","volume-title":"Proceedings of the sixth conference on Applied natural language processing.","author":"Rindflesch","year":"2000"},{"key":"2023012512531949300_B31","doi-asserted-by":"crossref","first-page":"3592","DOI":"10.1021\/jm000467k","article-title":"Ligand-Protein DataBase: linking protein-ligand complex structures to binding data","volume":"44","author":"Roche","year":"2001","journal-title":"J. Med. Chem."},{"key":"2023012512531949300_B32","doi-asserted-by":"crossref","first-page":"1404","DOI":"10.1093\/bioinformatics\/btp175","article-title":"KiPar, a tool for systematic information retrieval regarding parameters for kinetic modelling of yeast metabolic pathways","volume":"25","author":"Spasi\u0107","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B33","first-page":"529","article-title":"Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in medline abstracts","volume-title":"Proceedings of the fifth Pacific Symposium on Biocomputing.","author":"Stapley","year":"2000"},{"key":"2023012512531949300_B34","doi-asserted-by":"crossref","first-page":"i547","DOI":"10.1093\/bioinformatics\/btq382","article-title":"Discovering drug\u2013drug interactions: a text-mining and reasoning approach based on properties of drug metabolism","volume":"26","author":"Tari","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B35","doi-asserted-by":"crossref","first-page":"2977","DOI":"10.1021\/jm030580l","article-title":"The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures","volume":"47","author":"Wang","year":"2004","journal-title":"J. Med. Chem."},{"key":"2023012512531949300_B36","doi-asserted-by":"crossref","first-page":"W623","DOI":"10.1093\/nar\/gkp456","article-title":"PubChem: a public information system for analyzing bioactivities of small molecules","volume":"37","author":"Wang","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012512531949300_B37","doi-asserted-by":"crossref","first-page":"815","DOI":"10.1093\/bioinformatics\/btp071","article-title":"High-performance gene name normalization with GeNo","volume":"25","author":"Wermter","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512531949300_B38","doi-asserted-by":"crossref","first-page":"119","DOI":"10.2307\/3001946","article-title":"Probability tables for individual comparisons by ranking methods","volume":"3","author":"Wilcoxon","year":"1947","journal-title":"Biometrics"},{"key":"2023012512531949300_B39","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1016\/j.jbi.2007.11.008","article-title":"Extracting interactions between proteins from the literature","volume":"41","author":"Zhou","year":"2008","journal-title":"J. Biomed. Informatics"},{"key":"2023012512531949300_B40","doi-asserted-by":"crossref","first-page":"2813","DOI":"10.1093\/bioinformatics\/btl480","article-title":"ADAM: another database of abbreviations in MEDLINE","volume":"22","author":"Zhou","year":"2006","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/16\/2162\/48870763\/bioinformatics_28_16_2162.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/16\/2162\/48870763\/bioinformatics_28_16_2162.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T17:51:34Z","timestamp":1674669094000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/16\/2162\/325343"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,7,5]]},"references-count":40,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2012,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts367","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,8,15]]},"published":{"date-parts":[[2012,7,5]]}}}