{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:18Z","timestamp":1772138058129,"version":"3.50.1"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2023,3,8]],"date-time":"2023-03-08T00:00:00Z","timestamp":1678233600000},"content-version":"vor","delay-in-days":7,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Helmholtz School for Data Science in Life, Earth, and Energy"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>TopEnzyme is a database of structural enzyme models created with TopModel and is linked to the SWISS-MODEL repository and AlphaFold Protein Structure Database to provide an overview of structural coverage of the functional enzyme space for over 200\u00a0000 enzyme models. It allows the user to quickly obtain representative structural models for 60% of all known enzyme functions.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We assessed the models with TopScore and contributed 9039 good-quality and 1297 high-quality structures. Furthermore, we compared these models to AlphaFold2 models with TopScore and found that the TopScore differs only by 0.04 on average in favor of AlphaFold2. We tested TopModel and AlphaFold2 for targets not seen in the respective training databases and found that both methods create qualitatively similar structures. When no experimental structures are available, this database will facilitate quick access to structural models across the currently most extensive structural coverage of the functional enzyme space within Swiss-Prot.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>We provide a full web interface to the database at https:\/\/cpclab.uni-duesseldorf.de\/topenzyme\/.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad116","type":"journal-article","created":{"date-parts":[[2023,3,8]],"date-time":"2023-03-08T07:36:14Z","timestamp":1678260974000},"source":"Crossref","is-referenced-by-count":5,"title":["TopEnzyme: a framework and database for structural coverage of the functional enzyme space"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1896-0291","authenticated-orcid":false,"given":"Karel J","family":"van der Weg","sequence":"first","affiliation":[{"name":"John von Neumann Institute for Computing (NIC), J\u00fclich Supercomputing Centre (JSC), and Institute of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum J\u00fclich GmbH , J\u00fclich 52425, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8613-1447","authenticated-orcid":false,"given":"Holger","family":"Gohlke","sequence":"additional","affiliation":[{"name":"John von Neumann Institute for Computing (NIC), J\u00fclich Supercomputing Centre (JSC), and Institute of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum J\u00fclich GmbH , J\u00fclich 52425, Germany"},{"name":"Institute for Pharmaceutical and Medicinal Chemistry, Heinrich Heine University D\u00fcsseldorf , D\u00fcsseldorf 40225, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2023,3,8]]},"reference":[{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1093\/nar\/gkt1242","article-title":"SCOP2 prototype: a new approach to protein structure mining","volume":"42","author":"Andreeva","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"871","DOI":"10.1126\/science.abj8754","article-title":"Accurate prediction of protein structures and interactions using a three-track neural network","volume":"373","author":"Baek","year":"2021","journal-title":"Science"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1093\/nar\/28.1.304","article-title":"The ENZYME database in 2000","volume":"28","author":"Bairoch","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1093\/bioinformatics\/btq662","article-title":"Toward the estimation of the absolute quality of individual protein structure models","volume":"27","author":"Benkert","year":"2011","journal-title":"Bioinformatics"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"980","DOI":"10.1038\/nsb1203-980","article-title":"Announcing the worldwide protein data bank","volume":"10","author":"Berman","year":"2003","journal-title":"Nat Struct Biol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"D313","DOI":"10.1093\/nar\/gkw1132","article-title":"The SWISS-MODEL repository\u2014new features and functionality","volume":"45","author":"Bienert","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"e1003926","DOI":"10.1371\/journal.pcbi.1003926","article-title":"ECOD: an evolutionary classification of protein domains","volume":"10","author":"Cheng","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1107\/S2059798318017242","article-title":"Crystal structure of Mycobacterium tuberculosis FadB2 implicated in mycobacterial \u03b2-oxidation","volume":"75","author":"Cox","year":"2019","journal-title":"Acta Crystallogr D Struct Biol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"2666","DOI":"10.1021\/ci200168b","article-title":"Pocket-Space maps to identify novel Binding-Site conformations in proteins","volume":"51","author":"Craig","year":"2011","journal-title":"J Chem Inf Model"},{"key":"2023031723192950400_","author":"Evans","year":"2021"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1006\/jmbi.1999.2661","article-title":"The relationship between protein structure and function: a comprehensive survey with application to the yeast genome","volume":"288","author":"Hegyi","year":"1999","journal-title":"J Mol Biol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1016\/S0022-2836(02)00971-3","article-title":"Sequence variations within protein families are linearly related to structural variations","volume":"323","author":"Koehl","year":"2002","journal-title":"J Mol Biol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1002\/1097-0134(2000)41:4+<63::AID-PROT60>3.0.CO;2-6","article-title":"Structural stability of binding sites: consequences for binding affinity and allosteric effects","volume":"Suppl 4","author":"Luque","year":"2000","journal-title":"Proteins"},{"key":"2023031723192950400_","first-page":"185","article-title":"Exploring the limits of weakly supervised pretraining","volume":"11206","author":"Mahajan","year":"2018","journal-title":"ECCV"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"2722","DOI":"10.1093\/bioinformatics\/btt473","article-title":"LDDT: a local superposition-free score for comparing protein structures and models using distance difference tests","volume":"29","author":"Mariani","year":"2013","journal-title":"Bioinformatics"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"e1007569","DOI":"10.1371\/journal.pcbi.1007569","article-title":"Functional analysis of rossmann-like domains reveals convergent evolution of topology and reaction pathways","volume":"15","author":"Medvedev","year":"2019","journal-title":"PLoS Comput Biol"},{"key":"2023031723192950400_","author":"Mirdita"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"6117","DOI":"10.1021\/acs.jctc.8b00690","article-title":"TopScore: using deep neural networks and large diverse data sets for accurate protein model quality assessment","volume":"14","author":"Mulnaes","year":"2018","journal-title":"J Chem Theory Comput"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"1953","DOI":"10.1021\/acs.jctc.9b00825","article-title":"TopModel: template-Based protein structure prediction at low sequence identity using Top-Down consensus and deep neural networks","volume":"16","author":"Mulnaes","year":"2020","journal-title":"J Chem Theory Comput"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"741","DOI":"10.1016\/S0022-2836(02)00649-6","article-title":"One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions","volume":"321","author":"Nagano","year":"2002","journal-title":"J Mol Biol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1016\/S0959-440X(99)80051-7","article-title":"From protein structure to function","volume":"9","author":"Orengo","year":"1999","journal-title":"Curr Opin Struct Biol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"eabj3221","DOI":"10.1126\/sciadv.abj3221","article-title":"High-resolution structure and dynamics of mitochondrial complex I-Insights into the proton pumping mechanism","volume":"7","author":"Parey","year":"2021","journal-title":"Sci Adv"},{"key":"2023031723192950400_","first-page":"3.1.1","article-title":"An introduction to sequence similarity (\u2018homology\u2019) searching","volume":"3","author":"Pearson","year":"2013","journal-title":"Curr Protoc Bioinform"},{"key":"2023031723192950400_","author":"Plappert","year":"2018"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1093\/protein\/12.2.85","article-title":"Twilight zone of protein sequence alignments","volume":"12","author":"Rost","year":"1999","journal-title":"Protein Eng"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1093\/nar\/gkaa1079","article-title":"CATH: increased structural coverage of functional space","volume":"49","author":"Sillitoe","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"5195","DOI":"10.1038\/s41467-020-18814-w","article-title":"Specific features and assembly of the plant mitochondrial complex I revealed by Cryo-EM","volume":"11","author":"Soufari","year":"2020","journal-title":"Nat Commun"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"1026","DOI":"10.1038\/nbt.3988","article-title":"MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets","volume":"35","author":"Steinegger","year":"2017","journal-title":"Nat Biotechnol"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1093\/nar\/gkaa1100","article-title":"UniProt: the universal protein knowledgebase in 2021","volume":"49","author":"The UniProt Consortium","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"eabd4420","DOI":"10.1126\/sciadv.abd4420","article-title":"Structure of TFIIK for phosphorylation of CTD of RNA polymerase II","volume":"7","author":"van Eeuwen","year":"2021","journal-title":"Sci Adv"},{"key":"2023031723192950400_","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1093\/nar\/gkab1061","article-title":"AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models","volume":"50","author":"Varadi","year":"2022","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad116\/49454319\/btad116.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/3\/btad116\/49555817\/btad116.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/3\/btad116\/49555817\/btad116.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,27]],"date-time":"2023-03-27T13:58:43Z","timestamp":1679925523000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad116\/7072462"}},"subtitle":[],"editor":[{"given":"Lenore","family":"Cowen","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2023,3,1]]},"references-count":32,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad116","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2022.06.13.495871","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,3,1]]},"published":{"date-parts":[[2023,3,1]]},"article-number":"btad116"}}