{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T20:15:56Z","timestamp":1780517756114,"version":"3.54.1"},"reference-count":127,"publisher":"Oxford University Press (OUP)","license":[{"start":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T00:00:00Z","timestamp":1660262400000},"content-version":"vor","delay-in-days":223,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000152","name":"Division of Molecular and Cellular Biosciences","doi-asserted-by":"publisher","award":["MCB- 2129768"],"award-info":[{"award-number":["MCB- 2129768"]}],"id":[{"id":"10.13039\/100000152","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000092","name":"U.S. National Library of Medicine","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000092","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,8,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Over the last 25\u2009years, biology has entered the genomic era and is becoming a science of \u2018big data\u2019. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500\u2009000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3\u20134 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.<\/jats:p>","DOI":"10.1093\/database\/baac062","type":"journal-article","created":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T21:10:19Z","timestamp":1660338619000},"source":"Crossref","is-referenced-by-count":57,"title":["A roadmap for the functional annotation of protein families: a community perspective"],"prefix":"10.1093","volume":"2022","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9955-3785","authenticated-orcid":false,"given":"Val\u00e9rie","family":"de Cr\u00e9cy-lagard","sequence":"first","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Rocio","family":"Amorin de Hegedus","sequence":"additional","affiliation":[{"name":"Genetics Institute, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0803-4817","authenticated-orcid":false,"given":"Cecilia","family":"Arighi","sequence":"additional","affiliation":[{"name":"Department of Computer and Information Sciences, University of Delaware , Newark, DE 19713, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jill","family":"Babor","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6982-4660","authenticated-orcid":false,"given":"Alex","family":"Bateman","sequence":"additional","affiliation":[{"name":"European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus , Hinxton CB10 1SD, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1631-3154","authenticated-orcid":false,"given":"Ian","family":"Blaby","sequence":"additional","affiliation":[{"name":"US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Crysten","family":"Blaby-Haas","sequence":"additional","affiliation":[{"name":"Biology Department, Brookhaven National Laboratory , Upton, NY 11973, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2148-9135","authenticated-orcid":false,"given":"Alan J","family":"Bridge","sequence":"additional","affiliation":[{"name":"Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire , Geneva 4 CH-1211, Switzerland"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2487-9713","authenticated-orcid":false,"given":"Stephen K","family":"Burley","sequence":"additional","affiliation":[{"name":"RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey , Piscataway, NJ 08854, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Stacey","family":"Cleveland","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lucy J","family":"Colwell","sequence":"additional","affiliation":[{"name":"Departmenf of Chemistry, University of Cambridge , Lensfield Road, Cambridge CB2 1EW, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9597-311X","authenticated-orcid":false,"given":"Ana","family":"Conesa","sequence":"additional","affiliation":[{"name":"Spanish National Research Council, Institute for Integrative Systems Biology , Paterna, Valencia 46980, Spain"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4650-6181","authenticated-orcid":false,"given":"Christian","family":"Dallago","sequence":"additional","affiliation":[{"name":"TUM (Technical University of Munich) Department of Informatics, Bioinformatics & Computational Biology , i12, Boltzmannstr. 3, Garching\/Munich 85748, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6350-5001","authenticated-orcid":false,"given":"Antoine","family":"Danchin","sequence":"additional","affiliation":[{"name":"School of Biomedical Sciences, Li KaShing Faculty of Medicine, The University of Hong Kong , 21 Sassoon Road, Pokfulam, SAR Hong Kong 999077, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9034-4119","authenticated-orcid":false,"given":"Anita","family":"de Waard","sequence":"additional","affiliation":[{"name":"Research Collaboration Unit, Elsevier , Jericho, VT 05465, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Adam","family":"Deutschbauer","sequence":"additional","affiliation":[{"name":"Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Raquel","family":"Dias","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8610-0659","authenticated-orcid":false,"given":"Yousong","family":"Ding","sequence":"additional","affiliation":[{"name":"Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida , Gainesville, FL 32610, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Gang","family":"Fang","sequence":"additional","affiliation":[{"name":"NYU-Shanghai , Shanghai 200120, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1789-8000","authenticated-orcid":false,"given":"Iddo","family":"Friedberg","sequence":"additional","affiliation":[{"name":"Department of Veterinary Microbiology and Preventive Medicine, Iowa State University , Ames, IA 50011, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"John","family":"Gerlt","sequence":"additional","affiliation":[{"name":"Institute for Genomic Biology and Departments of Biochemistry and Chemistry, University of Illinois at Urbana-Champaign , Urbana, IL 61801, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Joshua","family":"Goldford","sequence":"additional","affiliation":[{"name":"Physics of Living Systems, Massachusetts Institute of Technology , Cambridge, MA 02139, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mark","family":"Gorelik","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9439-5346","authenticated-orcid":false,"given":"Benjamin M","family":"Gyori","sequence":"additional","affiliation":[{"name":"Laboratory of Systems Pharmacology, Harvard Medical School , Boston, MA 02115, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Christopher","family":"Henry","sequence":"additional","affiliation":[{"name":"Mathematics and Computer Science Division, Argonne National Laboratory , Argonne, IL 60439, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Geoffrey","family":"Hutinet","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Marshall","family":"Jaroch","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Peter D","family":"Karp","sequence":"additional","affiliation":[{"name":"Bioinformatics Research Group, SRI International , Menlo Park, CA 94025, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Liudmyla","family":"Kondratova","sequence":"additional","affiliation":[{"name":"Genetics Institute, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9998-916X","authenticated-orcid":false,"given":"Zhiyong","family":"Lu","sequence":"additional","affiliation":[{"name":"National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH) , 8600 Rockville Pike, Bethesda, MD 20817, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Aron","family":"Marchler-Bauer","sequence":"additional","affiliation":[{"name":"National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH) , 8600 Rockville Pike, Bethesda, MD 20817, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Maria-Jesus","family":"Martin","sequence":"additional","affiliation":[{"name":"European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus , Hinxton CB10 1SD, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Claire","family":"McWhite","sequence":"additional","affiliation":[{"name":"Lewis-Sigler Institute for Integrative Genomics, Princeton University , Princeton, NJ 08540, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Gaurav D","family":"Moghe","sequence":"additional","affiliation":[{"name":"Plant Biology Section, School of Integrative Plant Science, Cornell University , Ithaca, NY 14853, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Paul","family":"Monaghan","sequence":"additional","affiliation":[{"name":"Department of Agricultural Education and Communication, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Anne","family":"Morgat","sequence":"additional","affiliation":[{"name":"Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire , Geneva 4 CH-1211, Switzerland"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6601-2165","authenticated-orcid":false,"given":"Christopher J","family":"Mungall","sequence":"additional","affiliation":[{"name":"Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Darren A","family":"Natale","sequence":"additional","affiliation":[{"name":"Georgetown University Medical Center , Washington, DC 20007, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"William C","family":"Nelson","sequence":"additional","affiliation":[{"name":"Biological Sciences Division, Pacific Northwest National Laboratories , Richland, WA 99354, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Se\u00e1n","family":"O\u2019Donoghue","sequence":"additional","affiliation":[{"name":"School of Biotechnology and Biomolecular Sciences, University of NSW , Sydney, NSW 2052, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Christine","family":"Orengo","sequence":"additional","affiliation":[{"name":"Department of Structural and Molecular Biology, University College London , London WC1E 6BT, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Katherine H","family":"O\u2019Toole","sequence":"additional","affiliation":[{"name":"New England Biolabs , Ipswich, MA 01938, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6769-0793","authenticated-orcid":false,"given":"Predrag","family":"Radivojac","sequence":"additional","affiliation":[{"name":"Khoury College of Computer Sciences, Northeastern University , Boston, MA 02115, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Colbie","family":"Reed","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Richard J","family":"Roberts","sequence":"additional","affiliation":[{"name":"New England Biolabs , Ipswich, MA 01938, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dmitri","family":"Rodionov","sequence":"additional","affiliation":[{"name":"Sanford Burnham Prebys Medical Discovery Institute , La Jolla, CA 92037, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6500-2758","authenticated-orcid":false,"given":"Irina A","family":"Rodionova","sequence":"additional","affiliation":[{"name":"Department of Bioengineering, Division of Engineering, University of California at San Diego , La Jolla, CA 92093-0412, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jeffrey D","family":"Rudolf","sequence":"additional","affiliation":[{"name":"Department of Chemistry, University of Florida , Gainesville, FL 32611, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lana","family":"Saleh","sequence":"additional","affiliation":[{"name":"New England Biolabs , Ipswich, MA 01938, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4223-9947","authenticated-orcid":false,"given":"Gloria","family":"Sheynkman","sequence":"additional","affiliation":[{"name":"Department of Molecular Physiology and Biological Physics, University of Virginia , Charlottesville, VA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Francoise","family":"Thibaud-Nissen","sequence":"additional","affiliation":[{"name":"National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH) , 8600 Rockville Pike, Bethesda, MD 20817, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9074-3507","authenticated-orcid":false,"given":"Paul D","family":"Thomas","sequence":"additional","affiliation":[{"name":"Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Peter","family":"Uetz","sequence":"additional","affiliation":[{"name":"Center for Biological Data Science, Virginia Commonwealth University , Richmond, VA 23284, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6648-0332","authenticated-orcid":false,"given":"David","family":"Vallenet","sequence":"additional","affiliation":[{"name":"LABGeM, G\u00e9nomique M\u00e9tabolique, CEA, Genoscope, Institut Fran\u00e7ois Jacob, Universit\u00e9 d\u2019\u00c9vry, Universit\u00e9 Paris-Saclay, CNRS , Evry 91057, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Erica Watson","family":"Carter","sequence":"additional","affiliation":[{"name":"Department of Plant Pathology, University of Florida Citrus Research and Education Center , 700 Experiment Station Rd., Lake Alfred, FL 33850, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3696-4541","authenticated-orcid":false,"given":"Peter R","family":"Weigele","sequence":"additional","affiliation":[{"name":"New England Biolabs , Ipswich, MA 01938, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6330-7526","authenticated-orcid":false,"given":"Valerie","family":"Wood","sequence":"additional","affiliation":[{"name":"Department of Biochemistry, University of Cambridge , Cambridge CB2 1GA, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Elisha M","family":"Wood-Charlson","sequence":"additional","affiliation":[{"name":"Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jin","family":"Xu","sequence":"additional","affiliation":[{"name":"Department of Plant Pathology, University of Florida Citrus Research and Education Center , 700 Experiment Station Rd., Lake Alfred, FL 33850, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2022,8,12]]},"reference":[{"key":"2022081221100595600_R1","doi-asserted-by":"publisher","DOI":"10.1155\/2014\/428570","article-title":"Systems biology in the context of big data and networks","volume":"2014","author":"Altaf-Ul-Amin","year":"2014","journal-title":"Biomed. Res. Int."},{"key":"2022081221100595600_R2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pbio.1002195","article-title":"Big data: astronomical or genomical?","volume":"13","author":"Stephens","year":"2015","journal-title":"PLoS Biol."},{"key":"2022081221100595600_R3","doi-asserted-by":"publisher","first-page":"1071","DOI":"10.1093\/bib\/bbx113","article-title":"MicroScope-an integrated resource for community expertise of gene functions and comparative analysis of microbial genomic and metabolic data","volume":"20","author":"M\u00e9digue","year":"2019","journal-title":"Brief. Bioinformat."},{"key":"2022081221100595600_R4","doi-asserted-by":"publisher","DOI":"10.7554\/eLife.67667","article-title":"Unifying the known and unknown microbial coding sequence space","volume":"11","author":"Vanni","year":"2022","journal-title":"Elife"},{"key":"2022081221100595600_R5","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1016\/j.csbj.2019.11.002","article-title":"Long walk to genomics: history and current approaches to genome sequencing and assembly","volume":"18","author":"Giani","year":"2020","journal-title":"Comput. Struct. Biotech. J."},{"key":"2022081221100595600_R6","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1038\/470163a","article-title":"Too many roads not taken","volume":"470","author":"Edwards","year":"2011","journal-title":"Nature"},{"key":"2022081221100595600_R7","doi-asserted-by":"publisher","DOI":"10.1098\/rsob.180241","article-title":"Hidden in plain sight: what remains to be discovered in the eukaryotic proteome?","volume":"9","author":"Wood","year":"2019","journal-title":"Open Biol."},{"key":"2022081221100595600_R8","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1093\/bib\/bbl004","article-title":"Automated protein function prediction\u2014the genomic challenge","volume":"7","author":"Friedberg","year":"2006","journal-title":"Brief Bioinformat."},{"key":"2022081221100595600_R9","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1128\/microbe.11.303.1","article-title":"Quality annotations, a key frontier in the microbial sciences","volume":"11","author":"de Cr\u00e9cy-lagard","year":"2016","journal-title":"Microbe Magazine"},{"key":"2022081221100595600_R10","doi-asserted-by":"publisher","first-page":"2446","DOI":"10.1093\/nar\/gkz030","article-title":"The y-ome defines the 35% of Escherichia coli genes that lack experimental evidence of function","volume":"47","author":"Ghatak","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R11","doi-asserted-by":"publisher","DOI":"10.7554\/eLife.36842","article-title":"Essential metabolism for a minimal cell","volume":"8","author":"Breuer","year":"2019","journal-title":"Elife"},{"key":"2022081221100595600_R12","doi-asserted-by":"publisher","DOI":"10.1099\/mgen.0.000341","article-title":"An assessment of genome annotation coverage across the bacterial tree of life","volume":"6","author":"Lobb","year":"2020","journal-title":"Microb. Genom."},{"key":"2022081221100595600_R13","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1042\/BST20180560","article-title":"Towards functional characterization of archaeal genomic dark matter. Towards functional characterization of archaeal genomic dark matter","volume":"47","author":"Makarova","year":"2019","journal-title":"Biochem. Soc. Trans."},{"key":"2022081221100595600_R14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1042\/BJ20091328","article-title":"\u201cUnknown\u201d proteins and \u201corphan\u201d enzymes: the missing half of the engineering parts list\u2014and how to find it","volume":"425","author":"Hanson","year":"2009","journal-title":"Biochem. J."},{"key":"2022081221100595600_R15","doi-asserted-by":"publisher","first-page":"437","DOI":"10.1093\/bib\/bbw135","article-title":"Plant genome and transcriptome annotations: from misconceptions to simple solutions","volume":"19","author":"Bolger","year":"2018","journal-title":"Brief. Bioinformat."},{"key":"2022081221100595600_R16","article-title":"This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC","author":"An Experimental Approach to Genome Annotation","year":"2004"},{"key":"2022081221100595600_R17","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1000605","article-title":"Annotation error in public databases: misannotation of molecular function in enzyme superfamilies","volume":"5","author":"Schnoes","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2022081221100595600_R18","doi-asserted-by":"publisher","DOI":"10.1093\/database\/bat071","article-title":"Ureidoglycolate hydrolase, amidohydrolase, lyase: how errors in biological databases are incorporated in scientific papers and vice versa","volume":"2013","author":"Percudani","year":"2013","journal-title":"Database (Oxford)"},{"key":"2022081221100595600_R19","doi-asserted-by":"crossref","DOI":"10.1098\/rsob.180241","article-title":"Hidden in plain sight: what remains to be discovered in the eukaryotic proteome?","volume":"9","author":"Wood","year":"2019","journal-title":"Open Biol."},{"key":"2022081221100595600_R20","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1016\/j.csbj.2014.05.008","article-title":"Variations in metabolic pathways create challenges for automated metabolic reconstructions: examples from the tetrahydrofolate synthesis pathway","volume":"10","author":"de Cr\u00e9cy-lagard","year":"2014","journal-title":"Comput. Struct. Biotechnol. J."},{"key":"2022081221100595600_R21","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0088889","article-title":"Functionally enigmatic genes: a case study of the brain ignorome","volume":"9","author":"Pandey","year":"2014","journal-title":"PLoS One"},{"key":"2022081221100595600_R22","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pbio.2006643","article-title":"Large-scale investigation of the reasons why potentially important genes are ignored","volume":"16","author":"Stoeger","year":"2018","journal-title":"PLoS Biol."},{"key":"2022081221100595600_R23","doi-asserted-by":"publisher","first-page":"D480","DOI":"10.1093\/nar\/gkaa1100","article-title":"UniProt: the universal protein knowledgebase in 2021","volume":"49","author":"Consortium","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R24","doi-asserted-by":"publisher","first-page":"1429","DOI":"10.1038\/s41588-019-0500-1","article-title":"Gene Ontology Causal Activity Modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems","volume":"51","author":"Thomas","year":"2019","journal-title":"Nat. Genet."},{"key":"2022081221100595600_R25","doi-asserted-by":"publisher","first-page":"1896","DOI":"10.1093\/bioinformatics\/btz817","article-title":"Enzyme annotation in UniProtKB using Rhea","volume":"36","author":"Morgat","year":"2020","journal-title":"Bioinformatics"},{"key":"2022081221100595600_R26","doi-asserted-by":"publisher","first-page":"D445","DOI":"10.1093\/nar\/gkz862","article-title":"The MetaCyc database of metabolic pathways and enzymes - a 2019 update","volume":"48","author":"Caspi","year":"2020","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R27","doi-asserted-by":"publisher","first-page":"D545","DOI":"10.1093\/nar\/gkaa970","article-title":"KEGG: integrating viruses and cellular organisms","volume":"49","author":"Kanehisa","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R28","doi-asserted-by":"publisher","first-page":"D498","DOI":"10.1093\/nar\/gkz1031","article-title":"The Reactome pathway knowledgebase","volume":"48","author":"Jassal","year":"2020","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R29","doi-asserted-by":"publisher","first-page":"D656","DOI":"10.1093\/nar\/gkx1065","article-title":"SABIO-RK: an updated resource for manually curated biochemical reaction kinetics","volume":"46","author":"Wittig","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R30","doi-asserted-by":"publisher","first-page":"D498","DOI":"10.1093\/nar\/gkaa1025","article-title":"BRENDA, the ELIXIR core data resource in 2021: new developments and updates","volume":"49","author":"Chang","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R31","doi-asserted-by":"publisher","first-page":"D325","DOI":"10.1093\/nar\/gkaa1113","article-title":"The Gene Ontology resource: enriching a GOld mine","volume":"49","author":"Consortium","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R32","doi-asserted-by":"publisher","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2022081221100595600_R33","doi-asserted-by":"publisher","first-page":"18820","DOI":"10.1021\/jacs.1c09820","article-title":"The open reaction database","volume":"143","author":"Kearnes","year":"2021","journal-title":"J. Am. Chem. Soc."},{"key":"2022081221100595600_R34","doi-asserted-by":"publisher","first-page":"W352","DOI":"10.1093\/nar\/gkab326","article-title":"LitSuggest: a web-based system for literature recommendation and curation using machine learning","volume":"49","author":"Allot","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R35","doi-asserted-by":"publisher","first-page":"W587","DOI":"10.1093\/nar\/gkz389","article-title":"PubTator central: automated concept annotation for biomedical full text articles","volume":"47","author":"Wei","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R36","doi-asserted-by":"publisher","first-page":"3454","DOI":"10.1093\/bioinformatics\/btx439","article-title":"On expert curation and scalability: UniProtKB\/Swiss-Prot as a case study","volume":"33","author":"Poux","year":"2017","journal-title":"Bioinformatics"},{"key":"2022081221100595600_R37","doi-asserted-by":"publisher","first-page":"D693","DOI":"10.1093\/nar\/gkab1016","article-title":"Rhea, the reaction knowledgebase in 2022","volume":"50","author":"Bansal","year":"2022","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R38","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1006390","article-title":"Scaling up data curation using deep learning: an application to literature triage in genomic variation resources","volume":"14","author":"Lee","year":"2018","journal-title":"PLoS Comput. Biol."},{"key":"2022081221100595600_R39","doi-asserted-by":"publisher","DOI":"10.1093\/genetics\/iyab222","article-title":"Fission stories: using PomBase to understand Schizosaccharomyces pombe biology","volume":"220","author":"Harris","year":"2021","journal-title":"Genetics"},{"key":"2022081221100595600_R40","doi-asserted-by":"publisher","DOI":"10.1093\/database\/baaa028","article-title":"Community curation in PomBase: enabling fission yeast experts to provide detailed, standardized, sharable annotation from research publications","volume":"2020","author":"Lock","year":"2020","journal-title":"Database (Oxford)"},{"key":"2022081221100595600_R41","doi-asserted-by":"publisher","first-page":"1791","DOI":"10.1093\/bioinformatics\/btu103","article-title":"Canto: an online tool for community literature curation","volume":"30","author":"Rutherford","year":"2014","journal-title":"Bioinformatics"},{"key":"2022081221100595600_R42","doi-asserted-by":"publisher","first-page":"932","DOI":"10.1038\/s41587-021-01179-w","article-title":"Using deep learning to annotate the protein universe","volume":"40","author":"Bileschi","year":"2022","journal-title":"Nat. Biotech."},{"key":"2022081221100595600_R43","doi-asserted-by":"publisher","first-page":"4239","DOI":"10.1021\/acs.biochem.8b00705","article-title":"The need for manuscripts to include database identifiers for proteins","volume":"57","author":"Gerlt","year":"2018","journal-title":"Biochemistry"},{"key":"2022081221100595600_R44","doi-asserted-by":"publisher","DOI":"10.1186\/s13321-021-00520-4","article-title":"FAIR chemical structures in the Journal of Cheminformatics","volume":"13","author":"Schymanski","year":"2021","journal-title":"J. Cheminform."},{"key":"2022081221100595600_R45","doi-asserted-by":"publisher","DOI":"10.1186\/s13321-021-00521-3","article-title":"Reply to \u201cFAIR chemical structure in the Journal of Cheminformatics\u201d","volume":"13","author":"Guha","year":"2021","journal-title":"J. Cheminform."},{"key":"2022081221100595600_R46","doi-asserted-by":"publisher","first-page":"8648","DOI":"10.1039\/D1SC02362D","article-title":"Predicting enzymatic reactions with a molecular transformer","volume":"12","author":"Kreutter","year":"2021","journal-title":"Chem. Sci."},{"key":"2022081221100595600_R47","doi-asserted-by":"publisher","first-page":"3316","DOI":"10.1039\/C9SC05704H","article-title":"Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy","volume":"11","author":"Schwaller","year":"2020","journal-title":"Chem. Sci."},{"key":"2022081221100595600_R48","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1009463","article-title":"Crowdsourcing biocuration: the community assessment of community annotation with ontologies (CACAO)","volume":"17","author":"Ramsey","year":"2021","journal-title":"PLoS Comp. Biol."},{"key":"2022081221100595600_R49","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pbio.3001464","article-title":"A crowdsourcing open platform for literature curation in UniProt","volume":"19","author":"Wang","year":"2021","journal-title":"PLoS Biol."},{"key":"2022081221100595600_R50","doi-asserted-by":"crossref","first-page":"D480","DOI":"10.1093\/nar\/gkaa1100","article-title":"UniProt: the universal protein knowledgebase in 2021","volume":"49","author":"Consortium","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R51","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1016\/j.sbi.2017.07.003","article-title":"Evolution of protein specificity: insights from ancestral protein reconstruction","volume":"47","author":"Siddiq","year":"2017","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2022081221100595600_R52","doi-asserted-by":"publisher","first-page":"449","DOI":"10.1093\/bib\/bbr042","article-title":"Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium","volume":"12","author":"Gaudet","year":"2011","journal-title":"Brief. Bioinformat."},{"key":"2022081221100595600_R53","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1016\/j.copbio.2020.12.004","article-title":"Discovery of new enzymatic functions and metabolic pathways using genomic enzymology web tools","volume":"69","author":"Zallot","year":"2021","journal-title":"Curr. Opin. Biotech."},{"key":"2022081221100595600_R54","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1021\/acsbiomedchemau.1c00048","article-title":"RadicalSAM.org: a resource to interpret sequence-function space and discover new radical SAM enzyme chemistry","volume":"2","author":"Oberg","year":"2022","journal-title":"ACS Bio. Med. Chem. Au."},{"key":"2022081221100595600_R55","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-019-2988-x","article-title":"FunFam protein families improve residue level molecular function prediction","volume":"20","author":"Scheibenreif","year":"2019","journal-title":"BMC Bioinform."},{"key":"2022081221100595600_R56","doi-asserted-by":"publisher","first-page":"D266","DOI":"10.1093\/nar\/gkaa1079","article-title":"CATH: increased structural coverage of functional space","volume":"49","author":"Sillitoe","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R57","doi-asserted-by":"publisher","first-page":"3449","DOI":"10.1093\/bioinformatics\/btab371","article-title":"Clustering FunFams using sequence embeddings improves EC purity","volume":"37","author":"Littmann","year":"2021","journal-title":"Bioinformatics"},{"key":"2022081221100595600_R58","doi-asserted-by":"publisher","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2022081221100595600_R59","doi-asserted-by":"publisher","first-page":"449","DOI":"10.1093\/bib\/bbr042","article-title":"Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium","volume":"12","author":"Gaudet","year":"2011","journal-title":"Brief. Bioinformat."},{"key":"2022081221100595600_R60","doi-asserted-by":"publisher","first-page":"D266","DOI":"10.1093\/nar\/gkaa1079","article-title":"CATH: increased structural coverage of functional space","volume":"49","author":"Sillitoe","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R61","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2022081221100595600_R62","doi-asserted-by":"publisher","DOI":"10.3389\/fphys.2022.815874","article-title":"Missing links between gene function and physiology in genomics","volume":"13","author":"Collado-Vides","year":"2022","journal-title":"Front Physiol."},{"key":"2022081221100595600_R63","doi-asserted-by":"crossref","first-page":"4643","DOI":"10.1093\/bioinformatics\/btaa485","article-title":"UniRule: a unified rule resource for automatic annotation in the UniProt Knowledgebase","volume":"36","author":"MacDougall","year":"2020","journal-title":"Bioinformatics"},{"key":"2022081221100595600_R64","doi-asserted-by":"publisher","DOI":"10.1093\/database\/baw110","article-title":"How much does curation cost?","volume":"2016","author":"Karp","year":"2016","journal-title":"Database (Oxford)"},{"key":"2022081221100595600_R65","doi-asserted-by":"publisher","DOI":"10.1093\/database\/baaa006","article-title":"Text mining meets community curation: a newly designed curation platform to improve author experience and participation at WormBase","volume":"2020","author":"Arnaboldi","year":"2020","journal-title":"Database"},{"key":"2022081221100595600_R66","doi-asserted-by":"publisher","DOI":"10.1093\/database\/bas024","article-title":"Directly e-mailing authors of newly published papers encourages community curation","volume":"2012","author":"Bunt","year":"2012","journal-title":"Database"},{"key":"2022081221100595600_R67","article-title":"Multiple routes of functional diversification of the plant BAHD acyltransferase family revealed by comparative biochemical and genomic analyses","author":"Kruse","year":"2021","journal-title":"bioRxiv"},{"key":"2022081221100595600_R68","doi-asserted-by":"publisher","first-page":"D1020","DOI":"10.1093\/nar\/gkaa1105","article-title":"RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation","volume":"49","author":"Li","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R69","doi-asserted-by":"publisher","first-page":"1085","DOI":"10.1093\/bib\/bbx085","article-title":"The BioCyc collection of microbial genomes and metabolic pathways","volume":"20","author":"Karp","year":"2019","journal-title":"Brief. Bioinformat."},{"key":"2022081221100595600_R70","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-020-80786-0","article-title":"Embeddings from deep learning transfer GO annotations beyond homology","volume":"11","author":"Littmann","year":"2021","journal-title":"Sci Rep"},{"key":"2022081221100595600_R71","doi-asserted-by":"publisher","first-page":"W535","DOI":"10.1093\/nar\/gkab354","article-title":"PredictProtein - predicting protein structure and function for 29 years","volume":"49","author":"Bernhofer","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R72","doi-asserted-by":"publisher","DOI":"10.1002\/cpz1.113","article-title":"Learned embeddings from deep learning to visualize and predict protein sets","volume":"1","author":"Dallago","year":"2021","journal-title":"Curr. Protoc."},{"key":"2022081221100595600_R73","article-title":"Multiple routes of functional diversification of the plant BAHD acyltransferase family revealed by comparative biochemical and genomic analyses","author":"Kruse","year":"2021","journal-title":"bioRxiv"},{"key":"2022081221100595600_R74","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-5-76","article-title":"A Bayesian method for identifying missing enzymes in predicted metabolic pathway databases","volume":"5","author":"Green","year":"2004","journal-title":"BMC Bioinform."},{"key":"2022081221100595600_R75","article-title":"ModelSEED 2: high-throughput genome-scale metabolic model reconstruction with enhanced energy biosynthesis pathway prediction","author":"Henry","year":"2022"},{"key":"2022081221100595600_R76","doi-asserted-by":"publisher","DOI":"10.15252\/msb.20177651","article-title":"From word models to executable models of signaling networks using automated assembly","volume":"13","author":"Gyori","year":"2017","journal-title":"Mol. Syst. Biol."},{"key":"2022081221100595600_R77","doi-asserted-by":"publisher","first-page":"935","DOI":"10.1038\/nbt.1666","article-title":"The BioPAX community standard for pathway data sharing","volume":"28","author":"Demir","year":"2010","journal-title":"Nat. Biotech."},{"key":"2022081221100595600_R78","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-11-530","article-title":"Formalization of taxon-based constraints to detect inconsistencies in annotation and ontology development","volume":"11","author":"Deegan N\u00e9e Clark","year":"2010","journal-title":"BMC Bioinfo."},{"key":"2022081221100595600_R79","doi-asserted-by":"publisher","first-page":"D330","DOI":"10.1093\/nar\/gky1055","article-title":"The Gene Ontology Resource: 20 years and still GOing strong","volume":"47","author":"Carbon","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R80","doi-asserted-by":"publisher","DOI":"10.1098\/rsob.200149","article-title":"Term matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns","volume":"10","author":"Wood","year":"2020","journal-title":"Open Biol."},{"key":"2022081221100595600_R81","doi-asserted-by":"publisher","DOI":"10.1186\/1752-0509-4-178","article-title":"Improving the iMM904 S. cerevisiae metabolic model using essentiality and synthetic lethality data","volume":"4","author":"Zomorrodi","year":"2010","journal-title":"BMC Systs. Biol."},{"key":"2022081221100595600_R82","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1000308","article-title":"GrowMatch: an automated method for reconciling in silico\/in vivo growth predictions","volume":"5","author":"Kumar","year":"2009","journal-title":"PLoS Comp. Biol."},{"key":"2022081221100595600_R83","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1009060","article-title":"A gap-filling algorithm for prediction of metabolic interactions in microbial communities","volume":"17","author":"Giannari","year":"2021","journal-title":"PLoS Comp. Biol."},{"key":"2022081221100595600_R84","doi-asserted-by":"publisher","DOI":"10.1128\/mbio.01630-22","article-title":"Metabolite damage and damage-control in a minimal genome","author":"Haas","year":"2022","journal-title":"mBio"},{"key":"2022081221100595600_R85","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbab454","article-title":"A roadmap for multi-omics data integration using deep learning. A roadmap for multi-omics data integration using deep learning","volume":"23","author":"Kang","year":"2022","journal-title":"Brief Bioinfo."},{"key":"2022081221100595600_R86","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-021-23774-w","article-title":"MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification","volume":"12","author":"Wang","year":"2021","journal-title":"Nat. Commun."},{"key":"2022081221100595600_R87","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR Guiding Principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci. Data"},{"key":"2022081221100595600_R88","doi-asserted-by":"publisher","first-page":"558","DOI":"10.1038\/d41586-022-00402-1","article-title":"NIH issues a seismic mandate: share data publicly","volume":"602","author":"Kozlov","year":"2022","journal-title":"Nature"},{"key":"2022081221100595600_R89","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1002\/pro.4213","article-title":"RCSB Protein Data Bank: celebrating 50 years of the PDB with new tools for understanding and visualizing biological macromolecules in 3D","volume":"31","author":"Burley","year":"2022","journal-title":"Protein Sci."},{"key":"2022081221100595600_R90","doi-asserted-by":"publisher","DOI":"10.1016\/j.jmb.2022.167599","article-title":"PDBx\/mmCIF ecosystem: foundational semantic tools for structural biology","volume":"434","author":"Westbrook","year":"2022","journal-title":"J. Mol. Biol."},{"key":"2022081221100595600_R91","doi-asserted-by":"publisher","DOI":"10.1016\/j.jmb.2020.11.003","article-title":"RCSB Protein Data Bank: architectural advances towards integrated searching and efficient access to macromolecular structure data from the PDB archive","volume":"433","author":"Rose","year":"2021","journal-title":"J. Mol. Biol."},{"key":"2022081221100595600_R92","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbc.2021.100559","article-title":"Impact of structural biologists and the Protein Data Bank on small-molecule drug discovery and development","volume":"296","author":"Burley","year":"2021","journal-title":"J. Biol. Chem."},{"key":"2022081221100595600_R93","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1002\/pro.4200","article-title":"PDB-101: educational resources supporting molecular explorations through biology and medicine","volume":"31","author":"Zardecki","year":"2022","journal-title":"Protein Sci."},{"key":"2022081221100595600_R94","doi-asserted-by":"publisher","DOI":"10.1093\/bioadv\/vbac034","article-title":"Gilda: biomedical entity text normalization with machine-learned disambiguation as a service","volume":"2","author":"Gyori","year":"2022","journal-title":"Bioinformatics Advances"},{"key":"2022081221100595600_R95","doi-asserted-by":"publisher","DOI":"10.15252\/msb.20177651","article-title":"From word models to executable models of signaling networks using automated assembly","volume":"13","author":"Gyori","year":"2017","journal-title":"Mol. Syst. Biol."},{"key":"2022081221100595600_R96","doi-asserted-by":"publisher","first-page":"D529","DOI":"10.1093\/nar\/gkaa853","article-title":"The Dark Kinase Knowledgebase: an online compendium of knowledge and experimental results of understudied kinases","volume":"49","author":"Berginski","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R97","doi-asserted-by":"publisher","DOI":"10.1101\/2020.04.02.022277","article-title":"A resource for exploring the understudied human kinome for research and therapeutic opportunities","author":"Moret","year":"2021","journal-title":"bioRxiv"},{"key":"2022081221100595600_R98","doi-asserted-by":"publisher","DOI":"10.7554\/eLife.72879","article-title":"Integrating multi-omics data reveals function and therapeutic potential of deubiquitinating enzymes","volume":"11","author":"Doherty","year":"2022","journal-title":"eLife"},{"key":"2022081221100595600_R99","doi-asserted-by":"publisher","DOI":"10.7554\/eLife.68292","article-title":"Author-sourced capture of pathway knowledge in computable form using Biofactoid","volume":"10","author":"Wong","year":"2021","journal-title":"Elife"},{"key":"2022081221100595600_R100","doi-asserted-by":"publisher","DOI":"10.1186\/s13321-015-0068-4","article-title":"InChI, the IUPAC international chemical identifier","volume":"7","author":"Heller","year":"2015","journal-title":"J. Cheminform."},{"key":"2022081221100595600_R101","doi-asserted-by":"publisher","first-page":"12523","DOI":"10.1093\/nar\/gkaa1125","article-title":"On the lifetime of bioinformatics web services","volume":"48","author":"Kern","year":"2020","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R102","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1016\/j.tibtech.2011.01.001","article-title":"Mining high-throughput experimental data to link gene and function","volume":"29","author":"Blaby-Haas","year":"2011","journal-title":"Trends Biotech."},{"key":"2022081221100595600_R103","doi-asserted-by":"publisher","first-page":"605","DOI":"10.1146\/annurev-arplant-050718-095841","article-title":"Comparative and functional algal genomics. comparative and functional algal genomics","volume":"70","author":"Blaby-Haas","year":"2019","journal-title":"Ann. Rev. Plant Biol."},{"key":"2022081221100595600_R104","doi-asserted-by":"publisher","first-page":"D112","DOI":"10.1093\/nar\/gkaa810","article-title":"iModulonDB: a knowledgebase of microbial transcriptional regulation derived from machine learning","volume":"49","author":"Rychel","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R105","doi-asserted-by":"publisher","DOI":"10.1038\/s42003-021-02516-0","article-title":"Identification of a transcription factor, PunR, that regulates the purine and purine nucleoside transporter punC in E. coli","volume":"4","author":"Rodionova","year":"2021","journal-title":"Commun. Biol."},{"key":"2022081221100595600_R106","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.gene.2012.07.083","article-title":"Function of alternative splicing","volume":"514","author":"Kelemen","year":"2013","journal-title":"Gene"},{"key":"2022081221100595600_R107","doi-asserted-by":"publisher","first-page":"D916","DOI":"10.1093\/nar\/gkaa1087","article-title":"GENCODE 2021","volume":"49","author":"Frankish","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022081221100595600_R108","doi-asserted-by":"publisher","first-page":"805","DOI":"10.1016\/j.cell.2016.01.029","article-title":"Widespread expansion of protein interaction capabilities by alternative splicing","volume":"164","author":"Yang","year":"2016","journal-title":"Cell"},{"key":"2022081221100595600_R109","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-020-16174-z","article-title":"ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms","volume":"11","author":"Sheynkman","year":"2020","journal-title":"Nat. Commun."},{"key":"2022081221100595600_R110","doi-asserted-by":"publisher","DOI":"10.1038\/s41592-022-01472-9","article-title":"Enhanced protein isoform characterization","volume":"19","author":"Singh","year":"2022","journal-title":"Nat. Meth."},{"key":"2022081221100595600_R111","doi-asserted-by":"publisher","article-title":"Systematic assessment of long-read RNA-seq methods for transcript identification and quantification","author":"Pardo-Palacios","DOI":"10.21203\/rs.3.rs-777702\/v1"},{"key":"2022081221100595600_R112","doi-asserted-by":"publisher","DOI":"10.1186\/s13059-022-02624-y","article-title":"Enhanced protein isoform characterization through long-read proteogenomics","volume":"23","author":"Miller","year":"2022","journal-title":"Genome Biol."},{"key":"2022081221100595600_R113","doi-asserted-by":"publisher","first-page":"186","DOI":"10.1038\/nmeth.2369","article-title":"Proteoform: a single term describing protein complexity","volume":"10","author":"Smith","year":"2013","journal-title":"Nat. Methods"},{"key":"2022081221100595600_R114","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1038\/nature10575","article-title":"Mapping intact protein isoforms in discovery mode using top-down proteomics","volume":"480","author":"Tran","year":"2011","journal-title":"Nature"},{"key":"2022081221100595600_R115","doi-asserted-by":"publisher","DOI":"10.1126\/sciadv.abk0734","article-title":"Defining the human proteome","volume":"7","author":"Smith","year":"2021","journal-title":"Sci. Adv."},{"key":"2022081221100595600_R116","doi-asserted-by":"publisher","first-page":"623","DOI":"10.1038\/35001009","article-title":"A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae","volume":"403","author":"Uetz","year":"2000","journal-title":"Nature"},{"key":"2022081221100595600_R117","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pgen.1002815","article-title":"RsfA (YbeB) proteins are conserved ribosomal silencing factors","volume":"8","author":"H\u00e4user","year":"2012","journal-title":"PLoS Genet."},{"key":"2022081221100595600_R118","doi-asserted-by":"publisher","first-page":"e00744","DOI":"10.1128\/mBio.00744-13","article-title":"Protein domains of unknown function are essential in bacteria","volume":"5","author":"Goodacre","year":"2014","journal-title":"mBio"},{"key":"2022081221100595600_R119","doi-asserted-by":"publisher","DOI":"10.3390\/proteomes9020016","article-title":"The protein interactome of glycolysis in Escherichia coli","volume":"9","author":"Chowdhury","year":"2021","journal-title":"Proteomes"},{"key":"2022081221100595600_R120","doi-asserted-by":"publisher","first-page":"503","DOI":"10.1038\/s41586-018-0124-0","article-title":"Mutant phenotypes for thousands of bacterial genes of unknown function","volume":"557","author":"Price","year":"2018","journal-title":"Nature"},{"key":"2022081221100595600_R121","doi-asserted-by":"publisher","DOI":"10.3390\/biom11081245","article-title":"Biomolecule and bioentity interaction databases in systems biology: a comprehensive review","volume":"11","author":"Baltoumas","year":"2021","journal-title":"Biomolecules"},{"key":"2022081221100595600_R122","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-020-19942-z","article-title":"Towards a unified open access dataset of molecular interactions","volume":"11","author":"Porras","year":"2020","journal-title":"Nat. Commun."},{"key":"2022081221100595600_R123","doi-asserted-by":"publisher","DOI":"10.1093\/database\/baaa112","article-title":"CEG 2.0: an updated database of clusters of essential genes including eukaryotic organisms","volume":"2020","author":"Liu","year":"2020","journal-title":"Database"},{"key":"2022081221100595600_R124","doi-asserted-by":"publisher","DOI":"10.3389\/fmicb.2017.02331","article-title":"A comprehensive overview of online resources to identify and predict bacterial essential genes","volume":"8","author":"Peng","year":"2017","journal-title":"Front Microbiol"},{"key":"2022081221100595600_R125","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pbio.1001638","article-title":"The COMBREX project: design, methodology, and initial results","volume":"11","author":"Anton","year":"2013","journal-title":"PLoS Biol."},{"key":"2022081221100595600_R126","article-title":"EMBL-EBI Impact Report 2021","author":"Charles Beagrie","year":"2021"},{"key":"2022081221100595600_R127","doi-asserted-by":"publisher","first-page":"3460","DOI":"10.1093\/bioinformatics\/btv398","article-title":"Functional classification of CATH superfamilies: a domain-based approach for protein function annotation","volume":"31","author":"Das","year":"2015","journal-title":"Bioinformatics"}],"container-title":["Database"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baac062\/45407561\/baac062.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baac062\/45407561\/baac062.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,25]],"date-time":"2023-11-25T19:01:30Z","timestamp":1700938890000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/database\/article\/doi\/10.1093\/database\/baac062\/6663924"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,1]]},"references-count":127,"URL":"https:\/\/doi.org\/10.1093\/database\/baac062","relation":{},"ISSN":["1758-0463"],"issn-type":[{"value":"1758-0463","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,1,1]]},"published":{"date-parts":[[2022,1,1]]},"article-number":"baac062"}}