{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T03:56:50Z","timestamp":1775620610338,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009930","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,4,6]],"date-time":"2022-04-06T00:00:00Z","timestamp":1649203200000}}],"reference-count":36,"publisher":"Public Library of Science (PLoS)","issue":"3","license":[{"start":{"date-parts":[[2022,3,25]],"date-time":"2022-03-25T00:00:00Z","timestamp":1648166400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100020840","name":"Australian Research Data Commons","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100020840","id-type":"DOI","asserted-by":"crossref"}]},{"name":"National Health and Medical Research Council of Australia"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Protein structure fundamentally underpins the function and processes of numerous biological systems. Fold recognition algorithms offer a sensitive and robust tool to detect structural, and thereby functional, similarities between distantly related homologs. In the era of accurate structure prediction owing to advances in machine learning techniques and a wealth of experimentally determined structures, previously curated sequence databases have become a rich source of biological information. Here, we use bioinformatic fold recognition algorithms to scan the entire AlphaFold structure database to identify novel protein family members, infer function and group predicted protein structures. As an example of the utility of this approach, we identify novel, previously unknown members of various pore-forming protein families, including MACPFs, GSDMs and aerolysin-like proteins.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1009930","type":"journal-article","created":{"date-parts":[[2022,3,25]],"date-time":"2022-03-25T13:45:31Z","timestamp":1648215931000},"page":"e1009930","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":13,"title":["Mining folded proteomes in the era of accurate structure prediction"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7573-7715","authenticated-orcid":true,"given":"Charles","family":"Bayly-Jones","sequence":"first","affiliation":[]},{"given":"James C.","family":"Whisstock","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,3,25]]},"reference":[{"issue":"10","key":"pcbi.1009930.ref001","doi-asserted-by":"crossref","first-page":"1190","DOI":"10.1107\/S1744309110007177","article-title":"Structural classification of proteins and structural genomics: New insights into protein folding and evolution","volume":"66","author":"A Andreeva","year":"2010","journal-title":"Acta Crystallogr Sect F Struct Biol Cryst Commun"},{"issue":"D1","key":"pcbi.1009930.ref002","doi-asserted-by":"crossref","first-page":"D376","DOI":"10.1093\/nar\/gkz1064","article-title":"The SCOP database in 2020: Expanded classification of representative family and superfamily domains of known protein structures","volume":"48","author":"A Andreeva","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009930.ref003","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1093\/nar\/gkq366","article-title":"Dali server: conservation mapping in 3D","volume":"38","author":"L. Holm","year":"2010","journal-title":"Nucleic Acids Res"},{"issue":"5844","key":"pcbi.1009930.ref004","doi-asserted-by":"crossref","first-page":"1548","DOI":"10.1126\/science.1144706","article-title":"A common fold mediates vertebrate defense and bacterial attack","volume":"317","author":"CJ Rosado","year":"2007","journal-title":"Science (80-)"},{"key":"pcbi.1009930.ref005","article-title":"Highly accurate protein structure prediction with AlphaFold","author":"J Jumper","year":"2021","journal-title":"Nature"},{"key":"pcbi.1009930.ref006","first-page":"eabj8754","article-title":"Accurate prediction of protein structures and interactions using a three-track neural network","author":"M Baek","year":"2021","journal-title":"Science (80-)"},{"key":"pcbi.1009930.ref007","article-title":"Highly accurate protein structure prediction for the human proteome","author":"K Tunyasuvunakool","year":"2021","journal-title":"Nature"},{"issue":"7869","key":"pcbi.1009930.ref008","doi-asserted-by":"crossref","first-page":"625","DOI":"10.1038\/d41586-021-02037-0","article-title":"Artificial intelligence in structural biology is here to stay","volume":"595","year":"2021","journal-title":"Nature"},{"key":"pcbi.1009930.ref009","article-title":"The structural coverage of the human proteome before and after AlphaFold","author":"E Porta-Pardo","year":"2021","journal-title":"bioRxiv"},{"key":"pcbi.1009930.ref010","article-title":"ColabFold\u2014Making protein folding accessible to all","author":"M Mirdita","year":"2021","journal-title":"bioRxiv"},{"issue":"1","key":"pcbi.1009930.ref011","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1186\/1471-2105-11-446","article-title":"Fast and accurate protein substructure searching with simulated annealing and GPUs","volume":"11","author":"AD Stivala","year":"2010","journal-title":"BMC Bioinformatics"},{"issue":"D1","key":"pcbi.1009930.ref012","doi-asserted-by":"crossref","first-page":"D412","DOI":"10.1093\/nar\/gkaa913","article-title":"Pfam: The protein families database in 2021","volume":"49","author":"J Mistry","year":"2021","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"pcbi.1009930.ref013","doi-asserted-by":"crossref","first-page":"711","DOI":"10.1038\/s41467-020-14301-4","article-title":"ProtCID: a data resource for structural information on protein interactions","volume":"11","author":"Q Xu","year":"2020","journal-title":"Nat Commun"},{"key":"pcbi.1009930.ref014","article-title":"Tissue-based map of the human proteome","author":"M Uhl\u00e9n","year":"2015","journal-title":"Science (80-)"},{"key":"pcbi.1009930.ref015","article-title":"Prioritization of cancer therapeutic targets using CRISPR\u2013Cas9 screens","author":"FM Behan","year":"2019","journal-title":"Nature"},{"key":"pcbi.1009930.ref016","article-title":"Genetic and Functional Drivers of Diffuse Large B Cell Lymphoma","author":"A Reddy","year":"2017","journal-title":"Cell"},{"key":"pcbi.1009930.ref017","article-title":"Systematic comparison of CRISPR\/Cas9 and RNAi screens for essential genes","author":"DW Morgens","year":"2016","journal-title":"Nat Biotechnol"},{"issue":"1","key":"pcbi.1009930.ref018","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1002\/pro.3978","article-title":"The BioGRID database: A comprehensive biomedical resource of curated protein, genetic, and chemical interactions","volume":"30","author":"R Oughtred","year":"2021","journal-title":"Protein Sci"},{"issue":"7703","key":"pcbi.1009930.ref019","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1038\/s41586-018-0058-6","article-title":"Cryo-EM structure of the gasdermin A3 membrane pore","volume":"557","author":"J Ruan","year":"2018","journal-title":"Nature"},{"issue":"7610","key":"pcbi.1009930.ref020","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1038\/nature18590","article-title":"Pore-forming activity and structural autoinhibition of the gasdermin family","volume":"535","author":"J Ding","year":"2016","journal-title":"Nature"},{"issue":"5","key":"pcbi.1009930.ref021","article-title":"Structure of astrotactin-2: A conserved vertebrate-specific and perforin-like membrane protein involved in neuronal development","volume":"6","author":"T Ni","year":"2016","journal-title":"Open Biol"},{"issue":"12","key":"pcbi.1009930.ref022","doi-asserted-by":"crossref","first-page":"1415","DOI":"10.1038\/s41587-020-0570-8","article-title":"A dual-constriction biological nanopore resolves homonucleotide sequences with high fidelity","volume":"38","author":"SE Van der Verren","year":"2020","journal-title":"Nat Biotechnol"},{"issue":"7530","key":"pcbi.1009930.ref023","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1038\/nature13768","article-title":"Structural and mechanistic insights into the bacterial amyloid secretion channel CsgG","volume":"516","author":"P Goyal","year":"2014","journal-title":"Nature"},{"key":"pcbi.1009930.ref024","article-title":"Infinite re-reading of single proteins at single-amino-acid resolution using nanopore sequencing","author":"H Brinkerhoff","year":"2021","journal-title":"bioRxiv"},{"issue":"2","key":"pcbi.1009930.ref025","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1038\/s41587-019-0401-y","article-title":"Reading amino acids in a nanopore","volume":"38","author":"S Howorka","year":"2020","journal-title":"Nat Biotechnol"},{"issue":"12","key":"pcbi.1009930.ref026","doi-asserted-by":"crossref","first-page":"12365","DOI":"10.1021\/nn5049987","article-title":"Discrimination among protein variants using an unfoldase-coupled nanopore","volume":"8","author":"J Nivala","year":"2014","journal-title":"ACS Nano"},{"key":"pcbi.1009930.ref027","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1038\/s41587-019-0345-2","article-title":"Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore","volume":"Vol. 38","author":"H Ouldali","year":"2020","journal-title":"Nature Biotechnology"},{"issue":"1","key":"pcbi.1009930.ref028","doi-asserted-by":"crossref","DOI":"10.1038\/s41467-020-14943-4","article-title":"Structure of the G protein chaperone and guanine nucleotide exchange factor Ric-8A bound to G\u03b1i1","volume":"11","author":"LJ McClelland","year":"2020","journal-title":"Nat Commun"},{"issue":"1","key":"pcbi.1009930.ref029","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-016-1198-z","article-title":"Fold-specific sequence scoring improves protein sequence matching","volume":"17","author":"SP Leelananda","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1009930.ref030","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1002\/prot.21688","article-title":"Exploring the extremes of sequence\/structure space with ensemble fold recognition in the program Phyre","volume":"Vol. 70","author":"RM Bennett-Lovsey","year":"2008","journal-title":"Proteins: Structure, Function and Genetics"},{"issue":"3","key":"pcbi.1009930.ref031","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1002\/prot.22458","article-title":"Structure is three to ten times more conserved than sequence\u2014A study of structural response in protein cores","volume":"77","author":"K Illerg\u00e5rd","year":"2009","journal-title":"Proteins Struct Funct Bioinforma"},{"issue":"D1","key":"pcbi.1009930.ref032","doi-asserted-by":"crossref","first-page":"D475","DOI":"10.1093\/nar\/gky1134","article-title":"SCOPe: Classification of large macromolecular structures in the structural classification of proteins\u2014Extended database","volume":"47","author":"JM Chandonia","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"pcbi.1009930.ref033","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1003926","article-title":"ECOD: An Evolutionary Classification of Protein Domains","volume":"10","author":"H Cheng","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1009930.ref034","article-title":"Foldseek: fast and accurate protein structure search","author":"M Van Kempen","year":"2022","journal-title":"bioRxiv"},{"issue":"7","key":"pcbi.1009930.ref035","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1007970","article-title":"Real time structural search of the Protein Data Bank","volume":"16","author":"D Guzenko","year":"2020","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1009930.ref036","article-title":"Single-sequence protein structure prediction using language models from deep learning","author":"R Chowdhury","year":"2021","journal-title":"bioRxiv"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009930","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,4,6]],"date-time":"2022-04-06T00:00:00Z","timestamp":1649203200000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009930","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,4,6]],"date-time":"2022-04-06T13:56:37Z","timestamp":1649253397000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009930"}},"subtitle":[],"editor":[{"given":"Dina","family":"Schneidman-Duhovny","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,3,25]]},"references-count":36,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2022,3,25]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009930","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.08.24.457439","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,25]]}}}