{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T09:52:41Z","timestamp":1776678761268,"version":"3.51.2"},"reference-count":84,"publisher":"Springer Science and Business Media LLC","issue":"7873","license":[{"start":{"date-parts":[[2021,7,15]],"date-time":"2021-07-15T00:00:00Z","timestamp":1626307200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,7,15]],"date-time":"2021-07-15T00:00:00Z","timestamp":1626307200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Nature"],"published-print":{"date-parts":[[2021,8,26]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort\n                    <jats:sup>1\u20134<\/jats:sup>\n                    , the structures of around 100,000 unique proteins have been determined\n                    <jats:sup>5<\/jats:sup>\n                    , but this represents a small fraction of the billions of known protein sequences\n                    <jats:sup>6,7<\/jats:sup>\n                    . Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence\u2014the structure prediction component of the \u2018protein folding problem\u2019\n                    <jats:sup>8<\/jats:sup>\n                    \u2014has been an important open research problem for more than 50\u00a0years\n                    <jats:sup>9<\/jats:sup>\n                    . Despite recent progress\n                    <jats:sup>10\u201314<\/jats:sup>\n                    , existing methods fall far\u00a0short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14)\n                    <jats:sup>15<\/jats:sup>\n                    , demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm.\n                  <\/jats:p>","DOI":"10.1038\/s41586-021-03819-2","type":"journal-article","created":{"date-parts":[[2021,7,15]],"date-time":"2021-07-15T15:10:38Z","timestamp":1626361838000},"page":"583-589","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":40279,"title":["Highly accurate protein structure prediction with AlphaFold"],"prefix":"10.1038","volume":"596","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6169-6580","authenticated-orcid":false,"given":"John","family":"Jumper","sequence":"first","affiliation":[]},{"given":"Richard","family":"Evans","sequence":"additional","affiliation":[]},{"given":"Alexander","family":"Pritzel","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3227-1505","authenticated-orcid":false,"given":"Tim","family":"Green","sequence":"additional","affiliation":[]},{"given":"Michael","family":"Figurnov","sequence":"additional","affiliation":[]},{"given":"Olaf","family":"Ronneberger","sequence":"additional","affiliation":[]},{"given":"Kathryn","family":"Tunyasuvunakool","sequence":"additional","affiliation":[]},{"given":"Russ","family":"Bates","sequence":"additional","affiliation":[]},{"given":"Augustin","family":"\u017d\u00eddek","sequence":"additional","affiliation":[]},{"given":"Anna","family":"Potapenko","sequence":"additional","affiliation":[]},{"given":"Alex","family":"Bridgland","sequence":"additional","affiliation":[]},{"given":"Clemens","family":"Meyer","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4271-4418","authenticated-orcid":false,"given":"Simon A. A.","family":"Kohl","sequence":"additional","affiliation":[]},{"given":"Andrew J.","family":"Ballard","sequence":"additional","affiliation":[]},{"given":"Andrew","family":"Cowie","sequence":"additional","affiliation":[]},{"given":"Bernardino","family":"Romera-Paredes","sequence":"additional","affiliation":[]},{"given":"Stanislav","family":"Nikolov","sequence":"additional","affiliation":[]},{"given":"Rishub","family":"Jain","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9928-3407","authenticated-orcid":false,"given":"Jonas","family":"Adler","sequence":"additional","affiliation":[]},{"given":"Trevor","family":"Back","sequence":"additional","affiliation":[]},{"given":"Stig","family":"Petersen","sequence":"additional","affiliation":[]},{"given":"David","family":"Reiman","sequence":"additional","affiliation":[]},{"given":"Ellen","family":"Clancy","sequence":"additional","affiliation":[]},{"given":"Michal","family":"Zielinski","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8781-9753","authenticated-orcid":false,"given":"Martin","family":"Steinegger","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2160-6226","authenticated-orcid":false,"given":"Michalina","family":"Pacholska","sequence":"additional","affiliation":[]},{"given":"Tamas","family":"Berghammer","sequence":"additional","affiliation":[]},{"given":"Sebastian","family":"Bodenstein","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5197-2892","authenticated-orcid":false,"given":"David","family":"Silver","sequence":"additional","affiliation":[]},{"given":"Oriol","family":"Vinyals","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2401-5691","authenticated-orcid":false,"given":"Andrew W.","family":"Senior","sequence":"additional","affiliation":[]},{"given":"Koray","family":"Kavukcuoglu","sequence":"additional","affiliation":[]},{"given":"Pushmeet","family":"Kohli","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2812-9917","authenticated-orcid":false,"given":"Demis","family":"Hassabis","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,7,15]]},"reference":[{"key":"3819_CR1","doi-asserted-by":"publisher","first-page":"667","DOI":"10.12688\/f1000research.25097.1","volume":"9","author":"MC Thompson","year":"2020","unstructured":"Thompson, M. C., Yeates, T. O. & Rodriguez, J. A. Advances in methods for atomic resolution macromolecular structure determination. F1000Res. 9, 667 (2020).","journal-title":"F1000Res."},{"key":"3819_CR2","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1016\/j.tibs.2014.10.005","volume":"40","author":"X-C Bai","year":"2015","unstructured":"Bai, X.-C., McMullan, G. & Scheres, S. H. W. How cryo-EM is revolutionizing structural biology. Trends Biochem. Sci. 40, 49\u201357 (2015).","journal-title":"Trends Biochem. Sci."},{"key":"3819_CR3","doi-asserted-by":"publisher","first-page":"3985","DOI":"10.1111\/febs.12796","volume":"281","author":"M Jaskolski","year":"2014","unstructured":"Jaskolski, M., Dauter, Z. & Wlodawer, A. A brief history of macromolecular crystallography, illustrated by a family tree and its Nobel fruits. FEBS J. 281, 3985\u20134009 (2014).","journal-title":"FEBS J."},{"key":"3819_CR4","doi-asserted-by":"publisher","first-page":"923","DOI":"10.1038\/nsb1101-923","volume":"8","author":"K W\u00fcthrich","year":"2001","unstructured":"W\u00fcthrich, K. The way to NMR structures of proteins. Nat. Struct. Biol. 8, 923\u2013925 (2001).","journal-title":"Nat. Struct. Biol."},{"key":"3819_CR5","doi-asserted-by":"publisher","first-page":"D520","DOI":"10.1093\/nar\/gky949","volume":"47","author":"wwPDB Consortium","year":"2018","unstructured":"wwPDB Consortium. Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res. 47, D520\u2013D528 (2018).","journal-title":"Nucleic Acids Res."},{"key":"3819_CR6","first-page":"D570","volume":"48","author":"AL Mitchell","year":"2020","unstructured":"Mitchell, A. L. et al. MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res. 48, D570\u2013D578 (2020).","journal-title":"Nucleic Acids Res."},{"key":"3819_CR7","doi-asserted-by":"publisher","first-page":"603","DOI":"10.1038\/s41592-019-0437-4","volume":"16","author":"M Steinegger","year":"2019","unstructured":"Steinegger, M., Mirdita, M. & S\u00f6ding, J. Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold. Nat. Methods 16, 603\u2013606 (2019).","journal-title":"Nat. Methods"},{"key":"3819_CR8","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1146\/annurev.biophys.37.092707.153558","volume":"37","author":"KA Dill","year":"2008","unstructured":"Dill, K. A., Ozkan, S. B., Shell, M. S. & Weikl, T. R. The protein folding problem. Annu. Rev. Biophys. 37, 289\u2013316 (2008).","journal-title":"Annu. Rev. Biophys."},{"key":"3819_CR9","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1126\/science.181.4096.223","volume":"181","author":"CB Anfinsen","year":"1973","unstructured":"Anfinsen, C. B. Principles that govern the folding of protein chains. Science 181, 223\u2013230 (1973).","journal-title":"Science"},{"key":"3819_CR10","doi-asserted-by":"publisher","first-page":"706","DOI":"10.1038\/s41586-019-1923-7","volume":"577","author":"AW Senior","year":"2020","unstructured":"Senior, A. W. et al. Improved protein structure prediction using potentials from deep learning. Nature 577, 706\u2013710 (2020).","journal-title":"Nature"},{"key":"3819_CR11","doi-asserted-by":"publisher","first-page":"e1005324","DOI":"10.1371\/journal.pcbi.1005324","volume":"13","author":"S Wang","year":"2017","unstructured":"Wang, S., Sun, S., Li, Z., Zhang, R. & Xu, J. Accurate de novo prediction of protein contact map by ultra-deep learning model. PLOS Comput. Biol. 13, e1005324 (2017).","journal-title":"PLOS Comput. Biol."},{"key":"3819_CR12","doi-asserted-by":"publisher","first-page":"1149","DOI":"10.1002\/prot.25792","volume":"87","author":"W Zheng","year":"2019","unstructured":"Zheng, W. et al. Deep-learning contact-map guided protein structure prediction in CASP13. Proteins 87, 1149\u20131164 (2019).","journal-title":"Proteins"},{"key":"3819_CR13","doi-asserted-by":"publisher","first-page":"1100","DOI":"10.1002\/prot.25787","volume":"87","author":"LA Abriata","year":"2019","unstructured":"Abriata, L. A., Tam\u00f2, G. E. & Dal Peraro, M. A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments. Proteins 87, 1100\u20131112 (2019).","journal-title":"Proteins"},{"key":"3819_CR14","doi-asserted-by":"publisher","first-page":"194","DOI":"10.1016\/j.sbi.2021.01.007","volume":"68","author":"R Pearce","year":"2021","unstructured":"Pearce, R. & Zhang, Y. Deep learning techniques have significantly impacted protein structure prediction and protein design. Curr. Opin. Struct. Biol. 68, 194\u2013207 (2021).","journal-title":"Curr. Opin. Struct. Biol."},{"key":"3819_CR15","unstructured":"Moult, J., Fidelis, K., Kryshtafovych, A., Schwede, T. & Topf, M. Critical assessment of techniques for protein structure prediction, fourteenth round. CASP 14 Abstract Book https:\/\/www.predictioncenter.org\/casp14\/doc\/CASP14_Abstracts.pdf (2020)."},{"key":"3819_CR16","doi-asserted-by":"publisher","first-page":"eaaz3041","DOI":"10.1126\/science.aaz3041","volume":"370","author":"E Brini","year":"2020","unstructured":"Brini, E., Simmerling, C. & Dill, K. Protein storytelling through physics. Science 370, eaaz3041 (2020).","journal-title":"Science"},{"key":"3819_CR17","doi-asserted-by":"publisher","first-page":"859","DOI":"10.1016\/S0022-2836(05)80269-4","volume":"213","author":"MJ Sippl","year":"1990","unstructured":"Sippl, M. J. Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J. Mol. Biol. 213, 859\u2013883 (1990).","journal-title":"J. Mol. Biol."},{"key":"3819_CR18","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1006\/jmbi.1993.1626","volume":"234","author":"A \u0160ali","year":"1993","unstructured":"\u0160ali, A. & Blundell, T. L. Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234, 779\u2013815 (1993).","journal-title":"J. Mol. Biol."},{"key":"3819_CR19","doi-asserted-by":"publisher","first-page":"725","DOI":"10.1038\/nprot.2010.5","volume":"5","author":"A Roy","year":"2010","unstructured":"Roy, A., Kucukural, A. & Zhang, Y. I-TASSER: a unified platform for automated protein structure and function prediction. Nat. Protocols 5, 725\u2013738 (2010).","journal-title":"Nat. Protocols"},{"key":"3819_CR20","doi-asserted-by":"publisher","first-page":"693","DOI":"10.1016\/0022-2836(87)90352-4","volume":"193","author":"D Altschuh","year":"1987","unstructured":"Altschuh, D., Lesk, A. M., Bloomer, A. C. & Klug, A. Correlation of co-ordinated amino acid substitutions with function in viruses related to tobacco mosaic virus. J. Mol. Biol. 193, 693\u2013707 (1987).","journal-title":"J. Mol. Biol."},{"key":"3819_CR21","doi-asserted-by":"publisher","first-page":"349","DOI":"10.1093\/protein\/7.3.349","volume":"7","author":"IN Shindyalov","year":"1994","unstructured":"Shindyalov, I. N., Kolchanov, N. A. & Sander, C. Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? Protein Eng. 7, 349\u2013358 (1994).","journal-title":"Protein Eng."},{"key":"3819_CR22","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1073\/pnas.0805923106","volume":"106","author":"M Weigt","year":"2009","unstructured":"Weigt, M., White, R. A., Szurmant, H., Hoch, J. A. & Hwa, T. Identification of direct residue contacts in protein\u2013protein interaction by message passing. Proc. Natl Acad. Sci. USA 106, 67\u201372 (2009).","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"3819_CR23","doi-asserted-by":"publisher","first-page":"e28766","DOI":"10.1371\/journal.pone.0028766","volume":"6","author":"DS Marks","year":"2011","unstructured":"Marks, D. S. et al. Protein 3D structure computed from evolutionary sequence variation. PLoS ONE 6, e28766 (2011).","journal-title":"PLoS ONE"},{"key":"3819_CR24","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1093\/bioinformatics\/btr638","volume":"28","author":"DT Jones","year":"2012","unstructured":"Jones, D. T., Buchan, D. W. A., Cozzetto, D. & Pontil, M. PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments. Bioinformatics 28, 184\u2013190 (2012).","journal-title":"Bioinformatics"},{"key":"3819_CR25","doi-asserted-by":"publisher","first-page":"ii","DOI":"10.1002\/prot.340230303","volume":"23","author":"J Moult","year":"1995","unstructured":"Moult, J., Pedersen, J. T., Judson, R. & Fidelis, K. A large-scale experiment to assess protein structure prediction methods. Proteins 23, ii\u2013iv (1995).","journal-title":"Proteins"},{"key":"3819_CR26","doi-asserted-by":"publisher","first-page":"1011","DOI":"10.1002\/prot.25823","volume":"87","author":"A Kryshtafovych","year":"2019","unstructured":"Kryshtafovych, A., Schwede, T., Topf, M., Fidelis, K. & Moult, J. Critical assessment of methods of protein structure prediction (CASP)-round XIII. Proteins 87, 1011\u20131020 (2019).","journal-title":"Proteins"},{"key":"3819_CR27","doi-asserted-by":"publisher","first-page":"702","DOI":"10.1002\/prot.20264","volume":"57","author":"Y Zhang","year":"2004","unstructured":"Zhang, Y. & Skolnick, J. Scoring function for automated assessment of protein structure template quality. Proteins 57, 702\u2013710 (2004).","journal-title":"Proteins"},{"key":"3819_CR28","doi-asserted-by":"publisher","first-page":"1744","DOI":"10.1109\/TPAMI.2009.186","volume":"32","author":"Z Tu","year":"2010","unstructured":"Tu, Z. & Bai, X. Auto-context and its application to high-level vision tasks and 3D brain image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1744\u20131757 (2010).","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"3819_CR29","doi-asserted-by":"crossref","unstructured":"Carreira, J., Agrawal, P., Fragkiadaki, K. & Malik, J. Human pose estimation with iterative error feedback. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 4733\u20134742 (2016).","DOI":"10.1109\/CVPR.2016.512"},{"key":"3819_CR30","doi-asserted-by":"publisher","first-page":"e0220182","DOI":"10.1371\/journal.pone.0220182","volume":"14","author":"C Mirabello","year":"2019","unstructured":"Mirabello, C. & Wallner, B. rawMSA: end-to-end deep learning using raw multiple sequence alignments. PLoS ONE 14, e0220182 (2019).","journal-title":"PLoS ONE"},{"key":"3819_CR31","doi-asserted-by":"crossref","unstructured":"Huang, Z. et al. CCNet: criss-cross attention for semantic segmentation. In Proc. IEEE\/CVF International Conference on Computer Vision 603\u2013612 (2019).","DOI":"10.1109\/ICCV.2019.00069"},{"key":"3819_CR32","doi-asserted-by":"publisher","first-page":"712","DOI":"10.1002\/prot.21123","volume":"65","author":"V Hornak","year":"2006","unstructured":"Hornak, V. et al. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins 65, 712\u2013725 (2006).","journal-title":"Proteins"},{"key":"3819_CR33","doi-asserted-by":"publisher","first-page":"3370","DOI":"10.1093\/nar\/gkg571","volume":"31","author":"A Zemla","year":"2003","unstructured":"Zemla, A. LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res. 31, 3370\u20133374 (2003).","journal-title":"Nucleic Acids Res."},{"key":"3819_CR34","doi-asserted-by":"publisher","first-page":"2722","DOI":"10.1093\/bioinformatics\/btt473","volume":"29","author":"V Mariani","year":"2013","unstructured":"Mariani, V., Biasini, M., Barbato, A. & Schwede, T. lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics 29, 2722\u20132728 (2013).","journal-title":"Bioinformatics"},{"key":"3819_CR35","doi-asserted-by":"crossref","unstructured":"Xie, Q., Luong, M.-T., Hovy, E. & Le, Q. V. Self-training with noisy student improves imagenet classification. In Proc. IEEE\/CVF Conference on Computer Vision and Pattern Recognition 10687\u201310698 (2020).","DOI":"10.1109\/CVPR42600.2020.01070"},{"key":"3819_CR36","doi-asserted-by":"publisher","first-page":"D170","DOI":"10.1093\/nar\/gkw1081","volume":"45","author":"M Mirdita","year":"2017","unstructured":"Mirdita, M. et al. Uniclust databases of clustered and deeply annotated protein sequences and alignments. Nucleic Acids Res. 45, D170\u2013D176 (2017).","journal-title":"Nucleic Acids Res."},{"key":"3819_CR37","unstructured":"Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: pre-training of deep bidirectional transformers for language understanding. In Proc. 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 1, 4171\u20134186 (2019)."},{"key":"3819_CR38","unstructured":"Rao, R. et al. MSA transformer. In Proc. 38th International Conference on Machine Learning PMLR 139, 8844\u20138856 (2021)."},{"key":"3819_CR39","doi-asserted-by":"publisher","unstructured":"Tunyasuvunakool, K. et al. Highly accurate protein structure prediction for the human proteome. Nature https:\/\/doi.org\/10.1038\/s41586-021-03828-1 (2021).","DOI":"10.1038\/s41586-021-03828-1"},{"key":"3819_CR40","doi-asserted-by":"publisher","first-page":"681","DOI":"10.1038\/s41580-019-0163-x","volume":"20","author":"B Kuhlman","year":"2019","unstructured":"Kuhlman, B. & Bradley, P. Advances in protein structure prediction and design. Nat. Rev. Mol. Cell Biol. 20, 681\u2013697 (2019).","journal-title":"Nat. Rev. Mol. Cell Biol."},{"key":"3819_CR41","doi-asserted-by":"publisher","first-page":"1072","DOI":"10.1038\/nbt.2419","volume":"30","author":"DS Marks","year":"2012","unstructured":"Marks, D. S., Hopf, T. A. & Sander, C. Protein structure prediction from sequence variation. Nat. Biotechnol. 30, 1072\u20131080 (2012).","journal-title":"Nat. Biotechnol."},{"key":"3819_CR42","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1016\/0022-2836(88)90564-5","volume":"202","author":"N Qian","year":"1988","unstructured":"Qian, N. & Sejnowski, T. J. Predicting the secondary structure of globular proteins using neural network models. J. Mol. Biol. 202, 865\u2013884 (1988).","journal-title":"J. Mol. Biol."},{"key":"3819_CR43","doi-asserted-by":"publisher","first-page":"835","DOI":"10.1093\/protein\/14.11.835","volume":"14","author":"P Fariselli","year":"2001","unstructured":"Fariselli, P., Olmea, O., Valencia, A. & Casadio, R. Prediction of contact maps with neural networks and correlated mutations. Protein Eng. 14, 835\u2013843 (2001).","journal-title":"Protein Eng."},{"key":"3819_CR44","doi-asserted-by":"publisher","first-page":"1496","DOI":"10.1073\/pnas.1914677117","volume":"117","author":"J Yang","year":"2020","unstructured":"Yang, J. et al. Improved protein structure prediction using predicted interresidue orientations. Proc. Natl Acad. Sci. USA 117, 1496\u20131503 (2020).","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"3819_CR45","doi-asserted-by":"publisher","first-page":"e1008865","DOI":"10.1371\/journal.pcbi.1008865","volume":"17","author":"Y Li","year":"2021","unstructured":"Li, Y. et al. Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks. PLOS Comput. Biol. 17, e1008865 (2021).","journal-title":"PLOS Comput. Biol."},{"key":"3819_CR46","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 770\u2013778 (2016).","DOI":"10.1109\/CVPR.2016.90"},{"key":"3819_CR47","doi-asserted-by":"publisher","first-page":"292","DOI":"10.1016\/j.cels.2019.03.006","volume":"8","author":"M AlQuraishi","year":"2019","unstructured":"AlQuraishi, M. End-to-end differentiable learning of protein structure. Cell Syst. 8, 292\u2013301 (2019).","journal-title":"Cell Syst."},{"key":"3819_CR48","doi-asserted-by":"publisher","first-page":"1141","DOI":"10.1002\/prot.25834","volume":"87","author":"AW Senior","year":"2019","unstructured":"Senior, A. W. et al. Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13). Proteins 87, 1141\u20131148 (2019).","journal-title":"Proteins"},{"key":"3819_CR49","unstructured":"Ingraham, J., Riesselman, A. J., Sander, C. & Marks, D. S. Learning protein structure with a differentiable simulator. in Proc. International Conference on Learning Representations (2019)."},{"key":"3819_CR50","unstructured":"Li, J. Universal transforming geometric network. Preprint at https:\/\/arxiv.org\/abs\/1908.00723 (2019)."},{"key":"3819_CR51","doi-asserted-by":"publisher","first-page":"601","DOI":"10.1038\/s42256-021-00348-5","volume":"3","author":"J Xu","year":"2021","unstructured":"Xu, J., McPartlon, M. & Li, J. Improved protein structure prediction by deep learning irrespective of co-evolution information. Nat. Mach. Intell. 3, 601\u2013609 (2021).","journal-title":"Nat. Mach. Intell."},{"key":"3819_CR52","unstructured":"Vaswani, A. et al. Attention is all you need. In Advances in Neural Information Processing Systems 5998\u20136008 (2017)."},{"key":"3819_CR53","doi-asserted-by":"crossref","unstructured":"Wang, H. et al. Axial-deeplab: stand-alone axial-attention for panoptic segmentation. in European Conference on Computer Vision 108\u2013126 (Springer, 2020).","DOI":"10.1007\/978-3-030-58548-8_7"},{"key":"3819_CR54","doi-asserted-by":"publisher","first-page":"1315","DOI":"10.1038\/s41592-019-0598-1","volume":"16","author":"EC Alley","year":"2019","unstructured":"Alley, E. C., Khimulya, G., Biswas, S., AlQuraishi, M. & Church, G. M. Unified rational protein engineering with sequence-based deep representation learning. Nat. Methods 16, 1315\u20131322 (2019).","journal-title":"Nat. Methods"},{"key":"3819_CR55","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-019-3220-8","volume":"20","author":"M Heinzinger","year":"2019","unstructured":"Heinzinger, M. et al. Modeling aspects of the language of life through transfer-learning protein sequences. BMC Bioinformatics 20, 723 (2019).","journal-title":"BMC Bioinformatics"},{"key":"3819_CR56","doi-asserted-by":"publisher","first-page":"e2016239118","DOI":"10.1073\/pnas.2016239118","volume":"118","author":"A Rives","year":"2021","unstructured":"Rives, A. et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl Acad. Sci. USA 118, e2016239118 (2021).","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"3819_CR57","doi-asserted-by":"publisher","DOI":"10.1002\/prot.26171","author":"J Pereira","year":"2021","unstructured":"Pereira, J. et al. High-accuracy protein structure prediction in CASP14. Proteins https:\/\/doi.org\/10.1002\/prot.26171 (2021).","journal-title":"Proteins"},{"key":"3819_CR58","doi-asserted-by":"publisher","unstructured":"Gupta, M. et al. CryoEM and AI reveal a structure of SARS-CoV-2 Nsp2, a multifunctional protein involved in key host processes. Preprint at https:\/\/doi.org\/10.1101\/2021.05.10.443524 (2021).","DOI":"10.1101\/2021.05.10.443524"},{"key":"3819_CR59","unstructured":"Ingraham, J., Garg, V. K., Barzilay, R. & Jaakkola, T. Generative models for graph-based protein design. in Proc. 33rd Conference on Neural Information Processing Systems (2019)."},{"key":"3819_CR60","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-11-431","volume":"11","author":"LS Johnson","year":"2010","unstructured":"Johnson, L. S., Eddy, S. R. & Portugaly, E. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics 11, 431 (2010).","journal-title":"BMC Bioinformatics"},{"key":"3819_CR61","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1038\/nmeth.1818","volume":"9","author":"M Remmert","year":"2012","unstructured":"Remmert, M., Biegert, A., Hauser, A. & S\u00f6ding, J. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods 9, 173\u2013175 (2012).","journal-title":"Nat. Methods"},{"key":"3819_CR62","doi-asserted-by":"publisher","first-page":"D480","DOI":"10.1093\/nar\/gkaa1100","volume":"49","author":"The UniProt Consortium","year":"2020","unstructured":"The UniProt Consortium. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480\u2013D489 (2020).","journal-title":"Nucleic Acids Res."},{"key":"3819_CR63","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-018-04964-5","volume":"9","author":"M Steinegger","year":"2018","unstructured":"Steinegger, M. & S\u00f6ding, J. Clustering huge protein sequence sets in linear time. Nat. Commun. 9, 2542 (2018).","journal-title":"Nat. Commun."},{"key":"3819_CR64","doi-asserted-by":"publisher","first-page":"1026","DOI":"10.1038\/nbt.3988","volume":"35","author":"M Steinegger","year":"2017","unstructured":"Steinegger, M. & S\u00f6ding, J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 35, 1026\u20131028 (2017).","journal-title":"Nat. Biotechnol."},{"key":"3819_CR65","doi-asserted-by":"publisher","DOI":"10.1038\/srep33964","volume":"6","author":"S Deorowicz","year":"2016","unstructured":"Deorowicz, S., Debudaj-Grabysz, A. & Gudy\u015b, A. FAMSA: fast and accurate multiple sequence alignment of huge protein families. Sci. Rep. 6, 33964 (2016).","journal-title":"Sci. Rep."},{"key":"3819_CR66","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-019-3019-7","volume":"20","author":"M Steinegger","year":"2019","unstructured":"Steinegger, M. et al. HH-suite3 for fast remote homology detection and deep protein annotation. BMC Bioinformatics 20, 473 (2019).","journal-title":"BMC Bioinformatics"},{"key":"3819_CR67","doi-asserted-by":"publisher","first-page":"926","DOI":"10.1093\/bioinformatics\/btu739","volume":"31","author":"BE Suzek","year":"2015","unstructured":"Suzek, B. E., Wang, Y., Huang, H., McGarvey, P. B. & Wu, C. H. UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics 31, 926\u2013932 (2015).","journal-title":"Bioinformatics"},{"key":"3819_CR68","doi-asserted-by":"publisher","first-page":"e1002195","DOI":"10.1371\/journal.pcbi.1002195","volume":"7","author":"SR Eddy","year":"2011","unstructured":"Eddy, S. R. Accelerated profile HMM searches. PLOS Comput. Biol. 7, e1002195 (2011).","journal-title":"PLOS Comput. Biol."},{"key":"3819_CR69","doi-asserted-by":"publisher","first-page":"e1005659","DOI":"10.1371\/journal.pcbi.1005659","volume":"13","author":"P Eastman","year":"2017","unstructured":"Eastman, P. et al. OpenMM 7: rapid development of high performance algorithms for molecular dynamics. PLOS Comput. Biol. 13, e1005659 (2017).","journal-title":"PLOS Comput. Biol."},{"key":"3819_CR70","unstructured":"Ashish, A. M. A. et al. TensorFlow: large-scale machine learning on heterogeneous systems. Preprint at https:\/\/arxiv.org\/abs\/1603.04467 (2015)."},{"key":"3819_CR71","unstructured":"Reynolds, M. et al. Open sourcing Sonnet \u2013 a new library for constructing neural networks. DeepMind https:\/\/deepmind.com\/blog\/open-sourcing-sonnet\/ (7 April 2017)."},{"key":"3819_CR72","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1038\/s41586-020-2649-2","volume":"585","author":"CR Harris","year":"2020","unstructured":"Harris, C. R. et al. Array programming with NumPy. Nature 585, 357\u2013362 (2020).","journal-title":"Nature"},{"key":"3819_CR73","unstructured":"Van Rossum, G. & Drake, F. L. Python 3 Reference Manual (CreateSpace, 2009)."},{"key":"3819_CR74","doi-asserted-by":"crossref","unstructured":"Bisong, E. in Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners 59\u201364 (Apress, 2019).","DOI":"10.1007\/978-1-4842-4470-8_7"},{"key":"3819_CR75","unstructured":"TensorFlow. XLA: Optimizing Compiler for TensorFlow. https:\/\/www.tensorflow.org\/xla (2018)."},{"key":"3819_CR76","doi-asserted-by":"publisher","first-page":"1091","DOI":"10.1093\/bioinformatics\/btz679","volume":"36","author":"T Wu","year":"2020","unstructured":"Wu, T., Hou, J., Adhikari, B. & Cheng, J. Analysis of several key factors influencing deep learning-based inter-residue contact prediction. Bioinformatics 36, 1091\u20131098 (2020).","journal-title":"Bioinformatics"},{"key":"3819_CR77","doi-asserted-by":"publisher","first-page":"e1008707","DOI":"10.1371\/journal.ppat.1008707","volume":"16","author":"W Jiang","year":"2020","unstructured":"Jiang, W. et al. MrpH, a new class of metal-binding adhesin, requires zinc to mediate biofilm formation. PLoS Pathog. 16, e1008707 (2020).","journal-title":"PLoS Pathog."},{"key":"3819_CR78","doi-asserted-by":"publisher","unstructured":"Dunne, M., Ernst, P., Sobieraj, A., Pluckthun, A. & Loessner, M. J. The M23 peptidase domain of the Staphylococcal phage 2638A endolysin. PDB https:\/\/doi.org\/10.2210\/pdb6YJ1\/pdb (2020).","DOI":"10.2210\/pdb6YJ1\/pdb"},{"key":"3819_CR79","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1038\/s41586-020-2921-5","volume":"589","author":"AV Drobysheva","year":"2021","unstructured":"Drobysheva, A. V. et al. Structure and function of virion RNA polymerase of a crAss-like phage. Nature 589, 306\u2013309 (2021).","journal-title":"Nature"},{"key":"3819_CR80","doi-asserted-by":"publisher","first-page":"e104129","DOI":"10.15252\/embj.2019104129","volume":"39","author":"N Flaugnatti","year":"2020","unstructured":"Flaugnatti, N. et al. Structural basis for loading and inhibition of a bacterial T6SS phospholipase effector by the VgrG spike. EMBO J. 39, e104129 (2020).","journal-title":"EMBO J."},{"key":"3819_CR81","doi-asserted-by":"publisher","first-page":"2226","DOI":"10.1021\/acssynbio.8b00224","volume":"7","author":"M ElGamacy","year":"2018","unstructured":"ElGamacy, M. et al. An interface-driven design strategy yields a novel, corrugated protein architecture. ACS Synth. Biol. 7, 2226\u20132235 (2018).","journal-title":"ACS Synth. Biol."},{"key":"3819_CR82","doi-asserted-by":"publisher","first-page":"1081","DOI":"10.1126\/science.aaz9649","volume":"368","author":"CJ Lim","year":"2020","unstructured":"Lim, C. J. et al. The structure of human CST reveals a decameric assembly bound to telomeric DNA. Science 368, 1081\u20131085 (2020).","journal-title":"Science"},{"key":"3819_CR83","doi-asserted-by":"publisher","first-page":"829","DOI":"10.1038\/s41594-020-0464-y","volume":"27","author":"V Debruycker","year":"2020","unstructured":"Debruycker, V. et al. An embedded lipid in the multidrug transporter LmrP suggests a mechanism for polyspecificity. Nat. Struct. Mol. Biol. 27, 829\u2013835 (2020).","journal-title":"Nat. Struct. Mol. Biol."},{"key":"3819_CR84","doi-asserted-by":"publisher","first-page":"e2021785118","DOI":"10.1073\/pnas.2021785118","volume":"118","author":"TG Flower","year":"2021","unstructured":"Flower, T. G. et al. Structure of SARS-CoV-2 ORF8, a rapidly evolving immune evasion protein. Proc. Natl Acad. Sci. USA 118, e2021785118 (2021).","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Nature"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.nature.com\/articles\/s41586-021-03819-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41586-021-03819-2","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41586-021-03819-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,5]],"date-time":"2024-12-05T11:08:10Z","timestamp":1733396890000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.nature.com\/articles\/s41586-021-03819-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,15]]},"references-count":84,"journal-issue":{"issue":"7873","published-print":{"date-parts":[[2021,8,26]]}},"alternative-id":["3819"],"URL":"https:\/\/doi.org\/10.1038\/s41586-021-03819-2","relation":{"has-review":[{"id-type":"doi","id":"10.3410\/f.740477161.793587441","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793588825","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587548","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793590095","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587546","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587468","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793588050","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793590709","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587507","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587439","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587538","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587604","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793588722","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793596551","asserted-by":"object"},{"id-type":"doi","id":"10.3410\/f.740477161.793587947","asserted-by":"object"}]},"ISSN":["0028-0836","1476-4687"],"issn-type":[{"value":"0028-0836","type":"print"},{"value":"1476-4687","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,15]]},"assertion":[{"value":"11 May 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 July 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 July 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"J.J., R.E., A. Pritzel, T.G., M.F., O.R., R.B., A.B., S.A.A.K., D.R. and A.W.S. have filed non-provisional patent applications 16\/701,070 and PCT\/EP2020\/084238, and provisional patent applications 63\/107,362, 63\/118,917, 63\/118,918, 63\/118,921 and 63\/118,919, each in the name of DeepMind Technologies Limited, each pending, relating to machine learning for predicting protein structures. The other authors declare no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}]}}