{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:05Z","timestamp":1772138045672,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2021,5,7]],"date-time":"2021-05-07T00:00:00Z","timestamp":1620345600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01GM089753"],"award-info":[{"award-number":["R01GM089753"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,10,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Inter-residue distance prediction by convolutional residual neural network (deep ResNet) has greatly advanced protein structure prediction. Currently, the most successful structure prediction methods predict distance by discretizing it into dozens of bins. Here, we study how well real-valued distance can be predicted and how useful it is for 3D structure modeling by comparing it with discrete-valued prediction based upon the same deep ResNet.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Different from the recent methods that predict only a single real value for the distance of an atom pair, we predict both the mean and standard deviation of a distance and then fold a protein by the predicted mean and deviation. Our findings include: (i) tested on the CASP13 FM (free-modeling) targets, our real-valued distance prediction obtains 81% precision on top L\/5 long-range contact prediction, much better than the best CASP13 results (70%); (ii) our real-valued prediction can predict correct folds for the same number of CASP13 FM targets as the best CASP13 group, despite generating only 20 decoys for each target; (iii) our method greatly outperforms a very new real-valued prediction method DeepDist in both contact prediction and 3D structure modeling and (iv) when the same deep ResNet is used, our real-valued distance prediction has 1\u20136% higher contact and distance accuracy than our own discrete-valued prediction, but less accurate 3D structure models.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>https:\/\/github.com\/j3xugit\/RaptorX-3DModeling.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab333","type":"journal-article","created":{"date-parts":[[2021,4,28]],"date-time":"2021-04-28T16:06:46Z","timestamp":1619626006000},"page":"3197-3203","source":"Crossref","is-referenced-by-count":14,"title":["Study of real-valued distance prediction for protein structure prediction with deep learning"],"prefix":"10.1093","volume":"37","author":[{"given":"Jin","family":"Li","sequence":"first","affiliation":[{"name":"Toyota Technological Institute at Chicago , Chicago, IL 60637, USA"},{"name":"Department of Computer Science, University of Chicago , Chicago, IL 60637, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7111-4839","authenticated-orcid":false,"given":"Jinbo","family":"Xu","sequence":"additional","affiliation":[{"name":"Toyota Technological Institute at Chicago , Chicago, IL 60637, USA"}]}],"member":"286","published-online":{"date-parts":[[2021,5,7]]},"reference":[{"key":"2023051608271140100_btab333-B1","doi-asserted-by":"crossref","first-page":"1100","DOI":"10.1002\/prot.25787","article-title":"A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments","volume":"87","author":"Abriata","year":"2019","journal-title":"Proteins"},{"key":"2023051608271140100_btab333-B2","first-page":"292","author":"AlQuraishi","year":"2019"},{"key":"2023051608271140100_btab333-B3","first-page":"3286","author":"Bello","year":"2019"},{"key":"2023051608271140100_btab333-B4","doi-asserted-by":"crossref","first-page":"2728","DOI":"10.1038\/nprot.2007.406","article-title":"Version 1.2 of the Crystallography and NMR system","volume":"2","author":"Brunger","year":"2007","journal-title":"Nat. Protoc"},{"key":"2023051608271140100_btab333-B5","first-page":"1567","volume-title":"Advances in Neural Information Processing Systems","author":"Cao","year":"2019"},{"key":"2023051608271140100_btab333-B6","doi-asserted-by":"crossref","first-page":"689","DOI":"10.1093\/bioinformatics\/btq007","article-title":"PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta","volume":"26","author":"Chaudhury","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051608271140100_btab333-B7","first-page":"352","volume-title":"Advances in Neural Information Processing Systems","author":"Chen","year":"2018"},{"key":"2023051608271140100_btab333-B8","doi-asserted-by":"crossref","first-page":"2001314","DOI":"10.1002\/advs.202001314","article-title":"Predicting the real-valued inter-residue distances for proteins","volume":"7","author":"Ding","year":"2020","journal-title":"Adv. Sci"},{"key":"2023051608271140100_btab333-B9","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1186\/s12859-018-2065-x","article-title":"RaptorX-Angle: real-value prediction of protein backbone dihedral angles through a hybrid method of clustering and deep learning","volume":"19","author":"Gao","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023051608271140100_btab333-B10","doi-asserted-by":"crossref","first-page":"3977","DOI":"10.1038\/s41467-019-11994-0","article-title":"Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints","volume":"10","author":"Greener","year":"2019","journal-title":"Nat. Commun"},{"key":"2023051608271140100_btab333-B11","author":"Ingraham","year":"2019"},{"key":"2023051608271140100_btab333-B12","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1186\/1471-2105-8-113","article-title":"Improved residue contact prediction using support vector machines and a large feature set","volume":"8","author":"Jianlin Cheng","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023051608271140100_btab333-B13","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1186\/1471-2105-11-431","article-title":"Hidden Markov model speed heuristic and iterative HMM search procedure","volume":"11","author":"Johnson","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023051608271140100_btab333-B14","doi-asserted-by":"crossref","first-page":"999","DOI":"10.1093\/bioinformatics\/btu791","article-title":"MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins","volume":"31","author":"Jones","year":"2015","journal-title":"Bioinformatics"},{"key":"2023051608271140100_btab333-B15","author":"Li","year":"2019"},{"key":"2023051608271140100_btab333-B16","author":"Loshchilov","year":"2019"},{"key":"2023051608271140100_btab333-B17","author":"Micikevicius","year":"2018"},{"key":"2023051608271140100_btab333-B18","doi-asserted-by":"crossref","first-page":"D170","DOI":"10.1093\/nar\/gkw1081","article-title":"Uniclust databases of clustered and deeply annotated protein sequences and alignments","volume":"45","author":"Mirdita","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023051608271140100_btab333-B19","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1038\/nmeth.1818","article-title":"HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment","volume":"9","author":"Remmert","year":"2012","journal-title":"Nat. Methods"},{"key":"2023051608271140100_btab333-B20","first-page":"2234","article-title":"Improved techniques for training gans","author":"Salimans","year":"2016","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2023051608271140100_btab333-B21","doi-asserted-by":"crossref","first-page":"3128","DOI":"10.1093\/bioinformatics\/btu500","article-title":"CCMpred\u2014fast and precise prediction of protein residue\u2013residue contacts from correlated mutations","volume":"30","author":"Seemayer","year":"2014","journal-title":"Bioinformatics"},{"key":"2023051608271140100_btab333-B22","doi-asserted-by":"crossref","first-page":"706","DOI":"10.1038\/s41586-019-1923-7","article-title":"Improved protein structure prediction using potentials from deep learning","volume":"577","author":"Senior","year":"2020","journal-title":"Nature"},{"key":"2023051608271140100_btab333-B23","doi-asserted-by":"crossref","first-page":"1058","DOI":"10.1002\/prot.25819","article-title":"Assessing the accuracy of contact predictions in CASP13","volume":"87","author":"Shrestha","year":"2019","journal-title":"Proteins"},{"key":"2023051608271140100_btab333-B24","first-page":"6105","author":"Tan","year":"2019"},{"key":"2023051608271140100_btab333-B25","author":"Tan","year":"2019"},{"key":"2023051608271140100_btab333-B26","first-page":"11534","author":"Wang","year":"2020"},{"key":"2023051608271140100_btab333-B27","doi-asserted-by":"crossref","first-page":"W430","DOI":"10.1093\/nar\/gkw306","article-title":"RaptorX-Property: a web server for protein structure property prediction","volume":"44","author":"Wang","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023051608271140100_btab333-B28","doi-asserted-by":"crossref","first-page":"e1005324","DOI":"10.1371\/journal.pcbi.1005324","article-title":"Accurate de novo prediction of protein contact map by ultra-deep learning model","volume":"13","author":"Wang","year":"2017","journal-title":"PLoS Comput. Biol"},{"key":"2023051608271140100_btab333-B29","author":"Wu","year":"2021"},{"key":"2023051608271140100_btab333-B30","doi-asserted-by":"crossref","first-page":"16856","DOI":"10.1073\/pnas.1821309116","article-title":"Distance-based protein folding powered by deep learning","volume":"116","author":"Xu","year":"2019","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051608271140100_btab333-B31","doi-asserted-by":"crossref","DOI":"10.1038\/s42256-021-00348-5","article-title":"Improved protein structure prediction by deep learning irrespective of co-evolution information","author":"Xu","year":"2021","journal-title":"Nature Machine Intelligence. doi: 10.1101\/2020.10.12.336859."},{"key":"2023051608271140100_btab333-B32","doi-asserted-by":"crossref","first-page":"1069","DOI":"10.1002\/prot.25810","article-title":"Analysis of distance-based protein structure prediction by deep learning in CASP13","volume":"87","author":"Xu","year":"2019","journal-title":"Proteins Struct. Funct. Bioinf"},{"key":"2023051608271140100_btab333-B33","doi-asserted-by":"crossref","first-page":"889","DOI":"10.1093\/bioinformatics\/btq066","article-title":"How significant is a protein structure similarity with TM-score = 0.5?","volume":"26","author":"Xu","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051608271140100_btab333-B34","doi-asserted-by":"crossref","first-page":"1496","DOI":"10.1073\/pnas.1914677117","article-title":"Improved protein structure prediction using predicted interresidue orientations","volume":"117","author":"Yang","year":"2020","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051608271140100_btab333-B35","doi-asserted-by":"crossref","first-page":"1118","DOI":"10.1016\/j.str.2012.04.003","article-title":"A position-specific distance-dependent statistical potential for protein structure and functional study","volume":"20","author":"Zhao","year":"2012","journal-title":"Structure"},{"key":"2023051608271140100_btab333-B36","doi-asserted-by":"crossref","first-page":"i263","DOI":"10.1093\/bioinformatics\/bty278","article-title":"Protein threading using residue co-variation and deep learning","volume":"34","author":"Zhu","year":"2018","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab333\/38379694\/btab333.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3197\/50338183\/btab333.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3197\/50338183\/btab333.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T04:41:22Z","timestamp":1684212082000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/19\/3197\/6271411"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,7]]},"references-count":36,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2021,10,11]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab333","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.11.26.400523","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,10,1]]},"published":{"date-parts":[[2021,5,7]]}}}