{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:44:24Z","timestamp":1753875864837,"version":"3.41.2"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2023,8,17]],"date-time":"2023-08-17T00:00:00Z","timestamp":1692230400000},"content-version":"vor","delay-in-days":16,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Office of Biological and Environmental Research (BER) Genomic Science program within the US Department of Energy (DOE) Office of Science","award":["DE-SC0021303","ERKP917"],"award-info":[{"award-number":["DE-SC0021303","ERKP917"]}]},{"name":"Oak Ridge National Laboratory, under the Laboratory Directed Research and Development Program","award":["09832"],"award-info":[{"award-number":["09832"]}]},{"name":"US DOE Joint Genome Institute"},{"name":"DOE Office of Science User Facility","award":["DE-AC02-05CH11231","10.46936\/10.25585\/60001030"],"award-info":[{"award-number":["DE-AC02-05CH11231","10.46936\/10.25585\/60001030"]}]},{"name":"Oak Ridge Leadership Computing Facility"},{"name":"DOE Office of Science User Facility","award":["DE-AC05-00OR22725"],"award-info":[{"award-number":["DE-AC05-00OR22725"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Sphagnum-dominated peatlands store a substantial amount of terrestrial carbon. The genus is undersampled and under-studied. No experimental crystal structure from any Sphagnum species exists in the Protein Data Bank and fewer than 200 Sphagnum-related genes have structural models available in the AlphaFold Protein Structure Database. Tools and resources are needed to help bridge these gaps, and to enable the analysis of other structural proteomes now made possible by accurate structure prediction.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We present the predicted structural proteome (25\u00a0134 primary transcripts) of Sphagnum divinum computed using AlphaFold, structural alignment results of all high-confidence models against an annotated nonredundant crystallographic database of over 90,000 structures, a structure-based classification of putative Enzyme Commission (EC) numbers across this proteome, and the computational method to perform this proteome-scale structure-based annotation.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>All data and code are available in public repositories, detailed at https:\/\/github.com\/BSDExabio\/SAFA. The structural models of the S. divinum proteome have been deposited in the ModelArchive repository at https:\/\/modelarchive.org\/doi\/10.5452\/ma-ornl-sphdiv.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad511","type":"journal-article","created":{"date-parts":[[2023,8,17]],"date-time":"2023-08-17T16:43:40Z","timestamp":1692290620000},"source":"Crossref","is-referenced-by-count":3,"title":["Predicted structural proteome of <i>Sphagnum divinum<\/i> and proteome-scale annotation"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2364-4039","authenticated-orcid":false,"given":"Russell B","family":"Davidson","sequence":"first","affiliation":[{"name":"Biosciences Division, Oak Ridge National Laboratory , Oak Ridge, TN 37830, United States"}]},{"given":"Mark","family":"Coletti","sequence":"additional","affiliation":[{"name":"Computer Science and Mathematics Division, Oak Ridge National Laboratory , Oak Ridge, TN 37830, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0378-3704","authenticated-orcid":false,"given":"Mu","family":"Gao","sequence":"additional","affiliation":[{"name":"Center for the Study of Systems Biology, School of Biological Sciences, Georgia Institute of Technology , Atlanta, GA 30332, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1334-8431","authenticated-orcid":false,"given":"Bryan","family":"Piatkowski","sequence":"additional","affiliation":[{"name":"Biosciences Division, Oak Ridge National Laboratory , Oak Ridge, TN 37830, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7336-7012","authenticated-orcid":false,"given":"Avinash","family":"Sreedasyam","sequence":"additional","affiliation":[{"name":"Genome Sequencing Center, HudsonAlpha Institute for Biotechnology , Huntsville, AL 35806, United States"}]},{"given":"Farhan","family":"Quadir","sequence":"additional","affiliation":[{"name":"Electrical Engineering and Computer Science, University of Missouri , Columbia, MS 65211, United States"}]},{"given":"David J","family":"Weston","sequence":"additional","affiliation":[{"name":"Biosciences Division, Oak Ridge National Laboratory , Oak Ridge, TN 37830, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8062-9172","authenticated-orcid":false,"given":"Jeremy","family":"Schmutz","sequence":"additional","affiliation":[{"name":"Genome Sequencing Center, HudsonAlpha Institute for Biotechnology , Huntsville, AL 35806, United States"},{"name":"Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0305-2853","authenticated-orcid":false,"given":"Jianlin","family":"Cheng","sequence":"additional","affiliation":[{"name":"Electrical Engineering and Computer Science, University of Missouri , Columbia, MS 65211, United States"}]},{"given":"Jeffrey","family":"Skolnick","sequence":"additional","affiliation":[{"name":"Center for the Study of Systems Biology, School of Biological Sciences, Georgia Institute of Technology , Atlanta, GA 30332, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3103-9333","authenticated-orcid":false,"given":"Jerry M","family":"Parks","sequence":"additional","affiliation":[{"name":"Biosciences Division, Oak Ridge National Laboratory , Oak Ridge, TN 37830, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8233-3057","authenticated-orcid":false,"given":"Ada","family":"Sedova","sequence":"additional","affiliation":[{"name":"Biosciences Division, Oak Ridge National Laboratory , Oak Ridge, TN 37830, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,8,17]]},"reference":[{"key":"2023082911044516400_btad511-B1","doi-asserted-by":"crossref","first-page":"1056","DOI":"10.1038\/s41594-022-00849-w","article-title":"A structural biology community assessment of AlphaFold2 applications","volume":"29","author":"Akdel","year":"2022","journal-title":"Nat Struct Mol Biol"},{"key":"2023082911044516400_btad511-B2","first-page":"S313","volume":"178s1","author":"Alexander","year":"2021","journal-title":"Br J Pharmacol"},{"key":"2023082911044516400_btad511-B3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2164-7-315","article-title":"High precision multi-genome scale reannotation of enzyme function by EFICAz","volume":"7","author":"Arakaki","year":"2006","journal-title":"BMC Genom"},{"key":"2023082911044516400_btad511-B4","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1093\/nar\/28.1.304","article-title":"The ENZYME database in 2000","volume":"28","author":"Bairoch","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023082911044516400_btad511-B5","doi-asserted-by":"crossref","first-page":"425","DOI":"10.1104\/pp.20.00412","article-title":"A new order through disorder: intrinsically disordered proteins reshape the cytoskeleton under drought stress","volume":"183","author":"Balcerowicz","year":"2020","journal-title":"Plant Physiol"},{"key":"2023082911044516400_btad511-B6","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023082911044516400_btad511-B7","doi-asserted-by":"crossref","first-page":"D523","DOI":"10.1093\/nar\/gkac1052","article-title":"UniProt: the universal protein knowledgebase in 2023","volume":"51","author":"Consortium","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2023082911044516400_btad511-B8","doi-asserted-by":"crossref","first-page":"2098","DOI":"10.1104\/pp.17.00531","article-title":"The pseudoenzyme PDX1.2 sustains vitamin B6 biosynthesis as a function of heat stress","volume":"174","author":"Dell\u2019Aglio","year":"2017","journal-title":"Plant Physiol"},{"first-page":"206","year":"2022","author":"Gao","key":"2023082911044516400_btad511-B9"},{"key":"2023082911044516400_btad511-B10","doi-asserted-by":"crossref","first-page":"D1178","DOI":"10.1093\/nar\/gkr944","article-title":"Phytozome: a comparative platform for green plant genomics","volume":"40","author":"Goodstein","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023082911044516400_btad511-B11","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1146\/annurev.food.080708.100754","article-title":"Anthocyanins: natural colorants with health-promoting properties","volume":"1","author":"He","year":"2010","journal-title":"Annu Rev Food Sci Technol"},{"key":"2023082911044516400_btad511-B12","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1038\/s41477-022-01333-5","article-title":"Newly identified sex chromosomes in the Sphagnum (peat moss) genome alter carbon sequestration and ecosystem dynamics","volume":"9","author":"Healey","year":"2023","journal-title":"Nat Plants"},{"key":"2023082911044516400_btad511-B13","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/0263-7855(96)00018-5","article-title":"VMD: visual molecular dynamics","volume":"14","author":"Humphrey","year":"1996","journal-title":"J Mol Graph"},{"key":"2023082911044516400_btad511-B14","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2023082911044516400_btad511-B15","doi-asserted-by":"crossref","first-page":"8203","DOI":"10.1074\/jbc.M113.540526","article-title":"The pseudoenzyme PDX1.2 boosts vitamin B6 biosynthesis under heat and oxidative stress in Arabidopsis","volume":"289","author":"Moccand","year":"2014","journal-title":"J Biol Chem"},{"key":"2023082911044516400_btad511-B16","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1111\/j.1574-6941.2007.00323.x","article-title":"The bryophyte genus Sphagnum is a reservoir for powerful and extraordinary antagonists and potentially facultative human pathogens","volume":"61","author":"Opelt","year":"2007","journal-title":"FEMS Microbiol Ecol"},{"key":"2023082911044516400_btad511-B17","doi-asserted-by":"crossref","first-page":"106904","DOI":"10.1016\/j.ympev.2020.106904","article-title":"Phylogenomics reveals convergent evolution of red-violet coloration in land plants and the origins of the anthocyanin biosynthetic pathway","volume":"151","author":"Piatkowski","year":"2020","journal-title":"Mol Phylogenet Evol"},{"key":"2023082911044516400_btad511-B18","doi-asserted-by":"crossref","first-page":"e1009446","DOI":"10.1371\/journal.pcbi.1009446","article-title":"Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1. 3.15 enzyme class","volume":"17","author":"Rembeza","year":"2021","journal-title":"PLoS Comput Biol"},{"key":"2023082911044516400_btad511-B19","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1107\/S2059798319002912","article-title":"Crystal structure of the pseudoenzyme PDX1.2 in complex with its cognate enzyme PDX1.3: a total eclipse","volume":"75","author":"Robinson","year":"2019","journal-title":"Acta Crystallogr D Struct Biol"},{"key":"2023082911044516400_btad511-B20","doi-asserted-by":"crossref","first-page":"167208","DOI":"10.1016\/j.jmb.2021.167208","article-title":"Alphafold and implications for intrinsically disordered proteins","volume":"433","author":"Ruff","year":"2021","journal-title":"J Mol Biol"},{"year":"2015","author":"Schr\u00f6dinger","key":"2023082911044516400_btad511-B21"},{"key":"2023082911044516400_btad511-B22","doi-asserted-by":"crossref","first-page":"1497","DOI":"10.1111\/nph.18429","article-title":"Phylogenomic structure and speciation in an emerging model: the Sphagnum magellanicum complex (bryophyta)","volume":"236","author":"Shaw","year":"2022","journal-title":"New Phytol"},{"key":"2023082911044516400_btad511-B23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-019-3019-7","article-title":"HH-suite3 for fast remote homology detection and deep protein annotation","volume":"20","author":"Steinegger","year":"2019","journal-title":"BMC Bioinform"},{"key":"2023082911044516400_btad511-B24","doi-asserted-by":"crossref","first-page":"13687","DOI":"10.1073\/pnas.0506228102","article-title":"Vitamin B6 biosynthesis in higher plants","volume":"102","author":"Tambasco-Studart","year":"2005","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023082911044516400_btad511-B25","doi-asserted-by":"crossref","first-page":"590","DOI":"10.1038\/s41586-021-03828-1","article-title":"Highly accurate protein structure prediction for the human proteome","volume":"596","author":"Tunyasuvunakool","year":"2021","journal-title":"Nature"},{"key":"2023082911044516400_btad511-B26","doi-asserted-by":"crossref","first-page":"D439","DOI":"10.1093\/nar\/gkab1061","article-title":"AlphaFold protein structure database: massively expanding the structural coverage of protein\u2013sequence space with high-accuracy models","volume":"50","author":"Varadi","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2023082911044516400_btad511-B27","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1111\/nph.14860","article-title":"The sphagnome project: enabling ecological and evolutionary insights through a genus-level sequencing project","volume":"217","author":"Weston","year":"2018","journal-title":"New Phytol"},{"key":"2023082911044516400_btad511-B28","doi-asserted-by":"crossref","first-page":"105218","DOI":"10.1016\/j.isci.2022.105218","article-title":"A unified approach to sequential and non-sequential structure alignment of proteins, RNAs and DNAs","volume":"25","author":"Zhang","year":"2022","journal-title":"iScience"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad511\/51135287\/btad511.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad511\/51278943\/btad511.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad511\/51278943\/btad511.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,29]],"date-time":"2023-08-29T11:05:37Z","timestamp":1693307137000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad511\/7243992"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,8,1]]},"references-count":28,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2023,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad511","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2023,8,1]]},"published":{"date-parts":[[2023,8,1]]},"article-number":"btad511"}}