{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T03:28:17Z","timestamp":1769743697973,"version":"3.49.0"},"reference-count":63,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2018,8,9]],"date-time":"2018-08-09T00:00:00Z","timestamp":1533772800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["DP150104386"],"award-info":[{"award-number":["DP150104386"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The cis-defensins are a superfamily of small, cationic, cysteine-rich proteins, sharing a common scaffold, but highly divergent sequences and varied functions from host-defence to signalling. Superfamily members are most abundant in plants (with some genomes containing hundreds of members), but are also found across fungi and invertebrates. However, of the thousands of cis-defensin sequences in databases, only have a handful have solved structures or assigned activities. Non-phylogenetic sequence-analysis methods are therefore necessary to use the relationships within the superfamily to classify members, and to predict and engineer functions.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We show that the generation of a quantitative map of sequence space allows these highly divergent sequences to be usefully analyzed. This information-rich technique can identify natural groupings of sequences with similar biophysical properties, detect interpretable covarying properties, and provide information on typical or intermediate sequences for each cluster. The cis-defensin superfamily contains clearly-defined groups, identifiable based on their biophysical properties and motifs. The organization of sequences within this space also provides a foundation of understanding the ancient evolution of the superfamily.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>A webtool for exploring and querying the space is hosted at TS404.shinyapps.io\/DefSpace.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty697","type":"journal-article","created":{"date-parts":[[2018,8,8]],"date-time":"2018-08-08T11:09:52Z","timestamp":1533726592000},"page":"743-752","source":"Crossref","is-referenced-by-count":27,"title":["A quantitative map of protein sequence space for the cis-defensin superfamily"],"prefix":"10.1093","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2298-7593","authenticated-orcid":false,"given":"Thomas","family":"Shafee","sequence":"first","affiliation":[{"name":"Department of biochemistry and genetics, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, Australia"}]},{"given":"Marilyn A","family":"Anderson","sequence":"additional","affiliation":[{"name":"Department of biochemistry and genetics, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, Australia"}]}],"member":"286","published-online":{"date-parts":[[2018,8,9]]},"reference":[{"key":"2023013107251178800_bty697-B1","author":"Adler","year":"2003"},{"key":"2023013107251178800_bty697-B2","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1093\/bioinformatics\/bti770","article-title":"The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling","volume":"22","author":"Arnold","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013107251178800_bty697-B3","doi-asserted-by":"crossref","first-page":"6395","DOI":"10.1073\/pnas.0408677102","article-title":"Solving the protein sequence metric problem","volume":"102","author":"Atchley","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013107251178800_bty697-B4","doi-asserted-by":"crossref","first-page":"e4345.","DOI":"10.1371\/journal.pone.0004345","article-title":"Using sequence similarity networks for visualization of relationships across diverse protein superfamilies","volume":"4","author":"Atkinson","year":"2009","journal-title":"PLoS One"},{"key":"2023013107251178800_bty697-B5","doi-asserted-by":"crossref","first-page":"6302","DOI":"10.1128\/AAC.01479-16","article-title":"Nicotiana alata Defensin Chimeras Reveal Differences in the Mechanism of Fungal and Tumor Cell Killing and an Enhanced Antifungal Variant","volume":"60","author":"Bleackley","year":"2016","journal-title":"Antimicrob. Agents Chemother"},{"key":"2023013107251178800_bty697-B6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3389\/fmicb.2016.01682","article-title":"Antiplasmodial activity is an ancient and conserved feature of tick defensins","volume":"7","author":"Cabezas-Cruz","year":"2016","journal-title":"Front. Microbiol"},{"key":"2023013107251178800_bty697-B7","doi-asserted-by":"crossref","first-page":"956","DOI":"10.2174\/092986608785849164","article-title":"TOP-IDP-scale: a new amino acid scale measuring propensity for intrinsic disorder","volume":"15","author":"Campen","year":"2008","journal-title":"Protein Pept. Lett"},{"key":"2023013107251178800_bty697-B8","doi-asserted-by":"crossref","first-page":"1972","DOI":"10.1093\/bioinformatics\/btp348","article-title":"trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses","volume":"25","author":"Capella-Guti\u00e9rrez","year":"2009","journal-title":"Bioinformatics"},{"key":"2023013107251178800_bty697-B9","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1038\/nsb0295-171","article-title":"A method to predict functional residues in proteins","volume":"2","author":"Casari","year":"1995","journal-title":"Nat. Struct. Biol"},{"key":"2023013107251178800_bty697-B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3389\/fevo.2014.00072","article-title":"Sequence similarity network reveals the imprints of major diversification events in the evolution of microbial life","volume":"2","author":"Cheng","year":"2014","journal-title":"Front. Ecol. Evol"},{"key":"2023013107251178800_bty697-B11","first-page":"1695","article-title":"The igraph software package for complex network research","volume":"1695","author":"Cs\u00e1rdi","year":"2006","journal-title":"Int. J. Complex Syst"},{"key":"2023013107251178800_bty697-B12","first-page":"1164","volume-title":"Bioinformatics","author":"Darriba","year":"2011"},{"key":"2023013107251178800_bty697-B14","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1080\/07391102.2006.10507088","article-title":"Amino Acid Principal Component Analysis (AAPCA) and its applications in protein structural class prediction","volume":"23","author":"Du","year":"2006","journal-title":"J. Biomol. Struct. Dyn"},{"key":"2023013107251178800_bty697-B16","doi-asserted-by":"crossref","first-page":"630","DOI":"10.1016\/j.bbrc.2012.08.143","article-title":"Alteration of the mode of antibacterial action of a defensin by the amino-terminal loop substitution","volume":"426","author":"Gao","year":"2012","journal-title":"Biochem. Biophys. Res. Commun"},{"key":"2023013107251178800_bty697-B17","first-page":"575","article-title":"Molecular description of scorpion toxin interaction with voltage-gated sodium channels","volume":"1","author":"Gopalakrishnakone","year":"2015","journal-title":"Scorpion Venoms"},{"key":"2023013107251178800_bty697-B18","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1038\/nrg3540","article-title":"Evolutionary biochemistry: revealing the historical and physical causes of protein properties","volume":"14","author":"Harms","year":"2013","journal-title":"Nat. Rev. Genet"},{"key":"2023013107251178800_bty697-B19","first-page":"15","article-title":"Sequence ordinations: a multivariate analysis approach to analysing large sequence data sets","volume":"8","author":"Higgins","year":"1992","journal-title":"Comput. Appl. Biosci"},{"key":"2023013107251178800_bty697-B20","doi-asserted-by":"crossref","first-page":"W545","DOI":"10.1093\/nar\/gkq366","article-title":"Dali server: conservation mapping in 3D","volume":"38","author":"Holm","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023013107251178800_bty697-B21","first-page":"9","article-title":"Molecular phylogenetics and the perennial problem of homology","volume":"1","author":"Inkpen","year":"2016","journal-title":"J. Mol. Evol"},{"key":"2023013107251178800_bty697-B22","doi-asserted-by":"crossref","first-page":"2411","DOI":"10.1038\/s41467-018-04669-9","article-title":"Molecular basis for the production of cyclic peptides by the plant asparaginyl endopeptidases","volume":"9","author":"Jackson","year":"2018","journal-title":"Nat. Commun"},{"key":"2023013107251178800_bty697-B23","doi-asserted-by":"crossref","first-page":"715","DOI":"10.1038\/35070613","article-title":"Functional proteins from a random-sequence library","volume":"410","author":"Keefe","year":"2001","journal-title":"Nature"},{"key":"2023013107251178800_bty697-B24","doi-asserted-by":"crossref","first-page":"45","DOI":"10.3389\/fchem.2017.00045","article-title":"Structure-activity relationships of insect defensins","volume":"5","author":"Koehbach","year":"2017","journal-title":"Front. Chem"},{"key":"2023013107251178800_bty697-B25","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/0022-2836(82)90515-0","article-title":"A simple method for displaying the hydropathic character of a protein","volume":"157","author":"Kyte","year":"1982","journal-title":"J. Mol. Biol"},{"key":"2023013107251178800_bty697-B26","doi-asserted-by":"crossref","first-page":"W242","DOI":"10.1093\/nar\/gkw290","article-title":"Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees","volume":"44","author":"Letunic","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023013107251178800_bty697-B27","doi-asserted-by":"crossref","first-page":"1563","DOI":"10.1016\/j.jprot.2011.11.029","article-title":"Extreme diversity of scorpion venom peptides and proteins revealed by transcriptomic analysis: implication for proteome evolution of scorpion venom arsenal","volume":"75","author":"Ma","year":"2012","journal-title":"J. Proteomics"},{"key":"2023013107251178800_bty697-B28","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1111\/j.1365-313X.2006.02788.x","article-title":"A putative novel role for plant defensins: a defensin from the zinc hyper-accumulating plant, Arabidopsis halleri, confers zinc tolerance","volume":"47","author":"Mirouze","year":"2006","journal-title":"Plant J"},{"key":"2023013107251178800_bty697-B29","doi-asserted-by":"crossref","first-page":"1310","DOI":"10.1104\/pp.105.060707","article-title":"The SOL Genomics Network: a comparative resource for Solanaceae biology and beyond","volume":"138","author":"Mueller","year":"2005","journal-title":"Plant Physiol"},{"key":"2023013107251178800_bty697-B30","doi-asserted-by":"crossref","first-page":"1095","DOI":"10.1016\/S0969-2126(98)00111-7","article-title":"An excitatory scorpion toxin with a distinctive feature: an additional alpha helix at the C terminus and its implications for interaction with insect sodium channels","volume":"6","author":"Oren","year":"1998","journal-title":"Struct. Fold. Des"},{"key":"2023013107251178800_bty697-B31","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1146\/annurev.biochem.74.082803.133029","article-title":"Protein families and their evolution \u2013 a structural perspective","volume":"74","author":"Orengo","year":"2005","journal-title":"Annu. Rev. Biochem"},{"key":"2023013107251178800_bty697-B32","article-title":"The evolution, function and mechanisms of action for plant defensins","author":"Parisi","year":"2018","journal-title":"Semin. cell Dev. Biol"},{"key":"2023013107251178800_bty697-B33","doi-asserted-by":"crossref","first-page":"1099","DOI":"10.1016\/j.bbamem.2016.02.016","article-title":"The plant defensin NaD1 introduces membrane disorder through a specific interaction with the lipid, phosphatidylinositol 4, 5 bisphosphate","volume":"1858","author":"Payne","year":"2016","journal-title":"Biochim. Biophys. Acta Biomembr"},{"key":"2023013107251178800_bty697-B34","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1016\/j.sbi.2005.05.005","article-title":"The limits of protein sequence comparison?","volume":"15","author":"Pearson","year":"2005","journal-title":"Curr. Opin. Struct. Biol"},{"key":"2023013107251178800_bty697-B35","doi-asserted-by":"crossref","first-page":"e01808","DOI":"10.7554\/eLife.01808","article-title":"Phosphoinositide-mediated oligomerization of a defensin induces cell lysis","volume":"3","author":"Poon","year":"2014","journal-title":"eLife"},{"key":"2023013107251178800_bty697-B13","volume-title":"R: A language and environment for statistical computing. R Foundation for Statistical Computing","year":"2013"},{"key":"2023013107251178800_bty697-B36","doi-asserted-by":"crossref","first-page":"14345","DOI":"10.1073\/pnas.0903433106","article-title":"Sequence physical properties encode the global organization of protein structure space","volume":"106","author":"Rackovsky","year":"2009","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013107251178800_bty697-B37","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1111\/j.2041-210X.2011.00169.x","article-title":"phytools: an R package for phylogenetic comparative biology (and other things)","volume":"3","author":"Revell","year":"2012","journal-title":"Methods Ecol. Evol"},{"key":"2023013107251178800_bty697-B38","doi-asserted-by":"crossref","first-page":"866","DOI":"10.1038\/nrm2805","article-title":"Exploring protein fitness landscapes by directed evolution","volume":"10","author":"Romero","year":"2009","journal-title":"Nat. Rev. Mol. Cell Biol"},{"key":"2023013107251178800_bty697-B39","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1093\/protein\/12.2.85","article-title":"Twilight zone of protein sequence alignments","volume":"12","author":"Rost","year":"1999","journal-title":"Protein Eng"},{"key":"2023013107251178800_bty697-B15","first-page":"289","article-title":"mclust 5: Clustering, classification and density estimation using gaussian finite mixture models","volume-title":"The R journal","author":"Scrucca","year":"2016"},{"key":"2023013107251178800_bty697-B40","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkv318","article-title":"GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters","volume":"43","author":"Sela","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023013107251178800_bty697-B41","doi-asserted-by":"crossref","first-page":"663","DOI":"10.1007\/s00018-016-2344-5","article-title":"Convergent evolution of defensin sequence, structure and function","volume":"74","author":"Shafee","year":"2017","journal-title":"Cell. Mol. Life Sci"},{"key":"2023013107251178800_bty697-B42","doi-asserted-by":"crossref","first-page":"27.","DOI":"10.1186\/s40064-015-1609-z","article-title":"Structural homology guided alignment of cysteine rich proteins","volume":"5","author":"Shafee","year":"2016","journal-title":"Springerplus"},{"key":"2023013107251178800_bty697-B43","doi-asserted-by":"crossref","DOI":"10.1093\/molbev\/msw106","article-title":"The defensins consist of two independent, convergent protein superfamilies","volume":"33","author":"Shafee","year":"2016","journal-title":"Mol. Biol. Evol"},{"key":"2023013107251178800_bty697-B44","author":"Shafee","year":"2018"},{"key":"2023013107251178800_bty697-B45","doi-asserted-by":"crossref","first-page":"434.","DOI":"10.1186\/s12859-016-1300-6","article-title":"AlignStat: a web-tool and R package for statistical comparison of alternative multiple sequence alignments","volume":"17","author":"Shafee","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2023013107251178800_bty697-B46","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1038\/msb.2011.75","article-title":"Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega","volume":"7","author":"Sievers","year":"2014","journal-title":"Mol. Syst. Biol"},{"key":"2023013107251178800_bty697-B47","doi-asserted-by":"crossref","first-page":"600","DOI":"10.1104\/pp.105.060079","article-title":"Genome organization of more than 300 defensin-like genes in Arabidopsis","volume":"138","author":"Silverstein","year":"2005","journal-title":"Plant Physiol"},{"key":"2023013107251178800_bty697-B48","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1111\/j.1365-313X.2007.03136.x","article-title":"Small cysteine-rich peptides resembling antimicrobial peptides have been under-predicted in plants","volume":"51","author":"Silverstein","year":"2007","journal-title":"Plant J"},{"key":"2023013107251178800_bty697-B49","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1038\/225563a0","article-title":"Natural selection and the concept of a protein space","volume":"225","author":"Smith","year":"1970","journal-title":"Nature"},{"key":"2023013107251178800_bty697-B50","doi-asserted-by":"crossref","first-page":"1312","DOI":"10.1093\/bioinformatics\/btu033","article-title":"RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies","volume":"30","author":"Stamatakis","year":"2014","journal-title":"Bioinformatics"},{"key":"2023013107251178800_bty697-B51","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1038\/nbt0695-549","article-title":"Searching sequence space","volume":"13","author":"Stemmer","year":"1995","journal-title":"Nat. Biotechnol"},{"key":"2023013107251178800_bty697-B52","doi-asserted-by":"crossref","first-page":"e1001449.","DOI":"10.1371\/journal.pbio.1001449","article-title":"A species-specific cluster of defensin-like genes encodes diffusible pollen tube attractants in Arabidopsis","volume":"10","author":"Takeuchi","year":"2012","journal-title":"PLoS Biol"},{"key":"2023013107251178800_bty697-B53","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1002\/bies.201500165","article-title":"Toxin structures as evolutionary tools: using conserved 3D folds to study the evolution of rapidly evolving peptides","volume":"38","author":"Undheim","year":"2016","journal-title":"BioEssays"},{"key":"2023013107251178800_bty697-B58","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1016\/j.fbr.2012.08.004","article-title":"Plant defensins: common fold, multiple functions","volume":"26","author":"Van der Weerden","year":"2013","journal-title":"Fungal Biol. Rev"},{"key":"2023013107251178800_bty697-B54","doi-asserted-by":"crossref","first-page":"12280","DOI":"10.3390\/molecules190812280","article-title":"Antifungal plant defensins: mechanisms of action and production","volume":"19","author":"Vriens","year":"2014","journal-title":"Molecules"},{"key":"2023013107251178800_bty697-B55","doi-asserted-by":"crossref","first-page":"543","DOI":"10.1086\/285234","article-title":"Homoplasy: the result of natural selection, or evidence of design limitations","volume":"138","author":"Wake","year":"1991","journal-title":"Am. Nat"},{"key":"2023013107251178800_bty697-B56","doi-asserted-by":"crossref","first-page":"135.","DOI":"10.1186\/1471-2105-8-135","article-title":"Supervised multivariate analysis of sequence groups to identify specificity determining residues","volume":"8","author":"Wallace","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023013107251178800_bty697-B57","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10969-014-9173-2","article-title":"Principal components analysis of protein sequence clusters","volume":"15","author":"Wang","year":"2014","journal-title":"J. Struct. Funct. Genomics"},{"key":"2023013107251178800_bty697-B59","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1093\/oxfordjournals.molbev.a003851","article-title":"A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach","volume":"18","author":"Whelan","year":"2001","journal-title":"Mol. Biol. Evol"},{"key":"2023013107251178800_bty697-B60","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-98141-3","volume-title":"Ggplot2: Elegant Graphics for Data Analysis","author":"Wickham","year":"2009"},{"key":"2023013107251178800_bty697-B61","doi-asserted-by":"crossref","first-page":"828","DOI":"10.1016\/j.molimm.2007.06.354","article-title":"Discovery of six families of fungal defensin-like peptides provides insights into origin and evolution of the CSalphabeta defensins","volume":"45","author":"Zhu","year":"2008","journal-title":"Mol. Immunol"},{"key":"2023013107251178800_bty697-B62","doi-asserted-by":"crossref","first-page":"546","DOI":"10.1093\/molbev\/msu038","article-title":"Experimental conversion of a defensin into a neurotoxin: implications for origin of toxic function","volume":"31","author":"Zhu","year":"2014","journal-title":"Mol. Biol. Evol"},{"key":"2023013107251178800_bty697-B63","doi-asserted-by":"crossref","first-page":"2257","DOI":"10.1007\/s00018-005-5200-6","article-title":"Phylogenetic distribution, functional epitopes and evolution of the CSalphabeta superfamily","volume":"62","author":"Zhu","year":"2005","journal-title":"Cell. Mol. Life Sci"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/5\/743\/48965937\/bioinformatics_35_5_743.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/5\/743\/48965937\/bioinformatics_35_5_743.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T10:20:27Z","timestamp":1675160427000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/5\/743\/5068591"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,8,9]]},"references-count":63,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2019,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty697","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,3,1]]},"published":{"date-parts":[[2018,8,9]]}}}