{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,7]],"date-time":"2026-02-07T15:49:21Z","timestamp":1770479361905,"version":"3.49.0"},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Quantitative descriptions of amino acid similarity, expressed as probabilistic models of evolutionary interchangeability, are central to many mainstream bioinformatic procedures such as sequence alignment, homology searching, and protein structural prediction. Here we present a web-based, user-friendly analysis tool that allows any researcher to quickly and easily visualize relationships between these bioinformatic metrics and to explore their relationships to underlying indices of amino acid molecular descriptors.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We demonstrate the three fundamental types of question that our software can address by taking as a specific example the connections between 49 measures of amino acid biophysical properties (e.g., size, charge and hydrophobicity), a generalized model of amino acid substitution (as represented by the PAM74-100 matrix), and the mutational distance that separates amino acids within the standard genetic code (i.e., the number of point mutations required for interconversion during protein evolution). We show that our software allows a user to recapture the insights from several key publications on these topics in just a few minutes.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>Our software facilitates rapid, interactive exploration of three interconnected topics: (i) the multidimensional molecular descriptors of the twenty proteinaceous amino acids, (ii) the correlation of these biophysical measurements with observed patterns of amino acid substitution, and (iii) the causal basis for differences between any two observed patterns of amino acid substitution. This software acts as an intuitive bioinformatic exploration tool that can guide more comprehensive statistical analyses relating to a diverse array of specific research questions.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-7-329","type":"journal-article","created":{"date-parts":[[2006,7,28]],"date-time":"2006-07-28T19:28:27Z","timestamp":1154114907000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["An interactive visualization tool to explore the biophysical properties of amino acids and their contribution to substitution matrices"],"prefix":"10.1186","volume":"7","author":[{"given":"Blazej","family":"Bulka","sequence":"first","affiliation":[]},{"given":"Marie","family":"desJardins","sequence":"additional","affiliation":[]},{"given":"Stephen J","family":"Freeland","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2006,7,3]]},"reference":[{"key":"1068_CR1","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1002\/prot.340170108","volume":"17","author":"S Henikoff","year":"1993","unstructured":"Henikoff S, Henikoff JG: Performance evaluation of amino acid substitution matrices. Proteins 1993, 17: 49\u201361.","journal-title":"Proteins"},{"key":"1068_CR2","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0968-0004(98)01285-7","volume":"23","author":"F Jeanmougin","year":"1998","unstructured":"Jeanmougin F, Thompson JD, Gouy M, Higgins DG, Gibson TJ: Multiple sequence alignment with Clustal X. Trends Biochem Sci 1998, 23: 403\u2013405.","journal-title":"Trends Biochem Sci"},{"key":"1068_CR3","volume-title":"Proteins","author":"M Tress","year":"2005","unstructured":"Tress M, Ezkurdia I, Grana O, Lopez G, Valencia A: Assessment of predictions submitted for the CASP6 comparative modelling category. Proteins 2005, in press."},{"key":"1068_CR4","doi-asserted-by":"publisher","first-page":"847","DOI":"10.1093\/bioinformatics\/btg492","volume":"20","author":"RB Vilim","year":"2004","unstructured":"Vilim RB, Cunningham RM, Lu B, Kheradpour P, Stevens FJ: Fold-specific substitution matrices for protein classification. Bioinformatics 2004, 20: 847\u2013853.","journal-title":"Bioinformatics"},{"key":"1068_CR5","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1002\/prot.10474","volume":"54","author":"O Teodorescu","year":"2004","unstructured":"Teodorescu O, Galor T, Pillardy J, Elber R: Enriching the sequence substitution matrix by structural information. Proteins 2004, 54: 41\u201348.","journal-title":"Proteins"},{"key":"1068_CR6","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1016\/j.crvi.2005.02.002","volume":"328","author":"O Bastien","year":"2005","unstructured":"Bastien O, Roy S, Marechal E: Construction of non-symmetric substitution matrices derived from proteomes with biased amino acid distributions. C R Biol 2005, 328: 445\u2013453.","journal-title":"C R Biol"},{"key":"1068_CR7","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1016\/0014-5793(94)80429-X","volume":"339","author":"DT Jones","year":"1994","unstructured":"Jones DT, Taylor WR, Thornton JM: A mutation data matrix for transmembrane proteins. FEBS Letters 1994, 339: 269\u2013275.","journal-title":"FEBS Letters"},{"key":"1068_CR8","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1002\/prot.10308","volume":"51","author":"RA Sutormin","year":"2003","unstructured":"Sutormin RA, Rakhmaninova AB, Gelfand MS: BATMAS30: amino acid substitution matrix for alignment of bacterial transporters. Proteins 2003, 51: 85\u201395.","journal-title":"Proteins"},{"key":"1068_CR9","doi-asserted-by":"publisher","first-page":"632","DOI":"10.1016\/j.crvi.2005.03.003","volume":"328","author":"M Pacholczyk","year":"2005","unstructured":"Pacholczyk M, Kimmel M: Analysis of differences in amino acid substitution patterns, using multilevel G-tests. C R Biol 2005, 328: 632\u2013641.","journal-title":"C R Biol"},{"key":"1068_CR10","doi-asserted-by":"publisher","first-page":"902","DOI":"10.1093\/bioinformatics\/bti070","volume":"21","author":"YK Yu","year":"2005","unstructured":"Yu YK, Altschul SF: The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions. Bioinformatics 2005, 21: 902\u2013911.","journal-title":"Bioinformatics"},{"key":"1068_CR11","doi-asserted-by":"publisher","first-page":"459","DOI":"10.1007\/BF02498640","volume":"42","author":"J Adachi","year":"1996","unstructured":"Adachi J, Hasegawa M: Model of amino acid substitution in proteins encoded by mitochondrial DNA. J Mol Evol 1996, 42: 459\u2013468.","journal-title":"J Mol Evol"},{"key":"1068_CR12","doi-asserted-by":"publisher","first-page":"4685","DOI":"10.1016\/j.febslet.2005.07.039","volume":"579","author":"HJ Feldman","year":"2005","unstructured":"Feldman HJ, Dumontier M, Ling S, Haider N, Hogue CW: CO: A chemical ontology for identification of functional groups and semantic comparison of small molecules. FEBS Letters 2005, 579: 4685\u20134691.","journal-title":"FEBS Letters"},{"key":"1068_CR13","doi-asserted-by":"publisher","first-page":"793","DOI":"10.1073\/pnas.0307490100","volume":"101","author":"G Giaever","year":"2004","unstructured":"Giaever G, Flaherty P, Kumm J, Proctor M, Nislow C, Jaramillo DF, Chu AM, Jordan MI, Arkin AP, Davis RW: Chemogenomic profiling: Identifying the functional interactions of small molecules in yeast. PNAS 2004, 101: 793\u2013798.","journal-title":"PNAS"},{"key":"1068_CR14","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1038\/nbt1075","volume":"23","author":"D di Bernardo","year":"2005","unstructured":"di Bernardo D, Thompson MJ, Gardner TS, Chobot SE, Eastwood EL, Wojtovich AP, Elliott SJ, Schaus SE, Collins JJ: Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks. Nature Biotechnology 2005, 23: 377\u2013383.","journal-title":"Nature Biotechnology"},{"key":"1068_CR15","doi-asserted-by":"publisher","first-page":"862","DOI":"10.1126\/science.185.4154.862","volume":"185","author":"R Grantham","year":"1974","unstructured":"Grantham R: Amino acid difference formula to help explain protein evolution. Science 1974, 185: 862\u2013864.","journal-title":"Science"},{"key":"1068_CR16","doi-asserted-by":"publisher","first-page":"1323","DOI":"10.1093\/protein\/7.11.1323","volume":"11","author":"SA Benner","year":"1994","unstructured":"Benner SA, Cohen MA, Gonnet GH: Amino acid substitution during functionally constrained divergent evolution of protein sequences. Protein Eng 1994, 11: 1323\u20131332.","journal-title":"Protein Eng"},{"key":"1068_CR17","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1016\/S0022-2836(66)80258-9","volume":"16","author":"WM Fitch","year":"1966","unstructured":"Fitch WM: An improved method of testing for evolutionary homology. J Mol Biol 1966, 16: 9\u201316.","journal-title":"J Mol Biol"},{"key":"1068_CR18","doi-asserted-by":"publisher","first-page":"134","DOI":"10.1186\/1471-2105-6-134","volume":"6","author":"A Schneider","year":"2005","unstructured":"Schneider A, Cannarozzi GM, Gonnet GH: Empirical codon substitution matrix. BMC Bioinformatics 2005, 6: 134.","journal-title":"BMC Bioinformatics"},{"key":"1068_CR19","volume-title":"Proteins","author":"Y Fujitsuka","year":"2005","unstructured":"Fujitsuka Y, Chikenji G, Takada S: SimFold energy function for de novo protein structure prediction: Consensus with Rosetta. Proteins 2005, in press."},{"key":"1068_CR20","doi-asserted-by":"publisher","first-page":"1459","DOI":"10.1534\/genetics.104.039107","volume":"170","author":"LY Yampolsky","year":"2005","unstructured":"Yampolsky LY, Stoltzfus A: The exchangeability of amino acids in proteins. Genetics 2005, 170: 1459\u20131472.","journal-title":"Genetics"},{"key":"1068_CR21","doi-asserted-by":"publisher","first-page":"686","DOI":"10.1093\/bioinformatics\/17.8.686","volume":"17","author":"Z Dosztanyi","year":"2001","unstructured":"Dosztanyi Z, Torda AE: Amino acid similarity matrices based on force fields. Bioinformatics 2001, 17: 686\u2013699.","journal-title":"Bioinformatics"},{"key":"1068_CR22","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1093\/protein\/2.2.93","volume":"2","author":"K Nakai","year":"1988","unstructured":"Nakai K, Kidera A, Kanehisa M: Cluster analysis of amino acid indices for prediction of protein structure and function. Protein Eng 1988, 2: 93\u2013100.","journal-title":"Protein Eng"},{"key":"1068_CR23","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1093\/protein\/9.1.27","volume":"9","author":"K Tomii","year":"1996","unstructured":"Tomii K, Kanehisa M: Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng 1996, 9: 27\u201336.","journal-title":"Protein Eng"},{"key":"1068_CR24","doi-asserted-by":"publisher","first-page":"374","DOI":"10.1093\/nar\/28.1.374","volume":"28","author":"S Kawashima","year":"2000","unstructured":"Kawashima S, Kanehisa M: AAindex: amino acid index database. Nucleic Acids Res 2000, 28: 374.","journal-title":"Nucleic Acids Res"},{"key":"1068_CR25","doi-asserted-by":"publisher","first-page":"RESEARCH0049","DOI":"10.1186\/gb-2001-2-11-research0049","volume":"2","author":"D Gilis","year":"2001","unstructured":"Gilis D, Massar S, Cerf NJ, Rooman M: Optimality of the genetic code with respect to protein stability and amino-acid frequencies. Genome Biol 2001, 2: RESEARCH0049.","journal-title":"Genome Biol"},{"key":"1068_CR26","volume-title":"Graph Drawing: Algorithms for the Visualization of Graphs","author":"IG Tollis","year":"1998","unstructured":"Tollis IG, Tamassia R, Eades P, Di Battista G: Graph Drawing: Algorithms for the Visualization of Graphs. Pearson Education; 1998."},{"key":"1068_CR27","unstructured":"TouchGraph Website[http:\/\/www.touchgraph.com]"},{"key":"1068_CR28","volume-title":"Introduction to Algorithms","author":"TH Cormen","year":"2001","unstructured":"Cormen TH, Leiserson CE, Rivest RL, Stein C: Introduction to Algorithms. Second edition. Cambridge, MA, London: The MIT Press; Boston, MA, Burr Ridge, IL, Dubuque, IA, Madison, WI, New York, NY, San Francisco, CA, St. Louis, MO, Montreal, Toronto: McGraw-Hill Book Company; 2001.","edition":"Second"},{"key":"1068_CR29","volume-title":"Machine Learning","author":"TM Mitchell","year":"1997","unstructured":"Mitchell TM: Machine Learning. McGraw-Hill Companies; 1997."},{"key":"1068_CR30","unstructured":"AAindex Website[http:\/\/www.genome.ad.jp\/dbget\/aaindex.html]"},{"key":"1068_CR31","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1007\/BF00592854","volume":"60","author":"CR Woese","year":"1973","unstructured":"Woese CR: Evolution of the genetic code. Naturwissenschaften 1973, 60: 447\u2013459.","journal-title":"Naturwissenschaften"},{"key":"1068_CR32","doi-asserted-by":"publisher","first-page":"412","DOI":"10.1007\/BF02103132","volume":"33","author":"D Haig","year":"1991","unstructured":"Haig D, Hurst LD: A quantitative measure of error minimisation within the genetic code. J Mol Evol 1991, 33: 412\u2013417.","journal-title":"J Mol Evol"},{"key":"1068_CR33","doi-asserted-by":"publisher","first-page":"238","DOI":"10.1007\/PL00006381","volume":"47","author":"SJ Freeland","year":"1998","unstructured":"Freeland SJ, Hurst LD: The genetic code is one in a million. J Mol Evol 1998, 47: 238\u2013248.","journal-title":"J Mol Evol"},{"key":"1068_CR34","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1016\/j.gene.2005.08.005","volume":"362","author":"H Goodarzi","year":"2005","unstructured":"Goodarzi H, Shateri Najafabadi H, Torabi N: On the coevolution of genes and genetic code. Gene 2005, 362: 133\u2013140.","journal-title":"Gene"},{"key":"1068_CR35","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1023\/A:1025771327614","volume":"33","author":"SJ Freeland","year":"2003","unstructured":"Freeland SJ, Wu T, Keulmann N: The case for an Error Minimizing Standard Genetic Code. Orig Life Evol Biosph 2003, 33: 457\u2013477.","journal-title":"Orig Life Evol Biosph"},{"key":"1068_CR36","doi-asserted-by":"publisher","first-page":"723","DOI":"10.1101\/SQB.1966.031.01.093","volume":"31","author":"CR Woese","year":"1966","unstructured":"Woese CR, Dugre DH, Saxinger WC, Dugre SA: On the fundamental nature and evolution of the genetic code. Cold Spring Harb Symp Quant Biol 1966, 31: 723\u2013736.","journal-title":"Cold Spring Harb Symp Quant Biol"},{"key":"1068_CR37","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1016\/0022-2836(82)90515-0","volume":"157","author":"J Kyte","year":"1982","unstructured":"Kyte J, Doolittle RF: A simple measure for displaying the hydropathic character of a protein. J Mol Biol 1982, 157: 105\u2013132.","journal-title":"J Mol Biol"},{"key":"1068_CR38","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1006\/jtbi.2000.2206","volume":"208","author":"M Di Giulio","year":"2001","unstructured":"Di Giulio M: The origin of the genetic code cannot be studied using measurements based on the PAM matrix because this matrix reflects the code itself, making any such analyses tautologous. J Theor Biol 2001, 208: 141\u2013144.","journal-title":"J Theor Biol"},{"key":"1068_CR39","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1007\/BF00178593","volume":"35","author":"E Szathmary","year":"1992","unstructured":"Szathmary E, Zintzaras E: A statistical test of hypotheses on the organization and origin of the genetic code. J Mol Evol 1992, 35: 185\u2013189.","journal-title":"J Mol Evol"},{"key":"1068_CR40","doi-asserted-by":"publisher","first-page":"708","DOI":"10.1007\/PL00006591","volume":"49","author":"D Haig","year":"1999","unstructured":"Haig D, Hurst LD: A quantitative measure of error minimization in the genetic code. J Mol Evol 1999, 49: 708.","journal-title":"J Mol Evol"},{"key":"1068_CR41","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/PL00006356","volume":"47","author":"DH Ardell","year":"1998","unstructured":"Ardell DH: On error minimization in a sequential origin of the standard genetic code. J Mol Evol 1998, 47: 1\u201313.","journal-title":"J Mol Evol"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-329.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T11:00:54Z","timestamp":1630494054000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-329"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,7,3]]},"references-count":41,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["1068"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-329","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,7,3]]},"assertion":[{"value":"12 December 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 July 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 July 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"329"}}