{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T07:53:01Z","timestamp":1770969181757,"version":"3.50.1"},"reference-count":50,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,1,26]],"date-time":"2024-01-26T00:00:00Z","timestamp":1706227200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001665","name":"Agence Nationale de la Recherche","doi-asserted-by":"publisher","award":["ANR-17-CE11-0039"],"award-info":[{"award-number":["ANR-17-CE11-0039"]}],"id":[{"id":"10.13039\/501100001665","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>The current richness of sequence data needs efficient methodologies to display and analyze the complexity of the information in a compact and readable manner. Traditionally, phylogenetic trees and sequence similarity networks have been used to display and analyze sequences of protein families. These methods aim to shed light on key computational biology problems such as sequence classification and functional inference. Here, we present a new methodology, AlignScape, based on self-organizing maps. AlignScape is applied to three large families of proteins: the kinases and GPCRs from human, and bacterial T6SS proteins. AlignScape provides a map of the similarity landscape and a tree representation of multiple sequence alignments These representations are useful to display, cluster, and classify sequences as well as identify functional trends. The efficient GPU implementation of AlignScape allows the analysis of large MSAs in a few minutes. Furthermore, we show how the AlignScape analysis of proteins belonging to the T6SS complex can be used to predict coevolving partners.<\/jats:p>","DOI":"10.3389\/fbinf.2024.1321508","type":"journal-article","created":{"date-parts":[[2024,1,26]],"date-time":"2024-01-26T04:48:55Z","timestamp":1706244535000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["AlignScape, displaying sequence similarity using self-organizing maps"],"prefix":"10.3389","volume":"4","author":[{"given":"Isaac","family":"Filella-Merce","sequence":"first","affiliation":[]},{"given":"Vincent","family":"Mallet","sequence":"additional","affiliation":[]},{"given":"Eric","family":"Durand","sequence":"additional","affiliation":[]},{"given":"Michael","family":"Nilges","sequence":"additional","affiliation":[]},{"given":"Guillaume","family":"Bouvier","sequence":"additional","affiliation":[]},{"given":"Riccardo","family":"Pellarin","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2024,1,26]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1109\/ICIAFS.2008.4783969","article-title":"Classification of protein sequences using the growing self-organizing map","volume":"2008","author":"Ahmad","year":"2008","journal-title":"Int. Conf. Inf. Automation Sustain."},{"key":"B2","doi-asserted-by":"publisher","first-page":"539","DOI":"10.1016\/j.jsps.2021.04.015","article-title":"GPCRs: the most promiscuous druggable receptor of the mankind","volume":"29","author":"Alhosaini","year":"2021","journal-title":"Saudi Pharm. J. SPJ"},{"key":"B3","doi-asserted-by":"publisher","first-page":"e13153","DOI":"10.1111\/cmi.13153","article-title":"Causalities of war: the connection between type VI secretion system and microbiota","volume":"22","author":"Allsopp","year":"2020","journal-title":"Cell. Microbiol."},{"key":"B4","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol."},{"key":"B5","doi-asserted-by":"publisher","first-page":"e4345","DOI":"10.1371\/journal.pone.0004345","article-title":"Using sequence similarity networks for visualization of relationships across diverse protein superfamilies","volume":"4","author":"Atkinson","year":"2009","journal-title":"PLOS ONE"},{"key":"B6","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1111\/j.1758-2229.2012.00394.x","article-title":"Distribution and diversity of bacterial secretion systems across metagenomic datasets","volume":"5","author":"Barret","year":"","journal-title":"Environ. Microbiol. Rep."},{"key":"B7","doi-asserted-by":"publisher","first-page":"D529","DOI":"10.1093\/nar\/gkaa853","article-title":"The Dark Kinase Knowledgebase: an online compendium of knowledge and experimental results of understudied kinases","volume":"49","author":"Berginski","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"B8","doi-asserted-by":"publisher","first-page":"104","DOI":"10.1186\/1471-2164-10-104","article-title":"Dissecting the bacterial type VI secretion system by a genome wide in silico analysis: what can be learned from available microbial genomic resources?","volume":"10","author":"Boyer","year":"2009","journal-title":"BMC Genomics"},{"key":"B9","doi-asserted-by":"publisher","first-page":"1404","DOI":"10.1038\/s41564-018-0260-1","article-title":"Biogenesis and structure of a type VI secretion baseplate","volume":"3","author":"Cherrak","year":"2018","journal-title":"Nat. Microbiol."},{"key":"B10","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1186\/s12859-022-04643-9","article-title":"Clustering biological sequences with dynamic sequence similarity threshold","volume":"23","author":"Chiu","year":"2022","journal-title":"BMC Bioinforma."},{"key":"B11","doi-asserted-by":"publisher","first-page":"e1005735","DOI":"10.1371\/journal.ppat.1005735","article-title":"VgrG and PAAR proteins define distinct versions of a functional type VI secretion system","volume":"12","author":"Cianfanelli","year":"2016","journal-title":"PLOS Pathog."},{"key":"B12","doi-asserted-by":"publisher","first-page":"4651","DOI":"10.1021\/acs.biochem.8b00473","article-title":"Revealing unexplored sequence-function space using sequence similarity networks","volume":"57","author":"Copp","year":"2018","journal-title":"Biochemistry"},{"key":"B13","doi-asserted-by":"publisher","first-page":"4112","DOI":"10.1111\/1462-2920.14976","article-title":"The <i>Vibrio cholerae<\/i> type VI secretion system: toxins, regulators and consequences","volume":"22","author":"Crisan","year":"2020","journal-title":"Environ. Microbiol."},{"key":"B14","doi-asserted-by":"publisher","first-page":"e1004805","DOI":"10.1371\/journal.pcbi.1004805","article-title":"Structure-based sequence alignment of the transmembrane domains of all human GPCRs: phylogenetic, structural and functional implications","volume":"12","author":"Cvicek","year":"2016","journal-title":"PLoS Comput. Biol."},{"key":"B15","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1038\/nrg3414","article-title":"Emerging methods in protein co-evolution","volume":"14","author":"de Juan","year":"2013","journal-title":"Nat. Rev. Genet."},{"key":"B16","doi-asserted-by":"publisher","first-page":"372","DOI":"10.1016\/j.tim.2020.01.005","article-title":"The evolution of protein secretion systems by Co-option and tinkering of cellular machineries","volume":"28","author":"Denise","year":"2020","journal-title":"Trends Microbiol."},{"key":"B17","doi-asserted-by":"publisher","first-page":"14157","DOI":"10.1074\/jbc.M111.338731","article-title":"Structural characterization and oligomerization of the TssL protein, a component shared by bacterial type VI and type IVb secretion systems","volume":"287","author":"Durand","year":"2012","journal-title":"J. Biol. Chem."},{"key":"B18","doi-asserted-by":"publisher","first-page":"2460","DOI":"10.1093\/bioinformatics\/btq461","article-title":"Search and clustering orders of magnitude faster than BLAST","volume":"26","author":"Edgar","year":"2010","journal-title":"Bioinforma. Oxf. Engl."},{"key":"B19","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1007\/978-1-60327-429-6_6","article-title":"Inferring function from homology","volume":"453","author":"Emes","year":"2008","journal-title":"Methods Mol. Biol. Clifton N. J."},{"key":"B20","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1007\/BF00204658","article-title":"Topological maps of protein sequences","volume":"65","author":"Ferr\u00e1n","year":"1991","journal-title":"Biol. Cybern."},{"key":"B21","doi-asserted-by":"publisher","first-page":"507","DOI":"10.1002\/pro.5560030316","article-title":"Self-organized neural maps of human protein sequences","volume":"3","author":"Ferr\u00e1n","year":"1994","journal-title":"Protein Sci. Publ. Protein Soc."},{"key":"B22","doi-asserted-by":"publisher","first-page":"1256","DOI":"10.1124\/mol.63.6.1256","article-title":"The G-protein-coupled receptors in the human genome form five main families. Phylogenetic analysis, paralogon groups, and fingerprints","volume":"63","author":"Fredriksson","year":"2003","journal-title":"Mol. Pharmacol."},{"key":"B23","doi-asserted-by":"publisher","first-page":"3150","DOI":"10.1093\/bioinformatics\/bts565","article-title":"CD-HIT: accelerated for clustering the next-generation sequencing data","volume":"28","author":"Fu","year":"2012","journal-title":"Bioinformatics"},{"key":"B24","doi-asserted-by":"publisher","first-page":"12317","DOI":"10.1074\/jbc.M110.193045","article-title":"Type VI secretion system in Pseudomonas aeruginosa: secretion and multimerization of VgrG proteins","volume":"286","author":"Hachani","year":"2011","journal-title":"J. Biol. Chem."},{"key":"B25","doi-asserted-by":"publisher","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc. Natl. Acad. Sci."},{"key":"B26","doi-asserted-by":"publisher","first-page":"1451","DOI":"10.2217\/fmb-2019-0194","article-title":"Type VI secretion system: a modular toolkit for bacterial dominance","volume":"14","author":"Jana","year":"2019","journal-title":"Future Microbiol."},{"key":"B27","doi-asserted-by":"publisher","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"B28","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1007\/BF00337288","article-title":"Self-organized formation of topologically correct feature maps","volume":"43","author":"Kohonen","year":"1982","journal-title":"Biol. Cybern."},{"key":"B29","doi-asserted-by":"publisher","first-page":"e29817","DOI":"10.1371\/journal.pone.0029817","article-title":"The origin of GPCRs: identification of mammalian like rhodopsin, adhesion, glutamate and frizzled GPCRs in fungi","volume":"7","author":"Krishnan","year":"2012","journal-title":"PLOS ONE"},{"key":"B30","doi-asserted-by":"publisher","first-page":"952","DOI":"10.1016\/j.cell.2015.01.037","article-title":"Structure of the type VI secretion system contractile sheath","volume":"160","author":"Kudryashev","year":"2015","journal-title":"Cell."},{"key":"B31","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1038\/nrd2518","article-title":"Structural diversity of G protein-coupled receptors and significance for drug discovery","volume":"7","author":"Lagerstr\u00f6m","year":"2008","journal-title":"Nat. Rev. Drug Discov."},{"key":"B32","doi-asserted-by":"publisher","first-page":"282","DOI":"10.1093\/bioinformatics\/17.3.282","article-title":"Clustering of highly homologous sequences to reduce the size of large protein databases","volume":"17","author":"Li","year":"2001","journal-title":"Bioinformatics"},{"key":"B33","doi-asserted-by":"publisher","first-page":"e1010116","DOI":"10.1371\/journal.ppat.1010116","article-title":"VgrG-dependent effectors and chaperones modulate the assembly of the type VI secretion system","volume":"17","author":"Liang","year":"2021","journal-title":"PLOS Pathog."},{"key":"B34","doi-asserted-by":"publisher","first-page":"1912","DOI":"10.1126\/science.1075762","article-title":"The protein kinase complement of the human genome","volume":"298","author":"Manning","year":"2002","journal-title":"Science"},{"key":"B35","doi-asserted-by":"publisher","first-page":"19790","DOI":"10.1038\/s41598-019-56499-4","article-title":"A structurally-validated multiple sequence alignment of 497 human protein kinase domains","volume":"9","author":"Modi","year":"2019","journal-title":"Sci. Rep."},{"key":"B36","unstructured":"Spring 2004|BLASTLab2023"},{"key":"B37","doi-asserted-by":"publisher","first-page":"523","DOI":"10.1007\/978-1-59745-398-1_31","article-title":"Prediction of protein interaction based on similarity of phylogenetic trees","volume":"484","author":"Pazos","year":"2008","journal-title":"Methods Mol. Biol. Clifton N. J."},{"key":"B38","doi-asserted-by":"publisher","first-page":"609","DOI":"10.1093\/protein\/14.9.609","article-title":"Similarity of phylogenetic trees as indicator of protein-protein interaction","volume":"14","author":"Pazos","year":"2001","journal-title":"Protein Eng."},{"key":"B39","doi-asserted-by":"publisher","first-page":"e00712","DOI":"10.1128\/mBio.00712-15","article-title":"Internalization of Pseudomonas aeruginosa strain PAO1 into epithelial cells is promoted by interaction of a T6SS effector with the microtubule network","volume":"6","author":"Sana","year":"2015","journal-title":"mBio"},{"key":"B40","doi-asserted-by":"publisher","first-page":"e80942","DOI":"10.7554\/eLife.80942","article-title":"ProteInfer, deep neural networks for protein functional inference","volume":"12","author":"Sanderson","year":"2023","journal-title":"eLife"},{"key":"B41","doi-asserted-by":"publisher","first-page":"2498","DOI":"10.1101\/gr.1239303","article-title":"Cytoscape: a software environment for integrated models of biomolecular interaction networks","volume":"13","author":"Shannon","year":"2003","journal-title":"Genome Res."},{"key":"B42","doi-asserted-by":"publisher","first-page":"D1083","DOI":"10.1093\/nar\/gks960","article-title":"IUPHAR-DB: updated database content and new features","volume":"41","author":"Sharman","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"B43","doi-asserted-by":"publisher","first-page":"473","DOI":"10.1186\/s12859-019-3019-7","article-title":"HH-suite3 for fast remote homology detection and deep protein annotation","volume":"20","author":"Steinegger","year":"2019","journal-title":"BMC Bioinforma."},{"key":"B44","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1111\/mmi.13921","article-title":"Contractile injection systems of bacteriophages and related systems","volume":"108","author":"Taylor","year":"2018","journal-title":"Mol. Microbiol."},{"key":"B45","doi-asserted-by":"publisher","first-page":"167918","DOI":"10.1016\/j.jmb.2022.167918","article-title":"Coevolution-guided mapping of the type VI secretion membrane complex-baseplate interface","volume":"435","author":"Vanlio\u011flu","year":"2023","journal-title":"J. Mol. Biol."},{"key":"B46","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1186\/1471-2105-13-174","article-title":"A novel hierarchical clustering algorithm for gene sequences","volume":"13","author":"Wei","year":"2012","journal-title":"BMC Bioinforma."},{"key":"B47","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1038\/s41392-020-00435-w","article-title":"G protein-coupled receptors: structure- and function-based drug discovery","volume":"6","author":"Yang","year":"2021","journal-title":"Signal Transduct. Target. Ther."},{"key":"B48","doi-asserted-by":"publisher","first-page":"402","DOI":"10.2174\/138920209789177575","article-title":"Hidden Markov models and their applications in biological sequence analysis","volume":"10","author":"Yoon","year":"2009","journal-title":"Curr. Genomics"},{"key":"B49","first-page":"229","article-title":"G protein-coupled receptors: abnormalities in signal transmission, disease states and pharmacotherapy","volume":"71","author":"Zalewska","year":"2014","journal-title":"Acta Pol. Pharm."},{"key":"B50","doi-asserted-by":"publisher","first-page":"e23876","DOI":"10.1371\/journal.pone.0023876","article-title":"Genetic analysis of anti-amoebae and anti-bacterial activities of the type VI secretion system in Vibrio cholerae","volume":"6","author":"Zheng","year":"2011","journal-title":"PLOS ONE"}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2024.1321508\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,26]],"date-time":"2024-01-26T04:49:01Z","timestamp":1706244541000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2024.1321508\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,26]]},"references-count":50,"alternative-id":["10.3389\/fbinf.2024.1321508"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2024.1321508","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,26]]},"article-number":"1321508"}}