{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,4,23]],"date-time":"2023-04-23T05:54:04Z","timestamp":1682229244243},"reference-count":55,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to <jats:italic>\u03b1<\/jats:italic>-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94%) appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1) a dimer interface motif found in voltage-gated chloride channels, (2) a proton transfer motif found in heme-copper oxidases, and (3) a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome <jats:italic>b<\/jats:italic>.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-11-204","type":"journal-article","created":{"date-parts":[[2010,4,27]],"date-time":"2010-04-27T06:14:42Z","timestamp":1272348882000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Structural fragment clustering reveals novel structural and functional motifs in \u03b1-helical transmembrane proteins"],"prefix":"10.1186","volume":"11","author":[{"given":"Annalisa","family":"Marsico","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Henschel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christof","family":"Winter","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anne","family":"Tuukkanen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Boris","family":"Vassilev","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kerstin","family":"Scheubert","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael","family":"Schroeder","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2010,4,26]]},"reference":[{"key":"3661_CR1","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1016\/S0014-5793(98)00095-7","volume":"423","author":"DT Jones","year":"1998","unstructured":"Jones DT: Do transmembrane protein superfolds exist? FEBS Lett 1998, 423: 281\u2013285. 10.1016\/S0014-5793(98)00095-7","journal-title":"FEBS Lett"},{"issue":"7068","key":"3661_CR2","doi-asserted-by":"publisher","first-page":"581","DOI":"10.1038\/nature04395","volume":"438","author":"JU Bowie","year":"2005","unstructured":"Bowie JU: Solving the membrane protein folding problem. Nature 2005, 438(7068):581\u2013589. 10.1038\/nature04395","journal-title":"Nature"},{"key":"3661_CR3","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1146\/annurev.biochem.76.052705.163539","volume":"76","author":"A Elofsson","year":"2007","unstructured":"Elofsson A, vonHeijne G: Membrane Protein Structure: Prediction vs Reality. Annu Rev Biochem 2007, 76: 125\u2013140. 10.1146\/annurev.biochem.76.052705.163539","journal-title":"Annu Rev Biochem"},{"key":"3661_CR4","doi-asserted-by":"publisher","first-page":"375","DOI":"10.1146\/annurev.biophys.32.110601.142520","volume":"32","author":"S Filipek","year":"2003","unstructured":"Filipek S, Teller DC, Palczewski K, Stenkamp R: The crystallographic model of rhodopsin and its use in studies of other G protein-coupled receptors. Annu Rev Biophys Biomol Struct 2003, 32: 375\u2013397. 10.1146\/annurev.biophys.32.110601.142520","journal-title":"Annu Rev Biophys Biomol Struct"},{"issue":"10","key":"3661_CR5","doi-asserted-by":"publisher","first-page":"2759","DOI":"10.1021\/bi027224+","volume":"42","author":"T Mirzadegan","year":"2003","unstructured":"Mirzadegan T, Benko G, Filipek S, Palczewski K: Sequence analyses of G-protein coupled receptors: similarities to rhodopsin. Biochemistry 2003, 42(10):2759\u20132767. 10.1021\/bi027224+","journal-title":"Biochemistry"},{"issue":"19","key":"3661_CR6","doi-asserted-by":"publisher","first-page":"7246","DOI":"10.1073\/pnas.0401429101","volume":"101","author":"AJ Rader","year":"2004","unstructured":"Rader AJ, Anderson G, Isin B, Khorana HG, Bahar I, Klein-Seetharaman J: Identification of core amino acids stabilizing rhodopsin. Proc Natl Acad Sci USA 2004, 101(19):7246\u20137251. 10.1073\/pnas.0401429101","journal-title":"Proc Natl Acad Sci USA"},{"issue":"33","key":"3661_CR7","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1146\/annurev.biophys.33.110502.140348","volume":"8","author":"C Sanders","year":"2004","unstructured":"Sanders C, Myers J: Disease-Related Misassembly of Membrane Proteins. Annu Rev Biophys Biomol Struct 2004, 8(33):25\u201351. 10.1146\/annurev.biophys.33.110502.140348","journal-title":"Annu Rev Biophys Biomol Struct"},{"key":"3661_CR8","doi-asserted-by":"publisher","first-page":"1587","DOI":"10.1002\/pro.5560060723","volume":"6","author":"K Han","year":"1997","unstructured":"Han K, Bystroff C, Baker D: Three-dimensional structures and contexts associated with recurrent amino acid sequence patterns. Protein Sci 1997, 6: 1587\u201390. 10.1002\/pro.5560060723","journal-title":"Protein Sci"},{"key":"3661_CR9","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1006\/jmbi.2001.5227","volume":"315","author":"J Watson","year":"2002","unstructured":"Watson J, Milne-White J: A novel main-chain anion-binding site in proteins: the nest. A particular combination of phi, psi values in successive residues give rise to anion-binding sites that occur commonly and are found often at functionally important regions. J Mol Biol 2002, 315: 171\u2013182. 10.1006\/jmbi.2001.5227","journal-title":"J Mol Biol"},{"key":"3661_CR10","doi-asserted-by":"crossref","unstructured":"Bystroff C, Baker D: Prediction of Local Structure in Proteins Using a Library of Sequence-Structure Motifs. J Mol Biol 1998, (281):565\u2013577. 10.1006\/jmbi.1998.1943","DOI":"10.1006\/jmbi.1998.1943"},{"key":"3661_CR11","doi-asserted-by":"publisher","first-page":"297","DOI":"10.1016\/S0022-2836(02)00942-7","volume":"223","author":"P Kolodny","year":"2002","unstructured":"Kolodny P, Koehl P, Guibas L, Levitt M: Small Libraries of Protein Fragments Model Native Protein Structures Accurately. J Mol Biol 2002, 223: 297\u2013307. 10.1016\/S0022-2836(02)00942-7","journal-title":"J Mol Biol"},{"key":"3661_CR12","doi-asserted-by":"publisher","first-page":"D218","DOI":"10.1093\/nar\/gkm794","volume":"36","author":"G Pugalenthi","year":"2008","unstructured":"Pugalenthi G, Suganthan PN, Sowdhamini R, Chakrabarti S: MegaMotifBase: a database of structural motifs in protein families and superfamilies. Nucleic Acids Res 2008, 36: D218\u201321. 10.1093\/nar\/gkm794","journal-title":"Nucleic Acids Res"},{"key":"3661_CR13","doi-asserted-by":"publisher","first-page":"D211","DOI":"10.1093\/nar\/gkh078","volume":"32","author":"A Golovin","year":"2004","unstructured":"Golovin A, Oldfield TJ, Tate JG, Velankar S, Barton GJ, Boutselakis H, Dimitropoulos D, Fillon J, Hussain A, Ionides JMC, John M, Keller PA, Krissinel E, McNeil P, Naim A, Newman R, Pajon A, Pineda J, Rachedi A, Copeland J, Sitnov A, Sobhany S, Suarez-Uruena A, Swaminathan GJ, Tagari M, Tromm S, Vranken W, Henrick K: E-MSD: an integrated data resource for bioinformatics. Nucleic Acids Res 2004, 32: D211\u20136. 10.1093\/nar\/gkh078","journal-title":"Nucleic Acids Res"},{"issue":"3","key":"3661_CR14","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1093\/bib\/3.3.265","volume":"3","author":"C Sigrist","year":"2002","unstructured":"Sigrist C, Cerutti L, Hulo N, Gattiker A, Falquet L, Pagni M, Bairoch A, Bucher F: PROSITE: A documented database using patterns and profiles as motif descriptors. Brief Bioinform 2002, 3(3):265\u2013274. 10.1093\/bib\/3.3.265","journal-title":"Brief Bioinform"},{"key":"3661_CR15","doi-asserted-by":"crossref","unstructured":"Laskowski R, Watson J, Thornton J: ProFunc: a server for predicting protein function from 3D structure. Nucleic Acids Res 2005, (33 Web Server):W89-W93. 10.1093\/nar\/gki414","DOI":"10.1093\/nar\/gki414"},{"key":"3661_CR16","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1002\/pro.5560070103","volume":"7","author":"R Aurora","year":"1998","unstructured":"Aurora R, Rose G: Helix capping. Protein Sci 1998, 7: 21\u201338.","journal-title":"Protein Sci"},{"key":"3661_CR17","doi-asserted-by":"publisher","first-page":"6260","DOI":"10.1093\/emboj\/18.22.6260","volume":"18","author":"DK Ghosh","year":"1999","unstructured":"Ghosh DK, Crane BR, Ghosh S, Wolan D, Gachhui R, Crooks C, Presta A, Tainer JA, Getzoff ED, Stuehr DJ: Inducible nitric oxide synthase: role of the N-terminal beta-hairpin hook and pterin-binding segment in dimerization and tetrahydrobiopterin interaction. EMBO J 1999, 18: 6260\u20136270. 10.1093\/emboj\/18.22.6260","journal-title":"EMBO J"},{"key":"3661_CR18","doi-asserted-by":"publisher","first-page":"591","DOI":"10.1016\/j.jmb.2006.06.037","volume":"361","author":"H Viklund","year":"2006","unstructured":"Viklund H, Granseth E, Elofsson A: Structural Classification and Prediction of Reentrant Regions in alpha-Helical Transmembrane Proteins: application to Complete Genomes. J Mol Biol 2006, 361: 591\u2013603. 10.1016\/j.jmb.2006.06.037","journal-title":"J Mol Biol"},{"key":"3661_CR19","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1016\/j.jmb.2004.11.036","volume":"346","author":"E Granseth","year":"2005","unstructured":"Granseth E, von Heijne G, Elofsson A: A study of the membrane-water interface region of membrane proteins. J Mol Biol 2005, 346: 377\u2013385. 10.1016\/j.jmb.2004.11.036","journal-title":"J Mol Biol"},{"key":"3661_CR20","doi-asserted-by":"publisher","first-page":"13658","DOI":"10.1073\/pnas.0605878103","volume":"103","author":"RFS Walters","year":"2006","unstructured":"Walters RFS, DeGrado WF: Helix-packing motifs in membrane proteins. Proc Natl Acad Sci USA 2006, 103: 13658\u201313663. 10.1073\/pnas.0605878103","journal-title":"Proc Natl Acad Sci USA"},{"issue":"4","key":"3661_CR21","doi-asserted-by":"publisher","first-page":"959","DOI":"10.1073\/pnas.0306077101","volume":"101","author":"S Yohannan","year":"2003","unstructured":"Yohannan S, Faham S, Yang D, Whitelegge P, Bowie J: The evolution of transmembrane helix kinks and the structural diverstity of G protein-coupled receptors. Proc Natl Acad Sci USA 2003, 101(4):959\u2013963. 10.1073\/pnas.0306077101","journal-title":"Proc Natl Acad Sci USA"},{"key":"3661_CR22","doi-asserted-by":"publisher","first-page":"1469","DOI":"10.1093\/bioinformatics\/btn202","volume":"24","author":"GE Tusn\u00e1dy","year":"2008","unstructured":"Tusn\u00e1dy GE, Kalm\u00e1r L, Hegyi H, Tompa P, Simon I: TOPDOM: database of domains and motifs with conservative location in transmembrane proteins. Bioinformatics 2008, 24: 1469\u20131470. 10.1093\/bioinformatics\/btn202","journal-title":"Bioinformatics"},{"key":"3661_CR23","doi-asserted-by":"publisher","first-page":"611","DOI":"10.1016\/j.jmb.2004.02.047","volume":"338","author":"AV Tendulkar","year":"2004","unstructured":"Tendulkar AV, Joshi AA, Sohoni MA, Wangikar PP: Clustering of protein structural fragments reveals modular building block approach of nature. J Mol Biol 2004, 338: 611\u2013629. 10.1016\/j.jmb.2004.02.047","journal-title":"J Mol Biol"},{"key":"3661_CR24","doi-asserted-by":"publisher","first-page":"719","DOI":"10.1089\/cmb.2006.13.719","volume":"13","author":"S Ferr\u00e9","year":"2006","unstructured":"Ferr\u00e9 S, King RD: Finding motifs in protein secondary structure for use in function prediction. J Comput Biol 2006, 13: 719\u2013731. 10.1089\/cmb.2006.13.719","journal-title":"J Comput Biol"},{"key":"3661_CR25","doi-asserted-by":"publisher","first-page":"2237","DOI":"10.1093\/bioinformatics\/btl382","volume":"22","author":"J Espadaler","year":"2006","unstructured":"Espadaler J, Querol E, Aviles FX, Oliva B: Identification of function-associated loop motifs and application to protein function prediction. Bioinformatics 2006, 22: 2237\u20132243. 10.1093\/bioinformatics\/btl382","journal-title":"Bioinformatics"},{"issue":"9","key":"3661_CR26","first-page":"R52","volume":"1","author":"M Karuppasamy","year":"2008","unstructured":"Karuppasamy M, Pal D, Suryanarayanarao R, Brener N, Iyengar S, Seetharaman G: Functionally important segments in proteins dissected using Gene Ontology and geometric clustering of peptide fragments. Genome Biol 2008, 1(9):R52.","journal-title":"Genome Biol"},{"issue":"6869","key":"3661_CR27","doi-asserted-by":"publisher","first-page":"287","DOI":"10.1038\/415287a","volume":"415","author":"R Dutzler","year":"2002","unstructured":"Dutzler R, Campbell E, Cadene M, Chait B, MacKinnon R: X-ray structure of a ClC chloride channel at 3.0 A reveals the molecular basis of anion selectivity. Nature 2002, 415(6869):287\u201394. 10.1038\/415287a","journal-title":"Nature"},{"issue":"2","key":"3661_CR28","doi-asserted-by":"publisher","first-page":"836","DOI":"10.1016\/S0006-3495(04)74159-4","volume":"86","author":"J Cohen","year":"2004","unstructured":"Cohen J, Schulten K: Mechanism of anionic conduction across ClC. Biophys J 2004, 86(2):836\u201345. 10.1016\/S0006-3495(04)74159-4","journal-title":"Biophys J"},{"key":"3661_CR29","doi-asserted-by":"crossref","unstructured":"Winter C, Henschel A, Kim W, Schroeder M: SCOPPI: a structural classification of protein-rptoein interfaces. Nucleic Acids Res 2006, (34 Database):D310-D314. 10.1093\/nar\/gkj099","DOI":"10.1093\/nar\/gkj099"},{"issue":"2-3","key":"3661_CR30","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1016\/S0005-2728(01)00169-4","volume":"1505","author":"MM Pereira","year":"2001","unstructured":"Pereira MM, Santana M, Teixeira M: A novel scenario for the evolution of haem-copper oxygen reductases. Biochim Biophys Acta 2001, 1505(2\u20133):185\u2013208. 10.1016\/S0005-2728(01)00169-4","journal-title":"Biochim Biophys Acta"},{"issue":"50","key":"3661_CR31","doi-asserted-by":"publisher","first-page":"16502","DOI":"10.1021\/bi0511336","volume":"44","author":"C Ribacka","year":"2005","unstructured":"Ribacka C, Verkhovsky MI, Belevich I, Bloch DA, Puustinen A, Wikstr\u00f6m M: An elementary reaction step of the proton pump is revealed by mutation of tryptophan-164 to phenylalanine in cytochrome c oxidase from Paracoccus denitrificans. Biochemistry 2005, 44(50):16502\u201316512. 10.1021\/bi0511336","journal-title":"Biochemistry"},{"key":"3661_CR32","first-page":"387","volume-title":"Nature","author":"O Boudker","year":"2007","unstructured":"Boudker O, Ryan R, Yernool D, Shimamoto K, Gouaux E: Coupling substrate and ion binding to extracellular gate of a sodium-dependent aspartate transporter. Nature 2007, 387\u2013393. advanced online publication advanced online publication 10.1038\/nature05455"},{"key":"3661_CR33","first-page":"179","volume-title":"Nature","author":"Y Wang","year":"2006","unstructured":"Wang Y, Zhang Y, Ha Y: Crystal structure of a rhomboid family intramembrane protease. Nature 2006, 179\u2013180. advanced online publication advanced online publication 10.1038\/nature05255"},{"issue":"17","key":"3661_CR34","doi-asserted-by":"publisher","first-page":"2964","DOI":"10.1093\/bioinformatics\/bth340","volume":"20","author":"G Tusnady","year":"2004","unstructured":"Tusnady G, Dosztanyi Z, Simon I: Transmembrane proteins in the Protein Data Bank: identification and classification. Bioinformatics 2004, 20(17):2964\u20132972. 10.1093\/bioinformatics\/bth340","journal-title":"Bioinformatics"},{"issue":"13","key":"3661_CR35","doi-asserted-by":"publisher","first-page":"1605","DOI":"10.1002\/jcc.20084","volume":"25","author":"E Pettersen","year":"2004","unstructured":"Pettersen E, Goddard T, Huang C, Couch G, Greenblatt D, Meng E, Ferrin T: UCSF Chimera-a visualization system for exploratory research and analysis. J Comput Chem 2004, 25(13):1605\u201312. 10.1002\/jcc.20084","journal-title":"J Comput Chem"},{"key":"3661_CR36","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1038\/72454","volume":"7","author":"JU Bowie","year":"2000","unstructured":"Bowie JU: Understanding membrane protein structure by design. Nature Structural Biology 2000, 7: 91\u201394. 10.1038\/72454","journal-title":"Nature Structural Biology"},{"key":"3661_CR37","doi-asserted-by":"publisher","first-page":"155","DOI":"10.1002\/prot.340060206","volume":"6","author":"M Karpen","year":"1989","unstructured":"Karpen M, de Haseth P, Neet K: Comparing Short Protein Substructures by a Method Based on Backbone Torsion Angles. Proteins 1989, 6: 155\u2013167. 10.1002\/prot.340060206","journal-title":"Proteins"},{"key":"3661_CR38","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J, Davis A, Dolinski K, Dwight S, Eppig J, Harris M, Hill D, Issel-Tarver L, Kasarskis A, Lewis S, Matese J, Richardson J, Ringwald M, Rubin G, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25: 25\u20139. 10.1038\/75556","journal-title":"Nat Genet"},{"key":"3661_CR39","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"H Berman","year":"2000","unstructured":"Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H, Shindyalov I, Bourne P: The Protein Data Bank. Nucleic Acids Res 2000, 28: 235\u201342. 10.1093\/nar\/28.1.235","journal-title":"Nucleic Acids Res"},{"key":"3661_CR40","doi-asserted-by":"crossref","unstructured":"Bairoch A, Apweiler R, Wu C, Barker W, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin M, Natale D, O'Donovan C, Redaschi N, Yeh L: The Universal Protein Resource (UniProt). Nucleic Acids Res 2005, (33 Database):D154\u20139.","DOI":"10.1093\/nar\/gki070"},{"key":"3661_CR41","doi-asserted-by":"crossref","unstructured":"Hulo N, Bairoch A, Bulliard V, Cerutti L, De CE, Langendijk-Genevaux P, Pagni M, Sigrist C: The PROSITE database. Nucleic Acids Res 2006, (34 Database):D227\u201330. 10.1093\/nar\/gkj063","DOI":"10.1093\/nar\/gkj063"},{"key":"3661_CR42","first-page":"28","volume-title":"Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology","author":"T Bailey","year":"1994","unstructured":"Bailey T, Elkan C: Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology. AAAI Press; 1994:28\u201336."},{"key":"3661_CR43","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1016\/S0968-0004(03)00026-4","volume":"28","author":"J Torres","year":"2003","unstructured":"Torres J, Stevens TJ, Sams\u00f3 M: Membrane proteins: the 'Wild West' of structural biology. Trends in biochemical sciences 2003, 28: 137\u2013144. 10.1016\/S0968-0004(03)00026-4","journal-title":"Trends in biochemical sciences"},{"key":"3661_CR44","doi-asserted-by":"publisher","first-page":"344","DOI":"10.1038\/nature08142","volume":"459","author":"SH White","year":"2009","unstructured":"White SH: Biophysical dissection of membrane proteins. Nature 2009, 459: 344\u2013346. 10.1038\/nature08142","journal-title":"Nature"},{"key":"3661_CR45","doi-asserted-by":"publisher","first-page":"D281","DOI":"10.1093\/nar\/gkm960","volume":"36","author":"RD Finn","year":"2007","unstructured":"Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz HR, Ceric G, Forslund K, Eddy SR, Sonnhammer ELL, Bateman A: The Pfam protein families database. Nucleic Acids Res 2007, 36: D281\u20138. 10.1093\/nar\/gkm960","journal-title":"Nucleic Acids Res"},{"key":"3661_CR46","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1016\/j.sbi.2009.02.001","volume":"19","author":"D Petrey","year":"2009","unstructured":"Petrey D, Honig B: Is protein classification necessary?: Toward alternative approaches to function annotation. Curr Opin Struct Biol 2009, 19: 363\u2013368. 10.1016\/j.sbi.2009.02.001","journal-title":"Curr Opin Struct Biol"},{"key":"3661_CR47","doi-asserted-by":"publisher","first-page":"546","DOI":"10.1016\/j.neurobiolaging.2005.03.031","volume":"27","author":"H Janovjak","year":"2006","unstructured":"Janovjak H, Kedrov A, Cisneros D, Sapra K, Struckmeier J, Mulle D: Imaging and detecting molecular interactions of single transmembrane proteins. Neurobiol Aging 2006, 27: 546\u2013561. 10.1016\/j.neurobiolaging.2005.03.031","journal-title":"Neurobiol Aging"},{"issue":"13","key":"3661_CR48","doi-asserted-by":"publisher","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","volume":"22","author":"W Li","year":"2006","unstructured":"Li W, Godzik A: Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22(13):1658\u20131659. 10.1093\/bioinformatics\/btl158","journal-title":"Bioinformatics"},{"key":"3661_CR49","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1007\/BF00134183","volume":"22","author":"J Mills","year":"1996","unstructured":"Mills J, Dean P: Three-dimensional hydrogen-bond geometry and probability information from a crystal survey. J Comput-Aided Mol Des 1996, 22: 607. 10.1007\/BF00134183","journal-title":"J Comput-Aided Mol Des"},{"key":"3661_CR50","doi-asserted-by":"crossref","unstructured":"Tusnay G, Dosztanyi Z, Simon I: PDBTM: selection and membrane localization of transmembrane proteins in the protein data bank. Nucleic Acids Res 2005, (33 Database):D275-D278.","DOI":"10.1093\/nar\/gki002"},{"key":"3661_CR51","doi-asserted-by":"publisher","first-page":"D234","DOI":"10.1093\/nar\/gkm751","volume":"36","author":"GE Tusn\u00e1dy","year":"2008","unstructured":"Tusn\u00e1dy GE, Kalm\u00e1r L, Simon I: TOPDB: topology data bank of transmembrane proteins. Nucleic Acids Res 2008, 36: D234\u20139. 10.1093\/nar\/gkm751","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"3661_CR52","doi-asserted-by":"publisher","first-page":"1587","DOI":"10.1002\/pro.5560040817","volume":"4","author":"I Jonassen","year":"1995","unstructured":"Jonassen I, Collins J, Higgins D: Finding flexible patterns in unaligned protein sequences. Protein Sci 1995, 4(8):1587\u20131595. 10.1002\/pro.5560040817","journal-title":"Protein Sci"},{"issue":"23","key":"3661_CR53","doi-asserted-by":"publisher","first-page":"4297","DOI":"10.1093\/bioinformatics\/bti694","volume":"21","author":"A Martin","year":"2005","unstructured":"Martin A: Mapping PDB chains to UniProtKB entries. Bioinformatics 2005, 21(23):4297\u20134301. 10.1093\/bioinformatics\/bti694","journal-title":"Bioinformatics"},{"key":"3661_CR54","doi-asserted-by":"crossref","unstructured":"Camon E, Magrane M, Barrel D, Lee V, Dimmer E, Maslen J, Binns D, Harte N, Lopez R, Apweiler R: The Gene Ontology Annotation (GOA) Database: sharin knowledge in Uniprot with Gene Ontology. Nucleic Acids Res 2004, (32 Database):D262-D266. 10.1093\/nar\/gkh021","DOI":"10.1093\/nar\/gkh021"},{"issue":"3","key":"3661_CR55","doi-asserted-by":"publisher","first-page":"921","DOI":"10.1006\/jmbi.1999.3488","volume":"296","author":"A Senes","year":"2000","unstructured":"Senes A, Gerstein M, Engleman DM: Statistical analysis of Amino Acid Patterns in Transmembrane Helices: The GxxxG Motif Occurs Frequently and in association with beta-branched Residues at Neighboring Positions. J Mol Biol 2000, 296(3):921\u2013936. 10.1006\/jmbi.1999.3488","journal-title":"J Mol Biol"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-204.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T12:13:44Z","timestamp":1630498424000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-204"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,4,26]]},"references-count":55,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["3661"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-204","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,4,26]]},"assertion":[{"value":"18 August 2009","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 April 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 April 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"204"}}