{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T14:43:39Z","timestamp":1776696219608,"version":"3.51.2"},"reference-count":53,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>The heterokonts are a particularly interesting group of eukaryotic organisms; they include many key species of planktonic and coastal algae and several important pathogens. To understand the biology of these organisms, it is necessary to be able to predict the subcellular localisation of their proteins but this is not straightforward, particularly in photosynthetic heterokonts which possess a complex chloroplast, acquired as the result of a secondary endosymbiosis. This is because the bipartite target peptides that deliver proteins to these chloroplasts can be easily confused with the signal peptides of secreted proteins, causing currently available algorithms to make erroneous predictions. HECTAR, a subcellular targeting prediction method which takes into account the specific properties of heterokont proteins, has been developed to address this problem.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>HECTAR is a statistical prediction method designed to assign proteins to five different categories of subcellular targeting: Signal peptides, type II signal anchors, chloroplast transit peptides, mitochondrion transit peptides and proteins which do not possess any N-terminal target peptide. The recognition rate of HECTAR is 96.3%, with Matthews correlation coefficients ranging from 0.67 to 0.95. The method is based on a hierarchical architecture which implements the divide and conquer approach to identify the different possible target peptides one at a time. At each node of the hierarchy, the most relevant outputs of various existing subcellular prediction methods are combined by a Support Vector Machine.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>The HECTAR method is able to predict the subcellular localisation of heterokont proteins with high accuracy. It also efficiently predicts the subcellular localisation of proteins from cryptophytes, a group that is phylogenetically close to the heterokonts. A variant of HECTAR, called HECTAR<jats:sup>\n                <jats:italic>SEC<\/jats:italic>\n              <\/jats:sup>, can be used to identify signal peptide and type II signal anchor sequences in proteins from any eukaryotic organism. Both HECTAR and HECTAR<jats:sup>\n                <jats:italic>SEC<\/jats:italic>\n              <\/jats:sup>are available as a web application at the following address: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/www.sb-roscoff.fr\/hectar\/\" ext-link-type=\"uri\">http:\/\/www.sb-roscoff.fr\/hectar\/<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-9-393","type":"journal-article","created":{"date-parts":[[2008,9,23]],"date-time":"2008-09-23T18:13:57Z","timestamp":1222193637000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":211,"title":["HECTAR: A method to predict subcellular targeting in heterokonts"],"prefix":"10.1186","volume":"9","author":[{"given":"Bernhard","family":"Gschloessl","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yann","family":"Guermeur","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"J Mark","family":"Cock","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2008,9,23]]},"reference":[{"key":"2378_CR1","doi-asserted-by":"publisher","first-page":"475","DOI":"10.1007\/s10811-006-9054-6","volume":"18","author":"H Yoon","year":"2006","unstructured":"Yoon H, Hackett J, Bhattacharya D: A Genomic and Phylogenetic Perspective on Endosymbiosis and Algal Origin. Journal of Applied Phycology 2006, 18: 475\u2013481. 10.1007\/s10811-006-9054-6","journal-title":"Journal of Applied Phycology"},{"key":"2378_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.thbio.2005.04.001","volume":"124","author":"U Kutschera","year":"2005","unstructured":"Kutschera U, Niklas KJ: Endosymbiosis, cell evolution, and speciation. Theory in Biosciences 2005, 124: 1\u201324. 10.1016\/j.thbio.2005.04.001","journal-title":"Theory in Biosciences"},{"key":"2378_CR3","doi-asserted-by":"publisher","first-page":"1027","DOI":"10.1016\/j.jmb.2004.03.016","volume":"338","author":"L K\u00e4ll","year":"2004","unstructured":"K\u00e4ll L, Krogh A, Sonnhammer E: A combined transmembrane topology and signal peptide prediction method. Journal of Molecular Biology 2004, 338: 1027\u20131036. 10.1016\/j.jmb.2004.03.016","journal-title":"Journal of Molecular Biology"},{"issue":"3","key":"2378_CR4","doi-asserted-by":"publisher","first-page":"567","DOI":"10.1006\/jmbi.2000.4315","volume":"305","author":"A Krogh","year":"2001","unstructured":"Krogh A, Larsson B, von Heijne G, Sonnhammer E: Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. Journal of Molecular Biology 2001, 305(3):567\u2013580. 10.1006\/jmbi.2000.4315","journal-title":"Journal of Molecular Biology"},{"issue":"9","key":"2378_CR5","doi-asserted-by":"publisher","first-page":"849","DOI":"10.1093\/bioinformatics\/17.9.849","volume":"17","author":"G Tusn\u00e1dy","year":"2001","unstructured":"Tusn\u00e1dy G, Simon I: The HMMTOP transmembrane topology prediction server. Bioinformatics 2001, 17(9):849\u2013850. 10.1093\/bioinformatics\/17.9.849","journal-title":"Bioinformatics"},{"issue":"6","key":"2378_CR6","doi-asserted-by":"publisher","first-page":"1581","DOI":"10.1002\/pmic.200300776","volume":"4","author":"I Small","year":"2004","unstructured":"Small I, Peeters N, Legeai F, Lurin C: Predotar: A tool for rapidly screening proteomes for N-terminal targeting sequences. Proteomics 2004, 4(6):1581\u20131590. 10.1002\/pmic.200300776","journal-title":"Proteomics"},{"issue":"4","key":"2378_CR7","doi-asserted-by":"publisher","first-page":"783","DOI":"10.1016\/j.jmb.2004.05.028","volume":"340","author":"J Bendtsen","year":"2004","unstructured":"Bendtsen J, Nielsen H, von Heijne G, Brunak S: Improved prediction of signal peptides: SignalP 3.0. Journal of Molecular Biology 2004, 340(4):783\u2013795. 10.1016\/j.jmb.2004.05.028","journal-title":"Journal of Molecular Biology"},{"issue":"4","key":"2378_CR8","doi-asserted-by":"publisher","first-page":"1005","DOI":"10.1006\/jmbi.2000.3903","volume":"300","author":"O Emanuelsson","year":"2000","unstructured":"Emanuelsson O, Nielsen H, Brunak S, von Heijne G: Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. Journal of Molecular Biology 2000, 300(4):1005\u20131016. 10.1006\/jmbi.2000.3903","journal-title":"Journal of Molecular Biology"},{"key":"2378_CR9","volume-title":"BMC Bioinformatics","author":"T Tamura","year":"2007","unstructured":"Tamura T, Akutsu T: Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition. BMC Bioinformatics 2007., 8(466):"},{"issue":"14","key":"2378_CR10","doi-asserted-by":"publisher","first-page":"e408","DOI":"10.1093\/bioinformatics\/btl222","volume":"22","author":"A Pierleoni","year":"2006","unstructured":"Pierleoni A, Martelli PL, Fariselli P, Casadio R: BaCelLo: a balanced subcellular localization predictor. Bioinformatics 2006, 22(14):e408-e416. 10.1093\/bioinformatics\/btl222","journal-title":"Bioinformatics"},{"issue":"13","key":"2378_CR11","doi-asserted-by":"publisher","first-page":"1656","DOI":"10.1093\/bioinformatics\/btg222","volume":"19","author":"KJ Park","year":"2003","unstructured":"Park KJ, Kanehisa M: Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs. Bioinformatics 2003, 19(13):1656\u201363. 10.1093\/bioinformatics\/btg222","journal-title":"Bioinformatics"},{"issue":"8","key":"2378_CR12","doi-asserted-by":"publisher","first-page":"721","DOI":"10.1093\/bioinformatics\/17.8.721","volume":"17","author":"S Hua","year":"2001","unstructured":"Hua S, Sun Z: Support vector machine approach for protein subcellular localization prediction. Bioinformatics 2001, 17(8):721\u2013728. 10.1093\/bioinformatics\/17.8.721","journal-title":"Bioinformatics"},{"issue":"5","key":"2378_CR13","doi-asserted-by":"publisher","first-page":"R68","DOI":"10.1186\/gb-2007-8-5-r68","volume":"8","author":"BR King","year":"2007","unstructured":"King BR, Guda C: ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes. Genome Biol 2007, 8(5):R68. 10.1186\/gb-2007-8-5-r68","journal-title":"Genome Biol"},{"key":"2378_CR14","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1093\/bioinformatics\/btg366","volume":"20","author":"Y Huang","year":"2004","unstructured":"Huang Y, Li Y: Prediction of protein subcellular locations using fuzzy k-NN method. Bioinformatics 2004, 20: 21\u201328. 10.1093\/bioinformatics\/btg366","journal-title":"Bioinformatics"},{"issue":"3","key":"2378_CR15","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1111\/j.1432-1033.1996.00779.x","volume":"241","author":"M Claros","year":"1996","unstructured":"Claros M, Vincens P: Computational method to predict mitochondrially imported proteins and their targeting sequences. European Journal of Biochemistry 1996, 241(3):779\u2013786. 10.1111\/j.1432-1033.1996.00779.x","journal-title":"European Journal of Biochemistry"},{"key":"2378_CR16","doi-asserted-by":"publisher","first-page":"W375","DOI":"10.1093\/nar\/gkh378","volume":"32","author":"K Hiller","year":"2004","unstructured":"Hiller K, Grote A, Scheer M, M\u00fcnch R, Jahn D: PrediSi: prediction of signal peptides and their cleavage positions. Nucleic Acids Research 2004, 32: W375-W379. 10.1093\/nar\/gkh378","journal-title":"Nucleic Acids Research"},{"issue":"2","key":"2378_CR17","doi-asserted-by":"publisher","first-page":"298","DOI":"10.1093\/bioinformatics\/18.2.298","volume":"18","author":"H Bannai","year":"2002","unstructured":"Bannai H, Tamada Y, Maruyama O, Nakai K, Miyano S: Extensive feature detection of N-terminal protein sorting signals. Bioinformatics 2002, 18(2):298\u2013305. 10.1093\/bioinformatics\/18.2.298","journal-title":"Bioinformatics"},{"issue":"5753","key":"2378_CR18","doi-asserted-by":"publisher","first-page":"1452","DOI":"10.1126\/science.1113752","volume":"310","author":"W Wickner","year":"2005","unstructured":"Wickner W, Schekman R: Protein translocation across biological membranes. Science 2005, 310(5753):1452\u20131456. 10.1126\/science.1113752","journal-title":"Science"},{"issue":"5785","key":"2378_CR19","doi-asserted-by":"publisher","first-page":"314","DOI":"10.1126\/science.1127895","volume":"313","author":"P Dolezal","year":"2006","unstructured":"Dolezal P, Likic V, Tachezy J, Lithgow T: Evolution of the molecular machines for protein import into mitochondria. Science 2006, 313(5785):314\u2013318. 10.1126\/science.1127895","journal-title":"Science"},{"key":"2378_CR20","doi-asserted-by":"publisher","first-page":"663","DOI":"10.1038\/nature06384","volume":"450","author":"TA Rapoport","year":"2007","unstructured":"Rapoport TA: Protein translocation across the eukaryotic endoplasmic reticulum and bacterial plasma membranes. Nature 2007, 450: 663\u2013669. 10.1038\/nature06384","journal-title":"Nature"},{"issue":"12","key":"2378_CR21","doi-asserted-by":"publisher","first-page":"670","DOI":"10.1016\/j.tree.2005.09.005","volume":"20","author":"PJ Keeling","year":"2005","unstructured":"Keeling PJ, Burger G, Durnford DG, Lang BF, Lee RW, Pearlman RE, Roger AJ, Gray MW: The tree of eukaryotes. Trends in ecology & evolution 2005, 20(12):670\u2013676. 10.1016\/j.tree.2005.09.005","journal-title":"Trends in ecology & evolution"},{"issue":"1429","key":"2378_CR22","doi-asserted-by":"publisher","first-page":"109","DOI":"10.1098\/rstb.2002.1194","volume":"358","author":"T Cavalier-Smith","year":"2003","unstructured":"Cavalier-Smith T: Genomic reduction and evolution of novel genetic membranes and protein-targeting machinery in eukaryote-eukaryote chimaeras (meta-algae). Philosophical transactions of the Royal Society of London. Series B, Biological sciences 2003, 358(1429):109\u2013133. 10.1098\/rstb.2002.1194","journal-title":"Philosophical transactions of the Royal Society of London. Series B, Biological sciences"},{"key":"2378_CR23","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1242\/jcs.35.1.253","volume":"35","author":"SP Gibbs","year":"1979","unstructured":"Gibbs SP: The route of entry of cytoplasmically synthesized proteins into chloroplasts of algae possessing chloroplast ER. Journal of cell science 1979, 35: 253\u2013266.","journal-title":"Journal of cell science"},{"issue":"2","key":"2378_CR24","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1111\/j.1365-313X.2004.02294.x","volume":"41","author":"O Kilian","year":"2005","unstructured":"Kilian O, Kroth PG: Identification and characterization of a new conserved motif within the presequence of proteins targeted into complex diatom plastids. The Plant journal: for cell and molecular biology 2005, 41(2):175\u2013183.","journal-title":"The Plant journal: for cell and molecular biology"},{"key":"2378_CR25","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/protein\/10.1.1","volume":"10","author":"H Nielsen","year":"1997","unstructured":"Nielsen H, Engelbrecht J, Brunak S, von Heijne G: Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein engineering 1997, 10: 1\u20136. 10.1093\/protein\/10.1.1","journal-title":"Protein engineering"},{"issue":"5","key":"2378_CR26","doi-asserted-by":"publisher","first-page":"519","DOI":"10.1007\/s11103-007-9171-x","volume":"64","author":"A Gruber","year":"2007","unstructured":"Gruber A, Vugrinec S, Hempel F, Gould SB, Maier UG, Kroth PG: Protein targeting into complex diatom plastids: functional characterisation of a specific targeting motif. Plant Molecular Biology 2007, 64(5):519\u2013530. 10.1007\/s11103-007-9171-x","journal-title":"Plant Molecular Biology"},{"key":"2378_CR27","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1016\/S1672-0229(06)60016-8","volume":"4","author":"E Petsalaki","year":"2006","unstructured":"Petsalaki E, Bagos P, Litou Z, Hamodrakas S: PredSL: a tool for the N-terminal sequence-based prediction of protein subcellular localization. Genomics Proteomics Bioinformatics 2006, 4: 48\u201355. 10.1016\/S1672-0229(06)60016-8","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2378_CR28","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1083\/jcb.113.1.25","volume":"113","author":"S High","year":"1991","unstructured":"High S, Flint N, Dobberstein B: Requirements for the membrane insertion of signal-anchor type proteins. The Journal of Cell Biology 1991, 113: 25\u201334. 10.1083\/jcb.113.1.25","journal-title":"The Journal of Cell Biology"},{"key":"2378_CR29","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1073\/pnas.89.1.16","volume":"89","author":"M Sakaguchi","year":"1992","unstructured":"Sakaguchi M, Tomiyoshi R, Kuroiwa T, Mihara K, Omura T: Functions of signal and signal-anchor sequences are determined by the balance between the hydrophobic segment and the N-terminal charge. Proceedings of the National Academy of Sciences of the United States of America 1992, 89: 16\u201319. 10.1073\/pnas.89.1.16","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"2","key":"2378_CR30","doi-asserted-by":"publisher","first-page":"489","DOI":"10.1006\/jmbi.1998.2107","volume":"283","author":"G Tusn\u00e1dy","year":"1998","unstructured":"Tusn\u00e1dy G, Simon I: Principles governing amino acid composition of integral membrane proteins: application to topology prediction. Journal of Molecular Biology 1998, 283(2):489\u2013506. 10.1006\/jmbi.1998.2107","journal-title":"Journal of Molecular Biology"},{"key":"2378_CR31","doi-asserted-by":"publisher","first-page":"978","DOI":"10.1110\/ps.8.5.978","volume":"8","author":"O Emanuelsson","year":"1999","unstructured":"Emanuelsson O, Nielsen H, von Heijne G: ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites. Protein Science: A Publication of the Protein Society 1999, 8: 978\u2013984.","journal-title":"Protein Science: A Publication of the Protein Society"},{"key":"2378_CR32","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1057\/jors.1969.103","volume":"20","author":"J Bates","year":"1969","unstructured":"Bates J, Granger C: The combination of forecasts. Operational Research Quaterly 1969, 20: 451\u2013468.","journal-title":"Operational Research Quaterly"},{"key":"2378_CR33","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1002\/for.3980080303","volume":"8","author":"C Granger","year":"1989","unstructured":"Granger C: Combining Forecasts \u2013 Twenty Years Later. Journal of Forecasting 1989, 8: 167\u2013173. 10.1002\/for.3980080303","journal-title":"Journal of Forecasting"},{"issue":"2","key":"2378_CR34","first-page":"123","volume":"24","author":"L Breiman","year":"1996","unstructured":"Breiman L: Bagging Predictors. Machine Learning 1996, 24(2):123\u2013140.","journal-title":"Machine Learning"},{"issue":"5","key":"2378_CR35","doi-asserted-by":"publisher","first-page":"1651","DOI":"10.1214\/aos\/1024691352","volume":"26","author":"R Schapire","year":"1998","unstructured":"Schapire R, Freund Y, Bartlett P, Lee W: Boosting the Margin: A New Explanation for the Effectiveness of Voting Methods. The Annals of Statistics 1998, 26(5):1651\u20131686. 10.1214\/aos\/1024691352","journal-title":"The Annals of Statistics"},{"issue":"2","key":"2378_CR36","doi-asserted-by":"publisher","first-page":"168","DOI":"10.1007\/s100440200015","volume":"5","author":"Y Guermeur","year":"2002","unstructured":"Guermeur Y: Combining discriminant models with new multi-class SVMs. Pattern Analysis and Applications 2002, 5(2):168\u2013179. 10.1007\/s100440200015","journal-title":"Pattern Analysis and Applications"},{"key":"2378_CR37","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1016\/j.neucom.2003.10.004","volume":"56C","author":"Y Guermeur","year":"2004","unstructured":"Guermeur Y, Pollastri G, Elisseeff A, Zelus D, Paugam-Moisy H, Baldi P: Combining Protein Secondary Structure Prediction Models with Ensemble Methods of Optimal Complexity. Neurocomputing 2004, 56C: 305\u2013327. 10.1016\/j.neucom.2003.10.004","journal-title":"Neurocomputing"},{"key":"2378_CR38","volume-title":"Multi-class Support Vector Machines","author":"J Weston","year":"1998","unstructured":"Weston J, Watkins C: Multi-class Support Vector Machines. Tech. Rep. CSD-TR-98\u201304, Royal Holloway, University of London, Department of Computer Science; 1998."},{"issue":"2","key":"2378_CR39","doi-asserted-by":"publisher","first-page":"442","DOI":"10.1016\/0005-2795(75)90109-9","volume":"405","author":"B Matthews","year":"1975","unstructured":"Matthews B: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica et Biophysica Acta 1975, 405(2):442\u2013451.","journal-title":"Biochimica et Biophysica Acta"},{"key":"2378_CR40","doi-asserted-by":"publisher","first-page":"241","DOI":"10.1016\/S0893-6080(05)80023-1","volume":"5","author":"D Wolpert","year":"1992","unstructured":"Wolpert D: Stacked Generalization. Neural Networks 1992, 5: 241\u2013259. 10.1016\/S0893-6080(05)80023-1","journal-title":"Neural Networks"},{"issue":"6","key":"2378_CR41","doi-asserted-by":"publisher","first-page":"674","DOI":"10.1007\/s00239-005-0099-y","volume":"62","author":"SB Gould","year":"2006","unstructured":"Gould SB, Sommer MS, Hadfi K, Zauner S, Kroth PG, Maier UG: Protein targeting into the complex plastid of cryptophytes. Journal of molecular evolution 2006, 62(6):674\u2013681. 10.1007\/s00239-005-0099-y","journal-title":"Journal of molecular evolution"},{"key":"2378_CR42","doi-asserted-by":"publisher","first-page":"954","DOI":"10.1128\/EC.00380-05","volume":"5","author":"P Deschamps","year":"2006","unstructured":"Deschamps P, Haferkamp I, Dauvill\u00e9e D, Haebel S, Steup M, Bul\u00e9on A, Putaux JL, Colleoni C, d'Hulst C, Plancke C, Gould S, Maier U, Neuhaus HE, Ball S: Nature of the periplastidial pathway of starch synthesis in the cryptophyte Guillardia theta. Eukaryotic cell 2006, 5: 954\u2013963. 10.1128\/EC.00380-05","journal-title":"Eukaryotic cell"},{"issue":"6","key":"2378_CR43","doi-asserted-by":"publisher","first-page":"931","DOI":"10.1007\/s00425-003-1058-8","volume":"217","author":"KD Belanger","year":"2003","unstructured":"Belanger KD, Wyman AJ, Sudol MN, Singla-Pareek SL, Quatrano RS: A signal peptide secretion screen in Fucus distichus embryos reveals expression of glucanase, EGF domain-containing, and LRR receptor kinase-like polypeptides during asymmetric cell growth. Planta 2003, 217(6):931\u2013950. 10.1007\/s00425-003-1058-8","journal-title":"Planta"},{"issue":"4786","key":"2378_CR44","doi-asserted-by":"publisher","first-page":"312","DOI":"10.1126\/science.3541205","volume":"235","author":"CA Kaiser","year":"1987","unstructured":"Kaiser CA, Preuss D, Grisafi P, Botstein D: Many random sequences functionally replace the secretion signal sequence of yeast invertase. Science 1987, 235(4786):312\u2013317. 10.1126\/science.3541205","journal-title":"Science"},{"issue":"5693","key":"2378_CR45","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1126\/science.1101156","volume":"306","author":"EV Armbrust","year":"2004","unstructured":"Armbrust EV, Berges JA, Bowler C, Green BR, Martinez D, Putnam NH, Zhou S, Allen AE, Apt KE, Bechner M, Brzezinski MA, Chaal BK, Chiovitti A, Davis AK, Demarest MS, Detter JC, Glavina T, Goodstein D, Hadi MZ, Hellsten U, Hildebrand M, Jenkins BD, Jurka J, Kapitonov VV, Kroger N, Lau WWY, Lane TW, Larimer FW, Lippmeier JC, Lucas S, Medina M, Montsant A, Obornik M, Parker MS, Palenik B, Pazour GJ, Richardson PM, Rynearson TA, Saito MA, Schwartz DC, Thamatrakoln K, Valentin K, Vardi A, Wilkerson FP, Rokhsar DS: The Genome of the Diatom Thalassiosira Pseudonana: Ecology, Evolution, and Metabolism. Science 2004, 306(5693):79\u201386. 10.1126\/science.1101156","journal-title":"Science"},{"issue":"5791","key":"2378_CR46","doi-asserted-by":"publisher","first-page":"1261","DOI":"10.1126\/science.1128796","volume":"313","author":"BM Tyler","year":"2006","unstructured":"Tyler BM, Tripathy S, Zhang X, Dehal P, Jiang RHY, Aerts A, Arredondo FD, Baxter L, Bensasson D, Beynon JL, Chapman J, Damasceno CMB, Dorrance AE, long Dou D, Dickerman AW, Dubchak IL, Garbelotto M, Gijzen M, Gordon SG, Govers F, us J Grunwald N, Huang W, Ivors KL, Jones RW, Kamoun S, Krampis K, Lamour KH, Lee MK, McDonald WH, Medina M, Meijer HJG, Nordberg EK, Maclean DJ, do MDOG, Morris PF, Phuntumart V, Putnam NH, Rash S, Rose JKC, Sakihama Y, ov AAS, Savidor A, Scheuring CF, Smith BM, Sobral BWS, Terry A, Torto-Alalibo TA, Win J, Xu Z, Zhang H, Grigoriev IV, Rokhsar DS, Boore JL: Phytophthora genome sequences uncover evolutionary origins and mechanisms of pathogenesis. Science 2006, 313(5791):1261\u20131266. 10.1126\/science.1128796","journal-title":"Science"},{"issue":"3","key":"2378_CR47","doi-asserted-by":"publisher","first-page":"409","DOI":"10.1002\/pro.5560010313","volume":"1","author":"U Hobohm","year":"1992","unstructured":"Hobohm U, Scharf M, Schneider R, Sander C: Selection of representative protein data sets. Protein Science: A Publication of the Protein Society 1992, 1(3):409\u2013417.","journal-title":"Protein Science: A Publication of the Protein Society"},{"key":"2378_CR48","doi-asserted-by":"publisher","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","volume":"27","author":"C Shannon","year":"1948","unstructured":"Shannon C: A Mathematical Theory of Communication. The Bell System Technical Journal 1948, 27: 379\u2013423.","journal-title":"The Bell System Technical Journal"},{"issue":"20","key":"2378_CR49","doi-asserted-by":"publisher","first-page":"6097","DOI":"10.1093\/nar\/18.20.6097","volume":"18","author":"TD Schneider","year":"1990","unstructured":"Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Research 1990, 18(20):6097\u20136100. 10.1093\/nar\/18.20.6097","journal-title":"Nucleic Acids Research"},{"key":"2378_CR50","first-page":"1391","volume":"5","author":"T Hastie","year":"2004","unstructured":"Hastie T, Rosset S, Tibshirani R, Zhu J: The Entire Regularization Path for the Support Vector Machine. Journal of Machine Learning Research 2004, 5: 1391\u20131415.","journal-title":"Journal of Machine Learning Research"},{"key":"2378_CR51","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1051\/ps:2005018","volume":"9","author":"S Boucheron","year":"2005","unstructured":"Boucheron S, Bousquet O, Lugosi G: Theory of Classification: A Survey of Some Recent Advances. ESAIM: Probability and Statistics 2005, 9: 323\u2013375. 10.1051\/ps:2005018","journal-title":"ESAIM: Probability and Statistics"},{"key":"2378_CR52","volume-title":"Habilitation \u00e1 diriger des recherches, UHP","author":"Y Guermeur","year":"2007","unstructured":"Guermeur Y: SVM multiclasses, th\u00e9orie et applications. Habilitation \u00e1 diriger des recherches, UHP 2007. [In French]"},{"issue":"6","key":"2378_CR53","doi-asserted-by":"publisher","first-page":"1188","DOI":"10.1101\/gr.849004","volume":"14","author":"GE Crooks","year":"2004","unstructured":"Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Research 2004, 14(6):1188\u20131190. 10.1101\/gr.849004","journal-title":"Genome Research"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-393.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T11:00:18Z","timestamp":1630494018000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-393"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,9,23]]},"references-count":53,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2378"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-393","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,9,23]]},"assertion":[{"value":"21 March 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 September 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 September 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"393"}}