{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T02:59:34Z","timestamp":1772679574735,"version":"3.50.1"},"reference-count":54,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2013,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Small peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses. However, popular similarity search tools and gene prediction techniques generally fail to identify most members belonging to this class of genes. This is largely due to the high sequence divergence among family members and the limited availability of experimentally verified small peptides to use as training sets for homology search and <jats:italic>ab initio<\/jats:italic> prediction. Consequently, there is an urgent need for both experimental and computational studies in order to further advance the accurate prediction of small peptides.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We present here a homology-based gene prediction program to accurately predict small peptides at the genome level. Given a high-quality profile alignment, SPADA identifies and annotates nearly all family members in tested genomes with better performance than all general-purpose gene prediction programs surveyed. We find numerous mis-annotations in the current <jats:italic>Arabidopsis thaliana<\/jats:italic> and <jats:italic>Medicago truncatula<\/jats:italic> genome databases using SPADA, most of which have RNA-Seq expression support. We also show that SPADA works well on other classes of small secreted peptides in plants (e.g., self-incompatibility protein homologues) as well as non-secreted peptides outside the plant kingdom (e.g., the alpha-amanitin toxin gene family in the mushroom, <jats:italic>Amanita bisporigera<\/jats:italic>).<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>SPADA is a free software tool that accurately identifies and predicts the gene structure for short peptides with one or two exons. SPADA is able to incorporate information from profile alignments into the model prediction process and makes use of it to score different candidate models. SPADA achieves high sensitivity and specificity in predicting small plant peptides such as the cysteine-rich peptide families. A systematic application of SPADA to other classes of small peptides by research communities will greatly improve the genome annotation of different protein families in public genome databases.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-14-335","type":"journal-article","created":{"date-parts":[[2013,11,20]],"date-time":"2013-11-20T17:43:38Z","timestamp":1384969418000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":93,"title":["Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)"],"prefix":"10.1186","volume":"14","author":[{"given":"Peng","family":"Zhou","sequence":"first","affiliation":[]},{"given":"Kevin AT","family":"Silverstein","sequence":"additional","affiliation":[]},{"given":"Liangliang","family":"Gao","sequence":"additional","affiliation":[]},{"given":"Jonathan D","family":"Walton","sequence":"additional","affiliation":[]},{"given":"Sumitha","family":"Nallu","sequence":"additional","affiliation":[]},{"given":"Joseph","family":"Guhlin","sequence":"additional","affiliation":[]},{"given":"Nevin D","family":"Young","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2013,11,20]]},"reference":[{"issue":"7","key":"6202_CR1","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1038\/35080529","volume":"2","author":"L Stein","year":"2001","unstructured":"Stein L: Genome annotation: from sequence to biology. Nat Rev Genet. 2001, 2 (7): 493-503. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/11433356,","journal-title":"Nat Rev Genet"},{"issue":"3","key":"6202_CR2","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1007\/s11103-005-0271-1","volume":"57","author":"H Yao","year":"2005","unstructured":"Yao H, Guo L, Fu Y, Borsuk La, Wen TJ, Skibbe DS, Cui X, Scheffler BE, Cao J, Emrich SJ, Ashlock Da, Schnable PS: Evaluation of five ab initio gene prediction programs for the discovery of maize genes. Plant Mol Biol. 2005, 57 (3): 445-460. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/15830133,","journal-title":"Plant Mol Biol"},{"key":"6202_CR3","first-page":"768","volume-title":"Genome Res","author":"M Basrai","year":"1997","unstructured":"Basrai M, Hieter P, Boeke J: Small open reading frames : beautiful needles in the haystack. Genome Res. 1997, 768-771. http:\/\/genome.cshlp.org\/content\/7\/8\/768.short,"},{"issue":"3","key":"6202_CR4","doi-asserted-by":"publisher","first-page":"831","DOI":"10.1104\/pp.106.086041","volume":"142","author":"Ka Lease","year":"2006","unstructured":"Lease Ka, Walker JC: The Arabidopsis unannotated secreted peptide database, a resource for plant peptidomics. Plant Physiol. 2006, 142 (3): 831-838. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=1630735&tool=pmcentrez&rendertype=abstract,","journal-title":"Plant Physiol"},{"key":"6202_CR5","doi-asserted-by":"publisher","first-page":"632","DOI":"10.1101\/gr.5836207","volume":"517","author":"K Hanada","year":"2007","unstructured":"Hanada K, Zhang X, Borevitz J: A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and\/or under purifying selection. Genome Res. 2007, 517: 632-640. http:\/\/genome.cshlp.org\/content\/17\/5\/632.short,","journal-title":"Genome Res"},{"key":"6202_CR6","first-page":"634","volume-title":"Genome Res","author":"X Yang","year":"2011","unstructured":"Yang X, Tschaplinski T, Hurst G: Discovery and annotation of small proteins using genomics, proteomics, and computational approaches. Genome Res. 2011, 634-641. http:\/\/genome.cshlp.org\/content\/21\/4\/634.short,"},{"key":"6202_CR7","first-page":"1","volume":"15","author":"B Pan","year":"2012","unstructured":"Pan B, Sheng J, Sun W, Zhao Y, Hao P, Li X: OrysPSSP: a comparative platform for small secreted proteins from rice and other plants. Nucleic Acids Res. 2012, 15: 1-7. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23203890,","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"6202_CR8","first-page":"D290\u2014D301","volume":"40","author":"M Punta","year":"2012","unstructured":"Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer ELL, Eddy SR, Bateman A, Finn RD: The Pfam protein families database. Nucleic Acids Res. 2012, 40 (Database issue): D290\u2014D301-http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3245129&tool=pmcentrez&rendertype=abstract,","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"6202_CR9","first-page":"D306\u2014D312","volume":"40","author":"S Hunter","year":"2012","unstructured":"Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, Bernard T, Binns D, Bork P, Burge S, de Castro E, Coggill P, Corbett M, Das U, Daugherty L, Duquenne L, Finn RD, Fraser M, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, et al: InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res. 2012, 40 (Database issue): D306\u2014D312-http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3245097&tool=pmcentrez&rendertype=abstract,","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"6202_CR10","doi-asserted-by":"publisher","first-page":"D344","DOI":"10.1093\/nar\/gks1067","volume":"41","author":"E deCastro Sigrist CJa","year":"2013","unstructured":"Sigrist CJa deCastro, Cerutti L, Cuche Ba Hulo, Bridge A, Bougueleret L, Xenarios I: New and continuing developments at PROSITE. Nucleic Acids Res. 2013, 41 (Database issue): D344-D347. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3531220&tool=pmcentrez&rendertype=abstract,","journal-title":"Nucleic Acids Res"},{"issue":"2","key":"6202_CR11","doi-asserted-by":"publisher","first-page":"262","DOI":"10.1111\/j.1365-313X.2007.03136.x","volume":"51","author":"K Silverstein","year":"2007","unstructured":"Silverstein K, Moskal W, Wu H, Underwood B, Graham M, Town CD, VandenBosch KA: Small cysteine-rich peptides resembling antimicrobial peptides have been under-predicted in plants. Plant J. 2007, 51 (2): 262-280. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/17565583,","journal-title":"Plant J"},{"issue":"4","key":"6202_CR12","doi-asserted-by":"publisher","first-page":"e60355","DOI":"10.1371\/journal.pone.0060355","volume":"8","author":"S Nallu","year":"2013","unstructured":"Nallu S, Silverstein KaT, Samac Da, Bucciarelli B, Vance CP, VandenBosch Ka: Regulatory patterns of a large family of defensin-like genes expressed in nodules of Medicago Truncatula. PloS ONE. 2013, 8 (4): e60355-http:\/\/dx.plos.org\/10.1371\/journal.pone.0060355,","journal-title":"PloS ONE"},{"issue":"3","key":"6202_CR13","doi-asserted-by":"publisher","first-page":"e58992","DOI":"10.1371\/journal.pone.0058992","volume":"8","author":"M Tesfaye","year":"2013","unstructured":"Tesfaye M, Silverstein KA, Nallu S, Wang L, Botanga CJ, Gomez SK, Costa LM, Harrison MJ, Samac DA, Glazebrook J, Katagiri F, Gutierrez-Marcos JF, Vandenbosch KA: Spatio-temporal expression patterns of Arabidopsis thaliana and Medicago truncatula defensin-like genes. PloS ONE. 2013, 8 (3): e58992-http:\/\/scholar.google.com\/scholar?hl=en&btnG=Search&q=intitle: Spatio-Temporal+Expression+Patterns+of+Arabidopsis+thaliana+and+Medicago+truncatula+Defensin-Like+Genes#0, http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3601123&tool=pmcentrez&rendertype=abstract,","journal-title":"PloS ONE"},{"issue":"16","key":"6202_CR14","doi-asserted-by":"publisher","first-page":"2878","DOI":"10.1093\/bioinformatics\/bth315","volume":"20","author":"WH Majoros","year":"2004","unstructured":"Majoros WH, Pertea M, Salzberg SL: TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics. 2004, 20 (16): 2878-2879. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/15145805,","journal-title":"Bioinformatics"},{"issue":"20","key":"6202_CR15","doi-asserted-by":"publisher","first-page":"6494","DOI":"10.1093\/nar\/gki937","volume":"33","author":"A Lomsadze","year":"2005","unstructured":"Lomsadze A, Ter-Hovhannisyan V, Chernoff YO, Borodovsky M: Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 2005, 33 (20): 6494-6506. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=1298918&tool=pmcentrez&rendertype=abstract,","journal-title":"Nucleic Acids Res"},{"key":"6202_CR16","first-page":"1","volume-title":"Curr Protoc Bioinformatics","author":"E Blanco","year":"2007","unstructured":"Blanco E, Parra G, Guig\u00f3 R: Using geneid to identify genes. Curr Protoc Bioinformatics. 2007, 1-28. http:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/0471250953.bi0403s18\/full,"},{"issue":"6","key":"6202_CR17","doi-asserted-by":"publisher","first-page":"757","DOI":"10.1093\/bioinformatics\/btr010","volume":"27","author":"O Keller","year":"2011","unstructured":"Keller O, Kollmar M, Stanke M, Waack S: A novel hybrid gene prediction method employing protein multiple sequence alignments. Bioinformatics. 2011, 27 (6): 757-763. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/21216780,","journal-title":"Bioinformatics"},{"key":"6202_CR18","volume-title":"Ab initio gene finding in drosophila genomic DNA","author":"AA Salamov","year":"2000","unstructured":"Salamov AA: Ab initio gene finding in drosophila genomic DNA. Genome Res. 2000, 10 (516-522): http:\/\/www.genome.org\/cgi\/doi\/10.1101\/gr.10.4.516,"},{"key":"6202_CR19","doi-asserted-by":"publisher","first-page":"988","DOI":"10.1101\/gr.1865504","volume":"4","author":"E Birney","year":"2004","unstructured":"Birney E, Clamp M, Durbin R: GeneWise and genomewise. Genome Res. 2004, 4: 988-995. http:\/\/genome.cshlp.org\/content\/14\/5\/988.short,","journal-title":"Genome Res"},{"key":"6202_CR20","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1186\/1471-2105-6-31","volume":"6","author":"GSC Slater","year":"2005","unstructured":"Slater GSC, Birney E: Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005, 6: 31-http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=553969&tool=pmcentrez&rendertype=abstract,","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"6202_CR21","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/2231712,","journal-title":"J Mol Biol"},{"issue":"9","key":"6202_CR22","doi-asserted-by":"publisher","first-page":"755","DOI":"10.1093\/bioinformatics\/14.9.755","volume":"14","author":"S Eddy","year":"1998","unstructured":"Eddy S: Profile hidden Markov models. Bioinformatics Rev. 1998, 14 (9): 755-http:\/\/bioinformatics.oxfordjournals.org\/content\/14\/9\/755.short,","journal-title":"Bioinformatics Rev"},{"issue":"7","key":"6202_CR23","doi-asserted-by":"publisher","first-page":"1157","DOI":"10.1093\/bioinformatics\/bth058","volume":"20","author":"V Brendel","year":"2004","unstructured":"Brendel V, Xing L, Zhu W: Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus. Bioinformatics. 2004, 20 (7): 1157-1169. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/14764557,","journal-title":"Bioinformatics"},{"issue":"10","key":"6202_CR24","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1038\/nmeth.1701","volume":"8","author":"TN Petersen","year":"2011","unstructured":"Petersen TN, Brunak Sr, von Heijne G, Nielsen H: SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011, 8 (10): 785-786. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/21959131,","journal-title":"Nat Methods"},{"issue":"539","key":"6202_CR25","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1038\/msb.2011.75","volume":"7","author":"F Sievers","year":"2011","unstructured":"Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, S\u00f6ding J, Thompson JD, Higgins DG: Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011, 7 (539): 539-http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/21988835,","journal-title":"Mol Syst Biol"},{"issue":"15","key":"6202_CR26","doi-asserted-by":"publisher","first-page":"1972","DOI":"10.1093\/bioinformatics\/btp348","volume":"25","author":"S Capella-guti\u00e9rrez","year":"2009","unstructured":"Capella-guti\u00e9rrez S, Silla-mart\u00ednez JM, Gabald\u00f3n T: trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009, 25 (15): 1972-1973.","journal-title":"Bioinformatics"},{"issue":"10","key":"6202_CR27","doi-asserted-by":"publisher","first-page":"e1002195","DOI":"10.1371\/journal.pcbi.1002195","volume":"7","author":"SR Eddy","year":"2011","unstructured":"Eddy SR: Accelerated Profile HMM Searches. PLoS Comput Biol. 2011, 7 (10): e1002195-http:\/\/dx.plos.org\/10.1371\/journal.pcbi.1002195, http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3197634&tool=pmcentrez&rendertype=abstract,","journal-title":"PLoS Comput Biol"},{"issue":"5956","key":"6202_CR28","doi-asserted-by":"publisher","first-page":"1112","DOI":"10.1126\/science.1178534","volume":"326","author":"PS Schnable","year":"2009","unstructured":"Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA, Minx P, Reily AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du F, Kim K, Abbott RM, Cotton M, Levy A, Marchetto P, Ochoa K, Jackson SM, Gillam B, et al: The, B73 maize genome: complexity, diversity, and dynamics. Science. 2009, 326 (5956): 1112-1115. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/19965430,","journal-title":"Science"},{"issue":"4","key":"6202_CR29","doi-asserted-by":"publisher","first-page":"420","DOI":"10.1016\/j.pbi.2010.04.004","volume":"13","author":"M Rep","year":"2010","unstructured":"Rep M, Kistler HC: The genomic organization of plant pathogenicity in Fusarium species. Curr Opin Plant Biol. 2010, 13 (4): 420-426. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/20471307,","journal-title":"Curr Opin Plant Biol"},{"issue":"22","key":"6202_CR30","doi-asserted-by":"publisher","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","volume":"89","author":"S Henikoff","year":"1992","unstructured":"Henikoff S, Henikoff JG: Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci. 1992, 89 (22): 10915-10919. [http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=50453&tool=pmcentrez&rendertype=abstract],","journal-title":"Proc Natl Acad Sci"},{"issue":"7378","key":"6202_CR31","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1038\/nature10625","volume":"480","author":"ND Young","year":"2011","unstructured":"Young ND, Debell\u00e9 F, Oldroyd GED, Geurts R, Cannon SB, Udvardi MK, Benedito VA, Mayer KFX, Gouzy J, Schoof H, Van de Peer Y, Proost S, Cook DR, Meyers BC, Spannagl M, Cheung F, De Mita S, Krishnakumar V, Gundlach H, Zhou S, Mudge J, Bharti AK, Murray JD, Naoumkina MA, Rosen B, Silverstein KAT, Tang H, Rombauts S, Zhao PX, Zhou P, et al: The, Medicago genome provides insight into the evolution of rhizobial symbioses. Nature. 2011, 480 (7378): 520-524. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3272368&tool=pmcentrez&rendertype=abstract, http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/22089132,","journal-title":"Nature"},{"issue":"Database issue","key":"6202_CR32","first-page":"D19\u2014D21","volume":"39","author":"R Leinonen","year":"2011","unstructured":"Leinonen R, Sugawara H, Shumway M: The sequence read archive. Nucleic Acids Res. 2011, 39 (Database issue): D19\u2014D21-http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3013647&tool=pmcentrez&rendertype=abstract,","journal-title":"Nucleic Acids Res"},{"issue":"3","key":"6202_CR33","doi-asserted-by":"publisher","first-page":"353","DOI":"10.1006\/geno.1996.0298","volume":"34","author":"M Burset","year":"1996","unstructured":"Burset M, Guig\u00f3 R: Evaluation of gene structure prediction programs. Genomics. 1996, 34 (3): 353-367. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/8786136,","journal-title":"Genomics"},{"issue":"3","key":"6202_CR34","doi-asserted-by":"publisher","first-page":"562","DOI":"10.1038\/nprot.2012.016","volume":"7","author":"C Trapnell","year":"2012","unstructured":"Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L: Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012, 7 (3): 562-578. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/22383036,","journal-title":"Nat Protoc"},{"issue":"2","key":"6202_CR35","doi-asserted-by":"publisher","first-page":"178","DOI":"10.1093\/bib\/bbs017","volume":"14","author":"H Thorvaldsd\u00f3ttir","year":"2013","unstructured":"Thorvaldsd\u00f3ttir H, Robinson JT, Mesirov JP: Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinformatics. 2013, 14 (2): 178-192. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3603213&tool=pmcentrez&rendertype=abstract,","journal-title":"Brief Bioinformatics"},{"key":"6202_CR36","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1186\/1741-7007-3-7","volume":"3","author":"BJ Haas","year":"2005","unstructured":"Haas BJ, Wortman JR, Ronning CM, Hannick LI, Smith RK, Maiti R, Chan AP, Yu C, Farzad M, Wu D, White O, Town CD: Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release. BMC Biol. 2005, 3: 7-http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=1082884&tool=pmcentrez&rendertype=abstract, http:\/\/www.biomedcentral.com\/1741-7007\/3\/7,","journal-title":"BMC Biol"},{"issue":"6","key":"6202_CR37","doi-asserted-by":"publisher","first-page":"2265","DOI":"10.1073\/pnas.91.6.2265","volume":"91","author":"H Foote","year":"1994","unstructured":"Foote H, Ride J: Cloning and expression of a distinctive class of self-incompatibility (S) gene from Papaver rhoeas L. Proc Natl Acad Sci. 1994, 91 (6): 2265-2269. http:\/\/www.pnas.org\/content\/91\/6\/2265.short,","journal-title":"Proc Natl Acad Sci"},{"issue":"Database issue","key":"6202_CR38","doi-asserted-by":"publisher","first-page":"D1202","DOI":"10.1093\/nar\/gkr1090","volume":"40","author":"P Lamesch","year":"2012","unstructured":"Lamesch P, Berardini TZ, Li D, Swarbreck D, Wilks C, Sasidharan R, Muller R, Dreher K, Alexander DL, Garcia-Hernandez M, Karthikeyan AS, Lee CH, Nelson WD, Ploetz L, Singh S, Wensel A, Huala E: The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res. 2012, 40 (Database issue): D1202-D1210. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=3245047&tool=pmcentrez&rendertype=abstract,","journal-title":"Nucleic Acids Res"},{"issue":"48","key":"6202_CR39","doi-asserted-by":"publisher","first-page":"19097","DOI":"10.1073\/pnas.0707340104","volume":"104","author":"HE Hallen","year":"2007","unstructured":"Hallen HE, Luo H, Scott-Craig JS, Walton JD: Gene family encoding the major toxins of lethal Amanita mushrooms. Proc Natl Acad Sci. 2007, 104 (48): 19097-19101. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=2141914&tool=pmcentrez&rendertype=abstract,","journal-title":"Proc Natl Acad Sci"},{"issue":"2","key":"6202_CR40","doi-asserted-by":"publisher","first-page":"600","DOI":"10.1104\/pp.105.060079","volume":"138","author":"KAT Silverstein","year":"2005","unstructured":"Silverstein KAT, Graham MA, Paape TD, VandenBosch KA: Genome organization of more than 300 defensin-like genes in Arabidopsis. Plant Physiol. 2005, 138 (2): 600-610. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=1150381&tool=pmcentrez&rendertype=abstract,","journal-title":"Plant Physiol"},{"issue":"10","key":"6202_CR41","doi-asserted-by":"publisher","first-page":"1848","DOI":"10.1371\/journal.pgen.0030171","volume":"3","author":"MW Jones-Rhoades","year":"2007","unstructured":"Jones-Rhoades MW, Borevitz JO, Preuss D: Genome-wide expression profiling of the Arabidopsis female gametophyte identifies families of small, secreted proteins. PLoS Genet. 2007, 3 (10): 1848-1861. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=2014789&tool=pmcentrez&rendertype=abstract,","journal-title":"PLoS Genet"},{"issue":"12","key":"6202_CR42","doi-asserted-by":"publisher","first-page":"e1001449","DOI":"10.1371\/journal.pbio.1001449","volume":"10","author":"H Takeuchi","year":"2012","unstructured":"Takeuchi H, Higashiyama T: A species-specific cluster of defensin-like genes encodes diffusible pollen tube attractants in arabidopsis. PLoS Biol. 2012, 10 (12): e1001449-http:\/\/dx.plos.org\/10.1371\/journal.pbio.1001449,","journal-title":"PLoS Biol"},{"key":"6202_CR43","first-page":"1","volume-title":"Nature","author":"X Gan","year":"2011","unstructured":"Gan X, Stegle O, Behr J, Steffen JG, Drewe P, Hildebrand KL, Lyngsoe R, Schultheiss SJ, Osborne EJ, Sreedharan VT, Kahles A, Bohnert R, Jean G, Derwent P, Kersey P, Belfield EJ, Harberd NP, Kemen E, Toomajian C, Kover PX, Clark RM, R\u00e4tsch G, Mott R: Multiple reference genomes and transcriptomes for Arabidopsis thaliana. Nature. 2011, 1-5. http:\/\/www.nature.com\/doifinder\/10.1038\/nature10414,"},{"issue":"5","key":"6202_CR44","doi-asserted-by":"publisher","first-page":"e1000940","DOI":"10.1371\/journal.pgen.1000940","volume":"6","author":"B Brachi","year":"2010","unstructured":"Brachi B, Faure N, Horton M, Flahauw E, Vazquez A, Nordborg M, Bergelson J, Cuguen J, Roux F: Linkage and association mapping of Arabidopsis thaliana flowering time in nature. PLoS Genet. 2010, 6 (5): e1000940-http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=2865524&tool=pmcentrez&rendertype=abstract,","journal-title":"PLoS Genet"},{"issue":"2","key":"6202_CR45","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1038\/ng.747","volume":"43","author":"KL Kump","year":"2011","unstructured":"Kump KL, Bradbury PJ, Wisser RJ, Buckler ES, Belcher AR, Oropeza-Rosas MA, Zwonitzer JC, Kresovich S, McMullen MD, Ware D, Balint-Kurti PJ, Holland JB: Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population. Nat Genet. 2011, 43 (2): 163-168. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/21217757,","journal-title":"Nat Genet"},{"issue":"5","key":"6202_CR46","doi-asserted-by":"publisher","first-page":"e65688","DOI":"10.1371\/journal.pone.0065688","volume":"8","author":"J Stanton-Geddes","year":"2013","unstructured":"Stanton-Geddes J, Paape T, Epstein B, Briskine R, Yoder J, Mudge J, Bharti AK, Farmer AD, Zhou P, Denny R, May GD, Erlandson S, Yakub M, Sugawara M, Sadowsky MJ, Young ND, Tiffin P: Candidate genes and genetic architecture of symbiotic and agronomic traits revealed by whole-genome, sequence-based association genetics in Medicago truncatula. PLoS ONE. 2013, 8 (5): e65688-http:\/\/dx.plos.org\/10.1371\/journal.pone.0065688,","journal-title":"PLoS ONE"},{"issue":"2","key":"6202_CR47","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1016\/j.pbi.2006.01.002","volume":"9","author":"KAT Silverstein","year":"2006","unstructured":"Silverstein KAT, Graham MA, VandenBosch KA: Novel paralogous gene families with potential function in legume nodules and seeds. Curr Opin Plant Biol. 2006, 9 (2): 142-146. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/16459131,","journal-title":"Curr Opin Plant Biol"},{"issue":"5969","key":"6202_CR48","doi-asserted-by":"publisher","first-page":"1126","DOI":"10.1126\/science.1184096","volume":"327","author":"D Wang","year":"2010","unstructured":"Wang D, Griffitts J, Starker C, Fedorova E, Limpens E, Ivanov S, Bisseling T, Long S: A nodule-specific protein secretory pathway required for nitrogen-fixing symbiosis. Science. 2010, 327 (5969): 1126-1129. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/20185723,","journal-title":"Science"},{"issue":"5969","key":"6202_CR49","doi-asserted-by":"publisher","first-page":"1122","DOI":"10.1126\/science.1184057","volume":"327","author":"W Van de Velde","year":"2010","unstructured":"Van de Velde W, Zehirov G, Szatmari A, Debreczeny M, Ishihara H, Kevei Z, Farkas A, Mikulass K, Nagy A, Tiricz H, Satiat-Jeunema\u00eetre B, Alunni B, Bourge M, Kucho Ki Abe, Kereszt A, Maroti G, Uchiumi T, Kondorosi E, Mergaert P: Plant peptides govern terminal differentiation of bacteria in symbiosis. Science. 2010, 327 (5969): 1122-1126. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/20185722,","journal-title":"Science"},{"key":"6202_CR50","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1104\/pp.102.018192","volume":"132","author":"P Mergaert","year":"2003","unstructured":"Mergaert P, Nikovics K, Kelemen Z, Maunoury N, Vaubert D, Kondorosi A, Kondorosi E: A novel family in Medicago truncatula consisting of more than 300 nodule-specific genes coding for small, secreted polypeptides with conserved cysteine motifs. Plant Physiol. 2003, 132: 161-173. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=166962&tool=pmcentrez&rendertype=abstract,","journal-title":"Plant Physiol"},{"issue":"12","key":"6202_CR51","doi-asserted-by":"publisher","first-page":"1620","DOI":"10.1101\/gr.3767105","volume":"15","author":"JE Galagan","year":"2005","unstructured":"Galagan JE, Henn MR, Ma LJ, Cuomo Ca, Birren B: Genomics of the fungal kingdom: insights into eukaryotic biology. Genome Res. 2005, 15 (12): 1620-1631. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/16339359,","journal-title":"Genome Res"},{"key":"6202_CR52","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1093\/nar\/28.1.185","volume":"28","author":"S Saxonov","year":"2000","unstructured":"Saxonov S, Daizadeh I, Fedorov A, Gilbert W: EID: the Exon-Intron Database-an exhaustive database of protein-coding intron-containing genes. Nucleic Acids Res. 2000, 28: 185-190. http:\/\/www.pubmedcentral.nih.gov\/articlerender.fcgi?artid=102483&tool=pmcentrez&rendertype=abstract,","journal-title":"Nucleic Acids Res"},{"issue":"7262","key":"6202_CR53","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1038\/nature08358","volume":"461","author":"BJ Haas","year":"2009","unstructured":"Haas BJ, Kamoun S, Zody MC, Jiang RHY, Handsaker RE, Cano LM, Grabherr M, Kodira CD, Raffaele S, Torto-Alalibo T, Bozkurt TO, Ah-Fong AMV, Alvarado L, Anderson VL, Armstrong MR, Avrova A, Baxter L, Beynon J, Boevink PC, Bollmann SR, Bos JIB, Bulone V, Cai G, Cakir C, Carrington JC, Chawner M, Conti L, Costanzo S, Ewan R, et al: Genome sequence and analysis of the, Irish potato famine pathogen Phytophthora infestans. Nature. 2009, 461 (7262): 393-398. http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/19741609,","journal-title":"Nature"},{"issue":"6010","key":"6202_CR54","doi-asserted-by":"publisher","first-page":"1543","DOI":"10.1126\/science.1194573","volume":"330","author":"PD Spanu","year":"2010","unstructured":"Spanu PD, Abbott JC, Amselem J, Burgis Ta, Soanes DM, Stuber K, Loren van Themaat EV, Brown JKM, Butcher Sa, Gurr SJ, Lebrun MH, Ridout CJ, Schulze-Lefert P, Talbot NJ, Ahmadinejad N, Ametz C, Barton GR, Benjdia M, Bidzinski P, Bindschedler LV, Both M, Brewer MT, Cadle-Davidson L, Cadle-Davidson MM, Collemare J, Cramer R, Frenkel O, Godfrey D, Harriman J, Hoede C, et al: Genome expansion and gene loss in powdery mildew fungi reveal tradeoffs in extreme parasitism. Science. 2010, 330 (6010): 1543-1546. http:\/\/www.sciencemag.org\/cgi\/doi\/10.1126\/science.1194573,","journal-title":"Science"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-14-335.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T02:10:25Z","timestamp":1630548625000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-14-335"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,11,20]]},"references-count":54,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,12]]}},"alternative-id":["6202"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-14-335","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,11,20]]},"assertion":[{"value":"2 August 2013","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2013","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 November 2013","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"335"}}