{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T15:31:49Z","timestamp":1759332709007},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2007,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Cowpea [<jats:italic>Vigna unguiculata<\/jats:italic> (L.) Walp.] is one of the most important food and forage legumes in the semi-arid tropics because of its ability to tolerate drought and grow on poor soils. It is cultivated mostly by poor farmers in developing countries, with 80% of production taking place in the dry savannah of tropical West and Central Africa. Cowpea is largely an underexploited crop with relatively little genomic information available for use in applied plant breeding. The goal of the Cowpea Genomics Initiative (CGI), funded by the Kirkhouse Trust, a UK-based charitable organization, is to leverage modern molecular genetic tools for gene discovery and cowpea improvement. One aspect of the initiative is the sequencing of the gene-rich region of the cowpea genome (termed the genespace) recovered using methylation filtration technology and providing annotation and analysis of the sequence data.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Description<\/jats:title>\n            <jats:p>CGKB, Cowpea Genespace\/Genomics Knowledge Base, is an annotation knowledge base developed under the CGI. The database is based on information derived from 298,848 cowpea genespace sequences (GSS) isolated by methylation filtering of genomic DNA. The CGKB consists of three knowledge bases: GSS annotation and comparative genomics knowledge base, GSS enzyme and metabolic pathway knowledge base, and GSS simple sequence repeats (SSRs) knowledge base for molecular marker discovery. A homology-based approach was applied for annotations of the GSS, mainly using BLASTX against four public FASTA formatted protein databases (NCBI GenBank Proteins, UniProtKB-Swiss-Prot, UniprotKB-PIR (Protein Information Resource), and UniProtKB-TrEMBL). Comparative genome analysis was done by BLASTX searches of the cowpea GSS against four plant proteomes from <jats:italic>Arabidopsis thaliana, Oryza sativa, Medicago truncatula<\/jats:italic>, and <jats:italic>Populus trichocarpa<\/jats:italic>. The possible exons and introns on each cowpea GSS were predicted using the HMM-based Genscan gene predication program and the potential domains on annotated GSS were analyzed using the HMMER package against the Pfam database. The annotated GSS were also assigned with Gene Ontology annotation terms and integrated with 228 curated plant metabolic pathways from the <jats:italic>Arabidopsis<\/jats:italic> Information Resource (TAIR) knowledge base. The UniProtKB-Swiss-Prot ENZYME database was used to assign putative enzymatic function to each GSS. Each GSS was also analyzed with the Tandem Repeat Finder (TRF) program in order to identify potential SSRs for molecular marker discovery. The raw sequence data, processed annotation, and SSR results were stored in relational tables designed in key-value pair fashion using a PostgreSQL relational database management system. The biological knowledge derived from the sequence data and processed results are represented as views or materialized views in the relational database management system. All materialized views are indexed for quick data access and retrieval. Data processing and analysis pipelines were implemented using the Perl programming language. The web interface was implemented in JavaScript and Perl CGI running on an Apache web server. The CPU intensive data processing and analysis pipelines were run on a computer cluster of more than 30 dual-processor Apple XServes. A job management system called Vela was created as a robust way to submit large numbers of jobs to the Portable Batch System (PBS).<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>CGKB is an integrated and annotated resource for cowpea GSS with features of homology-based and HMM-based annotations, enzyme and pathway annotations, GO term annotation, toolkits, and a large number of other facilities to perform complex queries. The cowpea GSS, chloroplast sequences, mitochondrial sequences, retroelements, and SSR sequences are available as FASTA formatted files and downloadable at CGKB. This database and web interface are publicly accessible at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/cowpeagenomics.med.virginia.edu\/CGKB\/\" ext-link-type=\"uri\">http:\/\/cowpeagenomics.med.virginia.edu\/CGKB\/<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-8-129","type":"journal-article","created":{"date-parts":[[2007,5,2]],"date-time":"2007-05-02T15:38:43Z","timestamp":1178120323000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":44,"title":["CGKB: an annotation knowledge base for cowpea (Vigna unguiculata L.) methylation filtered genomic genespace sequences"],"prefix":"10.1186","volume":"8","author":[{"given":"Xianfeng","family":"Chen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas W","family":"Laudeman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paul J","family":"Rushton","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas A","family":"Spraggins","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael P","family":"Timko","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2007,4,19]]},"reference":[{"key":"1501_CR1","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1201\/9780203489284","volume-title":"Genetic Resources, Chromosome Engineering and Crop Improvement","author":"BB Singh","year":"2005","unstructured":"Singh BB: Cowpea [ Vigna unguiculata (L.) Walp. In Genetic Resources, Chromosome Engineering and Crop Improvement. Volume 1. Edited by: Singh RJ, Jauhar PP. Boca Raton, FL: CRC Press; 2005:117\u2013162."},{"key":"1501_CR2","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1007\/978-3-540-34516-9_3","volume-title":"Genome Mapping and Molecular Breeding in Plants Pulses, Sugar and Tuber Crops","author":"MP Timko","year":"2007","unstructured":"Timko MP, Ehlers JD, Roberts PA: Cowpea. In Genome Mapping and Molecular Breeding in Plants Pulses, Sugar and Tuber Crops. Volume 3. Edited by: Kole C. Berlin Heidelberg: Springer-Verlag; 2007:49\u201368."},{"key":"1501_CR3","doi-asserted-by":"publisher","first-page":"208","DOI":"10.1007\/BF02672069","volume":"9","author":"K Arumuganathan","year":"1991","unstructured":"Arumuganathan K, Earle ED: Nuclear DNA content of some important plant species. Plant Mol Biol Rep 1991, 9: 208\u2013218.","journal-title":"Plant Mol Biol Rep"},{"key":"1501_CR4","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1139\/g94-081","volume":"37","author":"JL Bennetzen","year":"1994","unstructured":"Bennetzen JL, Schrick K, Springer PS, Brown WE, SanMiguel P: Active maize genes are unmodified and flanked by diverse classes of modified, highly repetitive DNA. Genome 1994, 37: 565\u2013576.","journal-title":"Genome"},{"key":"1501_CR5","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1016\/S0168-9525(98)01518-2","volume":"14","author":"R Martienssen","year":"1998","unstructured":"Martienssen R: Transposons, DNA methylation and gene control. Trends Genet 1998, 14: 263\u2013264. 10.1016\/S0168-9525(98)01518-2","journal-title":"Trends Genet"},{"key":"1501_CR6","doi-asserted-by":"publisher","first-page":"765","DOI":"10.1126\/science.274.5288.765","volume":"274","author":"P SanMiguel","year":"1996","unstructured":"SanMiguel P, Trkhonov A, Jin YK, Motchoulskaia N, Zakharov D, MelakeBerhan A, Springer PS, Edwards KJ, Lee M, Ayramova Z, Bennetzen JL: Nested retrotransposons in the intergenic regions of the maize genome. Science 1996, 274: 765\u2013768. 10.1126\/science.274.5288.765","journal-title":"Science"},{"key":"1501_CR7","doi-asserted-by":"publisher","first-page":"11792","DOI":"10.1073\/pnas.91.25.11792","volume":"91","author":"SE White","year":"1994","unstructured":"White SE, Habera LF, Wessler SR: Retrotransposons in the flanking regions of normal plant genes: A role for copia-like elements in the evolution of gene structure and expression. Proc Natl Acad Sci USA 1994, 91: 11792\u201311796. 10.1073\/pnas.91.25.11792","journal-title":"Proc Natl Acad Sci USA"},{"key":"1501_CR8","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1146\/annurev.arplant.55.031903.141641","volume":"55","author":"J Bender","year":"2004","unstructured":"Bender J: DNA methylation and epigenetics. Annu Rev Plant Physiol Plant Mol Biol 2004, 55: 41\u201368.","journal-title":"Annu Rev Plant Physiol Plant Mol Biol"},{"key":"1501_CR9","doi-asserted-by":"publisher","first-page":"3207","DOI":"10.1093\/nar\/20.12.3207","volume":"20","author":"LM Montero","year":"1992","unstructured":"Montero LM, Filipski J, Gil P, Capel J, Martinez-Zapater JM, Salinas J: The distribution of 5-methylcytosine in the nuclear genome of plants. Nucleic Acids Res 1992, 20: 3207\u20133210. 10.1093\/nar\/20.12.3207","journal-title":"Nucleic Acids Res"},{"key":"1501_CR10","doi-asserted-by":"publisher","first-page":"686","DOI":"10.1016\/j.gde.2004.09.009","volume":"14","author":"SH Rangwala","year":"2004","unstructured":"Rangwala SH, Richards EJ: The value-added genome: building and maintaining genomic cytosine methylation landscapes. Curr Opin Genetics & Development 2004, 14: 686\u2013691. 10.1016\/j.gde.2004.09.009","journal-title":"Curr Opin Genetics & Development"},{"key":"1501_CR11","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1023\/A:1020936229771","volume":"10","author":"O Mathieu","year":"2002","unstructured":"Mathieu O, Picard G, Tourmente S: Methylation of a euchromatin-heterochromatin transition region in Arabidopsis thaliana chromosome 5 left arm. Chromosome Res 2002, 10: 455\u201366. 10.1023\/A:1020936229771","journal-title":"Chromosome Res"},{"key":"1501_CR12","doi-asserted-by":"publisher","first-page":"1431","DOI":"10.1101\/gr.4100405","volume":"15","author":"PD Rabinowicz","year":"2005","unstructured":"Rabinowicz PD, Citek R, Budiman MA, Nunberg A, Bedell JA, Lakey N, O'Shaughnessy AL, Nascimento LU, McCombie WR, Martienssen RA: Differential methylation of genes and repeats in land plants. Genome Research 2005, 15: 1431\u20131440. 10.1101\/gr.4100405","journal-title":"Genome Research"},{"key":"1501_CR13","doi-asserted-by":"publisher","first-page":"e13","DOI":"10.1371\/journal.pbio.0030013","volume":"3","author":"JA Bedell","year":"2005","unstructured":"Bedell JA, Budiman MA, Nunberg A, Citek RW, Robbins D, Jones J, Flick E, Rohlfing T, Fries J, Bradford K, McMenamy J, Smith M, Holeman H, Roe BA, Wiley G, Korf IF, Rabinowicz PD, Lakey N, McCombie WR, Jeddeloh JA, Martienssen RA: Sorghum genome sequencing by methylation filtration. PLoS Biol 2005, 3: e13. 10.1371\/journal.pbio.0030013","journal-title":"PLoS Biol"},{"key":"1501_CR14","doi-asserted-by":"publisher","first-page":"2115","DOI":"10.1126\/science.1091265","volume":"302","author":"LE Palmer","year":"2003","unstructured":"Palmer LE, Rabinowicz PD, O'Shaughnessy AL, Balija VS, Nascimento LU, Dike S, de la Bastide M, Martienssen RA, McCombie WR: Maize genome sequencing by methylation filtration. Science 2003, 302: 2115\u20132117. 10.1126\/science.1091265","journal-title":"Science"},{"key":"1501_CR15","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1038\/15479","volume":"23","author":"PD Rabinowicz","year":"1999","unstructured":"Rabinowicz PD, Schutz K, Dedhia N, Yordan C, Parnell LD, Stein L, McCombie WR, Martienssen RA: Differential methylation of genes and retrotransposons facilitates shotgun sequencing of the maize genome. Nature Genetics 1999, 23: 305\u2013308. 10.1038\/15479","journal-title":"Nature Genetics"},{"key":"1501_CR16","doi-asserted-by":"publisher","first-page":"2118","DOI":"10.1126\/science.1090047","volume":"302","author":"CA Whitelaw","year":"2003","unstructured":"Whitelaw CA, Barbazuk WB, Pertea G, Chan AP, Cheung F, Lee Y, Zheng L, van Heeringen S, Karamycheva S, Bennetzen JL, SanMiguel P, Lakey N, Bedell J, Yuan Y, Budiman MA, Resnick A, Van Aken S, Utterback T, Riedmuller S, Williams M, Feldblyum T, Schubert K, Beachy R, Fraser CM, Quackenbush J: Enrichment of gene-coding sequences in maize by genome filtration. Science 2003, 302: 2118\u20132120. 10.1126\/science.1090047","journal-title":"Science"},{"key":"1501_CR17","unstructured":"Orion Genomics[http:\/\/www.oriongenomics.com\/]"},{"key":"1501_CR18","unstructured":"The Kirkhouse Trust[http:\/\/www.kirkhousetrust.org\/]"},{"key":"1501_CR19","unstructured":"The Perl Foundation[http:\/\/www.perl.org\/]"},{"key":"1501_CR20","unstructured":"Portable Batch System[http:\/\/www.openpbs.org\/]"},{"key":"1501_CR21","unstructured":"The Arabidopsis Information Resource[http:\/\/www.arabidopsis.org\/]"},{"key":"1501_CR22","unstructured":"The International Rice Genome Sequencing Project[http:\/\/rgp.dna.affrc.go.jp\/IRGSP\/]"},{"key":"1501_CR23","unstructured":"The Medicagotruncatula Genome Project[http:\/\/www.tigr.org\/tdb\/e2k1\/mta1\/]"},{"key":"1501_CR24","doi-asserted-by":"publisher","first-page":"1596","DOI":"10.1126\/science.1128691","volume":"313","author":"GA Tuskan","year":"2006","unstructured":"Tuskan GA, DiFazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen GL, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, Cunningham R, Davis J, Degroeve S, D\u00e9jardin A, dePamphilis C, Detter J, Dirks B, Dubchak I, Duplessis S, Ehlting J, Ellis B, Gendler K, Goodstein D, Gribskov M, Grimwood J, Groover A, Gunter L, Hamberger B, Heinze B, Helariutta Y, Henrissat B, Holligan D, Holt R, Huang W, Islam-Faridi N, Jones S, Jones-Rhoades M, Jorgensen R, Joshi C, Kangasj\u00e4rvi J, Karlsson J, Kelleher C, Kirkpatrick R, Kirst M, Kohler A, Kalluri U, Larimer F, Leebens-Mack J, Lepl\u00e9 J-C, Locascio P, Lou Y, Lucas S, Martin F, Montanini B, Napoli C, Nelson DR, Nelson C, Nieminen K, Nilsson O, Pereda V, Peter G, Philippe R, Pilate G, Poliakov A, Razumovskaya J, Richardson P, Rinaldi C, Ritland K, Rouz\u00e9 P, Ryaboy D, Schmutz J, Schrader J, Segerman B, Shin H, Siddiqui A, Sterky F, Terry A, Tsai C-J, Uberbacher E, Unneberg P, Vahala J, Wall K, Wessler S, Yang G, Yin T, Douglas C, Marra M, Sandberg G, Van de Peer Y, Rokhsar D: The genome of black cottonwood Populus trichocarpa (Torr. & Gray). Science 2006, 313: 1596\u20131604. 10.1126\/science.1128691","journal-title":"Science"},{"key":"1501_CR25","unstructured":"ENZYME enzyme nomenclature database[http:\/\/ca.expasy.org\/enzyme\/]"},{"key":"1501_CR26","unstructured":"UniProtKB\/TrEMBL[http:\/\/www.ebi.ac.uk\/trembl\/]"},{"key":"1501_CR27","unstructured":"UniProtKB\/Swiss-Prot[http:\/\/www.ebi.ac.uk\/swissprot\/]"},{"key":"1501_CR28","unstructured":"FTP directory\/genbank\/at ftp.ncbi.nih.gov[ftp:\/\/ftp.ncbi.nih.gov\/genbank\/]"},{"key":"1501_CR29","unstructured":"The Protein Information Resource[http:\/\/pir.georgetown.edu\/]"},{"key":"1501_CR30","unstructured":"HMMER[http:\/\/hmmer.janelia.org\/]"},{"key":"1501_CR31","doi-asserted-by":"publisher","first-page":"D138","DOI":"10.1093\/nar\/gkh121","volume":"32","author":"A Bateman","year":"2004","unstructured":"Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL: The Pfam protein families database. Nucleic Acids Res 2004, 32: D138-D141. 10.1093\/nar\/gkh121","journal-title":"Nucleic Acids Res"},{"key":"1501_CR32","unstructured":"The Pfam database of protein families and HMMs[http:\/\/www.sanger.ac.uk\/Software\/Pfam\/]"},{"key":"1501_CR33","unstructured":"Tandem repeats finder[http:\/\/tandem.bu.edu\/trf\/trf.html]"},{"key":"1501_CR34","doi-asserted-by":"publisher","first-page":"573","DOI":"10.1093\/nar\/27.2.573","volume":"27","author":"G Benson","year":"1999","unstructured":"Benson G: Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res 1999, 27: 573\u2013580. 10.1093\/nar\/27.2.573","journal-title":"Nucleic Acids Res"},{"key":"1501_CR35","unstructured":"PostgreSQL[http:\/\/www.postgresql.org\/]"},{"key":"1501_CR36","unstructured":"The Apache Software Foundation[http:\/\/www.apache.org\/]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-8-129.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T10:21:49Z","timestamp":1630491709000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-8-129"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,4,19]]},"references-count":36,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2007,12]]}},"alternative-id":["1501"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-8-129","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,4,19]]},"assertion":[{"value":"6 November 2006","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 April 2007","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 April 2007","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"129"}}