{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T15:07:09Z","timestamp":1767971229386,"version":"3.49.0"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"S12","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Studies on the relationship between disease and genetic variations such as single nucleotide polymorphisms (SNPs) are important. Genetic variations can cause disease by influencing important biological regulation processes. Despite the needs for analyzing SNP and disease correlation, most existing databases provide information only on functional variants at specific locations on the genome, or deal with only a few genes associated with disease. There is no combined resource to widely support gene-, SNP-, and disease-related information, and to capture relationships among such data. Therefore, we developed an integrated database-pipeline system for studying SNPs and diseases.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>To implement the pipeline system for the integrated database, we first unified complicated and redundant disease terms and gene names using the Unified Medical Language System (UMLS) for classification and noun modification, and the HUGO Gene Nomenclature Committee (HGNC) and NCBI gene databases. Next, we collected and integrated representative databases for three categories of information. For genes and proteins, we examined the NCBI mRNA, UniProt, UCSC Table Track and MitoDat databases. For genetic variants we used the dbSNP, JSNP, ALFRED, and HGVbase databases. For disease, we employed OMIM, GAD, and HGMD databases. The database-pipeline system provides a disease thesaurus, including genes and SNPs associated with disease. The search results for these categories are available on the web page <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/diseasome.kobic.re.kr\/\" ext-link-type=\"uri\">http:\/\/diseasome.kobic.re.kr\/<\/jats:ext-link>, and a genome browser is also available to highlight findings, as well as to permit the convenient review of potentially deleterious SNPs among genes strongly associated with specific diseases and clinical phenotypes.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>Our system is designed to capture the relationships between SNPs associated with disease and disease-causing genes. The integrated database-pipeline provides a list of candidate genes and SNP markers for evaluation in both epidemiological and molecular biological approaches to diseases-gene association studies. Furthermore, researchers then can decide semi-automatically the data set for association studies while considering the relationships between genetic variation and diseases. The database can also be economical for disease-association studies, as well as to facilitate an understanding of the processes which cause disease. Currently, the database contains 14,674 SNP records and 109,715 gene records associated with human diseases and it is updated at regular intervals.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-9-s12-s19","type":"journal-article","created":{"date-parts":[[2008,12,12]],"date-time":"2008-12-12T19:14:15Z","timestamp":1229109255000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["An integrated database-pipeline system for studying single nucleotide polymorphisms and diseases"],"prefix":"10.1186","volume":"9","author":[{"given":"Jin Ok","family":"Yang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sohyun","family":"Hwang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jeongsu","family":"Oh","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jong","family":"Bhak","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tae-Kwon","family":"Sohn","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2008,12,12]]},"reference":[{"issue":"2","key":"2724_CR1","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1007\/s10048-008-0119-3","volume":"9","author":"M Matarin","year":"2008","unstructured":"Matarin M, Simon-Sanchez J, Fung HC, Scholz S, Gibbs JR, Hernandez DG, Crews C, Britton A, Wavrant De Vrieze F, Brott TG, et al.: Structural genomic variation in ischemic stroke. Neurogenetics 2008,9(2):101\u2013108.","journal-title":"Neurogenetics"},{"key":"2724_CR2","volume-title":"Biochem Biophys Res Commun","author":"JS Bae","year":"2008","unstructured":"Bae JS, Cheong HS, Kim JO, Lee SO, Kim EM, Lee HW, Kim S, Kim JW, Cui T, Inoue I, et al.: Identification of SNP markers for common CNV regions and association analysis of risk of subarachnoid aneurysmal hemorrhage in Japanese population. Biochem Biophys Res Commun 2008."},{"issue":"1","key":"2724_CR3","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1016\/j.neuron.2006.09.027","volume":"52","author":"JA Lee","year":"2006","unstructured":"Lee JA, Lupski JR: Genomic rearrangements and gene copy-number alterations as a cause of nervous system disorders. Neuron 2006,52(1):103\u2013121.","journal-title":"Neuron"},{"issue":"Suppl 1","key":"2724_CR4","doi-asserted-by":"publisher","first-page":"S2","DOI":"10.1186\/1471-2105-9-S1-S2","volume":"9","author":"BC Kim","year":"2008","unstructured":"Kim BC, Kim WY, Park D, Chung WH, Shin KS, Bhak J: SNP@Promoter: a database of human SNPs (single nucleotide polymorphisms) within the putative promoter regions. BMC Bioinformatics 2008,9(Suppl 1):S2.","journal-title":"BMC Bioinformatics"},{"key":"2724_CR5","doi-asserted-by":"crossref","unstructured":"Han A, Kang HJ, Cho Y, Lee S, Kim YJ, Gong S: SNP@Domain: a web resource of single nucleotide polymorphisms (SNPs) within protein domain structures and sequences. Nucleic Acids Res 2006, (34 Web Server):W642\u2013644.","DOI":"10.1093\/nar\/gkl323"},{"issue":"5507","key":"2724_CR6","doi-asserted-by":"publisher","first-page":"1304","DOI":"10.1126\/science.1058040","volume":"291","author":"JC Venter","year":"2001","unstructured":"Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, et al.: The sequence of the human genome. Science 2001,291(5507):1304\u20131351.","journal-title":"Science"},{"issue":"Suppl","key":"2724_CR7","doi-asserted-by":"publisher","first-page":"228","DOI":"10.1038\/ng1090","volume":"33","author":"D Botstein","year":"2003","unstructured":"Botstein D, Risch N: Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat Genet 2003,33(Suppl):228\u2013237.","journal-title":"Nat Genet"},{"key":"2724_CR8","doi-asserted-by":"crossref","unstructured":"Bodenreider O: The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res 2004, (32 Database):D267\u2013270.","DOI":"10.1093\/nar\/gkh061"},{"key":"2724_CR9","doi-asserted-by":"crossref","unstructured":"Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 2005, (33 Database):D514\u2013517.","DOI":"10.1093\/nar\/gki033"},{"issue":"5","key":"2724_CR10","doi-asserted-by":"publisher","first-page":"431","DOI":"10.1038\/ng0504-431","volume":"36","author":"KG Becker","year":"2004","unstructured":"Becker KG, Barnes KC, Bright TJ, Wang SA: The genetic association database. Nat Genet 2004,36(5):431\u2013432.","journal-title":"Nat Genet"},{"issue":"1","key":"2724_CR11","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1093\/nar\/26.1.285","volume":"26","author":"DN Cooper","year":"1998","unstructured":"Cooper DN, Ball EV, Krawczak M: The human gene mutation database. Nucleic Acids Res 1998,26(1):285\u2013287.","journal-title":"Nucleic Acids Res"},{"key":"2724_CR12","doi-asserted-by":"crossref","unstructured":"Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, Dicuccio M, Edgar R, Federhen S, et al.: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2008, (36 Database):D13\u201321.","DOI":"10.1093\/nar\/gkm1000"},{"key":"2724_CR13","doi-asserted-by":"crossref","unstructured":"Eyre TA, Ducluzeau F, Sneddon TP, Povey S, Bruford EA, Lush MJ: The HUGO Gene Nomenclature Database, 2006 updates. Nucleic Acids Res 2006, (34 Database):D319\u2013321.","DOI":"10.1093\/nar\/gkj147"},{"key":"2724_CR14","first-page":"89","volume":"406","author":"E Boutet","year":"2007","unstructured":"Boutet E, Lieberherr D, Tognolli M, Schneider M, Bairoch A: UniProtKB\/Swiss-Prot: The Manually Annotated Section of the UniProt KnowledgeBase. Methods Mol Biol 2007, 406: 89\u2013112.","journal-title":"Methods Mol Biol"},{"key":"2724_CR15","doi-asserted-by":"crossref","unstructured":"Kuhn RM, Karolchik D, Zweig AS, Trumbower H, Thomas DJ, Thakkapallayil A, Sugnet CW, Stanke M, Smith KE, Siepel A, et al.: The UCSC genome browser database: update 2007. Nucleic Acids Res 2007, (35 Database):D668\u2013673.","DOI":"10.1093\/nar\/gkl928"},{"issue":"3","key":"2724_CR16","doi-asserted-by":"publisher","first-page":"566","DOI":"10.1002\/elps.1150170327","volume":"17","author":"PF Lemkin","year":"1996","unstructured":"Lemkin PF, Chipperfield M, Merril C, Zullo S: A World Wide Web (WWW) server database engine for an organelle database, MitoDat. Electrophoresis 1996,17(3):566\u2013572.","journal-title":"Electrophoresis"},{"issue":"1","key":"2724_CR17","doi-asserted-by":"publisher","first-page":"352","DOI":"10.1093\/nar\/28.1.352","volume":"28","author":"EM Smigielski","year":"2000","unstructured":"Smigielski EM, Sirotkin K, Ward M, Sherry ST: dbSNP: a database of single nucleotide polymorphisms. Nucleic Acids Res 2000,28(1):352\u2013355.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2724_CR18","doi-asserted-by":"publisher","first-page":"158","DOI":"10.1093\/nar\/30.1.158","volume":"30","author":"M Hirakawa","year":"2002","unstructured":"Hirakawa M, Tanaka T, Hashimoto Y, Kuroda M, Takagi T, Nakamura Y: JSNP: a database of common gene variations in the Japanese population. Nucleic Acids Res 2002,30(1):158\u2013162.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2724_CR19","doi-asserted-by":"publisher","first-page":"270","DOI":"10.1093\/nar\/gkg043","volume":"31","author":"H Rajeevan","year":"2003","unstructured":"Rajeevan H, Osier MV, Cheung KH, Deng H, Druskin L, Heinzen R, Kidd JR, Stein S, Pakstis AJ, Tosches NP, et al.: ALFRED: the ALelle FREquency Database. Update. Nucleic Acids Res 2003,31(1):270\u2013271.","journal-title":"Nucleic Acids Res"},{"key":"2724_CR20","first-page":"D516","volume-title":"Nucleic Acids Res","author":"D Fredman","year":"2004","unstructured":"Fredman D, Munns G, Rios D, Sjoholm F, Siegfried M, Lenhard B, Lehvaslaiho H, Brookes AJ: HGVbase: a curated resource describing human DNA variation and phenotype relationships. Nucleic Acids Res 2004, (32 Database):D516\u2013519."},{"key":"2724_CR21","doi-asserted-by":"publisher","first-page":"450","DOI":"10.1186\/1471-2105-8-450","volume":"8","author":"J Tian","year":"2007","unstructured":"Tian J, Wu N, Guo X, Guo J, Zhang J, Fan Y: Predicting the phenotypic effects of non-synonymous single nucleotide polymorphisms based on support vector machines. BMC Bioinformatics 2007, 8: 450.","journal-title":"BMC Bioinformatics"},{"issue":"13","key":"2724_CR22","doi-asserted-by":"publisher","first-page":"3812","DOI":"10.1093\/nar\/gkg509","volume":"31","author":"PC Ng","year":"2003","unstructured":"Ng PC, Henikoff S: SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res 2003,31(13):3812\u20133814.","journal-title":"Nucleic Acids Res"},{"issue":"3","key":"2724_CR23","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990,215(3):403\u2013410.","journal-title":"J Mol Biol"},{"key":"2724_CR24","doi-asserted-by":"publisher","first-page":"1599","DOI":"10.1101\/gr.403602","volume":"12","author":"LD Stein","year":"2002","unstructured":"Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, et al.: The Generic Genome Browser: A Building Block for a Model Organism System Database. Genome Res 2002, 12: 1599\u20131610.","journal-title":"Genome Res"},{"issue":"12","key":"2724_CR25","doi-asserted-by":"publisher","first-page":"5283","DOI":"10.1167\/iovs.06-0206","volume":"47","author":"A Abu","year":"2006","unstructured":"Abu A, Frydman M, Marek D, Pras E, Stolovitch C, Aviram-Goldring A, Rienstein S, Reznik-Wolf H, Pras E: Mapping of a gene causing brittle cornea syndrome in Tunisian jews to 16q24. Investigative ophthalmology & visual science 2006,47(12):5283\u20135287.","journal-title":"Investigative ophthalmology & visual science"},{"issue":"6","key":"2724_CR26","doi-asserted-by":"publisher","first-page":"1544","DOI":"10.2337\/diabetes.52.6.1544","volume":"52","author":"N Stefan","year":"2003","unstructured":"Stefan N, Kovacs P, Stumvoll M, Hanson RL, Lehn-Stefan A, Permana PA, Baier LJ, Tataranni PA, Silver K, Bogardus C: Metabolic effects of the Gly1057Asp polymorphism in IRS-2 and interactions with obesity. Diabetes 2003,52(6):1544\u20131550.","journal-title":"Diabetes"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-S12-S19.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T23:21:56Z","timestamp":1630452116000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-S12-S19"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,12]]},"references-count":26,"journal-issue":{"issue":"S12","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2724"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-s12-s19","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,12]]},"assertion":[{"value":"12 December 2008","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S19"}}