{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,11]],"date-time":"2026-01-11T01:47:48Z","timestamp":1768096068254,"version":"3.49.0"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"9","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Identification of the genetic variation underlying complex traits is challenging. The wealth of information publicly available about the biology of complex traits and the function of individual genes permits the development of informatics-assisted methods for the selection of candidate genes for these traits.<\/jats:p><jats:p>Results: We have developed a computational system named CAESAR that ranks all annotated human genes as candidates for a complex trait by using ontologies to semantically map natural language descriptions of the trait with a variety of gene-centric information sources. In a test of its effectiveness, CAESAR successfully selected 7 out of 18 (39%) complex human trait susceptibility genes within the top 2% of ranked candidates genome-wide, a subset that represents roughly 1% of genes in the human genome and provides sufficient enrichment for an association study of several hundred human genes. This approach can be applied to any well-documented mono- or multi-factorial trait in any organism for which an annotated gene set exists.<\/jats:p><jats:p>Availability: CAESAR scripts and test data can be downloaded from http:\/\/visionlab.bio.unc.edu\/caesar\/<\/jats:p><jats:p>Contact: \u00a0kgaulton@email.unc.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm001","type":"journal-article","created":{"date-parts":[[2007,1,20]],"date-time":"2007-01-20T01:12:50Z","timestamp":1169255570000},"page":"1132-1140","source":"Crossref","is-referenced-by-count":68,"title":["A computational system to select candidate genes for complex human traits"],"prefix":"10.1093","volume":"23","author":[{"given":"Kyle J.","family":"Gaulton","sequence":"first","affiliation":[{"name":"1 Curriculum in Genetics and Molecular Biologly, 2Bioinformatics and Computational Biology Training Program, Departments of 3Genetics and 4Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516"},{"name":"1 Curriculum in Genetics and Molecular Biologly, 2Bioinformatics and Computational Biology Training Program, Departments of 3Genetics and 4Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516"},{"name":"1 Curriculum in Genetics and Molecular Biologly, 2Bioinformatics and Computational Biology Training Program, Departments of 3Genetics and 4Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516"}]},{"given":"Karen L.","family":"Mohlke","sequence":"additional","affiliation":[{"name":"1 Curriculum in Genetics and Molecular Biologly, 2Bioinformatics and Computational Biology Training Program, Departments of 3Genetics and 4Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516"}]},{"given":"Todd J.","family":"Vision","sequence":"additional","affiliation":[{"name":"1 Curriculum in Genetics and Molecular Biologly, 2Bioinformatics and Computational Biology Training Program, Departments of 3Genetics and 4Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516"}]}],"member":"286","published-online":{"date-parts":[[2007,1,19]]},"reference":[{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1093\/bioinformatics\/btk031","article-title":"Suspects: enabling fast and effective prioritization of positional candidates","volume":"22","author":"Adie","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1038\/nbt1203","article-title":"Gene prioritization through genomic data fusion","volume":"24","author":"Aerts","year":"2006","journal-title":"Nat. Biotechnol"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D418","DOI":"10.1093\/nar\/gki051","article-title":"The biomolecular interaction database and related tools 2005 update","volume":"33","author":"Alfarano","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1093\/bioinformatics\/16.12.1145","article-title":"Interpro-an integrated documentation resource for protein families, domains and functional sites","volume":"16","author":"Apweiler","year":"2000","journal-title":"Bioinformatics"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D154","DOI":"10.1093\/nar\/gki070","article-title":"The universal protein resource (Uniprot)","volume":"33","author":"Bairoch","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1038\/ng0504-431","article-title":"The genetic association database","volume":"36","author":"Becker","year":"2004","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1086\/422827","article-title":"A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis","volume":"75","author":"Begovich","year":"2004","journal-title":"Am. J. Hum. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D556","DOI":"10.1093\/nar\/gkj133","article-title":"Ensembl 2006","volume":"34","author":"Birney","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1093\/nar\/gkg047","article-title":"MGD: the mouse genome database","volume":"31","author":"Blake","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1038\/ng1323","article-title":"A functional variant of lymphoid tyrosine phosphatase is associated with type 1 diabetes","volume":"36","author":"Bottini","year":"2004","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"662","DOI":"10.1101\/gr.461403","article-title":"The gene ontology annotation (GOA) project: implementation of GO in swiss-prot, trembl and interpro","volume":"13","author":"Camon","year":"2003","journal-title":"Genome Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1002\/humu.10259","article-title":"Approaches to identify genes for complex human diseases: lessons from mendelian disorders","volume":"22","author":"Dean","year":"2003","journal-title":"Hum. Mutat"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"1011","DOI":"10.1086\/504300","article-title":"Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes","volume":"78","author":"Franke","year":"2006","journal-title":"Am. J. Hum. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"S110","DOI":"10.1093\/bioinformatics\/18.suppl_2.S110","article-title":"A similarity-based method for genome-wide prediction of disease-relevant human genes","volume":"18","author":"Freudenberg","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1038\/sj.mp.4001498","article-title":"Association of the homeobox transcription factor, ENGRAILED 2, 3, with autism spectrum disorder","volume":"5","author":"Gharani","year":"2004","journal-title":"Mol. Psychiatry"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"458","DOI":"10.1038\/ng1750","article-title":"Variation in factor B (BF) and complement component 2 (C2) genes is associated with age-related macular degeneration","volume":"38","author":"Gold","year":"2006","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1038\/ng1732","article-title":"Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes","volume":"38","author":"Grant","year":"2006","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1086\/498851","article-title":"A scan of chromosome 10 identifies a novel locus showing strong association with late-onset alzheimer disease","volume":"78","author":"Grupe","year":"2006","journal-title":"Am. J. Hum. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1038\/ng1391","article-title":"A functional variant of SUMO4, a new I kappa B alpha modifier, is associated with type 1 diabetes","volume":"36","author":"Guo","year":"2004","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D514","DOI":"10.1093\/nar\/gki033","article-title":"Online mendelian inheritance in man (OMIM), a knowledgebase of human genes and genetic disorders","volume":"33","author":"Hamosh","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D258","DOI":"10.1093\/nar\/gkh036","article-title":"The Gene Ontology (GO) database and informatics resource","volume":"32","author":"Harris","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/ng1692","article-title":"A variant of the gene encoding leukotrine A4 hydrolase confers ethnicity-specific risk of myocardial infarction","volume":"38","author":"Helgadottir","year":"2006","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"S11","DOI":"10.1186\/1471-2105-6-S1-S11","article-title":"Overview of BioCreAtIvE task 1B: normalized gene lists","volume":"6","author":"Hirschman","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D277","DOI":"10.1093\/nar\/gkh063","article-title":"The KEGG resource for deciphering the genome","volume":"32","author":"Kanehisa","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"1222","DOI":"10.1101\/gr.985203","article-title":"eVOC: a controlled vocabulary for unifying gene expression data","volume":"13","author":"Kelso","year":"2003","journal-title":"Genome Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1126\/science.1109557","article-title":"Complement factor H polymorphism in age-related macular degeneration","volume":"308","author":"Klein","year":"2005","journal-title":"Science"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"478","DOI":"10.1038\/ng1540","article-title":"A functional variant in FCRL3, encoding fc receptor-like 3, is associated with rheumatoid arthritis and several autoimmunities","volume":"37","author":"Kochi","year":"2005","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"300","DOI":"10.1126\/science.1090010","article-title":"Characterization of a common susceptibility locus for asthma-related traits","volume":"304","author":"Laitinen","year":"2004","journal-title":"Science"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D54","DOI":"10.1093\/nar\/gki031","article-title":"Entrez gene: gene-centric information at NCBI","volume":"33","author":"Maglott","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1086\/496902","article-title":"High-resolution whole-genome association study of parkinson's disease","volume":"77","author":"Maraganore","year":"2005","journal-title":"Am. J. Hum. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"1341","DOI":"10.1038\/ng1680","article-title":"Myosin IXB variant increases the risk of celiac disease and points toward a primary intestinal barrier defect","volume":"37","author":"Monsuur","year":"2005","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1016\/j.mrfmmm.2005.01.006","article-title":"Genetic association studies of complex traits: design and analysis issues","volume":"573","author":"Newton-Cheh","year":"2005","journal-title":"Mutat. Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"1224","DOI":"10.1126\/science.291.5507.1224","article-title":"Genomics and medicine: dissecting human disease in the postgenomic era","volume":"291","author":"Peltonen","year":"2001","journal-title":"Science"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1038\/ng895","article-title":"Association of genes to genetically inherited diseases using data mining","volume":"31","author":"Perez-Iratxeta","year":"2002","journal-title":"Nat. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D497","DOI":"10.1093\/nar\/gkh070","article-title":"Human protein reference database as a discovery resource for proteomics","volume":"32","author":"Peri","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"902","DOI":"10.1086\/430095","article-title":"The epsin 4 gene of chromosome 5q, which encodes the clathrin-associated protein enthoprotin, is involved in the genetic susceptibility to schizophrenia","volume":"76","author":"Pimm","year":"2005","journal-title":"Am. J. Hum. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"3227","DOI":"10.1093\/hmg\/ddi353","article-title":"Hypothetical LOC387715 is a second major susceptibility gene for age-related macular degeneration, contributing independently of complement factor H to disease risk","volume":"14","author":"Rivera","year":"2005","journal-title":"Hum. Mol. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1145\/361219.361220","article-title":"A Vector Space Model for Automatic Indexing","volume":"18","author":"Salton","year":"1975","journal-title":"Commun. ACM"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"R7","DOI":"10.1186\/gb-2004-6-1-r7","article-title":"The mammalian phenotype ontology as a tool for annotating, analyzing and comparing phenotypic information","volume":"6","author":"Smith","year":"2005","journal-title":"Genome Biol"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1158\/1055-9965.EPI-06-0146","article-title":"Are we ready for genome-wide association studies?","volume":"15","author":"Thomas","year":"2006","journal-title":"Cancer Epidemiol. Biomarkers Prev"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"R75","DOI":"10.1186\/gb-2003-4-11-r75","article-title":"POCUS: mining genomic sequence annotation to predict disease genes","volume":"4","author":"Turner","year":"2003","journal-title":"Genome Biol"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"503","DOI":"10.1038\/nature01621","article-title":"Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease","volume":"423","author":"Ueda","year":"2003","journal-title":"Nature"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/sj.ejhg.5200918","article-title":"A new web-based data mining tool for the identification of candidate genes for human genetic disorders","volume":"11","author":"van Driel","year":"2003","journal-title":"Eur. J. Hum. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1086\/429843","article-title":"Localization of a type 1 diabetes locus in the IL2RA\/CD25 region by use of tag single-nucleotide polymorphisms","volume":"75","author":"Vella","year":"2005","journal-title":"Am. J. Hum. Genet"},{"key":"2023041209112526100_","doi-asserted-by":"crossref","first-page":"D173","DOI":"10.1093\/nar\/gkj158","article-title":"Database resources of the national center for biotechnology information","volume":"22","author":"Wheeler","year":"2006","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/9\/1132\/49858715\/bioinformatics_23_9_1132.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/9\/1132\/49858715\/bioinformatics_23_9_1132.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,10]],"date-time":"2024-02-10T09:43:00Z","timestamp":1707558180000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/9\/1132\/271969"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,1,19]]},"references-count":45,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2007,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm001","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,5,1]]},"published":{"date-parts":[[2007,1,19]]}}}