{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,4]],"date-time":"2026-03-04T08:07:35Z","timestamp":1772611655081,"version":"3.50.1"},"reference-count":23,"publisher":"Oxford University Press (OUP)","issue":"13","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: A few years ago, FlyBase undertook to design a new database schema to store Drosophila data. It would fully integrate genomic sequence and annotation data with bibliographic, genetic, phenotypic and molecular data from the literature representing a distillation of the first 100 years of research on this major animal model system. In developing this new integrated schema, FlyBase also made a commitment to ensure that its design was generic, extensible and available as open source, so that it could be employed as the core schema of any model organism data repository, thereby avoiding redundant software development and potentially increasing interoperability. Our question was whether we could create a relational database schema that would be successfully reused.<\/jats:p>\n               <jats:p>Results: Chado is a relational database schema now being used to manage biological knowledge for a wide variety of organisms, from human to pathogens, especially the classes of information that directly or indirectly can be associated with genome sequences or the primary RNA and protein products encoded by a genome. Biological databases that conform to this schema can interoperate with one another, and with application software from the Generic Model Organism Database (GMOD) toolkit. Chado is distinctive because its design is driven by ontologies. The use of ontologies (or controlled vocabularies) is ubiquitous across the schema, as they are used as a means of typing entities. The Chado schema is partitioned into integrated subschemas (modules), each encapsulating a different biological domain, and each described using representations in appropriate ontologies. To illustrate this methodology, we describe here the Chado modules used for describing genomic sequences.<\/jats:p>\n               <jats:p>Availability: GMOD is a collaboration of several model organism database groups, including FlyBase, to develop a set of open-source software for managing model organism data. The Chado schema is freely distributed under the terms of the Artistic License (http:\/\/www.opensource.org\/licenses\/artistic-license.php) from GMOD (www.gmod.org).<\/jats:p>\n               <jats:p>Contact: cjm@fruitfly.org or emmert@morgan.harvard.edu.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm189","type":"journal-article","created":{"date-parts":[[2007,7,23]],"date-time":"2007-07-23T16:13:46Z","timestamp":1185207226000},"page":"i337-i346","source":"Crossref","is-referenced-by-count":210,"title":["A Chado case study: an ontology-based modular schema for representing genome-associated biological information"],"prefix":"10.1093","volume":"23","author":[{"given":"Christopher J.","family":"Mungall","sequence":"first","affiliation":[{"name":"1 Lawrence Berkeley National Laboratory, Lawrence Berkeley National Lab, Mail Stop 64R0121, Berkeley, CA 94720 and 2Harvard University, Molecular and Cell Biology: FlyBase, 16 Divinity Avenue, Cambridge, MA 02138, USA"}]},{"given":"David B.","family":"Emmert","sequence":"additional","affiliation":[{"name":"1 Lawrence Berkeley National Laboratory, Lawrence Berkeley National Lab, Mail Stop 64R0121, Berkeley, CA 94720 and 2Harvard University, Molecular and Cell Biology: FlyBase, 16 Divinity Avenue, Cambridge, MA 02138, USA"}]},{"name":"The FlyBase Consortium","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2007,7,1]]},"reference":[{"key":"2023062708512586500_B1","doi-asserted-by":"crossref","first-page":"D439","DOI":"10.1093\/nar\/gkl777","article-title":"ParameciumDB: a community resource that integrates the Paramecium tetraurelia genome sequence with genetic data","volume":"35","author":"Arnaiz","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023062708512586500_B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The gene ontology consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet"},{"key":"2023062708512586500_B3","doi-asserted-by":"crossref","first-page":"R21","DOI":"10.1186\/gb-2005-6-2-r21","article-title":"An ontology for cell types","volume":"6","author":"Bard","year":"2005","journal-title":"Genome Biol"},{"key":"2023062708512586500_B4","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1073\/pnas.27.11.499","article-title":"Genetic control of biochemical reactions in neurospora","volume":"27","author":"Beadle","year":"1941","journal-title":"Proc. Natl Acad. Sci"},{"key":"2023062708512586500_B5","article-title":"Resource description framework (RDF) schema specification 1.0","volume-title":"W3C Candidate Recommendation","author":"Brickley","year":"2000"},{"key":"2023062708512586500_B6","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1093\/bib\/5.1.59","article-title":"Globally distributed object identification for biological knowledgebases","volume":"5","author":"Clark","year":"2004","journal-title":"Brief Bioinform"},{"key":"2023062708512586500_B7","article-title":"ACeDB","volume-title":"Computational Methods in Genome Research","author":"Durbin","year":"1994"},{"key":"2023062708512586500_B8","doi-asserted-by":"crossref","first-page":"642","DOI":"10.1002\/cfg.446","article-title":"Sequence ontology annotation guide","volume":"5","author":"Eilbeck","year":"2004","journal-title":"Comp. Funct. Genomics"},{"key":"2023062708512586500_B9","doi-asserted-by":"crossref","first-page":"R44","DOI":"10.1186\/gb-2005-6-5-r44","article-title":"The sequence ontology: a tool for the unification of genome annotations","volume":"6","author":"Eilbeck","year":"2005","journal-title":"Genome Biol"},{"key":"2023062708512586500_B10","doi-asserted-by":"crossref","first-page":"D258","DOI":"10.1093\/nar\/gkh036","article-title":"The Gene Ontology (GO) database and informatics resource","volume":"32","author":"Harris","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023062708512586500_B11","doi-asserted-by":"crossref","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","article-title":"CLUSTALW: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice","volume":"22","author":"Higgins","year":"1994","journal-title":"Nucleic Acids Res"},{"key":"2023062708512586500_B12","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-12-research0085","article-title":"Heterochromatic sequences in a Drosophila whole-genome shotgun assembly","volume":"3","author":"Hoskins","year":"2002","journal-title":"Genome Biol"},{"key":"2023062708512586500_B13","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1093\/nar\/29.1.106","article-title":"The ARKdb: genome databases for farmed and other animals","volume":"29","author":"Hu","year":"2001","journal-title":"Nucleic Acids Res"},{"key":"2023062708512586500_B14","doi-asserted-by":"crossref","first-page":"D447","DOI":"10.1093\/nar\/gki138","article-title":"Ensembl","volume":"33","author":"Hubbard","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023062708512586500_B15","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-12-research0082","article-title":"Apollo: a sequence annotation editor","volume":"3","author":"Lewis","year":"2002","journal-title":"Genome Biol"},{"key":"2023062708512586500_B16","doi-asserted-by":"crossref","first-page":"715","DOI":"10.1086\/278863","article-title":"The cause of gynandromorphism in insects","volume":"41","author":"Morgan","year":"1907","journal-title":"Am. Nat"},{"key":"2023062708512586500_B17","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-12-research0081","article-title":"An integrated computational pipeline and database to support whole-genome sequence annotation","volume":"3","author":"Mungall","year":"2002","journal-title":"Genome Biol"},{"key":"2023062708512586500_B18","doi-asserted-by":"crossref","first-page":"R46","DOI":"10.1186\/gb-2005-6-5-r46","article-title":"Relations in biomedical ontologies","volume":"6","author":"Smith","year":"2005","journal-title":"Genome Biol"},{"key":"2023062708512586500_B19","doi-asserted-by":"crossref","first-page":"1611","DOI":"10.1101\/gr.361602","article-title":"The Bioperl toolkit: Perl modules for the life sciences","volume":"12","author":"Stajich","year":"2002","journal-title":"Genome Res"},{"key":"2023062708512586500_B20","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1093\/bib\/bbl026","article-title":"Open source tools and toolkits for bioinformatics: significance, and where are we?","volume":"7","author":"Stajich","year":"2006","journal-title":"Brief Bioinform"},{"key":"2023062708512586500_B21","doi-asserted-by":"crossref","first-page":"1599","DOI":"10.1101\/gr.403602","article-title":"The generic genome browser: a building block for a model organism system database","volume":"12","author":"Stein","year":"2002","journal-title":"Genome Res"},{"key":"2023062708512586500_B22","doi-asserted-by":"crossref","first-page":"D476","DOI":"10.1093\/nar\/gkl776","article-title":"BeetleBase: the model organism database for Tribolium castaneum","volume":"35","author":"Wang","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023062708512586500_B23","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.0020015","article-title":"Large-scale trends in the evolution of gene structures within 11 animal genomes","volume":"2","author":"Yandell","year":"2006","journal-title":"PLoS Comput. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/13\/i337\/50714901\/bioinformatics_23_13_i337.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/13\/i337\/50714901\/bioinformatics_23_13_i337.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T08:51:59Z","timestamp":1687855919000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/13\/i337\/229507"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,7,1]]},"references-count":23,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2007,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm189","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,7]]},"published":{"date-parts":[[2007,7,1]]}}}