{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T15:18:24Z","timestamp":1764688704089},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3015,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: We have to cope with both a deluge of new genome sequences and a huge amount of data produced by high-throughput approaches used to exploit these genomic features. Crossing and comparing such heterogeneous and disparate data will help improving functional annotation of genomes. This requires designing elaborate integration systems such as warehouses for storing and querying these data.<\/jats:p>\n               <jats:p>Results: We have designed a relational genomic warehouse with an original multi-layer architecture made of a databases layer and an entities layer. We describe a new querying module, GenoQuery, which is based on this architecture. We use the entities layer to define mixed queries. These mixed queries allow searching for instances of biological entities and their properties in the different databases, without specifying in which database they should be found. Accordingly, we further introduce the central notion of alternative queries. Such queries have the same meaning as the original mixed queries, while exploiting complementarities yielded by the various integrated databases of the warehouse.<\/jats:p>\n               <jats:p>We explain how GenoQuery computes all the alternative queries of a given mixed query. We illustrate how useful this querying module is by means of a thorough example.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/www.lri.fr\/~lemoine\/GenoQuery\/<\/jats:p>\n               <jats:p>Contact: \u00a0chris@lri.fr, lemoine@lri.fr<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn159","type":"journal-article","created":{"date-parts":[[2008,6,27]],"date-time":"2008-06-27T07:43:13Z","timestamp":1214552593000},"page":"i322-i329","source":"Crossref","is-referenced-by-count":8,"title":["GenoQuery: a new querying module for functional annotation in a genomic warehouse"],"prefix":"10.1093","volume":"24","author":[{"given":"Fr\u00e9d\u00e9ric","family":"Lemoine","sequence":"first","affiliation":[{"name":"1 Institut de G\u00e9n\u00e9tique et Microbiologie and 2Laboratoire de Recherche en Informatique, Universit\u00e9 Paris-Sud XI, 91405 Orsay Cedex, France"},{"name":"1 Institut de G\u00e9n\u00e9tique et Microbiologie and 2Laboratoire de Recherche en Informatique, Universit\u00e9 Paris-Sud XI, 91405 Orsay Cedex, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bernard","family":"Labedan","sequence":"additional","affiliation":[{"name":"1 Institut de G\u00e9n\u00e9tique et Microbiologie and 2Laboratoire de Recherche en Informatique, Universit\u00e9 Paris-Sud XI, 91405 Orsay Cedex, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christine","family":"Froidevaux","sequence":"additional","affiliation":[{"name":"1 Institut de G\u00e9n\u00e9tique et Microbiologie and 2Laboratoire de Recherche en Informatique, Universit\u00e9 Paris-Sud XI, 91405 Orsay Cedex, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,7,1]]},"reference":[{"key":"2023020210412591200_B1","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol"},{"key":"2023020210412591200_B2","doi-asserted-by":"crossref","first-page":"D154","DOI":"10.1093\/nar\/gki070","article-title":"The universal protein resource (UniProt)","volume":"33","author":"Bairoch","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023020210412591200_B3","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1186\/1471-2105-7-70","article-title":"BIOZON: a system for unification, management and analysis of heterogeneous biological data","volume":"7","author":"Birkland","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020210412591200_B4","doi-asserted-by":"crossref","first-page":"3533","DOI":"10.1093\/nar\/gkl471","article-title":"AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system","volume":"34","author":"Bryson","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020210412591200_B5","first-page":"343","article-title":"Biomediator data integration and inference for functional annotation of anonymous sequences. In","author":"Cadag","year":"2007","journal-title":"Pacific Symposium on Biocomputing"},{"key":"2023020210412591200_B6","first-page":"187","article-title":"Genopage : a database of all protein modules encoded by completely sequenced genomes. In","author":"Cohen-Boulakia","year":"2002","journal-title":"Actes de JOBIM2002, Journ\u00e9es Ouvertes, Biologie, Informatique et Math\u00e9matiques"},{"key":"2023020210412591200_B7","doi-asserted-by":"crossref","first-page":"1069","DOI":"10.1142\/S0219720006002375","article-title":"Path-based systems to guide scientists in the maze of biological data sources","volume":"4","author":"Cohen-Boulakia","year":"2006","journal-title":"J. Bioinform. Comput. Biol"},{"key":"2023020210412591200_B8","doi-asserted-by":"crossref","first-page":"1301","DOI":"10.1093\/bioinformatics\/btm088","article-title":"BioGuideSRS: querying multiple sources with a user-centric perspective","volume":"23","author":"Cohen-Boulakia","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020210412591200_B9","doi-asserted-by":"crossref","first-page":"512","DOI":"10.1147\/sj.402.0512","article-title":"K2\/kleisli and gus: experiments in integrated access to genomic data sources","volume":"40","author":"Davidson","year":"2001","journal-title":"IBM Syst. J."},{"key":"2023020210412591200_B10","doi-asserted-by":"crossref","first-page":"3439","DOI":"10.1093\/bioinformatics\/bti525","article-title":"Biomart and bioconductor: a powerful link between biological databases and microarray data analysis","volume":"21","author":"Durinck","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020210412591200_B11","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1016\/S0076-6879(96)66010-8","article-title":"SRS: information retrieval system for molecular biology data banks","volume":"266","author":"Etzold","year":"1996","journal-title":"Methods Enzymol"},{"key":"2023020210412591200_B12","doi-asserted-by":"crossref","first-page":"3784","DOI":"10.1093\/nar\/gkg563","article-title":"Expasy: the proteomics server for in-depth protein knowledge and analysis","volume":"31","author":"Gasteiger","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023020210412591200_B13","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1093\/bioinformatics\/16.2.101","article-title":"Darwin v. 2.0: an interpreted computer language for the biosciences","volume":"16","author":"Gonnet","year":"2000","journal-title":"Bioinformatics"},{"key":"2023020210412591200_B14","first-page":"1131","article-title":"Orchestra: facilitating collaborative data sharing. In","author":"Green","year":"2007"},{"key":"2023020210412591200_B15","first-page":"158","article-title":"Integrating and warehousing liver gene expression data and related biomedical resources in GEDAW. In","volume-title":"Second International Workshop on Data Integration in the Life Sciences (DILS)","author":"Gu\u00e9rin","year":"2005"},{"key":"2023020210412591200_B16","first-page":"505","article-title":"Schema mediation in peer data management systems. In","author":"Halevy","year":"2003"},{"key":"2023020210412591200_B17","doi-asserted-by":"crossref","first-page":"D354","DOI":"10.1093\/nar\/gkj102","article-title":"From genomics to chemical genomics: new developments in KEGG","volume":"34","author":"Kanehisa","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020210412591200_B18","doi-asserted-by":"crossref","first-page":"6083","DOI":"10.1093\/nar\/gki892","article-title":"Expansion of the BioCyc collection of pathway\/genome databases to 160 genomes","volume":"33","author":"Karp","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023020210412591200_B19","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1101\/gr.1645104","article-title":"Ensmart: a generic system for fast and flexible access to biological data","volume":"14","author":"Kasprzyk","year":"2004","journal-title":"Genome Res."},{"key":"2023020210412591200_B20","unstructured":"Klyne\n              J\n            \n          \n          Resource description framework (RDF): concepts and Abstract Syntax, recommendation\n          2004\n          \n            W3C: http:\/\/www.w3.org\/TR\/rdf-concepts\/"},{"key":"2023020210412591200_B21","doi-asserted-by":"crossref","first-page":"1961","DOI":"10.1101\/gr.393902","article-title":"A strategy to retrieve the whole set of protein modules in microbial proteomes","volume":"12","author":"Le Bouder-Langevin","year":"2002","journal-title":"Genome Res"},{"key":"2023020210412591200_B22","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1186\/1471-2105-7-170","article-title":"BioWarehouse: a bioinformatics database warehouse toolkit","volume":"7","author":"Lee","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020210412591200_B23","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1186\/1471-2148-7-237","article-title":"Assessing the evolutionary rate of positional orthologous genes in prokaryotes using synteny data","volume":"7","author":"Lemoine","year":"2007","journal-title":"BMC Evol. Biol"},{"key":"2023020210412591200_B24","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1186\/1471-2105-7-436","article-title":"ORENZA: a web resource for studying ORphan ENZyme activities","volume":"7","author":"Lespinet","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020210412591200_B25","first-page":"391","article-title":"Improved data retrieval using semantic transformation. In","volume-title":"DEXA, Lecture Notes in Computer Science","author":"Lowden","year":"2004"},{"key":"2023020210412591200_B26","article-title":"Using ontologies for database query reformulation. In","author":"Necib","year":"2004","journal-title":"ADBIS (Local Proceedings)"},{"key":"2023020210412591200_B27","doi-asserted-by":"crossref","first-page":"c2001.1","DOI":"10.1186\/gb-2002-3-2-comment2001","article-title":"The past, present and future of genome-wide re-annotation","volume":"3","author":"Ouzounis","year":"2002","journal-title":"Genome Biol"},{"key":"2023020210412591200_B28","doi-asserted-by":"crossref","first-page":"1598","DOI":"10.1126\/science.319.5870.1598","article-title":"DNA data. Proposal to \u2018Wikify\u2019 GenBank meets stiff resistance","volume":"319","author":"Pennisi","year":"2008","journal-title":"Science"},{"key":"2023020210412591200_B29","article-title":"SPARQL Query Language for RDF","author":"Prud'hommeaux","year":"2007"},{"key":"2023020210412591200_B30","doi-asserted-by":"crossref","first-page":"D61","DOI":"10.1093\/nar\/gkl842","article-title":"NCBI Reference Sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins","volume":"35","author":"Pruitt","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023020210412591200_B31","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1089\/omi.2006.10.114","article-title":"Genome reviews: standardizing content and representation of information about complete genomes","volume":"10","author":"Sterk","year":"2006","journal-title":"OMICS"},{"key":"2023020210412591200_B32","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1093\/bioinformatics\/16.2.184","article-title":"TAMBIS: transparent access to multiple bioinformatics information sources","volume":"16","author":"Stevens","year":"2000","journal-title":"Bioinformatics"},{"key":"2023020210412591200_B33","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1186\/1471-2105-6-81","article-title":"Columba: an integrated database of proteins, structures, and annotations","volume":"6","author":"Trissl","year":"2005","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i322\/49054359\/bioinformatics_24_13_i322.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i322\/49054359\/bioinformatics_24_13_i322.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T12:27:49Z","timestamp":1675340869000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/13\/i322\/231381"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,1]]},"references-count":33,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2008,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn159","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,7,1]]},"published":{"date-parts":[[2008,7,1]]}}}