{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T15:15:12Z","timestamp":1764688512974,"version":"3.41.0"},"reference-count":41,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2004,9,1]],"date-time":"2004-09-01T00:00:00Z","timestamp":1093996800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGMOD Rec."],"published-print":{"date-parts":[[2004,9]]},"abstract":"<jats:p>This paper surveys the area of biological and genomic sources integration, which has recently become a major focus of the data integration research field. The challenges that an integration system for biological sources must face are due to several factors such as the variety and amount of data available, the representational heterogeneity of the data in the different sources, and the autonomy and differing capabilities of the sources.<\/jats:p>\n          <jats:p>This survey describes the main integration approaches that have been adopted. They include warehouse integration, mediator-based integration, and navigational integration. Then we look at the four major existing integration systems that have been developed for the biological domain: SRS, BioKleisli, TAMBIS, and DiscoveryLink. After analyzing these systems and mentioning a few others, we identify the pros and cons of the current approaches and systems and discuss what an integration system for biologists ought to be.<\/jats:p>","DOI":"10.1145\/1031570.1031583","type":"journal-article","created":{"date-parts":[[2005,11,9]],"date-time":"2005-11-09T22:23:27Z","timestamp":1131575007000},"page":"51-60","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":82,"title":["Integration of biological sources"],"prefix":"10.1145","volume":"33","author":[{"given":"Thomas","family":"Hernandez","sequence":"first","affiliation":[{"name":"Arizona State University, Tempe, AZ"}]},{"given":"Subbarao","family":"Kambhampati","sequence":"additional","affiliation":[{"name":"Arizona State University, Tempe, AZ"}]}],"member":"320","published-online":{"date-parts":[[2004,9]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology (ISMB98)","author":"Baker P.","year":"1998","unstructured":"P. Baker , A. Brass , S. Bechhofer , C. Goble , N. Paton , and R. Stevens . TAMBIS: Transparent Access to Multiple Bioinformatics Information Sources . In Proceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology (ISMB98) , 1998 . P. Baker, A. Brass, S. Bechhofer, C. Goble, N. Paton, and R. Stevens. TAMBIS: Transparent Access to Multiple Bioinformatics Information Sources. In Proceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology (ISMB98), 1998."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkg120"},{"volume-title":"http:\/\/kilimanjaro.eas.asu.edu","year":"2004","key":"e_1_2_1_3_1","unstructured":"BibFinder. http:\/\/kilimanjaro.eas.asu.edu , 2004 . BibFinder. http:\/\/kilimanjaro.eas.asu.edu, 2004."},{"volume-title":"http:\/\/phanxipan.eas.asu.edu","year":"2003","key":"e_1_2_1_4_1","unstructured":"BioHavasu. http:\/\/phanxipan.eas.asu.edu , 2003 . BioHavasu. http:\/\/phanxipan.eas.asu.edu, 2003."},{"key":"e_1_2_1_5_1","volume-title":"http:\/\/www.bionavigator.com","author":"Solutions BioNavigator","year":"2003","unstructured":"BioNavigator Solutions . http:\/\/www.bionavigator.com , 2003 . BioNavigator Solutions. http:\/\/www.bionavigator.com, 2003."},{"key":"e_1_2_1_6_1","volume-title":"http:\/\/www.biosift.com\/products\/radia\/radia.asp","author":"Radia BioSift","year":"2003","unstructured":"BioSift Radia . http:\/\/www.biosift.com\/products\/radia\/radia.asp , 2003 . BioSift Radia. http:\/\/www.biosift.com\/products\/radia\/radia.asp, 2003."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/637411.637431"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the 21st VLDB Conference.","author":"Buneman P.","year":"1995","unstructured":"P. Buneman , S. Davidson , K. Hart , C. Overton , L. Wong . A Data Transformation System for Biological Data Sources . In Proceedings of the 21st VLDB Conference. 1995 . P. Buneman, S. Davidson, K. Hart, C. Overton, L. Wong. A Data Transformation System for Biological Data Sources. In Proceedings of the 21st VLDB Conference. 1995."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/637411.637421"},{"key":"e_1_2_1_10_1","volume-title":"G. De Giacomo and M. Lenzerini. On the Expressive Power of Data Integration Systems. In Proceedings of the 21st International Conference on Conceptual Modeling (ER 2002","author":"Cali A.","year":"2002","unstructured":"A. Cali , D. Calvanese , G. De Giacomo and M. Lenzerini. On the Expressive Power of Data Integration Systems. In Proceedings of the 21st International Conference on Conceptual Modeling (ER 2002 ). 2002 . A. Cali, D. Calvanese, G. De Giacomo and M. Lenzerini. On the Expressive Power of Data Integration Systems. In Proceedings of the 21st International Conference on Conceptual Modeling (ER 2002). 2002."},{"key":"e_1_2_1_11_1","unstructured":"DBCAT The Public Catalog of Databases. Infobiogen. http:\/\/www.infobiogen.fr\/services\/dbcat 2003.  DBCAT The Public Catalog of Databases. Infobiogen. http:\/\/www.infobiogen.fr\/services\/dbcat 2003."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1089\/cmb.1995.2.557"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1147\/sj.402.0512"},{"key":"e_1_2_1_14_1","volume-title":"The Kleisli Approach to Data Transformation and Integration","author":"Davidson S.","year":"2001","unstructured":"S. Davidson and L. Wong . The Kleisli Approach to Data Transformation and Integration . 2001 . S. Davidson and L. Wong. The Kleisli Approach to Data Transformation and Integration. 2001."},{"key":"e_1_2_1_15_1","volume-title":"http:\/\/www3.ebi.ac.uk\/Services\/DBStats","author":"Nucleotide Sequence Database Statistics EMBL","year":"2003","unstructured":"EMBL Nucleotide Sequence Database Statistics , http:\/\/www3.ebi.ac.uk\/Services\/DBStats , 2003 . EMBL Nucleotide Sequence Database Statistics, http:\/\/www3.ebi.ac.uk\/Services\/DBStats, 2003."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/791212.791290"},{"key":"e_1_2_1_17_1","unstructured":"Entigen. BioNavigator - BioNode & BioNodeSA: Overview. http:\/\/www.entigen.com\/library 2001.  Entigen. BioNavigator - BioNode & BioNodeSA: Overview. http:\/\/www.entigen.com\/library 2001."},{"key":"e_1_2_1_18_1","volume-title":"http:\/\/www.ncbi.nlm.nih.gov\/Entrez","author":"Search Entrez","year":"2003","unstructured":"Entrez - Search and Retrieval System . http:\/\/www.ncbi.nlm.nih.gov\/Entrez , 2003 . Entrez - Search and Retrieval System. http:\/\/www.ncbi.nlm.nih.gov\/Entrez, 2003."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/290593.290605"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the National Conference on Artificial Intelligence (AAAI), 67--73","author":"Friedman M.","year":"1999","unstructured":"M. Friedman , A. Levy , and T. Millstein . Navigational Plans For Data Integration . In Proceedings of the National Conference on Artificial Intelligence (AAAI), 67--73 , 1999 . M. Friedman, A. Levy, and T. Millstein. Navigational Plans For Data Integration. In Proceedings of the National Conference on Artificial Intelligence (AAAI), 67--73, 1999."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/SSDM.2000.869777"},{"key":"e_1_2_1_22_1","volume-title":"http:\/\/www.gusdb.org","author":"Functional Genomics The GUS","year":"2003","unstructured":"The GUS Platform for Functional Genomics . http:\/\/www.gusdb.org , 2003 . The GUS Platform for Functional Genomics. http:\/\/www.gusdb.org, 2003."},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the 2003 CIDR Conference.","author":"Hammer J.","year":"2003","unstructured":"J. Hammer and M. Schneider . Genomics Algebra: A New, Integrating Data Model, Language, and Tool Processing and Querying Genomic Information . In Proceedings of the 2003 CIDR Conference. 2003 . J. Hammer and M. Schneider. Genomics Algebra: A New, Integrating Data Model, Language, and Tool Processing and Querying Genomic Information. In Proceedings of the 2003 CIDR Conference. 2003."},{"issue":"1","key":"e_1_2_1_24_1","first-page":"31","volume":"22","author":"Haas L. M.","year":"1999","unstructured":"L. M. Haas , R. J. Miller , B. Niswonger , M. Tork Roth , P. M. Schwarz , and E. L. Wimmers . Transforming Heterogeneous Data with Database Middleware: Beyond Integration. IEEE Data Engineering Bulletin , 22 ( 1 ), 31 -- 36 , 1999 . L. M. Haas, R. J. Miller, B. Niswonger, M. Tork Roth, P. M. Schwarz, and E. L. Wimmers. Transforming Heterogeneous Data with Database Middleware: Beyond Integration. IEEE Data Engineering Bulletin, 22(1), 31--36, 1999.","journal-title":"Transforming Heterogeneous Data with Database Middleware: Beyond Integration. IEEE Data Engineering Bulletin"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/791211.791268"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1147\/sj.402.0489"},{"issue":"1","key":"e_1_2_1_27_1","first-page":"145","article-title":"The ARIADNE Approach to Web-based Information Integration. The Journal on Intelligent Cooperative Information Systems (IJCIS) Special Issue on Intelligent Information Agents","volume":"10","author":"Knoblock C.","year":"2001","unstructured":"C. Knoblock , S. Minton , J. L. Ambite , N. Ashish , I. Muslea , A. G. Philpot , and S. Tejada . The ARIADNE Approach to Web-based Information Integration. The Journal on Intelligent Cooperative Information Systems (IJCIS) Special Issue on Intelligent Information Agents : Theory and Applications , 10 ( 1\/2 ), 145 -- 169 , 2001 . C. Knoblock, S. Minton, J. L. Ambite, N. Ashish, I. Muslea, A. G. Philpot, and S. Tejada. The ARIADNE Approach to Web-based Information Integration. The Journal on Intelligent Cooperative Information Systems (IJCIS) Special Issue on Intelligent Information Agents: Theory and Applications, 10(1\/2), 145--169, 2001.","journal-title":"Theory and Applications"},{"key":"e_1_2_1_28_1","volume-title":"M-E. Vidal. Exploring Life Sciences Data Sources. Proceedings of IJCAI-03 Workshop on Information Integration on the Web","author":"Lacroix Z.","year":"2003","unstructured":"Z. Lacroix , F. Naumann , L. Raschid , and M-E. Vidal. Exploring Life Sciences Data Sources. Proceedings of IJCAI-03 Workshop on Information Integration on the Web , 2003 . Z. Lacroix, F. Naumann, L. Raschid, and M-E. Vidal. Exploring Life Sciences Data Sources. Proceedings of IJCAI-03 Workshop on Information Integration on the Web, 2003."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/543613.543644"},{"key":"e_1_2_1_30_1","unstructured":"R. Lopez. SRS - Sequence Retrieval System. Presentation. http:\/\/www.pdg.cnb.uam.es\/cursos\/BioInfo2001\/pages\/SRS\/. Universidad Autonoma de Madrid 2001.  R. Lopez. SRS - Sequence Retrieval System. Presentation. http:\/\/www.pdg.cnb.uam.es\/cursos\/BioInfo2001\/pages\/SRS\/. Universidad Autonoma de Madrid 2001."},{"key":"e_1_2_1_31_1","volume-title":"Model-Based Mediation with Domain Maps. 17th Intl. Conference on Data Engineering (ICDE)","author":"Ludascher B.","year":"2001","unstructured":"B. Ludascher , A. Gupta , M. E. Martone . Model-Based Mediation with Domain Maps. 17th Intl. Conference on Data Engineering (ICDE) , 2001 . B. Ludascher, A. Gupta, M. E. Martone. Model-Based Mediation with Domain Maps. 17th Intl. Conference on Data Engineering (ICDE), 2001."},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the 27th VLDB Conference.","author":"Manolescu I.","year":"2001","unstructured":"I. Manolescu , D. Florescu , and D. Kossmann . Answering XML Queries over Heterogeneous Data Sources . In Proceedings of the 27th VLDB Conference. 2001 . I. Manolescu, D. Florescu, and D. Kossmann. Answering XML Queries over Heterogeneous Data Sources. In Proceedings of the 27th VLDB Conference. 2001."},{"key":"e_1_2_1_33_1","volume-title":"A Decentralized Approach to the Integration of Life Science Web Databases. Informatica. 27(1)","author":"Miled Z. Ben","year":"2003","unstructured":"Z. Ben Miled , N. Li , M. Baumgartner and Y. Liu . A Decentralized Approach to the Integration of Life Science Web Databases. Informatica. 27(1) , 2003 . Z. Ben Miled, N. Li, M. Baumgartner and Y. Liu. A Decentralized Approach to the Integration of Life Science Web Databases. Informatica. 27(1), 2003."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the Symposium of the American Medical Informatics Association","author":"Mork P.","year":"2001","unstructured":"P. Mork , A. Halevy , and P. Tarczy-Hornoch . A Model for Data Integration Systems of Biomedical Data Applied to Online Genetic Databases . In Proceedings of the Symposium of the American Medical Informatics Association , 2001 . P. Mork, A. Halevy, and P. Tarczy-Hornoch. A Model for Data Integration Systems of Biomedical Data Applied to Online Genetic Databases. In Proceedings of the Symposium of the American Medical Informatics Association, 2001."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/502585.502623"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.5555\/977401.978117"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/SSDM.1999.787629"},{"key":"e_1_2_1_39_1","volume-title":"Information Management for Genome Level Bioinformatics. VLDB 2001 Tutorial.","author":"Paton N.","year":"2001","unstructured":"N. Paton and C. Goble . Information Management for Genome Level Bioinformatics. VLDB 2001 Tutorial. 2001 . N. Paton and C. Goble. Information Management for Genome Level Bioinformatics. VLDB 2001 Tutorial. 2001."},{"key":"e_1_2_1_40_1","unstructured":"SRS Documentation. SRS at the European Bioinformatics Institute. http:\/\/srs.ebi.ac.uk\/ 2003.  SRS Documentation. SRS at the European Bioinformatics Institute. http:\/\/srs.ebi.ac.uk\/ 2003."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/96602.96604"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.5555\/565771.565776"}],"container-title":["ACM SIGMOD Record"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1031570.1031583","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1031570.1031583","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T16:25:03Z","timestamp":1750263903000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1031570.1031583"}},"subtitle":["current systems and challenges ahead"],"short-title":[],"issued":{"date-parts":[[2004,9]]},"references-count":41,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2004,9]]}},"alternative-id":["10.1145\/1031570.1031583"],"URL":"https:\/\/doi.org\/10.1145\/1031570.1031583","relation":{},"ISSN":["0163-5808"],"issn-type":[{"type":"print","value":"0163-5808"}],"subject":[],"published":{"date-parts":[[2004,9]]},"assertion":[{"value":"2004-09-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}