{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T08:43:58Z","timestamp":1762505038980,"version":"3.41.0"},"reference-count":37,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2005,2,1]],"date-time":"2005-02-01T00:00:00Z","timestamp":1107216000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Internet Technol."],"published-print":{"date-parts":[[2005,2]]},"abstract":"<jats:p>XML is emerging as a major standard for representing data on the World Wide Web. Recently, many XML storage models have been proposed to manage XML data. In order to assess an XML database's abilities to deal with XML queries, several benchmarks have also been proposed, including XMark and XMach. However, no reported studies using those benchmarks were found that can provide users with insights on the impacts of a variety of storage models on XML query performance. In this article, we report our first set of results on benchmarking a set of XML database implementations using two XML benchmarks. The selected implementations represent a wide range of approaches, including RDBMS-based systems with document-independent and document-dependent XML-relational schema mapping approaches, and XML native engines based on an Object-Oriented Model and the Document Object Model. Comprehensive experiments were conducted to study relative performance of different approaches and the important issues that affect XML query performance, such as path expression query processing, effectiveness of various partitioning, label-path, and indexing structures.<\/jats:p>","DOI":"10.1145\/1052934.1052940","type":"journal-article","created":{"date-parts":[[2005,8,3]],"date-time":"2005-08-03T08:30:55Z","timestamp":1123057855000},"page":"154-194","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["What makes the differences: benchmarking XML database implementations"],"prefix":"10.1145","volume":"5","author":[{"given":"Hongjun","family":"Lu","sequence":"first","affiliation":[{"name":"The Hong Kong University of Science and Technology, Hong Kong, China"}]},{"given":"Jeffrey Xu","family":"Yu","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Hong Kong, China"}]},{"given":"Guoren","family":"Wang","sequence":"additional","affiliation":[{"name":"Northeastern University, Shenyang, China"}]},{"given":"Shihui","family":"Zheng","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Haifeng","family":"Jiang","sequence":"additional","affiliation":[{"name":"The Hong Kong University of Science and Technology, Hong Kong, China"}]},{"given":"Ge","family":"Yu","sequence":"additional","affiliation":[{"name":"Northeastern University, Shenyang, China"}]},{"given":"Aoying","family":"Zhou","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2005,2]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s007990050005"},{"volume-title":"Proceedings of Datenbanksysteme in B\u00fcro, Technik und Wissenschaft (BTW)","author":"B\u00f6hme T.","key":"e_1_2_1_2_1","unstructured":"B\u00f6hme , T. and Rahm , E . 2001. XMach-1: A benchmark for XML data management . In Proceedings of Datenbanksysteme in B\u00fcro, Technik und Wissenschaft (BTW) . Oldenburg, Germany, 264--273. B\u00f6hme, T. and Rahm, E. 2001. XMach-1: A benchmark for XML data management. In Proceedings of Datenbanksysteme in B\u00fcro, Technik und Wissenschaft (BTW). Oldenburg, Germany, 264--273."},{"key":"e_1_2_1_3_1","unstructured":"Berglund A. Boag S. Chamberlin D. Fernandez M. F. Kay M. Robie J. and Simeon J. 2002. XML path language (XPath) 2.0. Tech. rep.  Berglund A. Boag S. Chamberlin D. Fernandez M. F. Kay M. Robie J. and Simeon J. 2002. XML path language (XPath) 2.0. Tech. rep."},{"key":"e_1_2_1_4_1","unstructured":"Boag S. Chamberlin D. Fernandez M. F. Florescu D. Robie J. and Simeon J. 2002. XQuery 1.0: An XML query language. In W3C Working Draft 16 August 2002.  Boag S. Chamberlin D. Fernandez M. F. Florescu D. Robie J. and Simeon J. 2002. XQuery 1.0: An XML query language. In W3C Working Draft 16 August 2002."},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of VLDB 2002 Workshop EEXTT Lecture Notes in Computer Science","volume":"2590","author":"B\u00f6hme T.","unstructured":"B\u00f6hme , T. and Rahm , E . 2002. Multi-user evaluation of XML data management systems with xmach-1 . In Proceedings of VLDB 2002 Workshop EEXTT Lecture Notes in Computer Science , vol. 2590 . 148--159. B\u00f6hme, T. and Rahm, E. 2002. Multi-user evaluation of XML data management systems with xmach-1. In Proceedings of VLDB 2002 Workshop EEXTT Lecture Notes in Computer Science, vol. 2590. 148--159."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/344788.344822"},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science","volume":"147","author":"Brassan S.","unstructured":"Brassan , S. , Lee , M. L. , Li , Y. G. , Lacroix , Z. , and Nambiar , U . 2002. The XOO7 benchmark . In Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science Vol. 2590). 146-- 147 . Brassan, S., Lee, M. L., Li, Y. G., Lacroix, Z., and Nambiar, U. 2002. The XOO7 benchmark. In Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science Vol. 2590). 146--147."},{"volume-title":"Proceedings of the 8th International World Wide Web Conference.","author":"Ceri S.","key":"e_1_2_1_8_1","unstructured":"Ceri , S. , Comai , S. , Damiani , E. , Fraternali , P. , Paraboschi , S. , and Tanca , L . 1999. XML-GL: a graphical language for querying and restructuring XML documents . In Proceedings of the 8th International World Wide Web Conference. ( Toronto, Canada). 1171--1187. Ceri, S., Comai, S., Damiani, E., Fraternali, P., Paraboschi, S., and Tanca, L. 1999. XML-GL: a graphical language for querying and restructuring XML documents. In Proceedings of the 8th International World Wide Web Conference. (Toronto, Canada). 1171--1187."},{"volume-title":"WebDB (Informal Proceedings). 53--62","author":"Chamberlin D. D.","key":"e_1_2_1_9_1","unstructured":"Chamberlin , D. D. , Robie , J. , and Florescu , D . 2000. Quilt: An XML query language for heterogeneous data sources . In WebDB (Informal Proceedings). 53--62 . Chamberlin, D. D., Robie, J., and Florescu, D. 2000. Quilt: An XML query language for heterogeneous data sources. In WebDB (Informal Proceedings). 53--62."},{"volume-title":"Proceedings of the 28th International Conference on Very Large Data Bases","author":"Chien S. Y.","key":"e_1_2_1_10_1","unstructured":"Chien , S. Y. , Vagena , Z. , Zhang , D. , Tsotras , V. , and Zaniolo , C . 2002. Efficient structural joins on indexed XML documents . In Proceedings of the 28th International Conference on Very Large Data Bases . Hong Kong, China. 263--274. Chien, S. Y., Vagena, Z., Zhang, D., Tsotras, V., and Zaniolo, C. 2002. Efficient structural joins on indexed XML documents. In Proceedings of the 28th International Conference on Very Large Data Bases. Hong Kong, China. 263--274."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/191839.191901"},{"volume-title":"Proceedings of the 8th International World Wide Web Conference","author":"Deutsch A.","key":"e_1_2_1_12_1","unstructured":"Deutsch , A. , Fernandez , M. , and Florescu , D . 1999. A query language for XML . In Proceedings of the 8th International World Wide Web Conference . Toronto, Canada. 1155--1169. Deutsch, A., Fernandez, M., and Florescu, D. 1999. A query language for XML. In Proceedings of the 8th International World Wide Web Conference. Toronto, Canada. 1155--1169."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/304182.304220"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/276304.276341"},{"key":"e_1_2_1_15_1","unstructured":"Florescu D. and Kossmann D. 1999. A performance evaluation of alternative mapping schemes for storing XML data in a relational database. Survey report.  Florescu D. and Kossmann D. 1999. A performance evaluation of alternative mapping schemes for storing XML data in a relational database. Survey report."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/564691.564705"},{"volume-title":"Proceedings of the 13th Australasian Database Conference (ADC2002)","author":"Jiang H.","key":"e_1_2_1_17_1","unstructured":"Jiang , H. , Lu , H. , Wang , W. , and Yu , J. X . 2002a. Path materialization revisited: An efficient storage model for XML data . In Proceedings of the 13th Australasian Database Conference (ADC2002) . (Melbourne, Australia). 85--94. Jiang, H., Lu, H., Wang, W., and Yu, J. X. 2002a. Path materialization revisited: An efficient storage model for XML data. In Proceedings of the 13th Australasian Database Conference (ADC2002). (Melbourne, Australia). 85--94."},{"volume-title":"Proceedings of the 18th International Conference on Data Engineering.","author":"Jiang H.","key":"e_1_2_1_18_1","unstructured":"Jiang , H. , Lu , H. , Wang , W. , and Yu , J. X . 2002b. XParent: An efficient RDBMS-based XML database system . In Proceedings of the 18th International Conference on Data Engineering. ( San Jose, CA). 335--336. Jiang, H., Lu, H., Wang, W., and Yu, J. X. 2002b. XParent: An efficient RDBMS-based XML database system. In Proceedings of the 18th International Conference on Data Engineering. (San Jose, CA). 335--336."},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science","volume":"161","author":"Kanda Runapongsa M.","unstructured":"Kanda Runapongsa , Jignesh M. Patel , H. J. and Al-Khalifa , S . 2002. The Michigan benchmark: A microbenchmark for XML query processing systems . In Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science Vol. 2590). 160-- 161 . Kanda Runapongsa, Jignesh M. Patel, H. J. and Al-Khalifa, S. 2002. The Michigan benchmark: A microbenchmark for XML query processing systems. In Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science Vol. 2590). 160--161."},{"volume-title":"Proceedings of the 16th International Conference on Data Engineering.","author":"Kanne C. C.","key":"e_1_2_1_20_1","unstructured":"Kanne , C. C. and Moerkotte , G . 2000. Efficient storage of XML data . In Proceedings of the 16th International Conference on Data Engineering. ( San Diego, CA). 198--198. Kanne, C. C. and Moerkotte, G. 2000. Efficient storage of XML data. In Proceedings of the 16th International Conference on Data Engineering. (San Diego, CA). 198--198."},{"volume-title":"Proceedings of the 17th International Conference on Data Engineering.","author":"Kha D. D.","key":"e_1_2_1_21_1","unstructured":"Kha , D. D. , Yoshikawa , M. , and Uemura , S . 2001. An XML indexing structure with relative region coordinate . In Proceedings of the 17th International Conference on Data Engineering. ( Heidelberg, Germany). 313--320. Kha, D. D., Yoshikawa, M., and Uemura, S. 2001. An XML indexing structure with relative region coordinate. In Proceedings of the 17th International Conference on Data Engineering. (Heidelberg, Germany). 313--320."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/362084.362140"},{"volume-title":"Proceedings of the 27th International Conference on Very Large Data Bases.","author":"Li Q.","key":"e_1_2_1_23_1","unstructured":"Li , Q. and Moon , B . 2001. Indexing and querying XML data for regular path expressions . In Proceedings of the 27th International Conference on Very Large Data Bases. ( Rome, Italy). 361--370. Li, Q. and Moon, B. 2001. Indexing and querying XML data for regular path expressions. In Proceedings of the 27th International Conference on Very Large Data Bases. (Rome, Italy). 361--370."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/564691.564785"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 3rd International Conference on Web-Age Information Management. (Lecture Notes in Computer Science","volume":"24","author":"Lv J.","unstructured":"Lv , J. , Wang , G. , Yu , J. X. , Yu , G. , Lu , H. , and Sun , B . 2002. Performance evaluation of a Dom-based xml database: Storage, indexing, and query optimization . In Proceedings of the 3rd International Conference on Web-Age Information Management. (Lecture Notes in Computer Science Vol. 2419). 13-- 24 . Lv, J., Wang, G., Yu, J. X., Yu, G., Lu, H., and Sun, B. 2002. Performance evaluation of a Dom-based xml database: Storage, indexing, and query optimization. In Proceedings of the 3rd International Conference on Web-Age Information Management. (Lecture Notes in Computer Science Vol. 2419). 13--24."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/262762.262770"},{"volume-title":"International Workshop on the Web and Databases (Informal Proceedings). 47--52","author":"Schmidt A.","key":"e_1_2_1_27_1","unstructured":"Schmidt , A. , Kersten , M. L. , Windhouwer , M. , and Waas , F . 2000. Efficient relational storage and retrieval of XML documents . In International Workshop on the Web and Databases (Informal Proceedings). 47--52 . Schmidt, A., Kersten, M. L., Windhouwer, M., and Waas, F. 2000. Efficient relational storage and retrieval of XML documents. In International Workshop on the Web and Databases (Informal Proceedings). 47--52."},{"volume-title":"Proceedings of the 28th International Conference on Very Large Data Bases.","author":"Schmidt A.","key":"e_1_2_1_28_1","unstructured":"Schmidt , A. , Waas , F. , Kersten , M. , Carey , M. J. , Manolescu , I. , and Busse , R . 2002. Xmark: A benchmark for XML data management . In Proceedings of the 28th International Conference on Very Large Data Bases. ( Hong Kong, China). 974--985. Schmidt, A., Waas, F., Kersten, M., Carey, M. J., Manolescu, I., and Busse, R. 2002. Xmark: A benchmark for XML data management. In Proceedings of the 28th International Conference on Very Large Data Bases. (Hong Kong, China). 974--985."},{"key":"e_1_2_1_29_1","unstructured":"Schmidt A. Waas F. Kersten M. Florescu D. Manolescu L. Carey M. J. and Busse R. 2001. The XML benchmark project. Tech. rep. CWI Amsterdam The Netherlands.   Schmidt A. Waas F. Kersten M. Florescu D. Manolescu L. Carey M. J. and Busse R. 2001. The XML benchmark project. Tech. rep. CWI Amsterdam The Netherlands."},{"volume-title":"Proceedings of the 25th International Conference on Very Large Data Bases.","author":"Shanmugasundaram J.","key":"e_1_2_1_30_1","unstructured":"Shanmugasundaram , J. , Tufte , K. , Zhang , C. , Gang , H. , DeWitt , D. J. , and Naughton , J. F . 1999. Relational databases for querying XML documents: Limitations and opportunities . In Proceedings of the 25th International Conference on Very Large Data Bases. ( Edinburgh, Scotland) UK. 302--314. Shanmugasundaram, J., Tufte, K., Zhang, C., Gang, H., DeWitt, D. J., and Naughton, J. F. 1999. Relational databases for querying XML documents: Limitations and opportunities. In Proceedings of the 25th International Conference on Very Large Data Bases. (Edinburgh, Scotland) UK. 302--314."},{"volume-title":"Proceedings of the 18th International Conference on Data Engineering.","author":"Srivastava D.","key":"e_1_2_1_31_1","unstructured":"Srivastava , D. , Al-Khalifa , S. , Jagadish , H. V. , Koudas , N. , Patel , J. M. , and Yuqing , W. U . 2002. Structural joins: A primitive for efficient XML query pattern matching . In Proceedings of the 18th International Conference on Data Engineering. ( San Jose, CA). 141--152. Srivastava, D., Al-Khalifa, S., Jagadish, H. V., Koudas, N., Patel, J. M., and Yuqing, W. U. 2002. Structural joins: A primitive for efficient XML query pattern matching. In Proceedings of the 18th International Conference on Data Engineering. (San Jose, CA). 141--152."},{"key":"e_1_2_1_32_1","unstructured":"Tian F. DeWitt D. J. Chen J. and Zhang C. 2000. The design and performance evaluation of altervative XML storage strategies. Tech. rep. Computer Science Department University of Wisconsin Madison WI.  Tian F. DeWitt D. J. Chen J. and Zhang C. 2000. The design and performance evaluation of altervative XML storage strategies. Tech. rep. Computer Science Department University of Wisconsin Madison WI."},{"key":"e_1_2_1_33_1","unstructured":"W3C. Document object model (DOM). http:\/\/www.w3.org\/DOM\/.  W3C. Document object model (DOM). http:\/\/www.w3.org\/DOM\/."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science","volume":"163","author":"Yao B. B.","unstructured":"Yao , B. B. , \u00d6zsu , M. T. , and Keenleyside , J . 2002. XBench---A family of benchmarks for XML DBMSs . In Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science Vol. 2590). 162-- 163 . Yao, B. B., \u00d6zsu, M. T., and Keenleyside, J. 2002. XBench---A family of benchmarks for XML DBMSs. In Proceedings of Very Large Data Bases 2002 Workshop EEXTT. (Lecture Notes in Computer Science Vol. 2590). 162--163."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/383034.383038"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/375663.375722"},{"volume-title":"Proceedings of the 27th International Conference on Very Large Data Bases.","author":"Zhou A.","key":"e_1_2_1_37_1","unstructured":"Zhou , A. , Lu , H. , Zheng , S. , Liang , Y. , Zhang , L. , Ji , W. , and Tian , Z . 2001. VXMLR: A visual XML-relational database system . In Proceedings of the 27th International Conference on Very Large Data Bases. ( Rome, Italy). 719--720. Zhou, A., Lu, H., Zheng, S., Liang, Y., Zhang, L., Ji, W., and Tian, Z. 2001. VXMLR: A visual XML-relational database system. In Proceedings of the 27th International Conference on Very Large Data Bases. (Rome, Italy). 719--720."}],"container-title":["ACM Transactions on Internet Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1052934.1052940","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1052934.1052940","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:43:28Z","timestamp":1750286608000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1052934.1052940"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,2]]},"references-count":37,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2005,2]]}},"alternative-id":["10.1145\/1052934.1052940"],"URL":"https:\/\/doi.org\/10.1145\/1052934.1052940","relation":{},"ISSN":["1533-5399","1557-6051"],"issn-type":[{"type":"print","value":"1533-5399"},{"type":"electronic","value":"1557-6051"}],"subject":[],"published":{"date-parts":[[2005,2]]},"assertion":[{"value":"2005-02-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}