{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T18:05:18Z","timestamp":1754157918216,"version":"3.41.2"},"reference-count":27,"publisher":"Emerald","issue":"4","license":[{"start":{"date-parts":[[2012,11,16]],"date-time":"2012-11-16T00:00:00Z","timestamp":1353024000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,11,16]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-heading\">Purpose<\/jats:title><jats:p>Designing efficient XML schemas is essential for XML applications which manage semi\u2010structured data. On generating XML schemas, there are two opposite goals: to avoid redundancy and to provide connected structures in order to achieve good performance on queries. In general, highly connected XML structures allow data redundancy, and redundancy\u2010free schemas generate disconnected XML structures. The purpose of this paper is to describe and evaluate by experiments an approach which balances such trade\u2010off through a workload analysis. Additionally, it aims to identify the most accessed data based on the workload and suggest indexes to improve access performance.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Design\/methodology\/approach<\/jats:title><jats:p>The paper applies and evaluates a workload\u2010aware methodology to provide indexing and highly connected structures for data which are intensively accessed through paths traversed by the workload.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Findings<\/jats:title><jats:p>The paper presents benchmarking results on a set of design approaches for XML schemas and demonstrates that the XML schemas generated by the approach provide high query performance and low cost of data redundancy on balancing the trade\u2010off on XML schema design.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Research limitations\/implications<\/jats:title><jats:p>Although an XML benchmark is applied in these experiments, further experiments are expected in a real\u2010world application.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Practical implications<\/jats:title><jats:p>The approach proposed may be applied in a real\u2010world process for designing new XML databases as well as in reverse engineering process to improve XML schemas from legacy databases.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Originality\/value<\/jats:title><jats:p>Unlike related work, the reported approach integrates the two opposite goal in the XML schema design, and generates suitable schemas according to a workload. An experimental evaluation shows that the proposed methodology is promising.<\/jats:p><\/jats:sec>","DOI":"10.1108\/17440081211282874","type":"journal-article","created":{"date-parts":[[2012,11,22]],"date-time":"2012-11-22T05:03:57Z","timestamp":1353560637000},"page":"371-389","source":"Crossref","is-referenced-by-count":0,"title":["On evaluating an approach for balancing the trade\u2010off on XML schema design"],"prefix":"10.1108","volume":"8","author":[{"given":"Rebeca","family":"Schroeder","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Denio","family":"Duarte","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ronaldo","family":"dos Santos Mello","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","reference":[{"key":"key2022012820281139700_b1","doi-asserted-by":"crossref","unstructured":"Arenas, M. and Libkin, L. (2002), \u201cA normal form for XML documents\u201d, Proceedings of the Twenty\u2010first ACM SIGMOD\u2010SIGACT\u2010SIGART (PODS '02), ACM, New York, NY, pp. 85\u201096.","DOI":"10.1145\/543613.543625"},{"key":"key2022012820281139700_b2","doi-asserted-by":"crossref","unstructured":"Barbosa, D., Mendelzon, A., Keenleyside, J. and Lyons, K. (2002), \u201cToXgene: a template\u2010based data generator for XML\u201d, Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data (SIGMOD '02), ACM, New York, NY, pp. 616\u201021.","DOI":"10.1145\/564691.564769"},{"key":"key2022012820281139700_b3","unstructured":"Batini, C., Ceri, S. and Navathe, S. (1992), Conceptual Database Design: An Entity\u2010relationship Approach, Benjamin Cummings Publishing Company, Redwood City, CA."},{"key":"key2022012820281139700_b4","unstructured":"Bradford, T., Gritsenko, V. and O'Neill, K. (2011), \u201cApache xindice\u201d, available at: http:\/\/xml.apache.org\/xindice\/."},{"key":"key2022012820281139700_b5","doi-asserted-by":"crossref","unstructured":"Brantner, M. (2009), \u201cSausalito: an application servers for RESTful services in the cloud\u201d, Proceedings of the 13th East European Conference on Advances in Databases and Information Systems, Springer, Berlin.","DOI":"10.1007\/978-3-642-03973-7_1"},{"key":"key2022012820281139700_b6","doi-asserted-by":"crossref","unstructured":"Bird, L., Goodchild, A. and Halpin, T.A. (2000), \u201cObject role modeling and XML\u2010schema\u201d, International Conference on Conceptual Modeling, Springer, Salt Lake City, UT, pp. 309\u201022.","DOI":"10.1007\/3-540-45393-8_23"},{"key":"key2022012820281139700_b7","doi-asserted-by":"crossref","unstructured":"Curino, C., Jones, E., Zhang, Y. and Madden, S. (2010), \u201cSchism: a workload\u2010driven approach to database replication and partitioning\u201d, Proceedings of the VLDB Endowment, Vol. 3 No. 1, pp. 57\u201084 (Singapore, Malaysia).","DOI":"10.14778\/1920841.1920853"},{"key":"key2022012820281139700_b8","doi-asserted-by":"crossref","unstructured":"Elmasri, R., Weeldreyer, J. and Hevner, A. (1985), \u201cThe category concept: an extension to the entity\u2010relationship model\u201d, Data Knowl. Eng., Vol. 1 No. 1, pp. 75\u2010116.","DOI":"10.1016\/0169-023X(85)90027-8"},{"key":"key2022012820281139700_b9","doi-asserted-by":"crossref","unstructured":"Embley, D., Liddle, S. and Kamha, S. (2004), \u201cEnterprise modeling with conceptual XML\u201d, International Conference on Conceptual Modeling, Shanghai, China, Lecture Notes in Computer Science, Springer, Berlin, pp. 150\u201065.","DOI":"10.1007\/978-3-540-30464-7_13"},{"key":"key2022012820281139700_b10","doi-asserted-by":"crossref","unstructured":"Fong, J., Fong, A., Wong, H.K. and Yu, P. (2006), \u201cTranslating relational schema with constraints into XML schema\u201d, International Journal of Software Engineering and Knowledge Engineering IJSEKE, Vol. 16 No. 2, pp. 201\u201043.","DOI":"10.1142\/S0218194006002744"},{"key":"key2022012820281139700_b11","doi-asserted-by":"crossref","unstructured":"Kudrass, T. and Krumbein, T. (2003), Rule\u2010based Generation of XML Schemas from UML Class Diagrams, Berliner XML Tage, XML\u2010Clearinghouse, Berlin, pp. 213\u201027.","DOI":"10.1007\/978-3-540-39403-7_26"},{"key":"key2022012820281139700_b12","unstructured":"Ley, M. (2011), \u201cThe DBLP computer science bibliography\u201d, availabe at: www.informatik.uni\u2010trier.de\/\u223cley\/db\/."},{"key":"key2022012820281139700_b13","doi-asserted-by":"crossref","unstructured":"Mani, M. (2004), \u201cEReX: a conceptual model for XML\u201d, Database and XML Technologies, Lecture Notes in Computer Science, Springer, Berlin, pp. 128\u201042.","DOI":"10.1007\/978-3-540-30081-6_10"},{"key":"key2022012820281139700_b14","unstructured":"Miklau, G. (2012), \u201cXMLData repository\u201d, availabe at: www.cs.washington.edu\/research\/xmldatasets\/."},{"key":"key2022012820281139700_b15","doi-asserted-by":"crossref","unstructured":"Mok, W.Y. and Embley, D.W. (2006), \u201cGenerating compact redundancy\u2010free XML documents from conceptual\u2010model hypergraphs\u201d, IEEE Trans. on Knowl. Data Eng., Vol. 18 No. 8, pp. 1082\u201096.","DOI":"10.1109\/TKDE.2006.125"},{"key":"key2022012820281139700_b17","doi-asserted-by":"crossref","unstructured":"Moro, M.M., Lipyeow, L. and Chang, Y.\u2010C. (2007), \u201cSchema advisor for hybrid relational\u2010XML DBMS\u201d, Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, ACM, Beijing, pp. 959\u201070.","DOI":"10.1145\/1247480.1247592"},{"key":"key2022012820281139700_b16","doi-asserted-by":"crossref","unstructured":"Moro, M.M., Braganholo, V., Dorneles, C.F., Duarte, D., Galante, R. and Mello, R.S. (2009), \u201cXML: some papers in a haystack\u201d, SIGMOD Rec., Vol. 38 No. 2, pp. 29\u201034.","DOI":"10.1145\/1815918.1815924"},{"key":"key2022012820281139700_b18","doi-asserted-by":"crossref","unstructured":"Nicola, M., Kogan, I. and Schiefer, B. (2007), \u201cAn XML transaction processing benchmark\u201d, Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data (SIGMOD '07), ACM, New York, NY, pp. 937\u201048.","DOI":"10.1145\/1247480.1247590"},{"key":"key2022012820281139700_b19","unstructured":"Pigozzo, P. and Quintarelli, E. (2005), \u201cAn algorithm for generating XML schemas from ER schemas\u201d, Proceedings of the Thirteenth Italian Symposium on Advanced Database Systems, SEBD 2005, Brixen\u2010Bressanone (near Bozen\u2010Bolzano), Italy, pp. 192\u20109."},{"key":"key2022012820281139700_b20","unstructured":"Routledge, N., Bird, L. and Goodchild, A. (2002), \u201cUML and XML schema\u201d, Proceedings of the 13th Australasian Database Conference (ADC '02), Vol. 5, Australian Computer Society, Darlinghurst, pp. 157\u201066."},{"key":"key2022012820281139700_b24","unstructured":"Sch\u00f6ning, H. (2001), \u201cTamino \u2013 a DBMS designed for XML\u2010schema\u201d, Proceedings of the 17th International Conference on Data Engineering, IEEE Computer Society, Washington, DC, pp. 149\u201054."},{"key":"key2022012820281139700_b21","doi-asserted-by":"crossref","unstructured":"Schroeder, R. and Mello, R.S. (2008), \u201cImproving query performance on XML documents: a workload\u2010driven design approach\u201d, Proceedings of the Eighth ACM Symposium on Document Engineering, ACM, New York, NY, pp. 177\u201086.","DOI":"10.1145\/1410140.1410176"},{"key":"key2022012820281139700_b22","doi-asserted-by":"crossref","unstructured":"Schroeder, R. and Mello, R.S. (2009), \u201cDesigning XML documents from conceptual schemas and workload information\u201d, Multimedia Tools Appl., Vol. 43 No. 3, pp. 303\u201026.","DOI":"10.1007\/s11042-009-0272-1"},{"key":"key2022012820281139700_b23","doi-asserted-by":"crossref","unstructured":"Schroeder, R., Duarte, D. and Mello, R.S. (2011), \u201cA workload\u2010aware approach for optimizing the XML schema design trade\u2010off\u201d, Proceedings of the 13th International Conference on Information Integration and Web\u2010based Applications and Services (iiWAS '11), ACM, New York, NY.","DOI":"10.1145\/2095536.2095542"},{"key":"key2022012820281139700_b25","doi-asserted-by":"crossref","unstructured":"Wiwatwattana, N., Jagadish, H., Laks, A. and Lakshmanan, V.S. (2006), \u201cMaking designer schemas with colors\u201d, Proceedings of the 22nd International Conference on Data Engineering, IEEE Computer Society, Washington, DC.","DOI":"10.1109\/ICDE.2006.88"},{"key":"key2022012820281139700_b26","doi-asserted-by":"crossref","unstructured":"Wu, E. and Madden, S. (2011), \u201cPartitioning techniques for fine\u2010grained indexing\u201d, Proceedings of the IEEE International Conference on Data Engineering, Hannover, Germany.","DOI":"10.1109\/ICDE.2011.5767830"},{"key":"key2022012820281139700_b27","doi-asserted-by":"crossref","unstructured":"Yu, C. and Jagadish, H.V. (2008), \u201cXML schema refinement through redundancy detection and normalization\u201d, The VLDB Journal, Vol. 17 No. 2, pp. 203\u201023.","DOI":"10.1007\/s00778-007-0063-0"}],"container-title":["International Journal of Web Information Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/17440081211282874","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17440081211282874\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/17440081211282874\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T00:25:06Z","timestamp":1753403106000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/ijwis\/article\/8\/4\/371-389\/165546"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,11,16]]},"references-count":27,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,11,16]]}},"alternative-id":["10.1108\/17440081211282874"],"URL":"https:\/\/doi.org\/10.1108\/17440081211282874","relation":{},"ISSN":["1744-0084"],"issn-type":[{"type":"print","value":"1744-0084"}],"subject":[],"published":{"date-parts":[[2012,11,16]]}}}