{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T08:55:53Z","timestamp":1762505753805,"version":"3.38.0"},"reference-count":99,"publisher":"Walter de Gruyter GmbH","issue":"1","license":[{"start":{"date-parts":[[2011,1,1]],"date-time":"2011-01-01T00:00:00Z","timestamp":1293840000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>XML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document\/grammar comparison, presenting the background and discussing the various techniques related to the problem. We also discuss some prominent application domains, ranging over document classification and clustering, document transformation, grammar evolution, selective dissemination of XML information, XML querying, as well as alert filtering in intrusion detection systems and Web Services matching and communications.<\/jats:p>","DOI":"10.2478\/s13537-011-0005-1","type":"journal-article","created":{"date-parts":[[2011,3,24]],"date-time":"2011-03-24T15:25:25Z","timestamp":1300980325000},"source":"Crossref","is-referenced-by-count":5,"title":["XML document-grammar comparison: related problems and applications"],"prefix":"10.2478","volume":"1","author":[{"given":"Joe","family":"Tekli","sequence":"first","affiliation":[]},{"given":"Richard","family":"Chbeir","sequence":"additional","affiliation":[]},{"given":"Agma","family":"Traina","sequence":"additional","affiliation":[]},{"given":"Caetano","family":"Traina","sequence":"additional","affiliation":[]}],"member":"374","reference":[{"key":"5_CR1","doi-asserted-by":"crossref","unstructured":"Abu-Ghazaleh N., Lewis M.J., Differential Deserialization for Optimized SOAP Performance, Proceedings of the ACM\/IEEE Conference on Supercomputing (Seattle), 2005, 21\u201331","DOI":"10.1109\/SC.2005.24"},{"key":"5_CR2","doi-asserted-by":"crossref","unstructured":"Abu-Ghazaleh N., Lewis M.J., Govindaraju M., Differential Serialization for Optimized SOAP Performance, Proceedings of the 13th International Symposium on High Performance Distributed Computing (HPDC\u201904), 2004, 55\u201364","DOI":"10.1109\/HPDC.2004.1323489"},{"key":"5_CR3","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/0020-0190(95)00111-O","volume":"55","author":"T. Akatsu","year":"1995","unstructured":"Akatsu T., Approximate String Matching with Don\u2019t Care Characters, INFORM PROCESS LETT, 1995, 55, 235\u2013239","journal-title":"INFORM PROCESS LETT"},{"key":"5_CR4","doi-asserted-by":"crossref","first-page":"724","DOI":"10.1016\/j.datak.2009.01.001","volume":"68","author":"A. Algergawy","year":"2009","unstructured":"Algergawy A., Schallehn E., Saake G., Improving XML schema matching using Prufer sequences, DATA KNOWL ENG, 2009, 68, 724\u2013747","journal-title":"DATA KNOWL ENG"},{"key":"5_CR5","unstructured":"Altinel M., Franklin M.J., Efficient Filtering of XML Documents for Selective Dissemination of Information, Procedings of the 28th International Conference on Very Large Data Bases (VLDB\u201900), 2000, 53\u201364"},{"key":"5_CR6","unstructured":"Amer-Yahia S., Shanmugasundaram J., XML Full-Text Search: Challenges and Opportunities, Proceedings of the International Conference on Very Large Data Bases, 2005. Tutorial Slides: http:\/\/www.vldb2005.org\/program\/slides\/fri\/s1368-amer-yahia.ppt"},{"key":"5_CR7","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1145\/1107499.1107514","volume":"34","author":"S. Amer-Yahia","year":"2005","unstructured":"Amer-Yahia S., Case P., Rolleke T., Shanmugasundaram J., Weikum G., Report on the DB\/IR Panel at SIGMOD 2005, SIGMOD RECORD, 2005, 34, 71\u201374","journal-title":"SIGMOD RECORD"},{"key":"5_CR8","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1145\/357830.357849","volume":"3","author":"S. Axelsson","year":"2000","unstructured":"Axelsson S., The Base-Rate Fallacy and the Difficulty of Intrusion Detection, ACM T DATABASE SYST, 2000, 3, 186\u2013205","journal-title":"ACM T DATABASE SYST"},{"key":"5_CR9","doi-asserted-by":"crossref","first-page":"710","DOI":"10.1145\/1042046.1042050","volume":"29","author":"A. Balmin","year":"2004","unstructured":"Balmin A., Papakonstantinou Y., Vianu V., Incremental validation of XML documents, ACM T DATABASE SYST, 2004, 29, 710\u2013751","journal-title":"ACM T DATABASE SYST"},{"key":"5_CR10","doi-asserted-by":"crossref","unstructured":"Barbosa D., Mendelzon A.O., Libkin L., Mignet L., Arenas M., Efficient Incremental Validation of XML Documents, Proceedings of the international Conference on Data Engineering (ICDE), IEEE Computer Society, 2004, 671\u2013682","DOI":"10.1109\/ICDE.2004.1320036"},{"key":"5_CR11","unstructured":"Berglund et al., XML Path Language (XPath) 2.0, W3C Recommendation, January 2007, http:\/\/www.w3.org\/TR\/xpath20\/"},{"key":"5_CR12","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/S0306-4379(03)00031-0","volume":"29","author":"E. Bertino","year":"2004","unstructured":"Bertino E., Guerrini G., Mesiti, M., A Matching Algorithm for Measuring the Structural Similarity between an XML Documents and a DTD and its Applications, ELSEVIER INFORMATION SYSTEMS, 2004, 29, 23\u201346","journal-title":"ELSEVIER INFORMATION SYSTEMS"},{"key":"5_CR13","doi-asserted-by":"crossref","unstructured":"Bouchou B., Cheriat A., Halfeld Ferrari M., Savary A., XML Document Correction: Incremental Approach Activated by Schema Validation, Proceedings of the International Database Engineering and Applications Symposium (IDEAS), 2006, 228\u2013238","DOI":"10.1109\/IDEAS.2006.54"},{"key":"5_CR14","first-page":"285","volume":"31","author":"B. Bouchou","year":"2007","unstructured":"Bouchou B., Cheriat A., Halfeld Ferrari M., Laurent D., Lima M.A., Musicante M., Efficient Constraint Validation for XML Database, INFORMATICA, 2007, 31, 285\u2013309","journal-title":"INFORMATICA"},{"key":"5_CR15","unstructured":"Bray T., Paoli J., Sperberg-McQueen C., Mailer Y., Yergeau F., Extensible Markup Language (XML) 1.0 \u2014 5th Edition, W3C Recommendation, 2008, http:\/\/www.w3.org\/TR\/REC-xml\/"},{"key":"5_CR16","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1162\/coli.2006.32.1.13","volume":"32","author":"A. Budanitsky","year":"2006","unstructured":"Budanitsky A., Hirst G., Evaluating WordNet-based Measures of Lexical Semantic Relatedness, COMPUT LINGUIST, 2006, 32, 13\u201347","journal-title":"COMPUT LINGUIST"},{"key":"5_CR17","unstructured":"Buttler D., A Short Survey of Document Structure Similarity Algorithms, Proceedings of the International Conference on Internet Computing (ICOMP), 2004, 3\u20139"},{"key":"5_CR18","unstructured":"Chamberlin D., Florescu D., Robie J., Simeon J., Stefanescu M., XQuery: A Query Language for XML, 2001, http:\/\/www.w3.org\/TR\/2001\/WD-xquery-20010215"},{"key":"5_CR19","doi-asserted-by":"crossref","first-page":"354","DOI":"10.1007\/s00778-002-0077-6","volume":"11","author":"C.Y. Chan","year":"2002","unstructured":"Chan C.Y., Felber P., Garofalakis M., Rastogi R., Efficient Filtering of XML Documents with XPath Expressions, VLDB J, 2002, 11, 354\u2013379","journal-title":"VLDB J"},{"key":"5_CR20","doi-asserted-by":"crossref","unstructured":"Chawathe S., Rajaraman A., Garcia-Molina H., Widom J., Change Detection in Hierarchically Structured Information, Proceedings of the ACM International Conference on Management of Data (SIGMOD), Montreal, 1996, 26\u201337","DOI":"10.1145\/233269.233366"},{"key":"5_CR21","unstructured":"Cheriat A., Savary A., Bouchou B., Halfeld Ferrari M., Incremental String Correction: Towards Correction of XML Documents, Proceedings of the Prague Stringology Conference(PSC), 2005, 201\u2013215"},{"key":"5_CR22","doi-asserted-by":"crossref","unstructured":"Chidlovskii B., Using Regular Tree Automata as XML Schemas, Proceedings of the IEEE Advances in Digital Libraries (ADL\u201900), 2000, 89\u201398","DOI":"10.1109\/ADL.2000.848373"},{"key":"5_CR23","unstructured":"Chinnici R., Moreau J.J., Ryman A., Weerawarana S., Web Services Description Language (WSDL) Version 2.0 Part 1: Core Language, W3C Recommendation 26 June 2007, http:\/\/www.w3.org\/TR\/wsdl20\/"},{"key":"5_CR24","first-page":"85","volume-title":"Proceedings of the 7th International Workshop on the Web and Databases (WebDB\u2019 04)","author":"C. Chitic","year":"2004","unstructured":"Chitic C., Rosu D., On Validation of XML Streams using Finite State Machines, Proceedings of the 7th International Workshop on the Web and Databases (WebDB\u2019 04), ACM Press, New York, NY, USA, 2004, 85\u201390"},{"key":"5_CR25","doi-asserted-by":"crossref","unstructured":"Cob\u00e9na G., Abiteboul S., Marian A., Detecting Changes in XML Documents, Proceedings of the IEEE International Conference on Data Engineering (ICDE), 2002, 41\u201352","DOI":"10.1109\/ICDE.2002.994696"},{"key":"5_CR26","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1016\/j.jalgor.2007.04.004","volume":"62","author":"R. Luz Da","year":"2007","unstructured":"Da Luz R., Halfeld Ferrari Alves M., Musicante M. A., Regular expression transformations to extend regular languages (with application to a Datalog XML schema validator), J ALGORITHM, 2007, 62, 148\u2013167","journal-title":"J ALGORITHM"},{"key":"5_CR27","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/j.is.2004.11.009","volume":"31","author":"T. Dalamagas","year":"2006","unstructured":"Dalamagas T., Cheng T., Winkel K., Sellis T., A Methodology for Clustering XML Documents by Structure, INFORM SYST, 2006, 31, 187\u2013228","journal-title":"INFORM SYST"},{"key":"5_CR28","unstructured":"Debar H., Curry D., Feinstein B., The Intrusion Detection Message Exchange Format (IDMEF), 2005, http:\/\/www.ietf.org\/rfc\/rfc4765.txt"},{"key":"5_CR29","unstructured":"Diao Y., Fischer P., Franklin M.J., To R., YFilter: Efficient and Scalable Filtering of XML Documents, Proceedings of the International Conference on Data Engineering (ICDE\u201902), 2002"},{"key":"5_CR30","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1016\/j.is.2006.09.002","volume":"32","author":"H. Do","year":"2007","unstructured":"Do H., Rahm E., Matching Large Schemas: Approaches and Evaluation, INFORM SYST, 2007, 32, 857\u2013885","journal-title":"INFORM SYST"},{"key":"5_CR31","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1023\/A:1021765902788","volume":"50","author":"A. Doan","year":"2003","unstructured":"Doan A., Domingos P., Halevy A., Learning to Match the Schemas of Data Sources: A Multistrategy Approach, MACH LEARN, 2003, 50, 279\u2013301","journal-title":"MACH LEARN"},{"key":"5_CR32","doi-asserted-by":"crossref","unstructured":"DuChateau F., Bellahsene Z., Hunt E., Roantree M., An Indexing Structure for Automatic Schema Matching, The 23rd International Conference on Data Engineering (ICDE) \u2014 Workshops, 2007, 485\u2013491","DOI":"10.1109\/ICDEW.2007.4401032"},{"key":"5_CR33","doi-asserted-by":"crossref","unstructured":"Fernau H., Extracting Minimum Length Document Type Definitions Is NP-Hard, Grammatical Inference: Algorithms and Applications (ICGI\u201904) 2004, 277\u2013278","DOI":"10.1007\/978-3-540-30195-0_26"},{"key":"5_CR34","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1093\/comjnl\/bxm051","volume":"51","author":"A. Formica","year":"2008","unstructured":"Formica A., Similarity of XML-Schema Elements: A Structural and Information content Approach, COMPUT J, 2008, 51, 240\u2013254","journal-title":"COMPUT J"},{"key":"5_CR35","doi-asserted-by":"crossref","unstructured":"Garofalakis M., Gionis A., Rastogi R., Seshadri S., Shim K., Xtract: A system for extracting document type descriptors from XML documents, Proceedings of the ACM International Conference on Management of Data (SIGMOD), Dallas, Texas, USA, 2000, 165\u2013176","DOI":"10.1145\/335191.335409"},{"key":"5_CR36","first-page":"1","volume":"9","author":"F. Giunchiglia","year":"2007","unstructured":"Giunchiglia F., Yatskevich M., Shvaiko P., Semantic matching: Algorithms and implementation, JODS, 2007, 9, 1\u201338","journal-title":"JODS"},{"key":"5_CR37","unstructured":"Grahne G., Thomo A., Approximate Reasoning in Semi-structured Databases, Proceedings of the International Workshop on Knowledge Representation meets Databases (KRDB), Rome, 2001. Vol. 45"},{"key":"5_CR38","unstructured":"Guerrini G., Mesiti M., Sanz I., An overview of similarity measures for clustering XML documents, In: Vakali A., Pallis G. (Eds.), Web Data Management Practices: Emerging Techniques and Technologies, IDEA Group, 2006"},{"key":"5_CR39","doi-asserted-by":"crossref","unstructured":"Guha S., Jagadish H.V., Koudas N., Srivastava D., Yu T., Approximate XML Joins, Proceedings of ACM International Conference on Managemenet of Data (SIGMOD), 2002, 287\u2013298","DOI":"10.1145\/564724.564725"},{"key":"5_CR40","doi-asserted-by":"crossref","unstructured":"Murata M., Hosoya H., Validation Algorithm for Attribute-Element Constraints of RELAX NG, Extreme Markup Languages, Montreal, Canada, 2003","DOI":"10.1007\/3-540-45089-0_19"},{"key":"5_CR41","unstructured":"Halfeld Ferrari Alves M., Aspects Dynamiques de XML et Sp\u00e9cification des Interfaces de Services Web avec PEWS, Rapport de HDR, Universit\u00e9 Fran\u00e7ois Rabelais de Tours, 2007"},{"key":"5_CR42","unstructured":"Helmer S., Measuring the Structural Similarity of Semistructured Documents Using Entropy, Proceedings of the International Conference on Very Large Databases (VLDB), 2007, 1022\u20131032"},{"key":"5_CR43","doi-asserted-by":"crossref","unstructured":"Hopcroft J.E., Motwani R., Ullman J.D., Introduction to Automata Theory, Languages and Computation, Addison Wesley, 2nd edition, 2001","DOI":"10.1145\/568438.568455"},{"key":"5_CR44","doi-asserted-by":"crossref","unstructured":"Kayacik H.G., Zincir-Heywood A.N., A Case Study of Three Open Source Security Management Tools, Proceedings of 8th IFIP\/IEEE International Symposium on Integrated Network Management, 2003, 101\u2013104","DOI":"10.1007\/978-0-387-35674-7_10"},{"key":"5_CR45","doi-asserted-by":"crossref","unstructured":"Kim S.K., Lee M., Lee K.C., Validation of XML Document Updates Based on XML Schema in XML Databases, International Conference on Database and Expert Systems Applications (DEXA\u201903), 2003, LNCS 2736, 98\u2013108","DOI":"10.1007\/978-3-540-45227-0_11"},{"key":"5_CR46","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/0196-6774(89)90010-2","volume":"10","author":"G.M. Landau","year":"1989","unstructured":"Landau G.M., Vishkin U., Fast Parallel and Serial Approximate String Matching, J ALGORITHM, 1989, 10, 157\u2013169","journal-title":"J ALGORITHM"},{"key":"5_CR47","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1007\/PL00011672","volume":"3","author":"G. Lee","year":"2001","unstructured":"Lee G., Lee K., Chen A., Efficient Graph-based Algorithms for Discovering and Maintaining Association Rules in Large Databases, KNOWL INF SYST, 2001, 3, 338\u2013355","journal-title":"KNOWL INF SYST"},{"key":"5_CR48","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1108\/eb026913","volume":"49","author":"J. Lee","year":"1993","unstructured":"Lee J., Kim M., Lee Y., Information Retrieval Based on Conceptual Distance in IS-A Hierarchies, J DOC, 1993, 49, 188\u2013207","journal-title":"J DOC"},{"key":"5_CR49","doi-asserted-by":"crossref","unstructured":"Lee M., Yang L., Hsu W., Yang X., XClust: Clustering XML Schemas for Effective Integration, Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2002, 292\u2013299","DOI":"10.1145\/584838.584841"},{"key":"5_CR50","doi-asserted-by":"crossref","unstructured":"Leonardi E., Hoai T.T., Bhowmick S.S., Madria S., DTD-Diff: A Change Detection Algorithm for DTDs, Proceedings of the Database Systems for Advanced Applications conference (DASFAA), 2006, 384\u2013402","DOI":"10.1016\/j.datak.2006.06.003"},{"key":"5_CR51","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1109\/TKDE.2004.1264824","volume":"16","author":"W. Lian","year":"2004","unstructured":"Lian W., Cheung D., Mamoulis N., Yiu S., An Efficient and Scalable Algorithm for Clustering XML Documents by Structure, IEEE T KNOWL DATA EN, 2004, 16, 82\u201396","journal-title":"IEEE T KNOWL DATA EN"},{"key":"5_CR52","doi-asserted-by":"crossref","unstructured":"Liang W., Yokota H., LAX: An Efficient Approximate XML Join Based on Clustered Leaf Nodes for XML Data Integration, Proceedings of the British National Conference on Databases (BNCOD), 2005, 82\u201397","DOI":"10.1007\/11511854_7"},{"key":"5_CR53","unstructured":"Lin D., An Information-Theoretic Definition of Similarity, Proceedings of the International Conference on Machine Learning (ICML), 1998, 296\u2013304"},{"key":"5_CR54","doi-asserted-by":"crossref","unstructured":"Long J., Shwartz D., Stoecklin S., Distinguishing False from True Alerts in Snort by Data Mining Patterns of Alerts, Proceedings of SPIE\u201906, the International Society for Optical Engineering, 2006","DOI":"10.1117\/12.665211"},{"key":"5_CR55","doi-asserted-by":"crossref","unstructured":"Maguitman A., Menczer F., Roinestad H., Vespignani A., Algorithmic Detection of Semantic Similarity, Proceedings of the International Conference on the World Wide Web (www), 2005, 107\u2013116","DOI":"10.1145\/1060745.1060765"},{"key":"5_CR56","unstructured":"Marian A., Abiteboul S., Mignet L., Change-Centric Management of Versions in an XML Warehouse, Proceedings of the International Conference on Very Large Data Bases (VLDB), 2001, 581\u2013590"},{"key":"5_CR57","unstructured":"Megginson D. et al., The Simple API for XML, http:\/\/www.megginson.com\/SAX\/"},{"key":"5_CR58","doi-asserted-by":"crossref","unstructured":"Miller G., WordNet: An On-Line Lexical Database, INT J LEXICOGR, 1990, 3","DOI":"10.1093\/ijl\/3.4.235"},{"key":"5_CR59","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1145\/1111627.1111631","volume":"5","author":"M. Murata","year":"2005","unstructured":"Murata M., Lee D., Mani M., Kawaguchi K., Taxonomy of XML Schema Languages Using Formal Language Theory, ACM TOIT, 2005, 5, 660\u2013704","journal-title":"ACM TOIT"},{"key":"5_CR60","doi-asserted-by":"crossref","unstructured":"Nakano K., Nishimura S., Deriving Event-Based Document Transformers from Tree-Based Specifications, In: van den Brand M., Parigot D. (Eds.), Electronic Notes in Theoretical Computer Science, Elsevier Science Publishers, 2001, Volume 44","DOI":"10.1016\/S1571-0661(04)80927-7"},{"key":"5_CR61","volume-title":"Parsing and Querying XML Documents in SML","author":"A. Neumann","year":"2000","unstructured":"Neumann A., Parsing and Querying XML Documents in SML, PhD thesis, University of Trier, Trier, Germany, 2000"},{"key":"5_CR62","doi-asserted-by":"crossref","unstructured":"Neumann A., Seidl H., Locating Matches of Tree Patterns in Forests, In: Arvind V., Ramamujan R. (Eds.), Foundations of Software Technology and Theoretical Computer Science (18th FST&TCS), Volume 1530 of Lecture Notes in Computer Science, Heidelberg, 1998, 134\u2013145","DOI":"10.1007\/978-3-540-49382-2_12"},{"key":"5_CR63","unstructured":"Nierman A., Jagadish H.V., Evaluating structural similarity in XML documents, Proceedings of the ACM SIGMOD International Workshop on the Web and Databases (WebDB), 2002, 61\u201366"},{"key":"5_CR64","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/j.scico.2004.07.001","volume":"54","author":"S. Nishimura","year":"2005","unstructured":"Nishimura S., Nakano K., XML Stream Transformer Generation through Program Composition and Dependency Analysis, SCI COMPUT PROGRAM, 2005, 54, 257\u2013290","journal-title":"SCI COMPUT PROGRAM"},{"key":"5_CR65","unstructured":"Peterson D., Gao S., Malhotra A., Sperberg-McQueen C., Thompson H., W3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes, http:\/\/www.w3.org\/TR\/xmlschema11-2\/"},{"key":"5_CR66","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1109\/21.24528","volume":"19","author":"R. Rada","year":"1989","unstructured":"Rada R., Mili H., Bicknell E., Blettner M., Development and Application of a Metric on Semantic Nets, IEEE T SYST MAN CYB, 1989, 19, 17\u201330","journal-title":"IEEE T SYST MAN CYB"},{"key":"5_CR67","volume-title":"Introduction to XML","author":"E.T. Ray","year":"2001","unstructured":"Ray E.T., Introduction to XML, O\u2019Reilly, Paris, 2001"},{"key":"5_CR68","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1145\/988672.988740","volume-title":"Proceedings of the 13th International Conference on the World Wide Web (www\u2019 04)","author":"D.C. Reis","year":"2004","unstructured":"Reis D.C., Golgher P.B., Silva A.S., Laender A.F., Automatic Web News Extraction using Tree Edit Distance, Proceedings of the 13th International Conference on the World Wide Web (www\u2019 04), ACM, New York, NY, USA, 2004, 502\u2013511"},{"key":"5_CR69","first-page":"448","volume":"1","author":"P. Resnik","year":"1995","unstructured":"Resnik P., Using Information Content to Evaluate Semantic Similarity in a Taxonomy, Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), 1995, 1, 448\u2013453","journal-title":"Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI)"},{"key":"5_CR70","volume-title":"Information Retrieval","author":"C.J. Rijsbergen van","year":"1979","unstructured":"Rijsbergen van C.J., Information Retrieval, Butterworths, London, 1979"},{"key":"5_CR71","doi-asserted-by":"crossref","unstructured":"Sahai A., Machiraju V., Enabling fo the Ubiquitous e-services Vision on the Internet, Hewlett-Packard Laboratories, HPL-2001-5, 2001","DOI":"10.2979\/esj.2001.1.1.5"},{"key":"5_CR72","volume-title":"Introduction to Modern Information Retrieval","author":"G. Salton","year":"1983","unstructured":"Salton G., Mcgill M.J., Introduction to Modern Information Retrieval, McGraw-Hill, Tokio, 1983"},{"key":"5_CR73","doi-asserted-by":"crossref","unstructured":"Sanz I., Mesiti M., Guerrini G., Berlanga La R., Berlanga Lavori R., Approximate Subtree Identification in Heterogeneous XML Documents Collections, XML SYMPOSIUM, 2005, 192\u2013206","DOI":"10.1007\/11547273_14"},{"key":"5_CR74","doi-asserted-by":"crossref","unstructured":"Saruladha K., Aghila G., Raj S., A Survey of Semantic Similarity Methods for Ontology Based Information Retrieval, Proceedings of the International Conference on Machine Learning and Computing (ICMLC\u201910), 2010, 297\u2013301","DOI":"10.1109\/ICMLC.2010.63"},{"key":"5_CR75","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1007\/s10791-005-0746-3","volume":"8","author":"R. Schenkel","year":"2005","unstructured":"Schenkel R., Theobald A., Weikum G., Semantic Similarity Search on Semistructured Data with the XXL Search Engine, INFORM RETRIEVAL, 2005, 8, 521\u2013545","journal-title":"INFORM RETRIEVAL"},{"key":"5_CR76","unstructured":"Schlieder T., Similarity Search in XML Data Using Cost-based Query Transformations, Proceedings of the ACM SIGMOD International Workshop on the Web and Databases (WebDB), 2001, 19\u201324"},{"key":"5_CR77","doi-asserted-by":"crossref","first-page":"489","DOI":"10.1002\/asi.10060","volume":"53","author":"T. Schlieder","year":"2002","unstructured":"Schlieder T., Meuss H., Querying and Ranking XML Documents, J AM SOC INFORM SCI, 2002, 53, 489\u2013503","journal-title":"J AM SOC INFORM SCI"},{"key":"5_CR78","doi-asserted-by":"crossref","unstructured":"Sch\u00f6ning H., Tamoni \u2014 A DBMS Designed for XML, Proceedings of the IEEE International Conference on Data Engineering (ICDE), 2001, 149\u2013154","DOI":"10.1109\/ICDE.2001.914823"},{"key":"5_CR79","doi-asserted-by":"crossref","unstructured":"Segoufin L., Vianu V., Validating Streaming XML Documents, Proceedings of the ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS), 2002, 53\u201364","DOI":"10.1145\/543621.543622"},{"key":"5_CR80","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1016\/0020-0190(77)90064-3","volume":"6","author":"S.M. Selkow","year":"1977","unstructured":"Selkow S.M., The Tree-to-Tree Editing Problem, INFORM PROCESS LETT, 1977, 6, 184\u2013186","journal-title":"INFORM PROCESS LETT"},{"key":"5_CR81","first-page":"146","volume":"IV","author":"P. Shvaiko","year":"2005","unstructured":"Shvaiko P., Euzenat J., A Survey of Schema-Based Matching Approaches, JOURNAL OF DATA SEMANTICS, 2005, IV, 146\u2013171","journal-title":"JOURNAL OF DATA SEMANTICS"},{"key":"5_CR82","doi-asserted-by":"crossref","unstructured":"Stanoi I., Mihaila G., Padmanabhan S., A framework for the selective dissemination of XML documents based on inferred user profiles, Proceedings of the International Conference on Data Engineering, 2003, 531\u2013542","DOI":"10.1109\/ICDE.2003.1260819"},{"key":"5_CR83","doi-asserted-by":"crossref","unstructured":"Su H., Padmanabhan S., Lo M.L., Identification of Syntactically Similar DTD Elements for Schema Matching, Proceedings of the International Conference on Advances in Web-Age Information Management (WAIM), 2001, 145\u2013159","DOI":"10.1007\/3-540-47714-4_14"},{"key":"5_CR84","doi-asserted-by":"crossref","unstructured":"Suzuki N., Finding an Optimum Edit Script between an XML Document and a DTD, Proceedings of the ACM Symposium on Applied Computing (ACM SAC), 2005, 647\u2013653","DOI":"10.1145\/1066677.1066825"},{"key":"5_CR85","doi-asserted-by":"crossref","unstructured":"Tekli J., Chbeir R., Y\u00e9tongnon K., Structural Similarity Evaluation between XML Documents and DTDs, Proceedings of the International Conference on Web Information Systems Engineering (WISE), 2007, 196\u2013211","DOI":"10.1007\/978-3-540-76993-4_17"},{"key":"5_CR86","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1016\/j.cosrev.2009.03.001","volume":"3","author":"J. Tekli","year":"2009","unstructured":"Tekli J., Chbeir R., Y\u00e9tongnon K., An Overview of XML Similarity: Background, Current Trends and Future Directions, ELSEVIER COMPUTER SCIENCE REVIEW, 2009, 3, 151\u2013173","journal-title":"ELSEVIER COMPUTER SCIENCE REVIEW"},{"key":"5_CR87","doi-asserted-by":"crossref","first-page":"140","DOI":"10.4018\/978-1-60566-014-1.ch020","volume":"2","author":"J. Tekli","year":"2009","unstructured":"Tekli J., Chbeir R., Y\u00e9tongnon K., XML Grammar Similarity: Breakthroughs and Limitations, Second Edition of the Encyclopedia of Multimedia Technology and Networking, Information Science Reference, Hershey \u2014 New York, 2009, 2, 140\u2013148","journal-title":"Second Edition of the Encyclopedia of Multimedia Technology and Networking, Information Science Reference, Hershey \u2014 New York"},{"key":"5_CR88","unstructured":"Tekli J., Damiani E., Chbeir R., Gianini G., SOAP Processing Performance and Enhacement, IEEE TSC, (in press)"},{"key":"5_CR89","doi-asserted-by":"crossref","unstructured":"Teraguchi M., Makino S., Ueno K., Chung H.V., Optimized Web Services Security Performance with Differential Parsing, Proceedings of the 4th International Conference on Service-Oriented Computing (ICSOC\u201906), 2006, 277\u2013288","DOI":"10.1007\/11948148_23"},{"key":"5_CR90","doi-asserted-by":"crossref","unstructured":"Theobald A., Weikum G., Adding Relevance to XML, Proceedings of the 3rd International Workshop on the Web Databases (WebDB), Dallas, USA, 2000, 105\u2013124","DOI":"10.1007\/3-540-45271-0_7"},{"key":"5_CR91","doi-asserted-by":"crossref","first-page":"18","DOI":"10.4018\/jwsr.2005010102","volume":"2","author":"C. Werner","year":"2005","unstructured":"Werner C., Buschmann C., Fischer S., WSDL-Driven SOAP Compression, INT J WEB SERV RES, 2005, 2, 18\u201335","journal-title":"INT J WEB SERV RES"},{"key":"5_CR92","unstructured":"Word Wide Web Consortium. SOAP Version 1.2. W3C Recommendation (Second Edition) 2007, http:\/\/www.w3.org\/TR\/soap\/ [cited February 2010]"},{"key":"5_CR93","unstructured":"World Wide Web Consortium, The Document Object Model, http:\/\/www.w3.org\/DOM [cited 28 May 2009]"},{"key":"5_CR94","doi-asserted-by":"crossref","unstructured":"Xing G., Fast Approximate Matching Between XML Documents and Schemata, The Asia Pacific Web Conference (APWeb\u201906), 2006, 425\u2013436","DOI":"10.1007\/11610113_38"},{"key":"5_CR95","doi-asserted-by":"crossref","unstructured":"Xing G., Xia X., Guo J., Clustering XML Documents Based on Structural Similarity, International Conference of Database Systems for Advanced Applications (DASFAA\u201907), 2007, 905\u2013911","DOI":"10.1007\/978-3-540-71703-4_77"},{"key":"5_CR96","doi-asserted-by":"crossref","first-page":"1245","DOI":"10.1137\/0218082","volume":"18","author":"K. Zhang","year":"1989","unstructured":"Zhang K. Shasha D., Simple Fast Algorithms for the Editing Distance between Trees and Related Problems, SIAM J COMPUT, 1989, 18, 1245\u20131262","journal-title":"SIAM J COMPUT"},{"key":"5_CR97","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1006\/jagm.1994.1003","volume":"16","author":"K. Zhang","year":"1994","unstructured":"Zhang K., Shasha D., Wang J., Approximate Tree Matching in the Presence of Variable Length Don\u2019t Cares, J ALGORITHM, 1994, 16, 33\u201366","journal-title":"J ALGORITHM"},{"key":"5_CR98","doi-asserted-by":"crossref","unstructured":"Zhang X., Jing L., Hu X., Ng M., Zhou X., A Comparative Study of Ontology Based Term Similarity Measures on PubMed Document Clustering, Proceedings of the International Conference on Database Systems for Advanced Applications (DASFAA\u2019 07), 2007, 115\u2013126","DOI":"10.1007\/978-3-540-71703-4_12"},{"key":"5_CR99","unstructured":"Zhang Z., Li R., Cao S., Zhu Y., Similarity Metric in XML Documents, Knowledge Management and Experience Management Workshop, 2003"}],"container-title":["Open Computer Science"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.2478\/s13537-011-0005-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.2478\/s13537-011-0005-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.2478\/s13537-011-0005-1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,4]],"date-time":"2025-03-04T17:29:43Z","timestamp":1741109383000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.2478\/s13537-011-0005-1\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,1,1]]},"references-count":99,"journal-issue":{"issue":"1"},"URL":"https:\/\/doi.org\/10.2478\/s13537-011-0005-1","relation":{},"ISSN":["2299-1093"],"issn-type":[{"type":"electronic","value":"2299-1093"}],"subject":[],"published":{"date-parts":[[2011,1,1]]}}}