{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T15:06:31Z","timestamp":1763651191834,"version":"3.45.0"},"reference-count":29,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T00:00:00Z","timestamp":1763596800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Future Internet"],"abstract":"<jats:p>The digital age and the rise of Internet of Things technology have led to an explosion of data, including vast amounts of semantic data. In the context of large-scale semantic data graphs, centralized storage struggles to meet the efficiency requirements of the queries. This has led to a shift towards distributed semantic data systems. In federated semantic data systems, ensuring both query efficiency and comprehensive results is challenging because of data independence and privacy constraints. To address this, we propose a query processing framework featuring a block-level star decomposition method for generating efficient query plans, augmented by auxiliary indexes to guarantee the completeness of the results. A specialized FEDERATEDAND BY keyword is introduced for federated environments, and a partition-based parallel assembly method accelerates the result integration. Our approach demonstrably improves query efficiency and is analyzed for its potential application in energy systems.<\/jats:p>","DOI":"10.3390\/fi17110531","type":"journal-article","created":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T15:00:17Z","timestamp":1763650817000},"page":"531","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Semantic Data Federated Query Optimization Based on Decomposition of Block-Level Subqueries"],"prefix":"10.3390","volume":"17","author":[{"given":"Yuan","family":"Yao","sequence":"first","affiliation":[{"name":"School of Engineering, University of Glasgow, Glasgow G12 8QQ, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yang","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing 100876, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,11,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1646","DOI":"10.11591\/eei.v9i4.2359","article-title":"An overview of big data analysis","volume":"9","author":"Arena","year":"2020","journal-title":"Bull. Electr. Eng. Inform."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"76","DOI":"10.1145\/3397512","article-title":"A review of the semantic web field","volume":"64","author":"Hitzler","year":"2021","journal-title":"Commun. ACM"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Tomaszuk, D., and Hyland-Wood, D. (2020). RDF 1.1: Knowledge representation and data integration language for the Web. Symmetry, 12.","DOI":"10.3390\/sym12010084"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"DuCharme, B. (2013). Learning SPARQL: Querying and Updating with SPARQL 1.1, O\u2019Reilly Media, Inc.","DOI":"10.1089\/big.2012.0004"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1695","DOI":"10.2174\/2666255813666191211114635","article-title":"A comparative review for question answering frameworks on the linked data","volume":"14","author":"Tasar","year":"2021","journal-title":"Recent Adv. Comput. Sci. Commun."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"55","DOI":"10.3390\/analytics2010004","article-title":"A brief survey of methods for analytics over RDF knowledge graphs","volume":"2","author":"Papadaki","year":"2023","journal-title":"Analytics"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Schwarte, A., Haase, P., Hose, K., Schenkel, R., and Schmidt, M. (2011, January 23\u201327). Fedx: Optimization techniques for federated query processing on linked data. Proceedings of the International Semantic Web Conference, Bonn, Germany.","DOI":"10.1007\/978-3-642-25073-6_38"},{"key":"ref_8","first-page":"13","article-title":"SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions","volume":"782","author":"Staab","year":"2011","journal-title":"COLD"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1123","DOI":"10.14778\/3402707.3402747","article-title":"Scalable SPARQL querying of large RDF graphs","volume":"4","author":"Huang","year":"2011","journal-title":"Proc. VLDB Endow."},{"key":"ref_10","unstructured":"Lee, K., Liu, L., Tang, Y., Zhang, Q., and Zhou, Y. (July, January 28). Efficient and customizable data partitioning framework for distributed big RDF data processing in the cloud. Proceedings of the 2013 IEEE Sixth International Conference on Cloud Computing, Santa Clara, CA, USA."},{"key":"ref_11","unstructured":"Gurajada, S., Seufert, S., Miliaraki, I., and Theobald, M. (2014, January 22\u201327). TriAD: A Distributed Shared-nothing RDF Engine Based on Asynchronous Message Passing. Proceedings of the 14th International Conference on ACM Special Interest Group on Management of Data, Snowbird, UT, USA."},{"key":"ref_12","unstructured":"Peng, P., Zou, L., Chen, L., and Zhao, D. (2016, January 15\u201318). Query Workload-based RDF graph fragmentation and allocation. Proceedings of the 19th International Conference on Extending Database Technology, Bordeaux, France."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"46","DOI":"10.9734\/ajrcos\/2021\/v11i230260","article-title":"A comprehensive survey for hadoop distributed file system","volume":"11","author":"Merceedi","year":"2021","journal-title":"Asian J. Res. Comput. Sci."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Gupta, S.K., Yadav, S.K., and Soni, S.K. (2023, January 23\u201324). Exploring the Power of Big Data for IoT: A Comprehensive Review. Proceedings of the 2023 International Conference on IoT, Communication and Automation Technology (ICICAT), Gorakhpur, India.","DOI":"10.1109\/ICICAT57735.2023.10263722"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Thakur, S., and Jha, S.K. (2023, January 6\u20138). Cloud Computing and its Emerging Trends on Big Data Analytics. Proceedings of the 2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India.","DOI":"10.1109\/ICESC57686.2023.10193144"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Sch\u00e4tzle, A., Przyjaciel-Zablocki, M., and Lausen, G. (2011, January 12\u201316). PigSPARQL: Mapping SPARQL to pig latin. Proceedings of the International Workshop on Semantic Web Information Management, Athens, Greece.","DOI":"10.1145\/1999299.1999303"},{"key":"ref_17","unstructured":"Papailiou, N., Tsoumakos, D., Konstantinou, I., and Koziris, N. (2014, January 22\u201327). H2RDF+: An efficient data management system for big RDF graphs. Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, Snowbird, UT, USA."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Shao, B., Wang, H., and Li, Y. (2013, January 22\u201327). Trinity: A distributed graph engine on a memory cloud. Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, New York, NY, USA.","DOI":"10.1145\/2463676.2467799"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1007\/s00778-015-0415-0","article-title":"Processing SPARQL queries over distributed RDF graphs","volume":"25","author":"Peng","year":"2016","journal-title":"VLDB J."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Cheng, S., and Hartig, O. (2022, January 23\u201327). Source Selection for SPARQL Endpoints: Fit for Heterogeneous Federations of RDF Data Sources?. Proceedings of the QuWeDa@ ISWC, Hangzhou, China.","DOI":"10.1007\/978-3-031-11609-4_11"},{"key":"ref_21","unstructured":"Quilitz, B., and Leser, U. (2008, January 1\u20135). Querying distributed RDF data sources with SPARQL. Proceedings of the Semantic Web: Research and Applications: 5th European Semantic Web Conference, ESWC 2008, Tenerife, Canary Islands, Spain. Proceedings 5."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Prasser, F., Kemper, A., and Kuhn, K.A. (2012, January 27\u201330). Efficient distributed query processing for autonomous RDF databases. Proceedings of the 15th International Conference on Extending Database Technology, Berlin, Germany.","DOI":"10.1145\/2247596.2247640"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Cimiano, P., Chiarcos, C., McCrae, J.P., and Gracia, J. (2020). Modelling metadata of language resources. Linguistic Linked Data: Representation, Generation and Applications, Springer.","DOI":"10.1007\/978-3-030-30225-2"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Charalambidis, A., Troumpoukis, A., and Konstantopoulos, S. (2015, January 15\u201317). SemaGrow: Optimizing federated SPARQL queries. Proceedings of the 11th International Conference on Semantic Systems, Vienna, Austria.","DOI":"10.1145\/2814864.2814886"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Saleem, M., and Ngomo, A.C.N. (2014, January 25\u201329). HiBISCuS: Hypergraph-based source selection for SPARQL endpoint federation. Proceedings of the European Semantic Web Conference, Crete, Greece.","DOI":"10.1007\/978-3-319-07443-6_13"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Endris, K.M., Galkin, M., Lytra, I., Mami, M.N., Vidal, M.-E., and Auer, S. (2017, January 28\u201331). MULDER: Querying the linked data web by bridging RDF molecule templates. Proceedings of the Database and Expert Systems Applications: 28th International Conference, DEXA 2017, Lyon, France. Proceedings, Part I 28.","DOI":"10.1007\/978-3-319-64468-4_1"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1145\/3186728.3164144","article-title":"Lusail: A system for querying linked data at scale","volume":"11","author":"Abdelaziz","year":"2017","journal-title":"Proc. VLDB Endow."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Azevedo, L.G., de Souza Soares, E.F., Souza, R., and Moreno, M. (2020, January 5\u20137). Modern Federated Database Systems: An Overview. Proceedings of the 22nd International Conference on Enterprise Information Systems: ICEIS 2020, Virtual.","DOI":"10.5220\/0009795402760283"},{"key":"ref_29","first-page":"107","article-title":"A systematic overview of data federation systems","volume":"15","author":"Gu","year":"2024","journal-title":"Semant. Web"}],"container-title":["Future Internet"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-5903\/17\/11\/531\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T15:01:41Z","timestamp":1763650901000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-5903\/17\/11\/531"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,20]]},"references-count":29,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2025,11]]}},"alternative-id":["fi17110531"],"URL":"https:\/\/doi.org\/10.3390\/fi17110531","relation":{},"ISSN":["1999-5903"],"issn-type":[{"value":"1999-5903","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,20]]}}}