{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T07:23:45Z","timestamp":1777879425102,"version":"3.51.4"},"reference-count":39,"publisher":"SAGE Publications","issue":"3","license":[{"start":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T00:00:00Z","timestamp":1775174400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"},{"start":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T00:00:00Z","timestamp":1775174400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Semantic Web: \u2013 Interoperability, Usability, Applicability"],"published-print":{"date-parts":[[2026,5]]},"abstract":"<jats:p>The proliferation of large and ever-growing resource description framework (RDF) datasets has sparked a need for robust and performant RDF archiving systems. In order to tackle this challenge, several solutions have been proposed throughout the years, including archiving systems based on independent copies, time-based indexes, and change-based approaches. In recent years, modern solutions combine several of the above mentioned paradigms. In particular, aggregated changesets of time-annotated triples have showcased a noteworthy ability to handle and query relatively large RDF archives. However, such approaches still suffer from scalability issues, notably at ingestion time. This makes the use of these solutions prohibitive for large revision histories. Furthermore, applications for such systems remain often constrained by their limited querying abilities, where SPARQL is often left out in favor of single triple-pattern queries. In this article, we propose a hybrid storage approach based on aggregated changesets, snapshots, and multiple delta chains that additionally provides full querying SPARQL on RDF archives. This is done by interfacing our system with a modified SPARQL query engine. We evaluate our system with different snapshot creation strategies on the BEAR benchmark for RDF archives and showcase improvements of up to one order of magnitude in ingestion speed compared to state-of-the-art approaches, while keeping competitive querying performance. Furthermore, we demonstrate our SPARQL query processing capabilities on the BEAR-C variant of BEAR. This is, to the best of our knowledge, the first openly available endeavor that provides full SPARQL querying on RDF archives.<\/jats:p>","DOI":"10.1177\/22104968261431405","type":"journal-article","created":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T14:15:11Z","timestamp":1775225711000},"update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":0,"title":["Expressive Querying and Scalable Management of Large RDF Archives"],"prefix":"10.1177","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1025-9687","authenticated-orcid":false,"given":"Olivier","family":"Pelgrin","sequence":"first","affiliation":[{"name":"Computer Science Department, Aalborg University, Aalborg, Denmark"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruben","family":"Taelman","sequence":"additional","affiliation":[{"name":"IDLab, Ghent University, Ghent, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0241-5379","authenticated-orcid":false,"given":"Luis","family":"Gal\u00e1rraga","sequence":"additional","affiliation":[{"name":"Inria, Rennes,\u00a0France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7025-8099","authenticated-orcid":false,"given":"Katja","family":"Hose","sequence":"additional","affiliation":[{"name":"Computer Science Department, Aalborg University, Aalborg, Denmark"},{"name":"DBAI Research Unit, TU Wien, Vienna, Austria"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2026,4,3]]},"reference":[{"key":"e_1_3_4_2_1","unstructured":"Abuoda G. Aebeloe C. Dell\u2019Aglio D. Keen A. Hose K. (2023). StarBench: Benchmarking RDF-star triplestores. In QuWeDa\/MEPDaW@ISWC. CEUR Workshop Proceedings Vol. 3565 (pp. 34\u201349). CEUR-WS.org."},{"key":"e_1_3_4_3_1","doi-asserted-by":"crossref","unstructured":"Aebeloe C. Montoya G. Hose K. (2021). ColChain: Collaborative linked data networks. In The web conference (WWW) (pp.\u00a01385\u20131396). https:\/\/doi.org\/10.1145\/3442381.3450037","DOI":"10.1145\/3442381.3450037"},{"key":"e_1_3_4_4_1","unstructured":"Anderson J. Bendiken A. (2016). Transaction-time queries in Dydra. In MEPDaW\/LDQ@ESWC. CEUR Workshop Proceedings Vol. 1585 (pp. 11\u201319). CEUR-WS.org."},{"key":"e_1_3_4_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2018.08.002"},{"key":"e_1_3_4_6_1","doi-asserted-by":"crossref","unstructured":"Bereta K. Smeros P. Koubarakis M. (2013). Representation and querying of valid time of triples in linked geospatial data. In ESWC (Vol. 7882 pp. 259\u2013274).","DOI":"10.1007\/978-3-642-38288-8_18"},{"key":"e_1_3_4_7_1","doi-asserted-by":"crossref","unstructured":"Brunsmann J. (2010). Archiving pushed inferences from sensor data streams. In International workshop on semantic sensor web (pp. 38\u201346). https:\/\/doi.org\/10.5220\/0003116000380046","DOI":"10.5220\/0003116000380046"},{"key":"e_1_3_4_8_1","first-page":"1","article-title":"Compressed and queryable self-indexes for RDF archives","volume":"66","author":"Cerdeira-Pena A.","year":"2023","unstructured":"Cerdeira-Pena A., de Bernardo G., Fari\u00f1a A., Fern\u00e1ndez J. D., Mart\u00ednez-Prieto M. A. (2023). Compressed and queryable self-indexes for RDF archives. Knowledge and Information Systems, 66, 1\u201337.","journal-title":"Knowledge and Information Systems"},{"key":"e_1_3_4_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2013.01.002"},{"issue":"2","key":"e_1_3_4_10_1","first-page":"247","article-title":"Evaluating query and storage strategies for RDF archives","volume":"10","author":"Fern\u00e1ndez J. D.","year":"2019","unstructured":"Fern\u00e1ndez J. D., Umbrich J., Polleres A., Knuth M. (2019). Evaluating query and storage strategies for RDF archives. The Journal of Web Semantics, 10(2), 247\u2013291. https:\/\/doi.org\/10.3233\/SW-180309","journal-title":"The Journal of Web Semantics"},{"key":"e_1_3_4_11_1","unstructured":"Fionda V. Chekol M. W. Pirr\u00f2 G. (2016). Gize: A time warp in the web of data. In International semantic web conference (ISWC) (Vol. 1690)."},{"key":"e_1_3_4_12_1","unstructured":"Grandi F. (2010). T-SPARQL: A TSQL2-like temporal query language for RDF. In Local proceedings of European conference on Advances in Databases and Information Systems (ADBIS) (pp. 21\u201330)."},{"key":"e_1_3_4_13_1","unstructured":"Graube M. Hensel S. Urbas L. (2014). R43ples: Revisions for triples \u2013 An approach for version control in the semantic web. In LDQ@SEMANTICS."},{"issue":"5","key":"e_1_3_4_14_1","first-page":"557","article-title":"A foundation for spatial data warehouses on the semantic web","volume":"9","author":"G\u00fcr N.","year":"2018","unstructured":"G\u00fcr N., Pedersen T. B., Zim\u00e1nyi E., Hose K. (2018). A foundation for spatial data warehouses on the semantic web. Semantic Web, 9(5), 557\u2013587.","journal-title":"Semantic Web"},{"key":"e_1_3_4_15_1","unstructured":"Hartig O. (2017). Foundations of RDF\u22c6 and SPARQL\u22c6 (an alternative approach to statement-level metadata in RDF). In AMW. CEUR Workshop Proceedings Vol. 1912. CEUR-WS.org."},{"key":"e_1_3_4_16_1","unstructured":"Hose K. (2021). Knowledge graph (r)evolution and the web of data. In MEPDaW@ISWC. CEUR Workshop Proceedings Vol. 3225 (pp. 1\u20137). CEUR-WS.org."},{"key":"e_1_3_4_17_1","doi-asserted-by":"crossref","unstructured":"Huet T. Biega J. Suchanek F. M. (2013). Mining history with Le Monde. In Workshop on automated knowledge base construction (pp. 49\u201354). https:\/\/doi.org\/10.1145\/2509558.2509567","DOI":"10.1145\/2509558.2509567"},{"issue":"1","key":"e_1_3_4_18_1","first-page":"2:1","article-title":"Automated quality assessment of metadata across open data portals","volume":"8","author":"Neumaier S.","year":"2016","unstructured":"Neumaier S., Umbrich J., Polleres A. (2016). Automated quality assessment of metadata across open data portals. ACM Journal of Data and Information Quality, 8(1), 2:1\u20132:29.","journal-title":"ACM Journal of Data and Information Quality"},{"key":"e_1_3_4_19_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920877"},{"key":"e_1_3_4_20_1","unstructured":"Papakonstantinou V. Flouris G. Fundulaki I. Stefanidis K. Roussakis Y. (2017). SPBv: Benchmarking linked data archiving systems. In BLINK\/NLIWoD3@ISWC. CEUR Workshop Proceedings Vol. 1932."},{"issue":"6","key":"e_1_3_4_21_1","first-page":"903","article-title":"Towards fully-fledged archiving for RDF datasets","volume":"12","author":"Pelgrin O.","year":"2021","unstructured":"Pelgrin O., Gal\u00e1rraga L., Hose K. (2021). Towards fully-fledged archiving for RDF datasets. Semantic Web Journal, 12(6), 903\u2013925. https:\/\/doi.org\/10.3233\/SW-210434","journal-title":"Semantic Web Journal"},{"key":"e_1_3_4_22_1","doi-asserted-by":"crossref","unstructured":"Pelgrin O. Taelman R. Gal\u00e1rraga L. Hose K. (2023). Glenda: Querying RDF archives with full SPARQL. In C. Pesquita H. Skaf-Molli V. Efthymiou S. Kirrane A. Ngonga D. Collarana R. Cerqueira M. Alam C. Trojahn & S. Hertling (Eds.) The semantic web: ESWC 2023 satellite events (pp. 75\u201380). Cham: Springer Nature Switzerland. ISBN 978-3-031-43458-7.","DOI":"10.1007\/978-3-031-43458-7_14"},{"key":"e_1_3_4_23_1","unstructured":"Pelgrin O. Taelman R. Gal\u00e1rraga L. Hose K. (2023). The need for better RDF archiving benchmarks. In MEPDaW@ISWC. CEUR Workshop Proceedings. CEUR-WS.org."},{"key":"e_1_3_4_24_1","doi-asserted-by":"crossref","unstructured":"Pelgrin O. Taelman R. Gal\u00e1rraga L. Hose K. (2023). Scaling large RDF archives to very long histories. In International conference in semantic computing (ICSC) (pp. 41\u201348).","DOI":"10.1109\/ICSC56153.2023.00013"},{"key":"e_1_3_4_25_1","doi-asserted-by":"crossref","unstructured":"Pellissier Tanon T. Bourgaux C. Suchanek F. (2019). Learning how to correct a knowledge base from the edit history. In The web conference (WWW) (pp. 1465\u20131475). https:\/\/doi.org\/10.1145\/3308558.3313584","DOI":"10.1145\/3308558.3313584"},{"key":"e_1_3_4_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.2966609"},{"key":"e_1_3_4_27_1","doi-asserted-by":"crossref","unstructured":"Perry M. Jain P. Sheth A. P. (2011). SPARQL-ST: Extending SPARQL to support spatiotemporal queries. In Geospatial semantics and the semantic web (Vol. 12 pp. 61\u201386).","DOI":"10.1007\/978-1-4419-9446-2_3"},{"issue":"1","key":"e_1_3_4_28_1","first-page":"11:1","article-title":"How does knowledge evolve in open knowledge graphs?","volume":"1","author":"Polleres A.","year":"2023","unstructured":"Polleres A., Pernisch R., Bonifati A., Dell\u2019Aglio D., Dobriy D., Dumbrava S., Etcheverry L., Ferranti N., Hose K., Jim\u00e9nez-Ruiz E., Lissandrini M., Scherp A., Tommasini R., Wachs J. (2023). How does knowledge evolve in open knowledge graphs? Transactions on Graph Data and Knowledge, 1(1), 11:1\u201311:59.","journal-title":"Transactions on Graph Data and Knowledge"},{"key":"e_1_3_4_29_1","unstructured":"Raimond Y. Schreiber G. (2014). RDF 1.1 primer. W3C recommendation. http:\/\/www.w3.org\/TR\/2014\/NOTE-rdf11-primer-20140624\/"},{"key":"e_1_3_4_30_1","doi-asserted-by":"crossref","unstructured":"Roussakis Y. Chrysakis I. Stefanidis K. Flouris G. Stavrakas Y. (2015). A flexible framework for understanding the dynamics of evolving RDF datasets. In International semantic web conference (ISWC) (Vol. 9366 pp. 495\u2013512). https:\/\/doi.org\/10.1007\/978-3-319-25007-6_29","DOI":"10.1007\/978-3-319-25007-6_29"},{"key":"e_1_3_4_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-021-00725-x"},{"key":"e_1_3_4_32_1","unstructured":"Seaborne A. Harris S. (2013). SPARQL 1.1 query language. W3C recommendation W3C. http:\/\/www.w3.org\/TR\/2013\/REC-sparql11-query-20130321\/"},{"key":"e_1_3_4_33_1","first-page":"705","article-title":"Optimizing storage of RDF archives using bidirectional delta chains","volume":"13","author":"Taelman R.","year":"2022","unstructured":"Taelman R., Mahieu T., Vanbrabant M., Verborgh R. (2022). Optimizing storage of RDF archives using bidirectional delta chains. The Journal of Web Semantics, 13, 705\u2013734.","journal-title":"The Journal of Web Semantics"},{"key":"e_1_3_4_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2018.08.001"},{"key":"e_1_3_4_35_1","doi-asserted-by":"crossref","unstructured":"Taelman R. Sande M. V. Verborgh R. (2018). Versioned querying with OSTRICH and comunica in MOCHA 2018. In SemWebEval@ESWC (Vol. 927. pp. 17\u201323).","DOI":"10.1007\/978-3-030-00072-1_2"},{"key":"e_1_3_4_36_1","unstructured":"Taelman R. Van Herwegen J. Vander Sande M. Verborgh R. Comunica: A modular SPARQL query engine for the web. In International semantic web conference (ISWC)."},{"key":"e_1_3_4_37_1","doi-asserted-by":"crossref","unstructured":"Tanon T. P. Suchanek F. M. (2019). Querying the edit history of Wikidata. In ESWC (Vol. 11762. pp. 161\u2013166). https:\/\/doi.org\/10.1007\/978-3-030-32327-1_32","DOI":"10.1007\/978-3-030-32327-1_32"},{"key":"e_1_3_4_38_1","unstructured":"Volkel M. Winkler W. Sure Y. Kruk S. R. Synak M. (2005). SemVersion: A versioning system for RDF and ontologies. Extended semantic web conference (ESWC)."},{"issue":"1","key":"e_1_3_4_39_1","first-page":"1008","article-title":"Hexastore: Sextuple indexing for semantic web data management","volume":"1","author":"Weiss C.","year":"2008","unstructured":"Weiss C., Karras P., Bernstein A. (2008). Hexastore: Sextuple indexing for semantic web data management. PVLDB, 1(1), 1008\u20131019. https:\/\/doi.org\/10.14778\/1453856.1453965","journal-title":"PVLDB"},{"key":"e_1_3_4_40_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2011.08.006"}],"container-title":["Semantic Web: \u2013 Interoperability, Usability, Applicability"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/22104968261431405","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/22104968261431405","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/22104968261431405","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T23:43:31Z","timestamp":1777592611000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/22104968261431405"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,4,3]]},"references-count":39,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2026,5]]}},"alternative-id":["10.1177\/22104968261431405"],"URL":"https:\/\/doi.org\/10.1177\/22104968261431405","relation":{},"ISSN":["1570-0844","2210-4968"],"issn-type":[{"value":"1570-0844","type":"print"},{"value":"2210-4968","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,4,3]]},"article-number":"22104968261431405"}}