{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:26:34Z","timestamp":1760243194075,"version":"build-2065373602"},"reference-count":66,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2014,11,4]],"date-time":"2014-11-04T00:00:00Z","timestamp":1415059200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100000780","name":"European Commission","doi-asserted-by":"publisher","award":["ICT 270239"],"award-info":[{"award-number":["ICT 270239"]}],"id":[{"id":"10.13039\/501100000780","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Future Internet"],"abstract":"<jats:p>The constantly growing amount ofWeb content and the success of the SocialWeb lead to increasing needs for Web archiving. These needs go beyond the pure preservationo of Web pages. Web archives are turning into \u201ccommunity memories\u201d that aim at building a better understanding of the public view on, e.g., celebrities, court decisions and other events. Due to the size of the Web, the traditional \u201ccollect-all\u201d strategy is in many cases not the best method to build Web archives. In this paper, we present the ARCOMEM (From Future Internet 2014, 6 689 Collect-All Archives to Community Memories) architecture and implementation that uses semantic information, such as entities, topics and events, complemented with information from the Social Web to guide a novel Web crawler. The resulting archives are automatically enriched with semantic meta-information to ease the access and allow retrieval based on conditions that involve high-level concepts.<\/jats:p>","DOI":"10.3390\/fi6040688","type":"journal-article","created":{"date-parts":[[2014,11,4]],"date-time":"2014-11-04T09:41:15Z","timestamp":1415094075000},"page":"688-716","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["The ARCOMEM Architecture for Social- and Semantic-Driven Web Archiving"],"prefix":"10.3390","volume":"6","author":[{"given":"Thomas","family":"Risse","sequence":"first","affiliation":[{"name":"L3S Research Center, Leibniz Universit\u00e4t Hannover, Hannover 30167, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elena","family":"Demidova","sequence":"additional","affiliation":[{"name":"L3S Research Center, Leibniz Universit\u00e4t Hannover, Hannover 30167, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stefan","family":"Dietze","sequence":"additional","affiliation":[{"name":"L3S Research Center, Leibniz Universit\u00e4t Hannover, Hannover 30167, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wim","family":"Peters","sequence":"additional","affiliation":[{"name":"NLP Group, Department of Computer Science, University of Sheffield, S1 4DP Sheffield, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nikolaos","family":"Papailiou","sequence":"additional","affiliation":[{"name":"ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies, 15125 Maroussi, Athens, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Katerina","family":"Doka","sequence":"additional","affiliation":[{"name":"ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies, 15125 Maroussi, Athens, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yannis","family":"Stavrakas","sequence":"additional","affiliation":[{"name":"ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies, 15125 Maroussi, Athens, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vassilis","family":"Plachouras","sequence":"additional","affiliation":[{"name":"ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies, 15125 Maroussi, Athens, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pierre","family":"Senellart","sequence":"additional","affiliation":[{"name":"CNRS LTCIT, Institut Mines-T\u00e9l\u00e9com, T\u00e9l\u00e9com ParisTech, 75634 Paris Cedex 13, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Florent","family":"Carpentier","sequence":"additional","affiliation":[{"name":"Internet Memory Foundation, 45 ter rue de la R\u00e9volution, 93100 Montreuil, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Amin","family":"Mantrach","sequence":"additional","affiliation":[{"name":"Yahoo Research, 08018 Barcelona, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bogdan","family":"Cautis","sequence":"additional","affiliation":[{"name":"CNRS LTCIT, Institut Mines-T\u00e9l\u00e9com, T\u00e9l\u00e9com ParisTech, 75634 Paris Cedex 13, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Patrick","family":"Siehndel","sequence":"additional","affiliation":[{"name":"L3S Research Center, Leibniz Universit\u00e4t Hannover, Hannover 30167, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dimitris","family":"Spiliotopoulos","sequence":"additional","affiliation":[{"name":"Athens Technology Center (ATC), 15233 Halandri Athens, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2014,11,4]]},"reference":[{"key":"ref_1","unstructured":"Blue Ribbon Task Force on Sustainable Digital Preservation and Access Sustainable Economics for a Digital Planet, ensuring Long-Term Access to Digital Information. Available online: http:\/\/brtf.sdsc.edu\/biblio\/BRTF_Final_Report.pdf."},{"key":"ref_2","unstructured":"ARCOMEM: Archiving Communities Memories. Available online: http:\/\/www.arcomem.eu\/."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Ntoulas, A., Cho, J., and Olston, C. (2004, January 17\u201320). What\u2019s New on the Web? The Evolution of the Web from a Search Engine Perspective. Proceedings of the 13th International Conference on World Wide Web, New York, NY, USA.","DOI":"10.1145\/988672.988674"},{"key":"ref_4","unstructured":"Gomes, D., Miranda, J., and Costa, M. (2011, January 26\u201328). A Survey on Web Archiving Initiatives. Proceedings of the 15th International Conference on Theory and Practice of Digital Libraries (TPDL), Berlin, Germany."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Risse, T., Dietze, S., Peters, W., Doka, K., Stavrakas, Y., and Senellart, P. (2012, January 23\u201327). Exploiting the Social and Semantic Web for Guided Web Archiving. Proceedings of the 16th International Conference on Theory and Practice of Digital Libraries (TPDL), Paphos, Cyprus.","DOI":"10.1007\/978-3-642-33290-6_47"},{"key":"ref_6","unstructured":"The ARCOMEM Consortium ARCOMEM system release. Available online: http:\/\/sourceforge. net\/projects\/arcomem\/."},{"key":"ref_7","unstructured":"ISO ISO 28500:2009 Information and Documentation\u2014WARC File Format. Available online: http:\/\/www.iso.org\/iso\/iso_catalogue\/catalogue_tc\/catalogue_detail.htm?csnumber=44717."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"433","DOI":"10.3390\/fi6030433","article-title":"Analysing and Enriching Focused Semantic Web Archives for Parliament Applications","volume":"6","author":"Demidova","year":"2014","journal-title":"Futur. Internet"},{"key":"ref_9","unstructured":"McGuinness, D.L., and van Harmelen, F. OWL Web Ontology Language. Available online: http:\/\/www.w3.org\/TR\/owl-features\/."},{"key":"ref_10","unstructured":"Lee, C. Open Archival Information System (OAIS) Reference Model. Available online: http:\/\/www.tandfonline.com\/doi\/abs\/10.1081\/E-ELIS3-120044377."},{"key":"ref_11","unstructured":"Resource Description Framework (RDF). Available online: http:\/\/www.w3.org\/RDF\/."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Scherp, A., Franz, T., Saathoff, C., and Staab, S. (2009, January 1\u20134). F-A Model of Events based on the Foundational Ontology DOLCE + DnS Ultralight. Proceedings of the International Conference on Knowledge Capturing (K-CAP), Redondo Beach, CA, USA.","DOI":"10.1145\/1597735.1597760"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Shaw, R., Troncy, R., and Hardman, L. (2009, January 6\u20139). LODE: Linking Open Descriptions of Events. Proceedings of the 4th Asian Semantic Web Conference (ASWC), Shanghai, China.","DOI":"10.1007\/978-3-642-10871-6_11"},{"key":"ref_14","unstructured":"The ARCOMEM Consortium ARCOMEM Data Model. Available online: http:\/\/www.gate.ac.uk\/ns\/ontologies\/arcomem-data-model.owl."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1145\/1327452.1327492","article-title":"MapReduce: Simplified data processing on large clusters","volume":"51","author":"Dean","year":"2008","journal-title":"Commun. ACM"},{"key":"ref_16","unstructured":"Apache Foundation The Apache HBase Project. Available online: http:\/\/hbase.apache.org\/."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., and Gruber, R.E. (2008). Bigtable: A Distributed Storage System for Structured Data. ACM Trans. Comput. Syst.","DOI":"10.1145\/1365815.1365816"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Shvachko, K., Kuang, H., Radia, S., and Chansler, R. (2010, January 3\u20137). The Hadoop Distributed File System. Proceedings of the IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), Lake Tahoe, NV, USA.","DOI":"10.1109\/MSST.2010.5496972"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Papailiou, N., Konstantinou, I., Tsoumakos, D., and Koziris, N. (2012, January 16\u201320). H2RDF: Adaptive Query Processing on RDF Data in the Cloud. Proceedings of the 21st International Conference Companion on World Wide Web (Companion Volume), Lyon, France.","DOI":"10.1145\/2187980.2188058"},{"key":"ref_20","unstructured":"Papailiou, N., Konstantinou, I., Tsoumakos, D., Karras, P., and Koziris, N. (2013, January 6\u20139). H2RDF+: High-performance distributed joins over large-scale RDF graphs. Proceedings of the IEEE International Conference on Big Data, Santa Clara, CA, USA."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1008","DOI":"10.14778\/1453856.1453965","article-title":"Hexastore: Sextuple indexing for semantic Web data management","volume":"1","author":"Weiss","year":"2008","journal-title":"Proc. VLDB Endow."},{"key":"ref_22","unstructured":"Cunningham, H., Maynard, D., Bontcheva, K., and Tablan, V. (2002, January 6\u201312). GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL), Philadelphia, PA, USA."},{"key":"ref_23","unstructured":"TermRaider term extraction tools. Available online: https:\/\/gate.ac.uk\/sale\/tao\/splitch23.html#sec:creole:termraider."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., and Bizer, C. (2014). DBpedia\u2014A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia. Semantic Web Journal.","DOI":"10.3233\/SW-140134"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Bollacker, K., Evans, C., Paritosh, P., Sturge, T., and Taylor, J. (2008, January 9\u201312). Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge. Proceedings of the ACM SIGMOD Conference (SIGMOD\u201908), Vancouver, Canada.","DOI":"10.1145\/1376616.1376746"},{"key":"ref_26","unstructured":"Nunes, B.P., Dietze, S., Casanova, M., Kawase, R., Fetahu, B., and Nejdl, W. (2013, January 26\u201330). Combining a co-occurrence-based and a semantic measure for entity linking. Proceeedings of the 10th Extended Semantic Web Conference (ESWC), Montpellier, France."},{"key":"ref_27","unstructured":"Demidova, E., Oelze, I., and Nejdl, W. (November, January 27). Aligning Freebase with the YAGO Ontology. Proceedings of the 22nd ACM International Conference on Conference on Information and Knowledge Management (CIKM), San Francisco, CA, USA."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Suchanek, F.M., Kasneci, G., and Weikum, G. (2007, January 8\u201312). Yago: A Core of Semantic Knowledge. Proceedings of the 16th International World Wide Web Conference, Banff, Alberta, Canada.","DOI":"10.1145\/1242572.1242667"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Poblete, B., Gavilanes, R.O.G., Mendoza, M., and Jaimes, A. (2011, January 24\u201328). Do all birds tweet the same?: Characterizing Twitter around the world. Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM), Glasgow, UK.","DOI":"10.1145\/2063576.2063724"},{"key":"ref_30","unstructured":"Siehndel, P., and Kawase, R. (2012, January 11\u201315). TwikiMe!\u2014User Profiles That Make Sense. Proceedings of the ISWC 2012 Posters & Demonstrations Track, Boston, MA, USA."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Kawase, R., Siehndel, P., Pereira Nunes, B., Herder, E., and Nejdl, W. (2014, January 1\u20134). Exploiting the wisdom of the crowds for characterizing and connecting heterogeneous resources. Proceedings of the 25th ACM Conference on Hypertext and Social Media, Santiago, Chile.","DOI":"10.1145\/2631775.2631797"},{"key":"ref_32","unstructured":"Wikipedia Miner Toolkit. Available online: http:\/\/wikipedia-miner.cms.waikato.ac.nz\/."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1016\/j.artint.2012.06.007","article-title":"An open-source toolkit for mining Wikipedia","volume":"194","author":"Milne","year":"2013","journal-title":"Artif. Intell."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Kwak, H., Lee, C., Park, H., and Moon, S. (2010, January 26\u201330). What is Twitter, a Social Network or a News Media?. Proceedings of the 19th International Conference on World Wide Web, Raleigh, NC, USA.","DOI":"10.1145\/1772690.1772751"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Stajner, T., Thomee, B., Popescu, A.M., Pennacchiotti, M., and Jaimes, A. (2013, January 11\u201314). Automatic selection of social media responses to news. Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Chicago, IL, USA.","DOI":"10.1145\/2487575.2487659"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Maniu, S., and Cautis, B. (2012, January 20\u201324). Taagle: Efficient, personalized search in collaborative tagging networks. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), Scottsdale, AZ, USA.","DOI":"10.1145\/2213836.2213926"},{"key":"ref_37","unstructured":"Maniu, S., and Cautis, B. Efficient Top-K Retrieval in Online Social Tagging Networks. Available online: http:\/\/arxiv.org\/abs\/1104.1605."},{"key":"ref_38","unstructured":"Tahmasebi, N., Gossen, G., Kanhabua, N., Holzmann, H., and Risse, T. (2012, January 8\u201315). NEER: An Unsupervised Method for Named Entity Evolution Recognition. Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Tahmasebi, N., Niklas, K., Theuerkauf, T., and Risse, T. (2010, January 21\u201325). Using Word Sense Discrimination on Historic Document Collections. Proceedings of the 10th ACM\/IEEE Joint Conference on Digital Libraries (JCDL), Gold Coast, Queensland, Australia.","DOI":"10.1145\/1816123.1816137"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"O\u2019Connor, B., Balasubramanyan, R., Routledge, B.R., and Smith, N.A. (2010, January 23\u201326). From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. Proceedings of the 4th International Conference on Weblogs and Social Media (ICWSM), Washington, DC, USA.","DOI":"10.1609\/icwsm.v4i1.14031"},{"key":"ref_41","unstructured":"Jiang, L., Yu, M., Zhou, M., Liu, X., and Zhao, T. (2011, January 19\u201324). Target-dependent Twitter Sentiment Classification. Proceedingsa of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, OR, USA."},{"key":"ref_42","unstructured":"Plachouras, V., and Stavrakas, Y. (2012, January 31). Querying Term Associations and their Temporal Evolution in Social Data. VLDB Workshop on Online Social Systems (WOSS), Istanbul, Turkey."},{"key":"ref_43","unstructured":"Mohr, G., Stack, M., Ranitovic, I., Avery, D., and Kimpton, M. (2004, January 16). Introduction to heritrix, an archival quality Web crawler. Proceedings of the 4th International Web Archiving Workshop, Bath, UK."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"518","DOI":"10.3390\/fi6030518","article-title":"ARCOMEM Crawling Architecture","volume":"6","author":"Plachouras","year":"2014","journal-title":"Future Internet"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1108\/eb045623","article-title":"The Kulturarw Project\u2014The Swedish Royal Web Archive","volume":"16","author":"Arvidson","year":"1998","journal-title":"Electron. Libr."},{"key":"ref_46","unstructured":"International Internet Preservation Consortium (IIPC). Available online: http:\/\/netpreserve.org\/."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Masan\u00e8s, J. (2006). Web Archiving, Springer.","DOI":"10.1007\/978-3-540-46332-0"},{"key":"ref_48","unstructured":"Living Web Archives Project. Available online: http:\/\/www.liwa-project.eu\/."},{"key":"ref_49","unstructured":"Cho, J., Garcia-Molina, H., and Page, L. (1998, January 14\u201318). Efficient crawling through URL ordering. Proceedings of the 7th International Conference on World Wide Web, Brisbane, Australia."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Baeza-Yates, R., Castillo, C., Marin, M., and Rodriguez, A. (2005, January 10\u201314). Crawling a Country: Better Strategies Than Breadth-first for Web Page Ordering. Proceedings of Special Interest Tracks and Posters of the 14th International Conference on World Wide Web, Chiba, Japan.","DOI":"10.1145\/1062745.1062768"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1623","DOI":"10.1016\/S1389-1286(99)00052-3","article-title":"Focused Crawling: A New Approach to Topic-specific Web Resource Discovery","volume":"31","author":"Chakrabarti","year":"1999","journal-title":"Comput. Netw."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1145\/1031114.1031117","article-title":"Topical Web crawlers: Evaluating adaptive algorithms","volume":"4","author":"Menczer","year":"2004","journal-title":"ACM Trans. Internet Technol."},{"key":"ref_53","unstructured":"Laranjeira, B., Moreira, V., Villavicencio, A., Ramisch, C., and Finatto, M.J. (2014, January 26\u201331). Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them. Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC), Reykjavik, Iceland."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Pandey, S., and Olston, C. (2005, January 10\u201314). User-centric Web Crawling. Proceedings of the 14th International Conference on World Wide Web, Chiba, Japan.","DOI":"10.1145\/1060745.1060805"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Barford, P., Canadi, I., Krushevskaja, D., Ma, Q., and Muthukrishnan, S. (2014, January 7\u201311). Adscape: Harvesting and Analyzing Online Display Ads. Proceedings of the 23rd International Conference on World Wide Web, Seoul, Korea.","DOI":"10.1145\/2566486.2567992"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Boanjak, M., Oliveira, E., Martins, J., Mendes Rodrigues, E., and Sarmento, L. (2012, January 16). TwitterEcho: A Distributed Focused Crawler to Support Open Research with Twitter Data. Proceedings of the Workshop on Social Media Applications in News and Entertainment (SMANE 2012), at the ACM 2012 International World Wide Web Conference, Lyon, France.","DOI":"10.1145\/2187980.2188266"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Psallidas, F., Ntoulas, A., and Delis, A. (2013, January 13\u201315). Soc Web: Efficient Monitoring of Social Network Activities. Proceedings of the 14th International Web Information Systems Engineering Conference, Nanjing, China.","DOI":"10.1007\/978-3-642-41154-0_9"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Blackburn, J., and Iamnitchi., A. (2013, January 9). An architecture for collecting longitudinal social data. IEEE ICC Workshop on Beyond Social Networks: Collective Awareness, Budapest, Hungary.","DOI":"10.1109\/ICCW.2013.6649225"},{"key":"ref_59","unstructured":"Isele, R., Umbrich, J., Bizer, C., and Harth, A. (2010, January 9). LDSpider: An open-source crawling framework for the Web of Linked Data. Proceedings of 9th International Semantic Web Conference (ISWC) Posters and Demos, Shanghai, China."},{"key":"ref_60","unstructured":"Slug: A Semantic Web Crawler. Available online: http:\/\/www.ldodds.com\/projects\/slug\/."},{"key":"ref_61","unstructured":"Internet Archive NutchWAX. Available online: http:\/\/archive-access.sourceforge.net\/projects\/nutch\/."},{"key":"ref_62","unstructured":"Apache Nutch\u2014Highly extensible, highly scalable Web crawler. Available online: http:\/\/nutch.apache.org\/."},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Gomes, D., Cruz, D., Miranda, J.A., Costa, M., and Fontes, S.A. (2013, January 13\u201317). Search the Past with the Portuguese Web Archive. Proceedings of the 22nd International Conference on World Wide Web (Companion Volume), Rio de Janeiro, Brazil.","DOI":"10.1145\/2487788.2487934"},{"key":"ref_64","unstructured":"Internet Archive Wayback. Available online: http:\/\/archive-access.sourceforge.net\/projects\/wayback\/."},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Spaniol, M., and Weikum, G. (2012, January 16\u201320). Tracking Entities in Web Archives: The LAWA Project. Proceedings of the 21st International Conference Companion on World Wide Web (Companion Volume), Lyon, France.","DOI":"10.1145\/2187980.2188030"},{"key":"ref_66","unstructured":"Big UK Domain Data for the Arts and Humanities. Available online: http:\/\/buddah.projects.history.ac.uk\/."}],"container-title":["Future Internet"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-5903\/6\/4\/688\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T21:08:56Z","timestamp":1760216936000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-5903\/6\/4\/688"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,11,4]]},"references-count":66,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2014,12]]}},"alternative-id":["fi6040688"],"URL":"https:\/\/doi.org\/10.3390\/fi6040688","relation":{},"ISSN":["1999-5903"],"issn-type":[{"type":"electronic","value":"1999-5903"}],"subject":[],"published":{"date-parts":[[2014,11,4]]}}}