{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T05:21:50Z","timestamp":1767849710335,"version":"3.49.0"},"reference-count":63,"publisher":"SAGE Publications","issue":"2","license":[{"start":{"date-parts":[[2020,8,24]],"date-time":"2020-08-24T00:00:00Z","timestamp":1598227200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2022,4]]},"abstract":"<jats:p> For some decades now, Galleries, Libraries, Archives and Museums (GLAM) institutions have published and provided access to information resources in digital format. Recently, innovative approaches have appeared such as the concept of Labs within GLAM institutions that facilitates the adoption of innovative and creative tools for content delivery and user engagement. In addition, new methods have been proposed to address the publication of digital collections as data sets amenable to computational use. In this article, we propose a methodology to create machine actionable collections following a set of steps. This methodology is then applied to several use cases based on data sets published by relevant GLAM institutions. It intends to encourage institutions to adopt the publication of data sets that support computationally driven research as a core activity. <\/jats:p>","DOI":"10.1177\/0165551520950246","type":"journal-article","created":{"date-parts":[[2020,8,25]],"date-time":"2020-08-25T06:13:46Z","timestamp":1598336026000},"page":"251-267","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":25,"title":["Reusing digital collections from GLAM institutions"],"prefix":"10.1177","volume":"48","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6122-0777","authenticated-orcid":false,"given":"Gustavo","family":"Candela","sequence":"first","affiliation":[{"name":"Universitat d\u2019Alacant, Spain"}]},{"given":"Mar\u00eda Dolores","family":"S\u00e1ez","sequence":"additional","affiliation":[{"name":"Universitat d\u2019Alacant, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7705-5224","authenticated-orcid":false,"given":"MPilar","family":"Escobar Esteban","sequence":"additional","affiliation":[{"name":"Universitat d\u2019Alacant, Spain"}]},{"given":"Manuel","family":"Marco-Such","sequence":"additional","affiliation":[{"name":"Universitat d\u2019Alacant, Spain"}]}],"member":"179","published-online":{"date-parts":[[2020,8,24]]},"reference":[{"key":"bibr1-0165551520950246","unstructured":"Baillieul JB, Hall LO, Moura JMF et al. The first IEEE workshop on the future of research curation and research reproducibility, 2017, https:\/\/open.bu.edu\/handle\/2144\/39028"},{"key":"bibr2-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2018.2816618"},{"key":"bibr3-0165551520950246","unstructured":"Research Libraries UK. A manifesto for the digital shift in research libraries, 2020, https:\/\/www.rluk.ac.uk\/digital-shift-manifesto\/ (accessed 20 April 2020)."},{"key":"bibr4-0165551520950246","unstructured":"Mahey M, Al-Abdulla A, Ames S et al. Open a GLAM lab, 2019, https:\/\/glamlabs.io\/books\/open-a-glam-lab\/"},{"key":"bibr5-0165551520950246","unstructured":"Padilla T, Allen L, Frost H et al. Final report \u2013 always already computational: collections as data, 2019, https:\/\/doi.org\/10.5281\/zenodo.3152935"},{"key":"bibr6-0165551520950246","unstructured":"International Image Interoperability Framework, https:\/\/iiif.io\/"},{"key":"bibr7-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-76298-0_52"},{"key":"bibr8-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1145\/2872427.2874809"},{"key":"bibr9-0165551520950246","unstructured":"Rule A, Birmingham A, Zuniga C et al. Ten simple rules for reproducible research in Jupyter Notebooks. CoRR 2018, http:\/\/arxiv.org\/abs\/1810.08055.1810.08055"},{"key":"bibr10-0165551520950246","doi-asserted-by":"publisher","DOI":"10.3233\/SW-170274"},{"key":"bibr11-0165551520950246","unstructured":"IFLA Information Technology Section; IFLA Semantic Web Special Interest Group; Biblioth\u00e8que nationale de France. We grew up together: data.bnf.fr from the BnF and Logilab perspectives. Paris, Biblioth\u00e8que nationale de France, Petit auditorium: IFLA Information Technology Section; IFLA Semantic Web Special Interest Group; Biblioth\u00e8que nationale de France, 2014, http:\/\/ifla2014-satdata.bnf.fr\/program.html"},{"key":"bibr12-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.18"},{"key":"bibr13-0165551520950246","unstructured":"Harris G, Potter A, Zwaard K et al. Digital Scholarship at the Library of Congress. User demand, current practices, and options for expanded services, 2020, https:\/\/labs.loc.gov\/static\/labs\/meta\/DHWorkingGroupPaper-v1.0.pdf (accessed 20 April 2020)."},{"key":"bibr14-0165551520950246","unstructured":"Association of European Research Libraries. Implementing FAIR data principles: the role of libraries, 2017, https:\/\/libereurope.eu\/wp-content\/uploads\/2017\/12\/LIBER-FAIR-Data.pdf (accessed 21 April 2020)."},{"key":"bibr15-0165551520950246","unstructured":"National Library of Scotland. Data foundry. Data collections from the National Library of Scotland, https:\/\/data.nls.uk\/"},{"key":"bibr16-0165551520950246","unstructured":"British Library. A collection of datasets released by the British Library, https:\/\/data.bl.uk\/"},{"key":"bibr17-0165551520950246","unstructured":"Library of Congress. LC for robots, https:\/\/labs.loc.gov\/lc-for-robots\/"},{"key":"bibr18-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1007\/s00799-018-0259-5"},{"key":"bibr19-0165551520950246","unstructured":"Smithsonian. International image interoperability framework at the Smithsonian Institution, https:\/\/iiif.si.edu"},{"key":"bibr20-0165551520950246","unstructured":"Europeana. Europeana IIIF APIs, https:\/\/pro.europeana.eu\/page\/iiif"},{"key":"bibr21-0165551520950246","unstructured":"Staatsbibliothek zu Berlin. Staatsbibliothek zu Berlin Labs, https:\/\/lab.sbb.berlin\/dc\/?lang=en"},{"key":"bibr22-0165551520950246","unstructured":"Padilla T, Allen L, Frost H et al. 50 things \u2013 always already computational: collections as data, 2019, https:\/\/doi.org\/10.5281\/zenodo.3066237"},{"key":"bibr23-0165551520950246","unstructured":"Det Kgl Bibliotek. Det Kgl Bibliotek Labs, https:\/\/labs.kb.dk\/"},{"key":"bibr24-0165551520950246","doi-asserted-by":"publisher","DOI":"10.6017\/ital.v38i4.11101"},{"key":"bibr25-0165551520950246","unstructured":"Project Jupyter, https:\/\/jupyter.org\/"},{"key":"bibr26-0165551520950246","unstructured":"Sherratt T. Glam-workbench\/getting-started, 2019, https:\/\/doi.org\/10.5281\/zenodo.3549636"},{"key":"bibr27-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1109\/JCDL.2019.00059"},{"key":"bibr28-0165551520950246","unstructured":"Library of Congress. Chronicling America data visualizations, 2019, https:\/\/www.loc.gov\/ndnp\/data-visualizations\/"},{"key":"bibr29-0165551520950246","unstructured":"Library of Congress. By the people, https:\/\/crowd.loc.gov\/"},{"key":"bibr30-0165551520950246","unstructured":"CoreTrustSeal Standards and Certification Board. CoreTrustSeal trustworthy data repositories requirements: extended guidance 2020\u20132022, 2019, https:\/\/doi.org\/10.5281\/zenodo.3632533"},{"key":"bibr31-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1108\/JD-06-2019-0112"},{"key":"bibr32-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-1767-5_2"},{"key":"bibr33-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1177\/0165551518812658"},{"key":"bibr34-0165551520950246","unstructured":"World Wide Web Consortium. RDF 1.1 Primer, 2014, https:\/\/www.w3.org\/TR\/rdf11-primer\/"},{"key":"bibr35-0165551520950246","unstructured":"OpenRefine, https:\/\/openrefine.org\/"},{"key":"bibr36-0165551520950246","volume":"68","author":"Esteban MPE","year":"2020","journal-title":"Comput Stand Interfaces"},{"key":"bibr37-0165551520950246","unstructured":"Jevon G. Clean. Migrate. Validate. Enhance. Processing archival metadata with open refine, 2020, https:\/\/blogs.bl.uk\/digital-scholarship\/2020\/04\/clean-migrate-validate-enhance-processing-archival-metadata-with-open-refine.html"},{"key":"bibr38-0165551520950246","unstructured":"McPhillips T, Li L, Parulian N et al. Modeling provenance and understanding reproducibility for openrefine data cleaning workflows. In: 11th international workshop on theory and practice of provenance (TaPP 2019). Philadelphia, PA: USENIX Association, https:\/\/www.usenix.org\/conference\/tapp2019\/presentation\/mcphillips"},{"key":"bibr39-0165551520950246","unstructured":"Ferriter M. Introducing the computing cultural heritage in the cloud project, 2019, https:\/\/blogs.loc.gov\/thesignal\/2019\/11\/introducing-the-computing-cultural-heritage-in-the-cloud-project\/?loclr=blogsig"},{"key":"bibr40-0165551520950246","unstructured":"European Commission. Modernisation of the EU copyright rules, 2019, https:\/\/ec.europa.eu\/digital-single-market\/en\/modernisation-eu-copyright-rules#:~:text=The%20objective%20of%20the%20Directive, digital%20and%20cross%2Dborder%20uses"},{"key":"bibr41-0165551520950246","unstructured":"Open Archives Initiative. Open archives initiative protocol for metadata harvesting, https:\/\/www.openarchives.org\/pmh\/"},{"key":"bibr42-0165551520950246","unstructured":"World Wide Web Consortium. SPARQL 1.1 query language, 2013, https:\/\/www.w3.org\/TR\/sparql11-query\/"},{"key":"bibr43-0165551520950246","unstructured":"Pandas \u2212 Python Data Analysis Library, https:\/\/pandas.pydata.org\/"},{"key":"bibr44-0165551520950246","unstructured":"Hagedorn S. When sweet and cute isn\u2019t enough anymore: solving scalability issues in python pandas with grizzly. In: 10th conference on innovative data systems research (CIDR 2020), Amsterdam, 12\u221215 January 2020, http:\/\/cidrdb.org\/cidr2020\/gongshow2020\/gongshow\/abstracts\/cidr2020_abstract76.pdf"},{"key":"bibr45-0165551520950246","unstructured":"World Wide Web Consortium. Data on the web best practices, 2017, https:\/\/www.w3.org\/TR\/dwbp\/"},{"key":"bibr46-0165551520950246","unstructured":"Walsh P, Pollock R. Data package, 2007, https:\/\/specs.frictionlessdata.io\/data-package"},{"key":"bibr47-0165551520950246","unstructured":"Open Knowledge Foundation, http:\/\/goodtables.io\/"},{"key":"bibr48-0165551520950246","unstructured":"Raschka S, Patterson J, Nolet C. Machine learning in python: main developments and technology trends in data science, machine learning, and artificial intelligence. CoRR 2020, https:\/\/arxiv.org\/abs\/2002.04803.2002.04803"},{"key":"bibr49-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2019.2914711"},{"key":"bibr50-0165551520950246","unstructured":"NumPy, https:\/\/numpy.org\/"},{"key":"bibr51-0165551520950246","unstructured":"Matplotlib: visualization with python, https:\/\/matplotlib.org\/"},{"key":"bibr52-0165551520950246","unstructured":"British Library. Basic RDF\/XML, 2014, http:\/\/www.bl.uk\/bibliographic\/datafree.html#basicrdfxml (accessed 26 April 2020)."},{"key":"bibr53-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1045\/january2015-bontcheva"},{"key":"bibr54-0165551520950246","unstructured":"Folium, https:\/\/python-visualization.github.io\/folium\/"},{"key":"bibr55-0165551520950246","unstructured":"Pillow, https:\/\/pillow.readthedocs.io\/en\/stable\/"},{"key":"bibr56-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1145\/3243907.3243914"},{"key":"bibr57-0165551520950246","unstructured":"McCallum AK. Mallet: a machine learning for language toolkit, 2002, http:\/\/mallet.cs.umass.edu"},{"key":"bibr58-0165551520950246","unstructured":"Enderle JS, Balagopalan A, Li X et al. senderle\/topic-modeling-tool: first stable release2017, https:\/\/doi.org\/10.5281\/zenodo.496150"},{"key":"bibr59-0165551520950246","unstructured":"British Library. Theatrical playbills from Britain and Ireland (OCR text only), 2015, https:\/\/doi.org\/10.21250\/pb2"},{"key":"bibr60-0165551520950246","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2973071"},{"key":"bibr61-0165551520950246","unstructured":"OpenCV, https:\/\/opencv.org\/"},{"key":"bibr62-0165551520950246","unstructured":"Library of Congress. Marc standards, http:\/\/www.loc.gov\/marc\/"},{"key":"bibr63-0165551520950246","unstructured":"Pymarc, https:\/\/pymarc.readthedocs.io\/en\/latest\/"}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551520950246","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/0165551520950246","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551520950246","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,2]],"date-time":"2025-03-02T08:39:09Z","timestamp":1740904749000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/0165551520950246"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,8,24]]},"references-count":63,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,4]]}},"alternative-id":["10.1177\/0165551520950246"],"URL":"https:\/\/doi.org\/10.1177\/0165551520950246","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,8,24]]}}}