{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T23:57:25Z","timestamp":1776729445539,"version":"3.51.2"},"reference-count":99,"publisher":"Emerald","issue":"1","license":[{"start":{"date-parts":[[2024,12,24]],"date-time":"2024-12-24T00:00:00Z","timestamp":1734998400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["DLP"],"published-print":{"date-parts":[[2025,1,28]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title>\n<jats:p>This paper aims to address the pressing challenges in research data management within institutional repositories, focusing on the escalating volume, heterogeneity and multi-source nature of research data. The aim is to enhance the data services provided by institutional repositories and modernise their role in the research ecosystem.<\/jats:p>\n<\/jats:sec>\n<jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title>\n<jats:p>The authors analyse the evolution of data management architectures through literature review, emphasising the advantages of data lakehouses. Using the design science research methodology, the authors develop an end-to-end data lakehouse architecture tailored to the needs of institutional repositories. This design is refined through interviews with data management professionals, institutional repository administrators and researchers.<\/jats:p>\n<\/jats:sec>\n<jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title>\n<jats:p>The authors present a comprehensive framework for data lakehouse architecture, comprising five fundamental layers: data collection, data storage, data processing, data management and data services. Each layer articulates the implementation steps, delineates the dependencies between them and identifies potential obstacles with corresponding mitigation strategies.<\/jats:p>\n<\/jats:sec>\n<jats:sec><jats:title content-type=\"abstract-subheading\">Practical implications<\/jats:title>\n<jats:p>The proposed data lakehouse architecture provides a practical and scalable solution for institutional repositories to manage research data. It offers a range of benefits, including enhanced data management capabilities, expanded data services, improved researcher experience and a modernised institutional repository ecosystem. The paper also identifies and addresses potential implementation obstacles and provides valuable guidance for institutions embarking on the adoption of this architecture. The implementation in a university library showcases how the architecture enhances data sharing among researchers and empowers institutional repository administrators with comprehensive oversight and control of the university\u2019s research data landscape.<\/jats:p>\n<\/jats:sec>\n<jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title>\n<jats:p>This paper enriches the theoretical knowledge and provides a comprehensive research framework and paradigm for scholars in research data management. It details a pioneering application of the data lakehouse architecture in an academic setting, highlighting its practical benefits and adaptability to meet the specific needs of institutional repositories.<\/jats:p>\n<\/jats:sec>","DOI":"10.1108\/dlp-02-2024-0022","type":"journal-article","created":{"date-parts":[[2024,12,19]],"date-time":"2024-12-19T23:04:50Z","timestamp":1734649490000},"page":"145-178","source":"Crossref","is-referenced-by-count":10,"title":["Research data management in institutional repositories: an architectural approach using data lakehouses"],"prefix":"10.1108","volume":"41","author":[{"given":"Zilong","family":"He","sequence":"first","affiliation":[]},{"given":"Wei","family":"Fang","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2024,12,24]]},"reference":[{"key":"key2025012506554756800_ref001","doi-asserted-by":"publisher","first-page":"70","DOI":"10.1016\/j.jpdc.2023.02.007","article-title":"Spatial big data architecture: from data warehouses and data Lakes to the lake house","volume":"176","year":"2023","journal-title":"Journal of Parallel and Distributed Computing"},{"issue":"1","key":"key2025012506554756800_ref002","first-page":"153","article-title":"Data warehouse as a backbone for business intelligence: issues and challenges","volume":"33","year":"2011","journal-title":"European Journal of Economics, Finance and Administrative Sciences"},{"issue":"4","key":"key2025012506554756800_ref003","doi-asserted-by":"publisher","first-page":"851","DOI":"10.1007\/s10209-016-0475-y","article-title":"A comparison of research data management platforms: architecture, flexible metadata and interoperability","volume":"16","year":"2017","journal-title":"Universal Access in the Information Society"},{"issue":"4","key":"key2025012506554756800_ref004","doi-asserted-by":"publisher","first-page":"349","DOI":"10.1080\/13614533.2021.1964549","article-title":"Research data management (RDM) and the evolving identity of academic libraries and librarians: a literature review","volume":"28","year":"2022","journal-title":"New Review of Academic Librarianship"},{"issue":"2","key":"key2025012506554756800_ref005","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1007\/s40031-019-00388-x","article-title":"Data categorization using Hadoop map reduce-based parallel K-Means clustering","volume":"100","year":"2019","journal-title":"Journal of The Institution of Engineers (India): Series B"},{"key":"key2025012506554756800_ref006","article-title":"Lakehouse: a new generation of open platforms that unify data warehousing and advanced analytics","volume":"8","year":"2021","journal-title":"Proceedings of CIDR"},{"key":"key2025012506554756800_ref007","doi-asserted-by":"publisher","first-page":"35242","DOI":"10.1109\/ACCESS.2019.2897729","article-title":"Understanding institutional repository in higher learning institutions: a systematic literature review and directions for future research","volume":"7","year":"2019","journal-title":"IEEE Access"},{"key":"key2025012506554756800_ref008","doi-asserted-by":"publisher","DOI":"10.1108\/GKMC-07-2020-0103","article-title":"A systematic literature review on research data management practices and services","year":"2020","journal-title":"Global Knowledge, Memory and Communication"},{"key":"key2025012506554756800_ref009","doi-asserted-by":"crossref","first-page":"44","DOI":"10.5334\/dsj-2019-044","article-title":"The Australian research data commons","volume":"18","year":"2019","journal-title":"Data Science Journal"},{"issue":"3","key":"key2025012506554756800_ref010","doi-asserted-by":"publisher","first-page":"518","DOI":"10.1177\/09610006211009592","article-title":"Evolution of institutional repositories: managing institutional research output to remove the gap of academic elitism","volume":"54","year":"2022","journal-title":"Journal of Librarianship and Information Science"},{"key":"key2025012506554756800_ref011","doi-asserted-by":"publisher","first-page":"4643","DOI":"10.1109\/BigData52589.2021.9671534","article-title":"A lakehouse architecture for the management and analysis of heterogeneous data for biomedical research and mega-biobanks","year":"2021"},{"issue":"3","key":"key2025012506554756800_ref012","doi-asserted-by":"publisher","DOI":"10.47989\/irpaper942","article-title":"Fitness for use of data: Scientists\u2019 heuristics of discovery and reuse behaviour framed by the FAIR data principles","volume":"27","year":"2022","journal-title":"Information Research: An International Electronic Journal"},{"key":"key2025012506554756800_ref013","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1109\/SSDM.2002.1029701","article-title":"A conceptual framework for composing and managing scientific data lineage","year":"2002"},{"key":"key2025012506554756800_ref014","unstructured":"Carson, M.B. (2024), \u201cSupporting transparency and compliance with invenio RDM\u201d, Open Repositories 2024 (OR2024), G\u00f6teborg, Sweden, doi: 10.5281\/zenodo.12587111."},{"issue":"2","key":"key2025012506554756800_ref015","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1007\/s41870-017-0067-y","article-title":"Comprehensive survey on data warehousing research","volume":"10","year":"2018","journal-title":"International Journal of Information Technology"},{"key":"key2025012506554756800_ref016","doi-asserted-by":"crossref","first-page":"67","DOI":"10.56294\/mw202467","article-title":"Data lakehouse: next generation information system","volume":"3","year":"2024","journal-title":"Seminars in Medical Writing and Education"},{"key":"key2025012506554756800_ref017","doi-asserted-by":"publisher","first-page":"2201","DOI":"10.1145\/2882903.2912574","article-title":"Data cleaning: overview and emerging challenges","year":"2016"},{"key":"key2025012506554756800_ref018","volume-title":"Managing and Sharing Research Data: A Guide to Good Practice","year":"2019"},{"issue":"5","key":"key2025012506554756800_ref019","doi-asserted-by":"publisher","first-page":"1","DOI":"10.5334\/dsj-2019-005","article-title":"Supporting the interdisciplinary, long-term research project \u2018patterns in soil-vegetation-atmosphere-systems\u2019 by data management services","volume":"18","year":"2019","journal-title":"Data Science Journal"},{"key":"key2025012506554756800_ref020","volume-title":"DAMA-DMBOK: Data Management Body of Knowledge","author":"DAMA International","year":"2017"},{"key":"key2025012506554756800_ref021","unstructured":"Dappert, A., Mayer, R., Pr\u00f6ll, S., Rauber, A., Page, K.R., Palma, R. and Garijo, D. (2014), \u201cPreserving data to preserving research: curation of process and context\u201d, iPRES, available at: https:\/\/phaidra.univie.ac.at\/detail\/o:378139.pdf"},{"issue":"1","key":"key2025012506554756800_ref022","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1147\/sj.271.0060","article-title":"An architecture for a business and information system","volume":"27","year":"1988","journal-title":"IBM Systems Journal"},{"issue":"2","key":"key2025012506554756800_ref023","first-page":"270","article-title":"NVivo","volume":"110","year":"2022","journal-title":"Journal of the Medical Library Association"},{"key":"key2025012506554756800_ref024","doi-asserted-by":"publisher","first-page":"91265","DOI":"10.1109\/ACCESS.2019.2927491","article-title":"In search of big medical data integration solutions-a comprehensive survey","volume":"7","year":"2019","journal-title":"IEEE Access"},{"key":"key2025012506554756800_ref025","unstructured":"Dixon, J. (2010), \u201cPentaho, Hadoop, and data Lakes\u201d, available at: https:\/\/jamesdixon.wordpress.com\/2010\/10\/14\/"},{"issue":"2","key":"key2025012506554756800_ref026","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1177\/09610006211070282","article-title":"Research data management systems and the organization of universities and research institutes: a systematic literature review","volume":"55","year":"2023","journal-title":"Journal of Librarianship and Information Science"},{"key":"key2025012506554756800_ref027","doi-asserted-by":"publisher","first-page":"148","DOI":"10.1007\/978-3-030-87101-7_15","article-title":"Data catalogs: a systematic literature review and guidelines to implementation","volume-title":"Communications in Computer and Information Science","year":"2021"},{"key":"key2025012506554756800_ref028","unstructured":"Esser, A., Frenzel, J., Josenhans, V., Otto, T., Pacharra, M., Walk, P. and Winter, N.O.C. (2024), \u201cCreating trust in research data repositories via a user-driven implementation: case study of a hyrax-based, institutional data repository\u201d, Open Repositories 2024 (OR2024), G\u00f6teborg, Sweden, doi: 10.5281\/zenodo.12579346."},{"key":"key2025012506554756800_ref029","doi-asserted-by":"publisher","article-title":"Metrics to increase data usage understanding and transparency","year":"2024","DOI":"10.5281\/zenodo.11152524"},{"issue":"1","key":"key2025012506554756800_ref030","doi-asserted-by":"publisher","first-page":"1","DOI":"10.54501\/jots.v1i1.24","article-title":"An overview of perceptual hashing","volume":"1","year":"2021","journal-title":"Journal of Online Trust and Safety"},{"key":"key2025012506554756800_ref031","doi-asserted-by":"publisher","first-page":"2089","DOI":"10.1145\/2882903.2899391","article-title":"CLAMS: bringing quality to data Lakes","year":"2016"},{"key":"key2025012506554756800_ref032","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1002\/9781118680605.ch16","article-title":"Data modeling","volume-title":"A New Companion to Digital Humanities","year":"2015","edition":"1st ed"},{"issue":"2","key":"key2025012506554756800_ref033","article-title":"Institutional repositories as infrastructures for long-term preservation","volume":"22","year":"2017","journal-title":"Information Research"},{"key":"key2025012506554756800_ref034","doi-asserted-by":"publisher","DOI":"10.18420\/btw2021-19","article-title":"The data lake architecture framework: a foundation for building a comprehensive data lake architecture","volume":"2021","year":"2021","journal-title":"BTW"},{"key":"key2025012506554756800_ref035","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1007\/978-3-030-27520-4_13","article-title":"Leveraging the data lake: Current state and challenges","year":"2019"},{"issue":"02n03","key":"key2025012506554756800_ref036","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1142\/S0218843098000118","article-title":"The dimensional fact model: a conceptual model for data warehouses","volume":"07","year":"1998","journal-title":"International Journal of Cooperative Information Systems"},{"key":"key2025012506554756800_ref037","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1109\/BigData55660.2022.10020719","article-title":"From data warehouse to lakehouse: a comparative review","year":"2022"},{"issue":"6","key":"key2025012506554756800_ref038","article-title":"Storage structures in the era of big data: from data warehouse to lakehouse","volume":"102","year":"2024","journal-title":"Journal of Theoretical and Applied Information Technology"},{"issue":"1","key":"key2025012506554756800_ref039","doi-asserted-by":"publisher","first-page":"134","DOI":"10.2218\/ijdc.v3i1.48","article-title":"The DCC curation lifecycle model","volume":"3","year":"2008","journal-title":"International Journal of Digital Curation"},{"key":"key2025012506554756800_ref040","volume-title":"Data Lake Architecture: Designing the Data Lake and Avoiding the Garbage Dump","year":"2016"},{"issue":"1","key":"key2025012506554756800_ref041","first-page":"1","article-title":"What is a data warehouse","volume":"1","year":"1995","journal-title":"Prism Tech Topic"},{"issue":"11","key":"key2025012506554756800_ref042","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1145\/240455.240470","article-title":"The data warehouse and data mining","volume":"39","year":"1996","journal-title":"Communications of the ACM"},{"key":"key2025012506554756800_ref043","article-title":"Analyzing and comparing lakehouse storage systems","year":"2023"},{"key":"key2025012506554756800_ref044","volume-title":"Fundamentals of Data Warehouses","year":"2002"},{"issue":"7","key":"key2025012506554756800_ref045","doi-asserted-by":"publisher","first-page":"1275","DOI":"10.1108\/OIR-03-2020-0079","article-title":"Surveying research data-sharing practices in US social sciences: a knowledge infrastructure-inspired conceptual framework","volume":"46","year":"2022","journal-title":"Online Information Review"},{"issue":"3","key":"key2025012506554756800_ref046","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1108\/LHT-12-2017-0266","article-title":"Investigation of challenges in academic institutional repositories: a survey of academic librarians","volume":"37","year":"2019","journal-title":"Library Hi Tech"},{"issue":"3","key":"key2025012506554756800_ref047","doi-asserted-by":"publisher","first-page":"242","DOI":"10.1108\/DLP-10-2020-0106","article-title":"Research data services from the perspective of academic librarians","volume":"37","year":"2021","journal-title":"Digital Library Perspectives"},{"issue":"5","key":"key2025012506554756800_ref048","doi-asserted-by":"publisher","first-page":"2035","DOI":"10.1007\/s11227-017-2210-8","article-title":"Data deduplication techniques for efficient cloud storage management: a systematic review","volume":"74","year":"2018","journal-title":"The Journal of Supercomputing"},{"issue":"6","key":"key2025012506554756800_ref049","doi-asserted-by":"publisher","DOI":"10.1108\/OIR-08-2021-0423","article-title":"Data sharing and reuse practices: disciplinary differences and improvements needed","volume":"47","year":"2023","journal-title":"Online Information Review"},{"key":"key2025012506554756800_ref050","doi-asserted-by":"publisher","first-page":"3025","DOI":"10.1051\/itmconf\/20181703025","article-title":"Data lake: a new ideology in big data era","volume":"17","year":"2018","journal-title":"ITM Web of Conferences"},{"key":"key2025012506554756800_ref051","unstructured":"Kiefer, C. (2016), \u201cAssessing the quality of unstructured data: an initial overview\u201d, LWDA, pp. 62-73, available at: https:\/\/ceur-ws.org\/Vol-1670\/paper-25.pdf"},{"issue":"3","key":"key2025012506554756800_ref052","doi-asserted-by":"publisher","first-page":"709","DOI":"10.1108\/jd-02-2021-0044","article-title":"A sequential route of data and document qualities, satisfaction and motivations on researchers\u2019 data reuse intentions","volume":"78","year":"2022","journal-title":"Journal of Documentation"},{"key":"key2025012506554756800_ref053","volume-title":"The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses","year":"1996"},{"key":"key2025012506554756800_ref054","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1007\/978-1-4842-8233-5_1","article-title":"The data lakehouse paradigm","volume-title":"The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake","year":"2022"},{"key":"key2025012506554756800_ref055","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1145\/3626246.3653388","article-title":"BigLake: big query\u2019s evolution toward a Multi-Cloud lakehouse","year":"2024"},{"issue":"1","key":"key2025012506554756800_ref056","doi-asserted-by":"crossref","first-page":"163","DOI":"10.2218\/ijdc.v10i1.354","article-title":"Service integration to enhance research data management: RSpace electronic laboratory notebook case study","volume":"10","year":"2015","journal-title":"International Journal of Digital Curation"},{"key":"key2025012506554756800_ref057","volume-title":"Decision Support and Data Warehouse Systems","year":"2000"},{"key":"key2025012506554756800_ref058","volume-title":"The Security Data Lake","year":"2015"},{"issue":"1","key":"key2025012506554756800_ref059","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-019-0178-3","article-title":"A survey on data storage and placement methodologies for cloud-big data ecosystem","volume":"6","year":"2019","journal-title":"Journal of Big Data"},{"issue":"1","key":"key2025012506554756800_ref060","doi-asserted-by":"crossref","first-page":"102","DOI":"10.2218\/ijdc.v14i1.594","article-title":"Putting the trust into trusted data repositories: a federated solution for the Australian national imaging facility","volume":"14","year":"2019","journal-title":"International Journal of Digital Curation"},{"issue":"1","key":"key2025012506554756800_ref061","doi-asserted-by":"publisher","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR guiding principles for scientific data management and stewardship","volume":"3","year":"2016","journal-title":"Scientific Data"},{"issue":"4","key":"key2025012506554756800_ref062","doi-asserted-by":"publisher","first-page":"132","DOI":"10.3390\/bdcc6040132","article-title":"An overview of data warehouse and data lake in modern enterprise data management","volume":"6","year":"2022","journal-title":"Big Data and Cognitive Computing"},{"issue":"12","key":"key2025012506554756800_ref063","doi-asserted-by":"publisher","first-page":"1986","DOI":"10.14778\/3352063.3352116","article-title":"Data lake management: challenges and opportunities","volume":"12","year":"2019","journal-title":"Proceedings of the VLDB Endowment"},{"key":"key2025012506554756800_ref064","doi-asserted-by":"publisher","first-page":"1242","DOI":"10.23919\/MIPRO52101.2021.9597091","article-title":"Data lakehouse-a novel step in analytics architecture","year":"2021"},{"key":"key2025012506554756800_ref065","unstructured":"Osswald, A. and Strathmann, S. (2012), \u201cThe role of libraries in curation and preservation of research data in Germany: findings of a survey\u201d, available at: http:\/\/eprints.rclis.org\/27911\/"},{"issue":"3","key":"key2025012506554756800_ref066","doi-asserted-by":"publisher","first-page":"45","DOI":"10.2753\/MIS0742-1222240302","article-title":"A design science research methodology for information systems research","volume":"24","year":"2007","journal-title":"Journal of Management Information Systems"},{"key":"key2025012506554756800_ref067","volume-title":"Data Warehousing Fundamentals: A Comprehensive Guide for IT Professionals","year":"2004"},{"key":"key2025012506554756800_ref068","unstructured":"Poto\u010dekov\u00e1, N. (2023), \u201cData lineage analysis for databricks platform\u201d, available at: https:\/\/dspace.cuni.cz\/handle\/20.500.11956\/184092"},{"issue":"4","key":"key2025012506554756800_ref069","first-page":"3","article-title":"Data cleaning: problems and current approaches","volume":"23","year":"2000","journal-title":"IEEE Data Eng. Bull"},{"key":"key2025012506554756800_ref070","article-title":"Transforming dimension of IPR: challenges for new age libraries","year":"2015"},{"key":"key2025012506554756800_ref071","doi-asserted-by":"publisher","first-page":"304","DOI":"10.1007\/978-3-030-27615-7_23","article-title":"Data Lakes: trends and perspectives","year":"2019"},{"issue":"9","key":"key2025012506554756800_ref072","first-page":"2865","article-title":"Data warehousing, data mining, OLAP and OLTP technologies are essential elements to support decision-making process in industries","volume":"2","year":"2010","journal-title":"International Journal on Computer Science and Engineering"},{"key":"key2025012506554756800_ref073","unstructured":"Sadowski, C. and Levin, G. (2007), \u201cSimhash: hash-based similarity detection (https:\/\/bit.ly\/3NIxBfF)\u201d, Technical report, Google, available at: https:\/\/web.archive.org\/web\/20230710083725\/; www.webrankinfo.com\/dossiers\/wp-content\/uploads\/simhash.pdf; www.webrankinfo.com\/dossiers\/wp-content\/uploads\/simhash.pdf"},{"key":"key2025012506554756800_ref074","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1145\/3569951.3593597","article-title":"Active research data management with the django globus portal framework","volume-title":"Practice and Experience in Advanced Research Computing","year":"2023"},{"issue":"1","key":"key2025012506554756800_ref075","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1007\/s10844-020-00608-7","article-title":"On data lake architectures and metadata management","volume":"56","year":"2021","journal-title":"Journal of Intelligent Information Systems"},{"issue":"4","key":"key2025012506554756800_ref076","doi-asserted-by":"publisher","first-page":"349","DOI":"10.5860\/crl-255","article-title":"A study of faculty data curation behaviors and attitudes at a Teaching-Centered university","volume":"73","year":"2012","journal-title":"College and Research Libraries"},{"key":"key2025012506554756800_ref077","doi-asserted-by":"publisher","DOI":"10.1177\/02666669231157405","article-title":"Evolution of research data management in academic libraries: a review of the literature","year":"2023","journal-title":"Information Development"},{"key":"key2025012506554756800_ref078","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1007\/978-1-4842-7061-5_2","article-title":"Modern data warehouses and data lakehouses","volume-title":"Beginning Azure Synapse Analytics: Transition from Data Warehouse to Data Lakehouse","year":"2021"},{"issue":"2","key":"key2025012506554756800_ref079","doi-asserted-by":"publisher","first-page":"642","DOI":"10.1108\/LHT-01-2021-0007","article-title":"Providing a framework for the reuse of research data based on the development dynamic framework of united nations development program (UNDP)","volume":"41","year":"2021","journal-title":"Library Hi Tech"},{"key":"key2025012506554756800_ref080","unstructured":"Springer Nature (2024), \u201cData repository guidance | scientific data\u201d, available at: www.nature.com\/sdata\/policies\/repositories"},{"key":"key2025012506554756800_ref081","unstructured":"Stall, S., Martone, M.E., Chandramouliswaran, I., Federer, L., Gautier, J., Gibson, J., Hahnel, M., Larkin, J., Pfeiffer, N., Sedora, B., Sim, I., Smith, T., Van Gulick, A.E., Walker, E., Wood, J., Zaringhalam, M. and Zigoni, A. (2023), \u201cGeneralist repository comparison chart\u201d, doi: 10.5281\/zenodo.7946938."},{"issue":"2","key":"key2025012506554756800_ref082","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1515\/libri-2018-0090","article-title":"Data curator\u2019s roles and responsibilities: an international perspective","volume":"69","year":"2019","journal-title":"Libri"},{"key":"key2025012506554756800_ref083","unstructured":"Tekiner, F. and Pierce, S. (2021), \u201cOpen data lakehouse on google cloud\u201d, Google Cloud Blog, available at: https:\/\/cloud.google.com\/blog\/products\/data-analytics\/open-data-lakehouse-on-google-cloud"},{"key":"key2025012506554756800_ref084","article-title":"Data wrangling: the challenging yourney from the wild to the lake","year":"2015"},{"key":"key2025012506554756800_ref085","first-page":"126","article-title":"Data warehouse configuration","volume":"97","year":"1997","journal-title":"VLDB"},{"key":"key2025012506554756800_ref086","doi-asserted-by":"publisher","article-title":"An exploration of the functionality and usability of open research platforms to support open science","year":"2024","DOI":"10.5281\/zenodo.11165613"},{"key":"key2025012506554756800_ref087","unstructured":"US. Office of Government Information Services (2021), \u201cData standards\u201d, available at: https:\/\/resources.data.gov\/standards\/concepts\/#data-standard"},{"key":"key2025012506554756800_ref088","doi-asserted-by":"publisher","article-title":"Theme development in qualitative content analysis and thematic analysis","year":"2016","DOI":"10.5430\/jnep.v6n5p100"},{"key":"key2025012506554756800_ref089","doi-asserted-by":"crossref","unstructured":"White House Office of Science and Technology Policy (OSTP) (2022), \u201cDesirable characteristics of data repositories for federally funded research\u201d, Executive Office of the President of the United States, doi: 10.5479\/10088\/113528.","DOI":"10.5479\/10088\/113528"},{"key":"key2025012506554756800_ref090","unstructured":"Wikipedia (2024), \u201cDisciplinary repository\u201d, Wikipedia, available at: https:\/\/en.wikipedia.org\/w\/index.php?title=Disciplinary_repository&oldid=1245805938"},{"issue":"3","key":"key2025012506554756800_ref091","doi-asserted-by":"publisher","first-page":"102508","DOI":"10.1016\/j.acalib.2022.102508","article-title":"A scoping review: synthesizing evidence on data management instruction in academic libraries","volume":"48","year":"2022","journal-title":"The Journal of Academic Librarianship"},{"issue":"1","key":"key2025012506554756800_ref092","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1109\/21.87068","article-title":"On ordered weighted averaging aggregation operators in multicriteria decision making","volume":"18","year":"1988","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics"},{"key":"key2025012506554756800_ref093","doi-asserted-by":"publisher","DOI":"10.1108\/LHT-11-2020-0285","article-title":"Knowledge mapping of research data in China: a bibliometric study using visual analysis","year":"2022","journal-title":"Library Hi Tech"},{"issue":"1","key":"key2025012506554756800_ref094","doi-asserted-by":"publisher","first-page":"1","DOI":"10.7710\/2162-3309.1210","article-title":"University faculty awareness and attitudes towards open access publishing and the institutional repository: a case study","volume":"3","year":"2015","journal-title":"Journal of Librarianship and Scholarly Communication"},{"key":"key2025012506554756800_ref095","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1109\/ICSRS.2016.7815845","article-title":"Comparative study of data warehouses modeling approaches: Inmon, kimball and data vault","year":"2016"},{"issue":"3","key":"key2025012506554756800_ref096","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1016\/j.lisr.2017.07.008","article-title":"Social scientists\u2019 data reuse behaviors: exploring the roles of attitudinal beliefs, attitudes, norms, and data repositories","volume":"39","year":"2017","journal-title":"Library and Information Science Research"},{"issue":"1","key":"key2025012506554756800_ref097","first-page":"1","article-title":"Quality assessment methodologies for linked open data","volume":"1","year":"2013","journal-title":"Submitted to Semantic Web Journal"},{"key":"key2025012506554756800_ref098","doi-asserted-by":"publisher","first-page":"1951","DOI":"10.1145\/3318464.3389726","article-title":"Finding related tables in data Lakes for interactive data science","year":"2020"},{"key":"key2025012506554756800_ref099","article-title":"Advanced data warehouse design: from conventional to spatial and temporal applications","volume-title":"Data-Centric Systems and Applications","year":"2008"}],"container-title":["Digital Library Perspectives"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DLP-02-2024-0022\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DLP-02-2024-0022\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:09:51Z","timestamp":1753398591000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/dlp\/article\/41\/1\/145-178\/1239471"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,24]]},"references-count":99,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12,24]]},"published-print":{"date-parts":[[2025,1,28]]}},"alternative-id":["10.1108\/DLP-02-2024-0022"],"URL":"https:\/\/doi.org\/10.1108\/dlp-02-2024-0022","relation":{},"ISSN":["2059-5816","2059-5824"],"issn-type":[{"value":"2059-5816","type":"print"},{"value":"2059-5824","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,24]]}}}