{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:29:03Z","timestamp":1760146143364,"version":"build-2065373602"},"reference-count":75,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2024,9,26]],"date-time":"2024-09-26T00:00:00Z","timestamp":1727308800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Software"],"abstract":"<jats:p>Open Science describes the movement of making any research artifact available to the public, fostering sharing and collaboration. While sharing the source code is a popular Open Science practice in software research and development, there is still a lot of work to be done to achieve the openness of the whole research and development cycle from the conception to the preservation phase. In this direction, the software engineering community faces significant challenges in adopting open science practices due to the complexity of the data, the heterogeneity of the development environments and the diversity of the application domains. In this paper, through the discussion of the 5Ws+1H (Why, Who, What, When, Where, and How) questions that are referred to as the Kipling\u2019s framework, we aim to provide a structured guideline to motivate and assist the software engineering community on the journey to data openness. Also, we demonstrate the practical application of these guidelines through a use case on opening research data.<\/jats:p>","DOI":"10.3390\/software3040021","type":"journal-article","created":{"date-parts":[[2024,9,26]],"date-time":"2024-09-26T11:40:27Z","timestamp":1727350827000},"page":"411-441","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Opening Software Research Data 5Ws+1H"],"prefix":"10.3390","volume":"3","author":[{"ORCID":"https:\/\/orcid.org\/0009-0008-0985-6688","authenticated-orcid":false,"given":"Anastasia","family":"Terzi","sequence":"first","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Western Macedonia, 50100 Kozani, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4248-3752","authenticated-orcid":false,"given":"Stamatia","family":"Bibi","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Western Macedonia, 50100 Kozani, Greece"}]}],"member":"1968","published-online":{"date-parts":[[2024,9,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Russell, A.L. (2014). Open Standards and the Digital Age: History, Ideology, and Networks, Cambridge University Press.","DOI":"10.1017\/CBO9781139856553"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Critchlow, T., and Kleese, K. (2016). Data-Intensive Science, CRC Press.","DOI":"10.1201\/b14935"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1038\/nchem.1149","article-title":"Open science is a research accelerator","volume":"3","author":"Woelfle","year":"2011","journal-title":"Nat. Chem."},{"key":"ref_4","unstructured":"(2024, September 15). Open Science Definition | FOSTER. Available online: https:\/\/www.fosteropenscience.eu\/foster-taxonomy\/open-science-definition."},{"key":"ref_5","unstructured":"(2024, September 15). Budapest Open Access Initiative. Available online: https:\/\/www.budapestopenaccessinitiative.org\/read\/."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1423","DOI":"10.1162\/qss_a_00167","article-title":"A framework for creating knowledge graphs of scientific software metadata","volume":"2","author":"Kelley","year":"2021","journal-title":"Quant. Sci. Stud."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"101278","DOI":"10.1016\/j.jocs.2020.101278","article-title":"Crediting pull requests to open source research software as an academic contribution","volume":"49","author":"Anzt","year":"2021","journal-title":"J. Comput. Sci."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1309","DOI":"10.1109\/TSE.2020.3018726","article-title":"Redundancy, context, and preference: An empirical study of duplicate pull requests in OSS projects","volume":"48","author":"Li","year":"2020","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/j.jss.2018.03.047","article-title":"Software search is not a science, even among scientists: A survey of how scientists and engineers find software","volume":"141","author":"Hucka","year":"2018","journal-title":"J. Syst. Softw."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"100222","DOI":"10.1016\/j.patter.2021.100222","article-title":"Taking a fresh look at FAIR for research software","volume":"2","author":"Katz","year":"2021","journal-title":"Patterns"},{"key":"ref_11","first-page":"165","article-title":"Software Maintenance as Materialization of Common Knowledge","volume":"9","author":"Ojala","year":"2023","journal-title":"Engag. Sci. Technol. Soc."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zaragoz\u00ed, B.M., Trilles, S., and Navarro-Carri\u00f3n, J.T. (2020). Leveraging Container Technologies in a GIScience Project: A Perspective from Open Reproducible Research. ISPRS Int. J. Geo-Inf., 9.","DOI":"10.3390\/ijgi9030138"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"91","DOI":"10.4067\/S0718-18762018000100106","article-title":"Views on Open Data Business from Software Development Companies","volume":"13","author":"Herala","year":"2018","journal-title":"J. Theor. Appl. Electron. Commer. Res."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/S0007-6813(03)00071-5","article-title":"A managerial overview of open source software","volume":"45","author":"Krishnamurthy","year":"2003","journal-title":"Bus. Horizons"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1145\/3449249","article-title":"The Labor of Maintaining and Scaling Free and Open-Source Software Projects","volume":"5","author":"Geiger","year":"2021","journal-title":"Proc. ACM Hum.-Comput. Interact."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Terzi, A., Christou, O., Bibi, S., and Angelidis, P. (September, January 31). Software Reuse and Evolution in JavaScript Applications. Proceedings of the 2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), Gran Canaria, Spain.","DOI":"10.1109\/SEAA56994.2022.00048"},{"key":"ref_17","unstructured":"Jackson, M. (2024, September 15). Software Deposit: What to Deposit. Available online: https:\/\/doi.org\/10.5281\/zenodo.1327325."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Pashchenko, I., Plate, H., Ponta, S.E., Sabetta, A., and Massacci, F. (2018, January 11\u201312). Vulnerable open source dependencies: Counting those that matter. Proceedings of the 12th ACM\/IEEE International Symposium on Empirical Software Engineering and Measurement, New York, NY, USA.","DOI":"10.1145\/3239235.3268920"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Alarcon, G.M., Gibson, A.M., Walter, C., Gamble, R.F., Ryan, T.J., Jessup, S.A., Boyd, B.E., and Capiola, A. (2020). Trust Perceptions of Metadata in Open-Source Software: The Role of Performance and Reputation. Systems, 8.","DOI":"10.3390\/systems8030028"},{"key":"ref_20","first-page":"e2630v1","article-title":"Software, vs. data in the context of citation","volume":"4","author":"Katz","year":"2016","journal-title":"PeerJ Prepr."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Tenopir, C., Allard, S., Douglass, K., Aydinoglu, A.U., Wu, L., Read, E., Manoff, M., and Frame, M. (2011). Data Sharing by Scientists: Practices and Perceptions. PLoS ONE, 6.","DOI":"10.1371\/journal.pone.0021101"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wnuk, K., Pfahl, D., Callele, D., and Karlsson, E.A. (2012, January 19\u201320). How can open source software development help requirements management gain the potential of open innovation: An exploratory study. Proceedings of the ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, New York, NY, USA.","DOI":"10.1145\/2372251.2372301"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Ho-Quang, T., Hebig, R., Robles, G., Chaudron, M.R., and Fernandez, M.A. (2017, January 20\u201328). Practices and Perceptions of UML Use in Open Source Projects. Proceedings of the 2017 IEEE\/ACM 39th International Conference on Software Engineering: Software Engineering in Practice Track (ICSE-SEIP), Buenos Aires, Argentina.","DOI":"10.1109\/ICSE-SEIP.2017.28"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Hebig, R., Quang, T.H., Chaudron, M.R.V., Robles, G., and Fernandez, M.A. (2016, January 2\u20137). The quest for open source projects that use UML: Mining GitHub. Proceedings of the ACM\/IEEE 19th International Conference on Model Driven Engineering Languages and Systems, New York, NY, USA.","DOI":"10.1145\/2976767.2976778"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1016\/j.dss.2009.04.004","article-title":"Steal my idea! Organizational adoption of user innovations from a user innovation community: A case study of Dell IdeaStorm","volume":"48","author":"Wasko","year":"2009","journal-title":"Decis. Support Syst."},{"key":"ref_26","unstructured":"(2024, September 15). Open Data Report | Elsevier. Available online: https:\/\/www.elsevier.com\/about\/open-science\/research-data\/open-data-report."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Runeson, P., Soderberg, E., and Host, M. (2024, January 16). A conceptual framework and recommendations for open data and artifacts in empirical software engineering. Proceedings of the 1st IEEE\/ACM International Workshop on Methodological Issues with Empirical Studies in Software Engineering, Lisbon, Portugal.","DOI":"10.1145\/3643664.3648206"},{"key":"ref_28","unstructured":"Kipling, R. (1902). Just So Stories, Macmillan & Co."},{"key":"ref_29","first-page":"319","article-title":"ABC Analysis, Forecasting And Economic Order Quantity (Eoq) Implementation to Improve Smooth Operation Process","volume":"1","author":"Imarah","year":"2020","journal-title":"Dinasti Int. J. Educ. Manag. Soc. Sci."},{"key":"ref_30","first-page":"139","article-title":"\u2018The five W\u2019s: An old tool for the new task of task analysis","volume":"43","author":"Hart","year":"1996","journal-title":"Tech. Commun."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Abdulkadir, S., and Aliyu, H.O. (2018). ReQueclass: A Framework for Classifying Requirement Elicitation Questions Based on Kipling\u2019s Technique and Zachman\u2019s Enterprise Framework\u2014A Guide for Software Requirement Engineers, i-manager Publications.","DOI":"10.26634\/jse.13.2.15625"},{"key":"ref_32","unstructured":"Terzi, A., Bibi, S., Tsitsimiklis, N., and Angelidis, P. Using Code from ChatGPT: Finding Patterns in the Developers\u2019 Interaction with ChatGPT. Proceedings of the International Conference on Software and Software Reuse."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Schmidt, B., Gemeinholzer, B., and Treloar, A. (2016). Open Data in Global Environmental Research: The Belmont Forum\u2019s Open Data Survey. PLoS ONE, 11.","DOI":"10.1371\/journal.pone.0146695"},{"key":"ref_34","unstructured":"Data, S., and Astell, M. (2024, September 15). Benefits of Open Research Data Infographic. Available online: https:\/\/doi.org\/10.6084\/m9.figshare.5179006.v3."},{"key":"ref_35","unstructured":"Jackson, M. (2024, September 15). Software Deposit: Why Deposit Software. Available online: https:\/\/doi.org\/10.5281\/zenodo.1327333."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1002\/pra2.2015.1450520100141","article-title":"Exploring openness in data and science: What is \u201copen\u201d, to whom, when, and why?","volume":"52","author":"Pasquetto","year":"2015","journal-title":"Proc. Proc. Assoc. Inf. Sci. Technol."},{"key":"ref_37","unstructured":"Reilly, S., Schallier, W., Schrimpf, S., Smit, E., and Wilkinson, M. (2024, September 15). Report on Integration of Data and Publications. Available online: https:\/\/doi.org\/10.5281\/zenodo.8307."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"8","DOI":"10.5334\/dsj-2017-008","article-title":"On the Reuse of Scientific Data","volume":"16","author":"Pasquetto","year":"2017","journal-title":"Data Sci. J."},{"key":"ref_39","unstructured":"Bartling, S., and Friesike, S. (2014). Open Access: A State of the Art. Opening Science: The Evolving Guide on How the Internet Is Changing Research, Collaboration and Scholarly Publishing, Springer International Publishing."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1109\/TSE.2015.2465387","article-title":"Evaluating the Effects of Architectural Documentation: A Case Study of a Large Scale Open Source Project","volume":"42","author":"Kazman","year":"2016","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Ding, W., Liang, P., Tang, A., Van Vliet, H., and Shahin, M. (2014, January 4\u20137). How Do Open Source Communities Document Software Architecture: An Exploratory Survey. Proceedings of the 2014 19th International Conference on Engineering of Complex Computer Systems, Tianjin, China.","DOI":"10.1109\/ICECCS.2014.26"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Gandhi, R., Germonprez, M., and Link, G.J. (2018, January 7\u201310). Open Data Standards for Open Source Software Risk Management Routines: An Examination of SPDX. Proceedings of the 2018 ACM International Conference on Supporting Group Work, New York, NY, USA.","DOI":"10.1145\/3148330.3148333"},{"key":"ref_43","unstructured":"(2024, September 15). Open Science: Purpose, Benefits, and What It Means for You. Available online: https:\/\/blog.theopenscholar.com\/en\/open-science-purposebenefits."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"e16800","DOI":"10.7554\/eLife.16800","article-title":"How open science helps researchers succeed","volume":"5","author":"McKiernan","year":"2016","journal-title":"eLife"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1525\/bio.2009.59.5.9","article-title":"Motivating Online Publication of Data","volume":"59","author":"Costello","year":"2009","journal-title":"BioScience"},{"key":"ref_46","unstructured":"Enders, T., Satzger, G., Fassnacht, M., and Wolff, C. (2022, January 5\u20139). Why should I share? Exploring benefits of open data for private sector organizations. Proceedings of the Pacific Asia Conference on Information Systems, Taibei, Taiwan."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1038\/s41597-022-01710-x","article-title":"Introducing the FAIR Principles for research software","volume":"9","author":"Barker","year":"2022","journal-title":"Sci. Data"},{"key":"ref_48","first-page":"39","article-title":"From FAIR research data toward FAIR and open research software","volume":"62","author":"Hasselbring","year":"2020","journal-title":"Inf. Technol."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Gil, Y., Ratnakar, V., and Garijo, D. (2015, January 7\u201310). OntoSoft: Capturing Scientific Software Metadata. Proceedings of the 8th International Conference on Knowledge Capture, New York, NY, USA.","DOI":"10.1145\/2815833.2816955"},{"key":"ref_50","unstructured":"Martinez-Ortiz, C., Martinez Lavanchy, P., Sesink, L., Olivier, B.G., Meakin, J., de Jong, M., and Cruz, M. (2024, September 15). Practical Guide to Software Management Plans. Available online: https:\/\/doi.org\/10.5281\/zenodo.7589725."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"e86","DOI":"10.7717\/peerj-cs.86","article-title":"Software citation principles","volume":"2","author":"Smith","year":"2016","journal-title":"PeerJ Comput. Sci."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"37","DOI":"10.3233\/DS-190026","article-title":"Towards FAIR principles for research software","volume":"3","author":"Lamprecht","year":"2020","journal-title":"Data Sci."},{"key":"ref_53","unstructured":"Druskat, S., Bertuch, O., Juckeland, G., Knodel, O., and Schlauch, T. (2022). Software publications with rich metadata: State of the art, automated workflows and HERMES concept. arXiv."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1016\/j.jss.2014.12.050","article-title":"An insight into license tools for open source software systems","volume":"102","author":"Kapitsaki","year":"2015","journal-title":"J. Syst. Softw."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"2209","DOI":"10.1007\/s11192-021-03869-3","article-title":"To what extent is researchers\u2019 data-sharing motivated by formal mechanisms of recognition and credit?","volume":"126","year":"2021","journal-title":"Scientometrics"},{"key":"ref_56","first-page":"5628","article-title":"Problems and Challenges in the Preservation of Digital Contents: An Analytical Study","volume":"2021","author":"Shah","year":"2021","journal-title":"Libr. Philos. Pract."},{"key":"ref_57","unstructured":"Strecker, D. (2021). Quantitative Assessment of Metadata Collections of Research Data Repositories. [Master\u2019s Thesis, Humboldt-Universit\u00e4t zu Berlin, Philosophische Fakult\u00e4t]."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/j.envsoft.2014.06.022","article-title":"A computational model library for publishing model documentation and code","volume":"61","author":"Rollins","year":"2014","journal-title":"Environ. Model. Softw."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"1226","DOI":"10.1126\/science.1213847","article-title":"Reproducible research in computational science","volume":"334","author":"Peng","year":"2011","journal-title":"Science"},{"key":"ref_60","unstructured":"Gousios, G., Vasilescu, B., Serebrenik, A., and Zaidman, A. (June, January 31). Lean GHTorrent: GitHub data on demand. Proceedings of the 11th Working Conference on Mining Software Repositories, Hyderabad, India."},{"key":"ref_61","unstructured":"Hypp\u00f6l\u00e4, J., Essen von, J., and Keskitalo, E.P. (2015, January 22\u201324). Beyond Open Access\u2014Tools and methods for open research. Proceedings of the AcademicMindTrek\u201915, Tampere, Finland."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1109\/MC.2020.2998235","article-title":"Open source research software","volume":"53","author":"Hasselbring","year":"2020","journal-title":"Computer"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Di Cosmo, R. (2020, January 13\u201316). Archiving and referencing source code with Software Heritage. Proceedings of the Mathematical Software\u2013ICMS 2020: 7th International Conference, Braunschweig, Germany.","DOI":"10.1007\/978-3-030-52200-1_36"},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1629\/uksg.510","article-title":"Data journals: Incentivizing data access and documentation within the scholarly communication system","volume":"33","author":"Walters","year":"2020","journal-title":"Insights"},{"key":"ref_65","first-page":"1","article-title":"Sicherstellung der Reproduzierbarkeit von Forschungsergebnissen durch Bewahrung des Zugriffs auf Forschungssoftware","volume":"5","author":"Brettschneider","year":"2023","journal-title":"Bausteine Forschungsdatenmanagement"},{"key":"ref_66","unstructured":"Chue Hong, N.P., and Crouch, S. (2024, September 15). What Is a Software Management Plan and How Can It Help Your Project?. Available online: https:\/\/doi.org\/10.5281\/zenodo.5648418."},{"key":"ref_67","unstructured":"Gomez-Diaz, T., and Romier, G. (2024, September 15). Research Software Management Plan Template, V3.2. Bilingual Document (FR\/EN). Available online: https:\/\/hal.science\/hal-01802565\/document."},{"key":"ref_68","unstructured":"Chair, N.C.C., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., and Odijk, J. (2016). The Public License Selector: Making Open Licensing Easier. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), European Language Resources Association (ELRA)."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Xiao, T., Treude, C., Hata, H., and Matsumoto, K. (2024). Devgpt: Studying developer-chatgpt conversations. Proceedings of the 2024 IEEE\/ACM 21st International Conference on Mining Software Repositories (MSR), IEEE.","DOI":"10.1145\/3643991.3648400"},{"key":"ref_70","unstructured":"(2024, September 15). NAIST. Available online: https:\/\/github.com\/NAIST."},{"key":"ref_71","unstructured":"(2024, September 15). ChatGPT Shared Links FAQ | OpenAI Help Center. Available online: https:\/\/help.openai.com\/en\/articles\/7925741-chatgpt-shared-links-faq."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"White, J., Hays, S., Fu, Q., Spencer-Smith, J., and Schmidt, D.C. (2024). Chatgpt prompt patterns for improving code quality, refactoring, requirements elicitation, and software design. Generative AI for Effective Software Development, Springer.","DOI":"10.1007\/978-3-031-55642-5_4"},{"key":"ref_73","unstructured":"(2024, September 15). GitHub-Anasterzia\/gptchallenge. Available online: https:\/\/github.com\/Anasterzia\/gptchallenge."},{"key":"ref_74","unstructured":"(2024, September 15). B2SHARE. Available online: https:\/\/b2share.eudat.eu\/records\/db2ef5890fa44c7a85af366a50de73b9."},{"key":"ref_75","unstructured":"Meijer, I., Costas, R., Zahedi, Z., and Wouters, P. (2016). The Value of Research Data\u2014Metrics for Datasets from a Cultural and Technical Point of View. A Knowledge Exchange Report, Knowledge Exchange."}],"container-title":["Software"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2674-113X\/3\/4\/21\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:04:15Z","timestamp":1760112255000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2674-113X\/3\/4\/21"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,26]]},"references-count":75,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["software3040021"],"URL":"https:\/\/doi.org\/10.3390\/software3040021","relation":{},"ISSN":["2674-113X"],"issn-type":[{"type":"electronic","value":"2674-113X"}],"subject":[],"published":{"date-parts":[[2024,9,26]]}}}