{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,21]],"date-time":"2025-04-21T11:08:21Z","timestamp":1745233701651,"version":"3.38.0"},"reference-count":17,"publisher":"China Science Publishing & Media Ltd.","issue":"2","license":[{"start":{"date-parts":[[2022,4,27]],"date-time":"2022-04-27T00:00:00Z","timestamp":1651017600000},"content-version":"vor","delay-in-days":116,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>In Canonical Workflow Framework for Research (CWFR) \u201cpackages\u201d are relevant in two different directions. In data science, workflows are in general being executed on a set of files which have been aggregated for specific purposes, such as for training a model in deep learning. We call this type of \u201cpackage\u201d a data collection and its aggregation and metadata description is motivated by research interests. The other type of \u201cpackages\u201d relevant for CWFR are supposed to represent workflows in a self-describing and self-contained way for later execution. In this paper, we will review different packaging technologies and investigate their usability in the context of CWFR. For this purpose, we draw on an exemplary use case and show how packaging technologies can support its realization. We conclude that packaging technologies of different flavors help on providing inputs and outputs for workflow steps in a machine-readable way, as well as on representing a workflow and all its artifacts in a self-describing and self-contained way.<\/jats:p>","DOI":"10.1162\/dint_a_00137","type":"journal-article","created":{"date-parts":[[2022,4,27]],"date-time":"2022-04-27T14:38:27Z","timestamp":1651070307000},"page":"372-385","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":1,"title":["Evaluation of Application Possibilities for Packaging Technologies in Canonical Workflows"],"prefix":"10.3724","volume":"4","author":[{"given":"Thomas","family":"Jejkal","sequence":"first","affiliation":[{"name":"Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sabrine","family":"Chelbi","sequence":"additional","affiliation":[{"name":"Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Pfeil","sequence":"additional","affiliation":[{"name":"Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter","family":"Wittenburg","sequence":"additional","affiliation":[{"name":"Max Planck Computing and Data Facility, Gie\u00dfenbachstra\u00dfe 2, 85748 Garching, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"2026","published-online":{"date-parts":[[2022,4,1]]},"reference":[{"volume-title":"Canonical Workflow Framework for Research (CWFR)\u2014position paper\u2014 version 2, December 2020","author":"Hardisty","key":"2022042714321579700_ref1"},{"volume-title":"Frictionless data and data packages","author":"Barratt","key":"2022042714321579700_ref2"},{"volume-title":"The BagIt file packaging format (V1.0)","author":"Kunze","key":"2022042714321579700_ref3"},{"volume-title":"Kaggle: Your machine learning and data science community","author":"Kaggle Inc.","key":"2022042714321579700_ref4"},{"volume-title":"Research data repository interoperability WG final recommendations","author":"RDA Research Data Repository Interoperability WG","key":"2022042714321579700_ref5"},{"volume-title":"A lightweight approach to research object data packaging","author":"Carrag\u00e1in","key":"2022042714321579700_ref6"},{"volume-title":"Community group","author":"W3C schema.org","key":"2022042714321579700_ref7"},{"volume-title":"Bioschemas profiles","key":"2022042714321579700_ref8"},{"volume-title":"Use of the Hydra\/Sufia repository and Portland Common Data Model for research data description, organization, and access","author":"Tuyl","key":"2022042714321579700_ref9"},{"key":"2022042714321579700_ref10","doi-asserted-by":"crossref","DOI":"10.1145\/1255175.1255190","volume-title":"The OAI-ORE effort: Progress, challenges, synergies","author":"Lynch","year":"2007"},{"volume-title":"Digital repository","author":"The University of Hull","key":"2022042714321579700_ref11"},{"volume-title":"RDA Research Data Collections WG recommendations","author":"Weigel","key":"2022042714321579700_ref12"},{"key":"2022042714321579700_ref13","doi-asserted-by":"crossref","DOI":"10.1007\/978-94-011-0325-1","volume-title":"Text encoding initiative: Background and contexts","author":"Ide","year":"1995"},{"key":"2022042714321579700_ref14","first-page":"257","volume-title":"The PAGE (Page Analysis and Ground-Truth Elements) format framework","author":"Pletschacher","year":"2010"},{"issue":"6","key":"2022042714321579700_ref15","first-page":"71","article-title":"Web annotation as a first-class object","volume":"17","author":"Ciccarese","year":"2013","journal-title":"In: IEEE Internet Computing"},{"volume-title":"RDA collections API","author":"Weigel","key":"2022042714321579700_ref16"},{"volume-title":"RDA data type registries working group output","author":"Lannom","key":"2022042714321579700_ref17"}],"container-title":["Data Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/dint\/article-pdf\/4\/2\/372\/2012369\/dint_a_00137.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/dint\/article-pdf\/4\/2\/372\/2012369\/dint_a_00137.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,14]],"date-time":"2025-03-14T07:44:07Z","timestamp":1741938247000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.sciengine.com\/doi\/10.1162\/dint_a_00137"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022]]},"references-count":17,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,4,1]]}},"URL":"https:\/\/doi.org\/10.1162\/dint_a_00137","relation":{},"ISSN":["2641-435X"],"issn-type":[{"type":"electronic","value":"2641-435X"}],"subject":[],"published-other":{"date-parts":[[2022]]},"published":{"date-parts":[[2022]]}}}