{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,25]],"date-time":"2025-11-25T08:57:58Z","timestamp":1764061078550,"version":"3.38.0"},"reference-count":29,"publisher":"China Science Publishing & Media Ltd.","issue":"2","license":[{"start":{"date-parts":[[2022,3,7]],"date-time":"2022-03-07T00:00:00Z","timestamp":1646611200000},"content-version":"vor","delay-in-days":65,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The FAIR principles have been accepted globally as guidelines for improving data-driven science and data management practices, yet the incentives for researchers to change their practices are presently weak. In addition, data-driven science has been slow to embrace workflow technology despite clear evidence of recurring practices. To overcome these challenges, the Canonical Workflow Frameworks for Research (CWFR) initiative suggests a large-scale introduction of self-documenting workflow scripts to automate recurring processes or fragments thereof. This standardised approach, with FAIR Digital Objects as anchors, will be a significant milestone in the transition to FAIR data without adding additional load onto the researchers who stand to benefit most from it. This paper describes the CWFR approach and the activities of the CWFR initiative over the course of the last year or so, highlights several projects that hold promise for the CWFR approaches, including Galaxy, Jupyter Notebook, and RO Crate, and concludes with an assessment of the state of the field and the challenges ahead.<\/jats:p>","DOI":"10.1162\/dint_a_00132","type":"journal-article","created":{"date-parts":[[2022,3,7]],"date-time":"2022-03-07T18:06:53Z","timestamp":1646676413000},"page":"286-305","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":4,"title":["Canonical Workflows to Make Data FAIR"],"prefix":"10.3724","volume":"4","author":[{"given":"Peter","family":"Wittenburg","sequence":"first","affiliation":[{"name":"FDO Forum, Gemeindweg 55, 47533 Kleve, Germany"}]},{"given":"Alex","family":"Hardisty","sequence":"additional","affiliation":[{"name":"Cardiff University, Cardiff, Wales CF10 3AT, UK"}]},{"given":"Yann","family":"Le Franc","sequence":"additional","affiliation":[{"name":"eScienceFactory, 75570 Paris Cedex 12, France"}]},{"given":"Amirpasha","family":"Mozaffari","sequence":"additional","affiliation":[{"name":"Forschungszentrum J\u00fclich GmbH, 52425 J\u00fclich, Germany"}]},{"given":"Limor","family":"Peer","sequence":"additional","affiliation":[{"name":"Yale University, New Haven, CT 06520, USA"}]},{"given":"Nikolay A.","family":"Skvortsov","sequence":"additional","affiliation":[{"name":"Russian Academy of Sciences, 121351 Moscow, Russia"}]},{"given":"Zhiming","family":"Zhao","sequence":"additional","affiliation":[{"name":"University of Amsterdam, PO-Box 94323, 1090 GH Amsterdam, The Netherlands"}]},{"given":"Alessandro","family":"Spinuso","sequence":"additional","affiliation":[{"name":"Royal Netherlands Meteorological Institute (KNMI), Utrechtseweg 297, 3731 GA De Bilt, The Netherlands"}]}],"member":"2026","published-online":{"date-parts":[[2022,4,1]]},"reference":[{"issue":"1","key":"2022042714423133700_ref1","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1162\/dint_a_00084","article-title":"Not ready for convergence in data\n                        infrastructures","volume":"3","author":"Jeffery","year":"2021","journal-title":"Data Intelligence"},{"key":"2022042714423133700_ref2","doi-asserted-by":"crossref","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR guiding principles for scientific data management\n                        and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Scientific Data"},{"issue":"1","key":"2022042714423133700_ref3","doi-asserted-by":"crossref","first-page":"49","DOI":"10.3233\/ISU-170824","article-title":"Cloudy, increasingly FAIR; revisiting the FAIR data guiding\n                        principles for the European Open Science Cloud","volume":"37","author":"Mons","year":"2017","journal-title":"Information Services & Use"},{"issue":"1-2","key":"2022042714423133700_ref4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1162\/dint_e_00023","article-title":"The FAIR principles: First generation implementation choices\n                        and challenges","volume":"2","author":"Mons","year":"2020","journal-title":"Data Intelligence"},{"issue":"1-2","key":"2022042714423133700_ref5","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1162\/dint_r_00024","article-title":"FAIR principles: Interpretations and implementation\n                        considerations","volume":"2","author":"Jacobsen","year":"2020","journal-title":"Data Intelligence"},{"key":"2022042714423133700_ref6","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1007\/s00799-005-0128-x","article-title":"A framework for distributed digital object\n                        services","volume":"6","author":"Kahn","year":"2006","journal-title":"International Journal on Digital\n                        Libraries"},{"issue":"2","key":"2022042714423133700_ref7","doi-asserted-by":"crossref","first-page":"357","DOI":"10.25300\/MISQ\/2013\/37.2.02","article-title":"The ambivalent ontology of digital artifacts","volume":"37","author":"Kallinikos","year":"2013","journal-title":"MIS Quarterly"},{"key":"2022042714423133700_ref8","doi-asserted-by":"crossref","DOI":"10.5749\/minnesota\/9780816698905.001.0001","volume-title":"On the existence of digital objects","author":"Hui","year":"2016"},{"issue":"2","key":"2022042714423133700_ref9","doi-asserted-by":"crossref","DOI":"10.3390\/publications8020021","article-title":"FAIR digital objects for science: From data pieces to\n                        actionable knowledge units","volume":"8","author":"De\n                                Smedt","year":"2020","journal-title":"Publications"},{"issue":"3-4","key":"2022042714423133700_ref10","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1017\/S1351324904003523","article-title":"UIMA: An architectural approach to unstructured information\n                        processing in the corporate research environment","volume":"10","author":"Ferrucci","year":"2004","journal-title":"Natural Language Engineering"},{"key":"2022042714423133700_ref11","doi-asserted-by":"crossref","first-page":"W537","DOI":"10.1093\/nar\/gky379","article-title":"The galaxy platform for accessible, reproducible and\n                        collaborative biomedical analyses: 2018 update","volume":"46","author":"Afgan","year":"2018","journal-title":"Nucleic Acids Research"},{"key":"2022042714423133700_ref12","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1186\/s12898-016-0103-y","article-title":"BioVeL: A virtual laboratory for data analysis and modelling\n                        in biodiversity science and ecology","volume":"16","author":"Hardisty","year":"2016","journal-title":"BMC\n                        Ecology"},{"key":"2022042714423133700_ref13","first-page":"87","article-title":"Jupyter Notebooks\u2014a publishing format for reproducible\n                        computational workflows.","volume-title":"Positioning and Power in Academic Publishing: Players, Agents and\n                        Agendas","author":"Kluyver","year":"2016"},{"volume-title":"Understanding reproducibility and replicability, reproducibility\n                        and replicability in science","year":"2019","author":"National Academies of Sciences, Engineering, and Medicine","key":"2022042714423133700_ref14"},{"volume-title":"FAIR digital object framework version\n                        1.02","year":"2019","author":"Research Data Alliance Group of European Experts (RDA-GEDE)","key":"2022042714423133700_ref15"},{"volume-title":"FAIR Digital Objects Forum","key":"2022042714423133700_ref16"},{"volume-title":"CWFR workshop","key":"2022042714423133700_ref17"},{"volume-title":"WebLichtWiki","key":"2022042714423133700_ref18"},{"volume-title":"Apache UIMA","key":"2022042714423133700_ref19"},{"volume-title":"OPC Unified Architecture","key":"2022042714423133700_ref20"},{"volume-title":"Research objects: Towards exchange and reuse of digital\n                        knowledge","author":"Bechhofer","key":"2022042714423133700_ref21"},{"volume-title":"Packaging research artefacts with ro-crate","year":"2021","author":"Soiland-Reyes","key":"2022042714423133700_ref22"},{"issue":"10","key":"2022042714423133700_ref23","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1093\/bioinformatics\/btt113","article-title":"EDAM: An ontology of bioinformatics operations, types of data\n                        and identifiers, topics and formats","volume":"29","author":"Ison","year":"2013","journal-title":"Bioinformatics"},{"volume-title":"InteroperAble descriptions of observable property terminology WG\n                        (I-ADOPT WG)","author":"Magagna","key":"2022042714423133700_ref24"},{"key":"2022042714423133700_ref25","first-page":"14","volume-title":"Assisting scientists with complex data analysis tasks through\n                        semantic workflows","author":"Gil","year":"2010"},{"volume-title":"SEMAF: A proposal for a flexible semantic mapping\n                    framework","author":"Broeder","key":"2022042714423133700_ref26"},{"volume-title":"Markdown","author":"Gruber","key":"2022042714423133700_ref27"},{"issue":"7","key":"2022042714423133700_ref28","doi-asserted-by":"crossref","first-page":"e1007007","DOI":"10.1371\/journal.pcbi.1007007","article-title":"Ten simple rules for writing and sharing computational\n                        analyses in Jupyter Notebooks","volume":"15","author":"Rule","year":"2019","journal-title":"PLOS Computational\n                        Biology"},{"issue":"1","key":"2022042714423133700_ref29","doi-asserted-by":"crossref","first-page":"28","DOI":"10.5334\/dsj-2020-028","article-title":"YARD: A tool for curating research outputs","volume":"19","author":"Peer","year":"2020","journal-title":"Data Science Journal"}],"container-title":["Data Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/dint\/article-pdf\/4\/2\/286\/2012355\/dint_a_00132.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/dint\/article-pdf\/4\/2\/286\/2012355\/dint_a_00132.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,14]],"date-time":"2025-03-14T07:43:34Z","timestamp":1741938214000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.sciengine.com\/doi\/10.1162\/dint_a_00132"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022]]},"references-count":29,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,4,1]]}},"URL":"https:\/\/doi.org\/10.1162\/dint_a_00132","relation":{},"ISSN":["2641-435X"],"issn-type":[{"type":"electronic","value":"2641-435X"}],"subject":[],"published-other":{"date-parts":[[2022]]},"published":{"date-parts":[[2022]]}}}