{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T14:45:25Z","timestamp":1740149125542,"version":"3.37.3"},"reference-count":19,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2023,7,10]],"date-time":"2023-07-10T00:00:00Z","timestamp":1688947200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,7,10]],"date-time":"2023-07-10T00:00:00Z","timestamp":1688947200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["2019609","2019609"],"award-info":[{"award-number":["2019609","2019609"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000104","name":"National Aeronautics and Space Administration","doi-asserted-by":"publisher","award":["80NSSC21M0028","80NSSC21M0028","80NSSC21M0028"],"award-info":[{"award-number":["80NSSC21M0028","80NSSC21M0028","80NSSC21M0028"]}],"id":[{"id":"10.13039\/100000104","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Earth Sci Inform"],"published-print":{"date-parts":[[2023,9]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Computational workflows are widely used in data analysis, enabling automated tracking of steps and storage of provenance information, leading to innovation and decision-making in the scientific community. However, the growing popularity of workflows has raised concerns about reproducibility and reusability which can hinder collaboration between institutions and users. In order to address these concerns, it is important to standardize workflows or provide tools that offer a framework for describing workflows and enabling computational reusability. One such set of standards that has recently emerged is the Common Workflow Language (CWL), which offers a robust and flexible framework for data analysis tools and workflows. To promote portability, reproducibility, and interoperability of AI\/ML workflows, we developed <jats:bold><jats:italic>geoweaver_cwl<\/jats:italic><\/jats:bold>, a Python package that automatically describes AI\/ML workflows from a workflow management system (WfMS) named Geoweaver into CWL. In this paper, we test our Python package on multiple use cases from different domains. Our objective is to demonstrate and verify the utility of this package. We make all the code and dataset open online and briefly describe the experimental implementation of the package in this paper, confirming that <jats:bold><jats:italic>geoweaver_cwl<\/jats:italic><\/jats:bold> can lead to a well-versed AI process while disclosing opportunities for further extensions. The <jats:bold><jats:italic>geoweaver_cwl<\/jats:italic><\/jats:bold> package is publicly released online at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/pypi.org\/project\/geoweaver-cwl\/0.0.1\/\">https:\/\/pypi.org\/project\/geoweaver-cwl\/0.0.1\/<\/jats:ext-link> and exemplar results are accessible at: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/amrutakale08\/geoweaver_cwl-usecases\">https:\/\/github.com\/amrutakale08\/geoweaver_cwl-usecases<\/jats:ext-link>.<\/jats:p>","DOI":"10.1007\/s12145-023-01045-0","type":"journal-article","created":{"date-parts":[[2023,7,10]],"date-time":"2023-07-10T09:02:22Z","timestamp":1688979742000},"page":"2955-2961","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Utility of the Python package Geoweaver_cwl for improving workflow reusability: an illustration with multidisciplinary use cases"],"prefix":"10.1007","volume":"16","author":[{"given":"Amruta","family":"Kale","sequence":"first","affiliation":[]},{"given":"Ziheng","family":"Sun","sequence":"additional","affiliation":[]},{"given":"Xiaogang","family":"Ma","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,7,10]]},"reference":[{"key":"1045_CR1","doi-asserted-by":"crossref","unstructured":"Akram A, Meredith D, Allan R (2006) May. Evaluation of BPEL to scientific workflows. In Sixth IEEE International Symposium on Cluster Computing and the Grid,Vol.\u00a01, pp.\u00a0269\u2013274","DOI":"10.1109\/CCGRID.2006.44"},{"key":"1045_CR2","unstructured":"Altintas I, Berkley C, Jaeger E, Jones M, Ludascher B, Mock S (2004) June. Kepler: an extensible system for design and execution of scientific workflows. In: Proceedings of the 16th International Conference on Scientific and Statistical Database Management. Santorini, Greece. pp.\u00a0423\u2013424"},{"key":"1045_CR5","unstructured":"Apache Software Foundation (2012) Apache Oozie. Available on: https:\/\/oozie.apache.org. Accessed on: January 9, 2023"},{"key":"1045_CR4","unstructured":"Apache Software Foundation (2014) Apache Taverna. Available on: https:\/\/taverna.apache.org. Accessed on: January 9, 2023"},{"key":"1045_CR3","unstructured":"Apache Software Foundation (2015) Apache Airflow. Available on: https:\/\/airflow.apache.org. Accessed on: January 9, 2023"},{"key":"1045_CR6","doi-asserted-by":"crossref","unstructured":"Callahan SP, Freire J, Santos E, Scheidegger CE, Silva CT, Vo HT (2006) June. VisTrails: visualization meets data management. In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp.\u00a0745\u2013747","DOI":"10.1145\/1142473.1142574"},{"issue":"1","key":"1045_CR7","doi-asserted-by":"publisher","first-page":"124","DOI":"10.1016\/j.csi.2011.06.002","volume":"34","author":"M Chinosi","year":"2012","unstructured":"Chinosi M, Trombetta A (2012) BPMN: an introduction to the standard. Comput Stand Interfaces 34(1):124\u2013134","journal-title":"Comput Stand Interfaces"},{"key":"1045_CR8","unstructured":"Crusoe MR, Abeln S, Iosup A, Amstutz P, Chilton J, Tijanic N, M\u00e9nager H, Soiland-Reyes S, Goble CA (2021) Methods included: Standardizing computational reuse and portability with the Common Workflow Language. CoRR abs\/2105.07028. Available at: https:\/\/arxiv.org\/abs\/2105.07028"},{"issue":"12","key":"1045_CR9","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1109\/MC.2007.421","volume":"40","author":"Y Gil","year":"2007","unstructured":"Gil Y, Deelman E, Ellisman M, Fahringer T, Fox G, Gannon D, Goble C, Livny M, Moreau L, Myers J (2007) Examining the challenges of scientific workflows. Computer 40(12):24\u201332","journal-title":"Computer"},{"issue":"8","key":"1045_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/gb-2010-11-8-r86","volume":"11","author":"J Goecks","year":"2010","unstructured":"Goecks J, Nekrutenko A, Taylor J (2010) Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 11(8):1\u201313","journal-title":"Genome Biol"},{"key":"1045_CR12","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1016\/B978-0-323-91737-7.00015-3","volume-title":"Artificial Intelligence in Earth Science","author":"A Kale","year":"2023","unstructured":"Kale A, Ma X (2023) Provenance in Earth AI. In: Sun Z, Cristea N, Rivas P (eds) Artificial Intelligence in Earth Science. Elsevier, Amsterdam, pp 357\u2013378"},{"issue":"1","key":"1045_CR11","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1162\/dint_a_00119","volume":"5","author":"A Kale","year":"2023","unstructured":"Kale A, Nguyen T, Harris Jr F, Li C, Zhang J, Ma X (2023a) Provenance documentation to enable explainable and trustworthy AI: a literature review. Data Intell 5(1):139\u2013162","journal-title":"Data Intell"},{"key":"1045_CR13","doi-asserted-by":"crossref","unstructured":"Kale A, Sun Z, Fan C, Ma X (2023b) Geoweaver_cwl: transforming geoweaver AI workflows to a common workflow language to extend interoperability. Applied Computing and Geosciences, p 100126","DOI":"10.1016\/j.acags.2023.100126"},{"key":"1045_CR14","doi-asserted-by":"crossref","unstructured":"Kaushik G, Ivkovic S, Simonovic J, Tijanic N, Davis-Dusenbery B, Kural D (2017) Rabix: an open-source workflow executor supporting recomputability and interoperability of workflow descriptions. In Pacific Symposium on Biocomputing 2017 (pp.\u00a0154\u2013165)","DOI":"10.1142\/9789813207813_0016"},{"key":"1045_CR15","doi-asserted-by":"crossref","unstructured":"Missier P, Belhajjame K, Cheney J (2013) March. The W3C PROV family of specifications for modeling provenance metadata. In Proceedings of the 16th International Conference on Extending Database Technology, pp.\u00a0773\u2013776","DOI":"10.1145\/2452376.2452478"},{"key":"1045_CR16","unstructured":"Organization for the Advancement of Structured Information Standard (OASIS) (1998) Available on: https:\/\/www.oasis-open.org\/standards. Accessed on: January 9, 2023"},{"issue":"6060","key":"1045_CR17","doi-asserted-by":"publisher","first-page":"1226","DOI":"10.1126\/science.1213847","volume":"334","author":"RD Peng","year":"2011","unstructured":"Peng RD (2011) Reproducible research in computational science. Science 334(6060):1226\u20131227","journal-title":"Science"},{"key":"1045_CR18","doi-asserted-by":"crossref","unstructured":"Sun Z, Di L, Burgess A, Tullis JA, Magill AB (2020) Geoweaver: Advanced cyberinfrastructure for managing hybrid geoscientific AI workflows. ISPRS International Journal of Geo-Information, 9(2), pp.119","DOI":"10.3390\/ijgi9020119"},{"key":"1045_CR19","unstructured":"Workflow Management Coalition (WfMC) (W.P.D.I.X., 2001) Process Definition Language (XPDL), WfMC Standards. WFMC-TC-1025"}],"container-title":["Earth Science Informatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s12145-023-01045-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s12145-023-01045-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s12145-023-01045-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,24]],"date-time":"2023-08-24T15:16:16Z","timestamp":1692890176000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s12145-023-01045-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,10]]},"references-count":19,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,9]]}},"alternative-id":["1045"],"URL":"https:\/\/doi.org\/10.1007\/s12145-023-01045-0","relation":{},"ISSN":["1865-0473","1865-0481"],"issn-type":[{"type":"print","value":"1865-0473"},{"type":"electronic","value":"1865-0481"}],"subject":[],"published":{"date-parts":[[2023,7,10]]},"assertion":[{"value":"12 May 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 July 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 July 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}]}}