{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T23:15:26Z","timestamp":1779232526145,"version":"3.51.4"},"reference-count":47,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,3,12]],"date-time":"2016-03-12T00:00:00Z","timestamp":1457740800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,3,12]],"date-time":"2016-03-12T00:00:00Z","timestamp":1457740800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Reproducibility is one of the tenets of the scientific method. Scientific experiments often comprise complex data flows, selection of adequate parameters, and analysis and visualization of intermediate and end results. Breaking down the complexity of such experiments into the joint collaboration of small, repeatable, well defined tasks, each with well defined inputs, parameters, and outputs, offers the immediate benefit of identifying bottlenecks, pinpoint sections which could benefit from parallelization, among others. Workflows rest upon the notion of splitting complex work into the joint effort of several manageable tasks.<\/jats:p>\n                <jats:p>There are several engines that give users the ability to design and execute workflows. Each engine was created to address certain problems of a specific community, therefore each one has its advantages and shortcomings. Furthermore, not all features of all workflow engines are royalty-free \u2014an aspect that could potentially drive away members of the scientific community.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We have developed a set of tools that enables the scientific community to benefit from workflow interoperability. We developed a platform-free structured representation of parameters, inputs, outputs of command-line tools in so-called <jats:italic>Common Tool Descriptor<\/jats:italic> documents. We have also overcome the shortcomings and combined the features of two royalty-free workflow engines with a substantial user community: the <jats:italic>Konstanz Information Miner<\/jats:italic>, an engine which we see as a formidable workflow editor, and the <jats:italic>Grid and User Support Environment<\/jats:italic>, a web-based framework able to interact with several high-performance computing resources. We have thus created a free and highly accessible way to design workflows on a desktop computer and execute them on high-performance computing resources.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>Our work will not only reduce time spent on designing scientific workflows, but also make executing workflows on remote high-performance computing resources more accessible to technically inexperienced users. We strongly believe that our efforts not only decrease the turnaround time to obtain scientific results but also have a positive impact on reproducibility, thus elevating the quality of obtained scientific results.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-016-0978-9","type":"journal-article","created":{"date-parts":[[2016,3,12]],"date-time":"2016-03-12T01:29:33Z","timestamp":1457746173000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["From the desktop to the grid: scalable bioinformatics via workflow conversion"],"prefix":"10.1186","volume":"17","author":[{"given":"Luis","family":"de la Garza","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Johannes","family":"Veit","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andras","family":"Szolek","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marc","family":"R\u00f6ttig","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stephan","family":"Aiche","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sandra","family":"Gesing","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Knut","family":"Reinert","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Oliver","family":"Kohlbacher","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2016,3,12]]},"reference":[{"issue":"5911","key":"978_CR1","first-page":"774","volume":"302","author":"W Gratzer","year":"2013","unstructured":"Gratzer W. Trouble at the lab. Economist. 2013; 302(5911):774\u20135. doi:http:\/\/dx.doi.org\/10.1038\/302774a0.","journal-title":"Economist"},{"issue":"6168","key":"978_CR2","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1126\/science.1250475","volume":"343","author":"M McNutt","year":"2014","unstructured":"McNutt M. Reproducibility. Science (New York, N.Y.) 2014; 343(6168):229. doi:http:\/\/dx.doi.org\/10.1126\/science.1250475.","journal-title":"Science (New York, N.Y.)"},{"issue":"12","key":"978_CR3","doi-asserted-by":"publisher","first-page":"1896","DOI":"10.1002\/jcp.24662","volume":"229","author":"CS Greene","year":"2014","unstructured":"Greene CS, Tan J, Ung M, Moore JH, Cheng C. Big data bioinformatics. J Cell Physiol. 2014; 229(12):1896\u2013900. doi:http:\/\/dx.doi.org\/10.1002\/jcp.24662.","journal-title":"J Cell Physiol"},{"key":"978_CR4","doi-asserted-by":"crossref","unstructured":"Berthold MR, Cebron N, Dill F, Gabriel TR, Kotter T, Meinl T, Ohl P, Sieb C, Thiel K, Wiswedel B. Knime. Web. 2007:1\u20138. doi:http:\/\/dx.doi.org\/10.1007\/978-3-540-78246-9.","DOI":"10.1007\/978-3-540-78246-9"},{"issue":"4","key":"978_CR5","doi-asserted-by":"publisher","first-page":"601","DOI":"10.1007\/s10723-012-9240-5","volume":"10","author":"P Kacsuk","year":"2012","unstructured":"Kacsuk P, Farkas Z, Kozlovszky M, Hermann G, Balasko A, Karoczkai K, Marton I. WS-PGRADE\/gUSE generic DCI gateway framework for a large variety of user communities. J Grid Comput. 2012; 10(4):601\u201330. doi:http:\/\/dx.doi.org\/10.1007\/s10723-012-9240-5.","journal-title":"J Grid Comput"},{"key":"978_CR6","doi-asserted-by":"crossref","unstructured":"Blankenberg D, Kuster GV, Coraor N, Ananda G, Lazarus R, Mangan M, Nekrutenko A, Taylor J. Galaxy: A web-based genome analysis tool for experimentalists. 2010. http:\/\/arxiv.org\/abs\/NIHMS150003 doi:http:\/\/dx.doi.org\/10.1002\/0471142727.mb1910s89","DOI":"10.1002\/0471142727.mb1910s89"},{"key":"978_CR7","doi-asserted-by":"crossref","unstructured":"Missier P, Soiland-Reyes S, Owen S, Tan W, Nenadic A, Dunlop I, Williams A, Oinn T, Goble C. Taverna, reloaded. In: Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6187 LNCS: 2010. p. 471\u201381, doi:http:\/\/dx.doi.org\/10.1007\/978-3-642-13818-8_33.","DOI":"10.1007\/978-3-642-13818-8_33"},{"key":"978_CR8","doi-asserted-by":"crossref","unstructured":"Abouelhoda M, Issa S, Ghanem M. Tavaxy: Integrating Taverna and Galaxy workflows with cloud computing support. 2012. doi:http:\/\/dx.doi.org\/10.1186\/1471-2105-13-77.","DOI":"10.1186\/1471-2105-13-77"},{"key":"978_CR9","unstructured":"Galaxy Tool XML File. https:\/\/wiki.galaxyproject.org\/Admin\/Tools\/ToolConfigSyntax. Accessed 28 July 2015."},{"key":"978_CR10","unstructured":"Galaxy Tool Shed. https:\/\/toolshed.g2.bx.psu.edu\/. Accessed 07 July 2015."},{"key":"978_CR11","doi-asserted-by":"crossref","unstructured":"Moreau L, Clifford B, Freire J, Futrelle J, Gil Y, Groth P, Kwasnikowska N, Miles S, Missier P, Myers J, Plale B, Simmhan Y, Stephan E, Den Bussche JV. The Open Provenance Model core specification (v1.1). In: Future Generation Computer Systems, vol. 27: 2011. p. 743\u201356, doi:http:\/\/dx.doi.org\/10.1016\/j.future.2010.07.005.","DOI":"10.1016\/j.future.2010.07.005"},{"key":"978_CR12","doi-asserted-by":"crossref","unstructured":"Goble CA, Bhagat J, Aleksejevs S, Cruickshank D, Michaelides D, Newman D, Borkum M, Bechhofer S, Roos M, Li P, de Roure D. myExperiment: A repository and social network for the sharing of bioinformatics workflows. Nucleic Acids Res. 2010;38(SUPPL. 2). doi:http:\/\/dx.doi.org\/10.1093\/nar\/gkq429.","DOI":"10.1093\/nar\/gkq429"},{"key":"978_CR13","unstructured":"KNIME | Open for Innovation. http:\/\/www.knime.org\/. Accessed 29 June 2015."},{"key":"978_CR14","unstructured":"KNIME | New Node Wizard. https:\/\/tech.knime.org\/new-node-wizard. Accessed 06 July 2015."},{"key":"978_CR15","unstructured":"KNIME | Community Contributions. https:\/\/tech.knime.org\/community. Accessed 07 July 2015."},{"key":"978_CR16","unstructured":"KNIME | KNIME Cluster Execution. https:\/\/www.knime.org\/cluster-execution. Accessed 06 July 2015."},{"key":"978_CR17","unstructured":"KNIME | KNIME Server - The Heart of a Collaborative KNIME Setup. https:\/\/www.knime.org\/knime-server. Accessed 06 July 2015."},{"key":"978_CR18","unstructured":"Web Service Definition Language (WSDL). http:\/\/www.w3.org\/TR\/wsdl. Accessed 06 July 2015."},{"key":"978_CR19","unstructured":"DCI Administration Manual, Version 3.7.1. http:\/\/sourceforge.net\/projects\/guse\/files\/3.7.1\/Documentation\/DCI_BRIDGE_MANUAL_v3.7.1.pdf\/download."},{"key":"978_CR20","unstructured":"Anjomshoaa A, Brisard F, Drescher M, Fellows D, Ly A, McGough S, Pulsipher D, Savva A. Job Submission Description Language (JSDL) Specification, Version 1.0. 2005:1\u201372. Open Grid Forum."},{"key":"978_CR21","first-page":"149","volume":"10","author":"M Romberg","year":"2002","unstructured":"Romberg M. The UNICORE Grid Infrastructure. Spec Issue Grid Comput Scientifc Program J. 2002; 10:149\u201357.","journal-title":"Spec Issue Grid Comput Scientifc Program J"},{"key":"978_CR22","unstructured":"IBM Platform Computing Products: Workload Management Platform - Platform LSF. IBM Corporation. 2012. http:\/\/www-03.ibm.com\/systems\/platformcomputing\/products\/lsf\/."},{"key":"978_CR23","unstructured":"HPC Products - Adaptive Computing. http:\/\/www.adaptivecomputing.com\/products\/hpc-products\/. Accessed 06 July 2015."},{"key":"978_CR24","unstructured":"Java SE Desktop Technologies - Java Web Start Technology. http:\/\/www.oracle.com\/technetwork\/java\/javase\/javawebstart\/index.html. Accessed 03 July 2015."},{"key":"978_CR25","doi-asserted-by":"publisher","first-page":"46","DOI":"10.1016\/j.future.2014.02.016","volume":"37","author":"G Terstyanszky","year":"2014","unstructured":"Terstyanszky G, Kukla T, Kiss T, Kacsuk P, Balasko A, Farkas Z. Enabling scientific workflow sharing through coarse-grained interoperability. Futur Gener Comput Syst. 2014; 37:46\u201359. doi:http:\/\/dx.doi.org\/10.1016\/j.future.2014.02.016.","journal-title":"Futur Gener Comput Syst"},{"key":"978_CR26","doi-asserted-by":"crossref","unstructured":"van der Aalst WMP. The application of petri nets to workflow management. 1998. doi:http:\/\/dx.doi.org\/10.1142\/S0218126698000043.","DOI":"10.1142\/S0218126698000043"},{"key":"978_CR27","unstructured":"Peterson JL, Vol. 24. Petri Net Theory and the Modeling of Systems; 1981, p. 290."},{"issue":"4","key":"978_CR28","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1016\/j.is.2004.02.002","volume":"30","author":"Aalst van der","year":"2005","unstructured":"van der Aalst WMP, ter Hofstede AHM. YAWL: yet another workflow language. Inf Syst. 2005; 30(4):245\u201375. doi:http:\/\/dx.doi.org\/10.1016\/j.is.2004.02.002.","journal-title":"Inf Syst"},{"key":"978_CR29","doi-asserted-by":"crossref","unstructured":"Plankensteiner K, Montagnat J, Prodan R. IWIR: A Language Enabling Portability Across Grid Workflow Systems. In: SIGMOD Rec: 2011. p. 97\u2013106, doi:http:\/\/dx.doi.org\/10.1145\/2110497.2110509. http:\/\/doi.acm.org\/10.1145\/2110497.2110509.","DOI":"10.1145\/2110497.2110509"},{"key":"978_CR30","unstructured":"Common Workflow Language. http:\/\/www.oracle.com\/technetwork\/java\/javase\/javawebstart\/index.html. Accessed 03 July 2015."},{"key":"978_CR31","doi-asserted-by":"crossref","unstructured":"Salimifard K, Wright M. Petri net-based modelling of workflow systems: An overview. 2001. doi:http:\/\/dx.doi.org\/10.1016\/S0377-2217(00)00292-7.","DOI":"10.1016\/S0377-2217(00)00292-7"},{"issue":"1","key":"978_CR32","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1023\/A:1024000426962","volume":"1","author":"E Deelman","year":"2003","unstructured":"Deelman E, Blythe J, Gil Y, Kesselman C, Mehta G, Vahi K, Blackburn K, Lazzarini A, Arbree A, Cavanaugh R, Koranda S. Mapping abstract complex workflows onto grid environments. J Grid Comput. 2003; 1(1):25\u201339. doi:http:\/\/dx.doi.org\/10.1023\/A:1024000426962.","journal-title":"J Grid Comput"},{"key":"978_CR33","doi-asserted-by":"crossref","unstructured":"Yu J, Buyya R. A taxonomy of scientific workflow systems for grid computing. 2005. doi:http:\/\/dx.doi.org\/10.1145\/1084805.1084814.","DOI":"10.1145\/1084805.1084814"},{"key":"978_CR34","unstructured":"KNIME | Flow Variables. https:\/\/tech.knime.org\/wiki\/flow-variables. Accessed 26 Oct 2015."},{"issue":"1","key":"978_CR35","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1186\/1471-2105-9-11","volume":"9","author":"A D\u00f6ring","year":"2008","unstructured":"D\u00f6ring A, Weese D, Rausch T, Reinert K. SeqAn an efficient, generic C++ library for sequence analysis. BMC Bioinforma. 2008; 9(1):11. doi:http:\/\/dx.doi.org\/10.1186\/1471-2105-9-11.","journal-title":"BMC Bioinforma"},{"key":"978_CR36","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1186\/1471-2105-9-163","volume":"9","author":"M Sturm","year":"2008","unstructured":"Sturm M, Bertsch A, Gr\u00f6pl C, Hildebrandt A, Hussong R, Lange E, Pfeifer N, Schulz-Trieglaff O, Zerck A, Reinert K, Kohlbacher O. Openms - an open-source software framework for mass spectrometry. BMC Bioinforma. 2008; 9:163.","journal-title":"BMC Bioinforma"},{"key":"978_CR37","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1186\/1471-2105-11-531","volume":"11","author":"A Hildebrandt","year":"2010","unstructured":"Hildebrandt A, Dehof AK, Rurainski A, Bertsch A, Schumann M, Toussaint NC, Moll A, St\u00f6ckel D, Nickels S, Mueller SC, Lenhof HP, Kohlbacher O. BALL\u2013biochemical algorithms library 1.3. BMC Bioinforma. 2010; 11:531. doi:http:\/\/dx.doi.org\/10.1186\/1471-2105-11-531.","journal-title":"BMC Bioinforma"},{"key":"978_CR38","unstructured":"Steinberg D, Budinsky F, Paternostro M, Merks E. EMF: Eclipse Modeling Framework; 2008, p. 744."},{"issue":"4","key":"978_CR39","doi-asserted-by":"publisher","first-page":"1017","DOI":"10.1007\/s00216-007-1486-6","volume":"389","author":"M Bantscheff","year":"2007","unstructured":"Bantscheff M, Schirle M, Sweetman G, Rick J, Kuster B. Quantitative mass spectrometry in proteomics: a critical review. Anal Bioanal Chem. 2007; 389(4):1017\u20131031.","journal-title":"Anal Bioanal Chem"},{"issue":"4","key":"978_CR40","doi-asserted-by":"publisher","first-page":"1628","DOI":"10.1021\/pr300992u","volume":"12","author":"H Weisser","year":"2013","unstructured":"Weisser H, Nahnsen S, Grosman J, Nilse L, Quandt A, Brauer H, Sturm M, Kenar E, Kohlbacher O, Aebersold R, Malmstrom L. An automated pipeline for high-throughput label-free quantitative proteomics. J Proteome Res. 2013; 12(4):1628\u20131644.","journal-title":"J Proteome Res"},{"issue":"2","key":"978_CR41","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1093\/bioinformatics\/btl299","volume":"23","author":"O Kohlbacher","year":"2007","unstructured":"Kohlbacher O, Reinert K, Gr\u00f6pl C, Lange E, Pfeifer N, Schulz-Trieglaff O, Sturm M. Topp\u2013the openms proteomics pipeline. Bioinformatics. 2007; 23(2):191\u20137.","journal-title":"Bioinformatics"},{"issue":"7","key":"978_CR42","doi-asserted-by":"publisher","first-page":"3914","DOI":"10.1021\/pr300187f","volume":"11","author":"J Junker","year":"2012","unstructured":"Junker J, Bielow C, Bertsch A, Sturm M, Reinert K, Kohlbacher O. Toppas: a graphical workflow editor for the analysis of high-throughput proteomics data. J Proteome Res. 2012; 11(7):3914\u2013920.","journal-title":"J Proteome Res"},{"key":"978_CR43","unstructured":"OpenMS | An Open-source Framework for Mass Spectrometry and TOPP \u2013 The OpenMS Proteomics Pipeline. http:\/\/open-ms.sourceforge.net\/. Accessed 26 June 2015."},{"issue":"8","key":"978_CR44","doi-asserted-by":"publisher","first-page":"3332","DOI":"10.1021\/pr2002879","volume":"10","author":"S Nahnsen","year":"2011","unstructured":"Nahnsen S, Bertsch A, Rahnenf\u00fchrer J, Nordheim A, Kohlbacher O. Probabilistic consensus scoring improves tandem mass spectrometry peptide identification. J Proteome Res. 2011; 10(8):3332\u2013343.","journal-title":"J Proteome Res"},{"issue":"5","key":"978_CR45","doi-asserted-by":"publisher","first-page":"958","DOI":"10.1021\/pr0499491","volume":"3","author":"LY Geer","year":"2004","unstructured":"Geer LY, Markey SP, Kowalak JA, Wagner L, Xu M, Maynard DM, Yang X, Shi W, Bryant SH. Open mass spectrometry search algorithm. J Proteome Res. 2004; 3(5):958\u201364.","journal-title":"J Proteome Res"},{"issue":"9","key":"978_CR46","doi-asserted-by":"publisher","first-page":"1466","DOI":"10.1093\/bioinformatics\/bth092","volume":"20","author":"R Craig","year":"2004","unstructured":"Craig R, Beavis RC. Tandem: matching proteins with tandem mass spectra. Bioinformatics. 2004; 20(9):1466\u20131467.","journal-title":"Bioinformatics"},{"key":"978_CR47","unstructured":"gUSE in a Nutshell. http:\/\/sourceforge.net\/projects\/guse\/files\/gUSE_in_a_Nutshell.pdf\/download."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-0978-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-0978-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-0978-9","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-0978-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T18:14:28Z","timestamp":1706811268000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-0978-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,3,12]]},"references-count":47,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["978"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-0978-9","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,3,12]]},"assertion":[{"value":"12 November 2015","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 March 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 March 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"127"}}