{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T04:39:26Z","timestamp":1776227966115,"version":"3.50.1"},"reference-count":51,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2018,7,28]],"date-time":"2018-07-28T00:00:00Z","timestamp":1532736000000},"content-version":"vor","delay-in-days":4,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001602","name":"Science Foundation Ireland","doi-asserted-by":"publisher","award":["13\/RC\/2094"],"award-info":[{"award-number":["13\/RC\/2094"]}],"id":[{"id":"10.13039\/501100001602","id-type":"DOI","asserted-by":"publisher"}]},{"name":"ELIXIR"},{"name":"Danish Research Council"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Numerous software utilities operating on mass spectrometry (MS) data are described in the literature and provide specific operations as building blocks for the assembly of on-purpose workflows. Working out which tools and combinations are applicable or optimal in practice is often hard. Thus researchers face difficulties in selecting practical and effective data analysis pipelines for a specific experimental design.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We provide a toolkit to support researchers in identifying, comparing and benchmarking multiple workflows from individual bioinformatics tools. Automated workflow composition is enabled by the tools\u2019 semantic annotation in terms of the EDAM ontology. To demonstrate the practical use of our framework, we created and evaluated a number of logically and semantically equivalent workflows for four use cases representing frequent tasks in MS-based proteomics. Indeed we found that the results computed by the workflows could vary considerably, emphasizing the benefits of a framework that facilitates their systematic exploration.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The project files and workflows are available from https:\/\/github.com\/bio-tools\/biotoolsCompose\/tree\/master\/Automatic-Workflow-Composition.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty646","type":"journal-article","created":{"date-parts":[[2018,7,26]],"date-time":"2018-07-26T19:32:08Z","timestamp":1532633528000},"page":"656-664","source":"Crossref","is-referenced-by-count":40,"title":["Automated workflow composition in mass spectrometry-based proteomics"],"prefix":"10.1093","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5865-8994","authenticated-orcid":false,"given":"Magnus","family":"Palmblad","sequence":"first","affiliation":[{"name":"Center for Proteomics and Metabolomics, Leiden University Medical Center, RC Leiden, The Netherlands"}]},{"given":"Anna-Lena","family":"Lamprecht","sequence":"additional","affiliation":[{"name":"Department of Information and Computing Sciences, Utrecht University, CC Utrecht, The Netherlands"}]},{"given":"Jon","family":"Ison","sequence":"additional","affiliation":[{"name":"National Life Science Supercomputing Center, Technical University of Denmark, Kongens Lyngby, Denmark"}]},{"given":"Veit","family":"Schw\u00e4mmle","sequence":"additional","affiliation":[{"name":"Department of Biochemistry and Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Odense, Denmark"}]}],"member":"286","published-online":{"date-parts":[[2018,7,24]]},"reference":[{"key":"2023051511005352700_bty646-B1","doi-asserted-by":"crossref","first-page":"W3","DOI":"10.1093\/nar\/gkw343","article-title":"The galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update","volume":"44","author":"Afgan","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023051511005352700_bty646-B2","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1109\/SCC.2004.1357986","article-title":"Constraint driven web service composition in meteor-s","author":"Aggarwal","year":"2004","journal-title":"IEEE International Conference onServices Computing, 2004. (SCC 2004). Proceedings. 2004"},{"key":"2023051511005352700_bty646-B3","author":"Amstutz","year":"2016"},{"key":"2023051511005352700_bty646-B4","doi-asserted-by":"crossref","first-page":"70.","DOI":"10.1186\/1471-2105-12-70","article-title":"compomics-utilities: an open-source java library for computational proteomics","volume":"12","author":"Barsnes","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023051511005352700_bty646-B5","doi-asserted-by":"crossref","first-page":"26.","DOI":"10.1145\/1656274.1656280","article-title":"KNIME - the konstanz information miner","volume":"11","author":"Berthold","year":"2009","journal-title":"ACM SIGKDD Explorations Newsletter"},{"key":"2023051511005352700_bty646-B6","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1007\/s10009-013-0287-9","article-title":"Algorithmic program synthesis: introduction","volume":"15","author":"Bodik","year":"2013","journal-title":"Int. J. Softw. Tools Technol. Transf"},{"key":"2023051511005352700_bty646-B7","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1038\/nbt.3134","article-title":"Multi-omic data analysis using galaxy","volume":"33","author":"Boekel","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023051511005352700_bty646-B8","doi-asserted-by":"crossref","first-page":"360","DOI":"10.1074\/mcp.M113.032813","article-title":"Proteome informatics research group (iPRG)_2012: a study on detecting modified peptides in a complex mixture","volume":"13","author":"Chalkley","year":"2014","journal-title":"Mol. Cell. Proteomics"},{"key":"2023051511005352700_bty646-B9","first-page":"319","volume-title":"Lecture Notes in Computer Science","author":"Chen","year":"2003"},{"key":"2023051511005352700_bty646-B10","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1021\/pr0504236","article-title":"InSilicoSpectro: an open-source proteomics library","volume":"5","author":"Colinge","year":"2006","journal-title":"J. Proteome Res"},{"key":"2023051511005352700_bty646-B11","doi-asserted-by":"crossref","first-page":"1367","DOI":"10.1038\/nbt.1511","article-title":"MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification","volume":"26","author":"Cox","year":"2008","journal-title":"Nat. Biotechnol"},{"key":"2023051511005352700_bty646-B12","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1016\/j.jbi.2008.02.005","article-title":"Semi-automatic web service composition for the life sciences using the BioMoby semantic web framework","volume":"41","author":"DiBernardo","year":"2008","journal-title":"J. Biomed. Informatics"},{"key":"2023051511005352700_bty646-B13","doi-asserted-by":"crossref","first-page":"e0157989.","DOI":"10.1371\/journal.pone.0157989","article-title":"A survey of bioinformatics database and software usage through mining the literature","volume":"11","author":"Duck","year":"2016","journal-title":"PLoS One"},{"key":"2023051511005352700_bty646-B14","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1109\/MIS.2004.1265882","article-title":"Artificial intelligence and grids: workflow planning and beyond","volume":"19","author":"Gil","year":"2004","journal-title":"IEEE Intell. Syst"},{"key":"2023051511005352700_bty646-B15","first-page":"1767","volume-title":"Proceedings of the 19th National Conference on Innovative Applications of Artificial Intelligence","author":"Gil","year":"2007"},{"key":"2023051511005352700_bty646-B16","doi-asserted-by":"crossref","first-page":"i451.","DOI":"10.1093\/bioinformatics\/bts389","article-title":"Enrichnet: network-based gene set enrichment analysis","volume":"28","author":"Glaab","year":"2012","journal-title":"Bioinformatics"},{"key":"2023051511005352700_bty646-B17","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-015-0714-x","article-title":"Scientific workflow optimization for improved peptide and protein identification","volume":"16","author":"Holl","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2023051511005352700_bty646-B18","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1093\/bioinformatics\/btt113","article-title":"EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats","volume":"29","author":"Ison","year":"2013","journal-title":"Bioinformatics"},{"key":"2023051511005352700_bty646-B19","doi-asserted-by":"crossref","first-page":"D38","DOI":"10.1093\/nar\/gkv1116","article-title":"Tools and data services registry: a community effort to document bioinformatics resources","volume":"44","author":"Ison","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023051511005352700_bty646-B20","doi-asserted-by":"crossref","first-page":"5898","DOI":"10.1021\/pr500812t","article-title":"Flexible and accessible workflows for improved proteogenomic analysis using the galaxy framework","volume":"13","author":"Jagtap","year":"2014","journal-title":"J. Proteome Res"},{"key":"2023051511005352700_bty646-B21","doi-asserted-by":"crossref","first-page":"5277.","DOI":"10.1038\/ncomms6277","article-title":"MS-GF+ makes progress towards a universal database search tool for proteomics","volume":"5","author":"Kim","year":"2014","journal-title":"Nat. Commun"},{"key":"2023051511005352700_bty646-B22","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-45389-2","volume-title":"User-Level Workflow Design. A Bioinformatics Perspective","author":"Lamprecht","year":"2013"},{"key":"2023051511005352700_bty646-B23","doi-asserted-by":"crossref","first-page":"S8.","DOI":"10.1186\/1471-2105-10-S10-S8","article-title":"Bio-jETI: a framework for semantics-based service composition","volume":"10","author":"Lamprecht","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023051511005352700_bty646-B24","volume-title":"2010 Seventh International Conference on the Quality of Information and Communications Technology","author":"Lamprecht","year":"2010"},{"key":"2023051511005352700_bty646-B25","doi-asserted-by":"crossref","first-page":"S5","DOI":"10.1186\/2041-1480-2-S1-S5","article-title":"Semantics-based composition of EMBOSS services","volume":"2","author":"Lamprecht","year":"2011","journal-title":"J. Biomed. Semantics"},{"key":"2023051511005352700_bty646-B26","doi-asserted-by":"crossref","first-page":"e0137048.","DOI":"10.1371\/journal.pone.0137048","article-title":"Comparative analysis of label-free and 8-plex iTRAQ approach for quantitative tissue proteomic analysis","volume":"10","author":"Latosinska","year":"2015","journal-title":"PLoS One"},{"key":"2023051511005352700_bty646-B27","first-page":"530","article-title":"A review of bioinformatic pipeline frameworks","volume":"18","author":"Leipzig","year":"2017","journal-title":"Brief. Bioinformatics"},{"key":"2023051511005352700_bty646-B28","doi-asserted-by":"crossref","first-page":"350","DOI":"10.1007\/978-3-540-30475-3_25","volume-title":"The Semantic Web \u2013 ISWC 2004","author":"Lord","year":"2004"},{"key":"2023051511005352700_bty646-B29","doi-asserted-by":"crossref","first-page":"1039","DOI":"10.1002\/cpe.994","article-title":"Scientific workflow management and the kepler system","volume":"18","author":"Lud\u00e4scher","year":"2006","journal-title":"Concurrency Comput. Pract. Experience"},{"key":"2023051511005352700_bty646-B30","doi-asserted-by":"crossref","first-page":"966","DOI":"10.1093\/bioinformatics\/btq054","article-title":"Skyline: an open source document editor for creating and analyzing targeted proteomics experiments","volume":"26","author":"MacLean","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051511005352700_bty646-B31","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1007\/BFb0053591","volume-title":"Fundamental Approaches to Software Engineering","author":"Margaria","year":"1998"},{"key":"2023051511005352700_bty646-B32","doi-asserted-by":"crossref","first-page":"1632","DOI":"10.1073\/pnas.77.3.1632","article-title":"Prediction of peptide retention times in high-pressure liquid chromatography on the basis of amino acid composition","volume":"77","author":"Meek","year":"1980","journal-title":"Proc. Natl. Acad. Sci. U.S.A"},{"key":"2023051511005352700_bty646-B33","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1093\/bib\/bbl014","article-title":"Agents in bioinformatics, computational and systems biology","volume":"8","author":"Merelli","year":"2007","journal-title":"Briefings in Bioinformatics"},{"key":"2023051511005352700_bty646-B34","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1007\/s41019-017-0050-4","article-title":"Robust cross-platform workflows: how technical and scientific communities collaborate to develop, test and share best practices for data analysis","volume":"2","author":"M\u00f6ller","year":"2017","journal-title":"Data Sci. Eng"},{"key":"2023051511005352700_bty646-B35","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1016\/S0950-7051(01)00167-8","article-title":"Integrating AI planning techniques with workflow management system","volume":"15","author":"Moreno","year":"2002","journal-title":"Knowl. Based Syst"},{"key":"2023051511005352700_bty646-B36","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1007\/978-3-642-28872-2_7","volume-title":"Fundamental Approaches to Software Engineering","author":"Naujokat","year":"2012"},{"key":"2023051511005352700_bty646-B37","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-30715-7","volume-title":"Scientific Workflows","author":"Qin","year":"2012"},{"key":"2023051511005352700_bty646-B38","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1007\/978-3-540-30581-1_5","volume-title":"Semantic Web Services and Web Process Composition","author":"Rao","year":"2005"},{"key":"2023051511005352700_bty646-B39","doi-asserted-by":"crossref","first-page":"W83","DOI":"10.1093\/nar\/gkw199","article-title":"g:profiler-a web server for functional interpretation of gene lists (2016 update)","volume":"44","author":"Reimand","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023051511005352700_bty646-B40","doi-asserted-by":"crossref","first-page":"741","DOI":"10.1038\/nmeth.3959","article-title":"OpenMS: a flexible open-source software platform for mass spectrometry data analysis","volume":"13","author":"R\u00f6st","year":"2016","journal-title":"Nat. Methods"},{"key":"2023051511005352700_bty646-B41","doi-asserted-by":"crossref","first-page":"1265","DOI":"10.1002\/pmic.200900437","article-title":"Scaffold: a bioinformatic tool for validating MS\/MS-based proteomic studies","volume":"10","author":"Searle","year":"2010","journal-title":"Proteomics"},{"key":"2023051511005352700_bty646-B42","doi-asserted-by":"crossref","first-page":"703.","DOI":"10.1186\/1471-2164-15-703","article-title":"Using galaxy-p to leverage rna-seq for the discovery of novel protein variations","volume":"15","author":"Sheynkman","year":"2014","journal-title":"BMC Genomics"},{"key":"2023051511005352700_bty646-B43","doi-asserted-by":"crossref","first-page":"1183","DOI":"10.1093\/bioinformatics\/btr081","article-title":"Proteomics to go: proteomatic enables the user-friendly creation of versatile MS\/MS data evaluation workflows","volume":"27","author":"Specht","year":"2011","journal-title":"Bioinformatics"},{"key":"2023051511005352700_bty646-B44","author":"Steffen","year":"1993"},{"key":"2023051511005352700_bty646-B45","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btv770","article-title":"Multi-omics enrichment analysis using the genetrail2 web service","volume":"32","author":"St\u00f6ckel","year":"2016","journal-title":"Bioinformatics"},{"key":"2023051511005352700_bty646-B46","doi-asserted-by":"crossref","first-page":"5354","DOI":"10.1021\/pr200611n","article-title":"Universal and confident phosphorylation site localization using phosphoRS","volume":"10","author":"Taus","year":"2011","journal-title":"J. Proteome Res"},{"key":"2023051511005352700_bty646-B47","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1007\/978-1-84628-757-2_20","volume-title":"Workflows for e-Science","author":"Taylor","year":"2007"},{"key":"2023051511005352700_bty646-B48","doi-asserted-by":"crossref","first-page":"1036","DOI":"10.1002\/pmic.201200408","article-title":"D-score: a search engine independent MD-score","volume":"13","author":"Vaudel","year":"2013","journal-title":"Proteomics"},{"key":"2023051511005352700_bty646-B49","doi-asserted-by":"crossref","first-page":"S7.","DOI":"10.1186\/1471-2105-11-S12-S7","article-title":"SADI, SHARE, and the in silico scientific method","volume":"11","author":"Wilkinson","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023051511005352700_bty646-B50","doi-asserted-by":"crossref","first-page":"W557","DOI":"10.1093\/nar\/gkt328","article-title":"The Taverna workflow suite: designing and executing workflows of web services on the desktop, web or in the cloud","volume":"41","author":"Wolstencroft","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023051511005352700_bty646-B51","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1016\/j.jprot.2014.02.008","article-title":"Characterization of the platelet granule proteome: evidence of the presence of MHC1 in alpha-granules","volume":"101","author":"Zufferey","year":"2014","journal-title":"J. Proteomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/4\/656\/50320510\/bioinformatics_35_4_656.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/4\/656\/50320510\/bioinformatics_35_4_656.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,15]],"date-time":"2023-05-15T11:02:37Z","timestamp":1684148557000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/4\/656\/5060940"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,7,24]]},"references-count":51,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2019,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty646","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,2,15]]},"published":{"date-parts":[[2018,7,24]]}}}