{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T07:53:28Z","timestamp":1778313208477,"version":"3.51.4"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,6,2]],"date-time":"2016-06-02T00:00:00Z","timestamp":1464825600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,6,2]],"date-time":"2016-06-02T00:00:00Z","timestamp":1464825600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Next generation sequencing (NGS) produces massive datasets consisting of billions of reads and up to thousands of samples. Subsequent bioinformatic analysis is typically done with the help of open source tools, where each application performs a single step towards the final result. This situation leaves the bioinformaticians with the tasks to combine the tools, manage the data files and meta-information, document the analysis, and ensure reproducibility.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We present SUSHI, an agile data analysis framework that relieves bioinformaticians from the administrative challenges of their data analysis. SUSHI lets users build reproducible data analysis workflows from individual applications and manages the input data, the parameters, meta-information with user-driven semantics, and the job scripts. As distinguishing features, SUSHI provides an expert command line interface as well as a convenient web interface to run bioinformatics tools. SUSHI datasets are self-contained and self-documented on the file system. This makes them fully reproducible and ready to be shared. With the associated meta-information being formatted as plain text tables, the datasets can be readily further analyzed and interpreted outside SUSHI.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>SUSHI provides an exquisite recipe for analysing NGS data. By following the SUSHI recipe, SUSHI makes data analysis straightforward and takes care of documentation and administration tasks. Thus, the user can fully dedicate his time to the analysis itself. SUSHI is suitable for use by bioinformaticians as well as life science researchers. It is targeted for, but by no means constrained to, NGS data analysis. Our SUSHI instance is in productive use and has served as data analysis interface for more than 1000 data analysis projects. SUSHI source code as well as a demo server are freely available.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-016-1104-8","type":"journal-article","created":{"date-parts":[[2016,6,2]],"date-time":"2016-06-02T09:30:35Z","timestamp":1464859835000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":130,"title":["SUSHI: an exquisite recipe for fully documented, reproducible and reusable NGS data analysis"],"prefix":"10.1186","volume":"17","author":[{"given":"Masaomi","family":"Hatakeyama","sequence":"first","affiliation":[]},{"given":"Lennart","family":"Opitz","sequence":"additional","affiliation":[]},{"given":"Giancarlo","family":"Russo","sequence":"additional","affiliation":[]},{"given":"Weihong","family":"Qi","sequence":"additional","affiliation":[]},{"given":"Ralph","family":"Schlapbach","sequence":"additional","affiliation":[]},{"given":"Hubert","family":"Rehrauer","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2016,6,2]]},"reference":[{"key":"1104_CR1","doi-asserted-by":"publisher","first-page":"e1000424","DOI":"10.1371\/journal.pcbi.1000424","volume":"5","author":"WS Noble","year":"2009","unstructured":"Noble WS. A quick guide to organizing computational biology projects. PLoS Computational Biology. 2009;5:e1000424.","journal-title":"PLoS Computational Biology"},{"key":"1104_CR2","doi-asserted-by":"publisher","first-page":"R86","DOI":"10.1186\/gb-2010-11-8-r86","volume":"11","author":"J Goecks","year":"2010","unstructured":"Goecks J, Nekrutenko A, Taylor J, Galaxy Team. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biology. 2010;11:R86. doi:10.1186\/gb-2010-11-8-r86.","journal-title":"Genome Biology"},{"key":"1104_CR3","doi-asserted-by":"publisher","first-page":"1724","DOI":"10.1093\/bioinformatics\/btv061","volume":"31","author":"KM Fisch","year":"2015","unstructured":"Fisch KM, Mei\u00dfner T, Gioia L, Ducom J-C, Carland TM, Loguercio S, Su AI. Omics Pipe: a community-based framework for reproducible multi-omics data analysis. Bioinformatics (Oxford, England). 2015;31:1724-8. doi:10.1093\/bioinformatics\/btv061.","journal-title":"Bioinformatics (Oxford, England)"},{"key":"1104_CR4","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1038\/nmeth.1809","volume":"9","author":"F Halbritter","year":"2012","unstructured":"Halbritter F, Vaidya HJ, Tomlinson SR. GeneProf: analysis of high-throughput sequencing experiments. Nature Methods. 2012;9:7-8. doi:10.1038\/nmeth.1809.","journal-title":"Nature Methods"},{"key":"1104_CR5","doi-asserted-by":"publisher","first-page":"500","DOI":"10.1038\/ng0506-500","volume":"38","author":"M Reich","year":"2006","unstructured":"Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP. GenePattern 2.0. Nature Genetics. 2006;38:500-1. doi:10.1038\/ng0506-500.","journal-title":"Nature Genetics"},{"key":"1104_CR6","doi-asserted-by":"publisher","first-page":"1525","DOI":"10.1093\/bioinformatics\/bts167","volume":"28","author":"SP Sadedin","year":"2012","unstructured":"Sadedin SP, Pope B, Oshlack A. Bpipe: a tool for running and managing bioinformatics pipelines. Bioinformatics (Oxford, England). 2012;28:1525-6. doi:10.1093\/bioinformatics\/bts167.","journal-title":"Bioinformatics (Oxford, England)"},{"key":"1104_CR7","doi-asserted-by":"publisher","first-page":"2778","DOI":"10.1093\/bioinformatics\/btq524","volume":"26","author":"L Goodstadt","year":"2010","unstructured":"Goodstadt L. Ruffus: a lightweight Python library for computational pipelines. Bioinformatics (Oxford, England). 2010;26:2778-9. doi:10.1093\/bioinformatics\/btq524.","journal-title":"Bioinformatics (Oxford, England)"},{"key":"1104_CR8","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1093\/bioinformatics\/bts696","volume":"29","author":"CO McCoy","year":"2013","unstructured":"McCoy CO, Gallagher A, Hoffman NG. nestly\u2014a framework for running software with nested parameter choices and aggregating results. Bioinformatics (Oxford, England). 2013;29:387-8. doi:10.1093\/bioinformatics\/bts696.","journal-title":"Bioinformatics (Oxford, England)"},{"key":"1104_CR9","doi-asserted-by":"publisher","first-page":"1471","DOI":"10.1093\/bioinformatics\/btu036","volume":"30","author":"FA Buske","year":"2014","unstructured":"Buske FA, French HJ, Smith MA, Clark SJ, Bauer DC. NGSANE: a lightweight production informatics framework for high-throughput data analysis. Bioinformatics (Oxford, England). 2014;30:1471-2. doi:10.1093\/bioinformatics\/btu036.","journal-title":"Bioinformatics (Oxford, England)"},{"key":"1104_CR10","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1007\/s10586-010-0134-7","volume":"13","author":"L Yu","year":"2010","unstructured":"Yu L, Moretti C, Thrasher A, Emrich S, Judd K, Thain D. Harnessing Parallelism in Multicore Clusters with the All-Pairs, Wavefront, and Makeflow Abstractions. Journal of Cluster Computing. 2010;13:243-56.","journal-title":"Journal of Cluster Computing"},{"key":"1104_CR11","doi-asserted-by":"publisher","first-page":"2520","DOI":"10.1093\/bioinformatics\/bts480","volume":"28","author":"J K\u00f6ster","year":"2012","unstructured":"K\u00f6ster J, Rahmann S. Snakemake-a scalable bioinformatics workflow engine. Bioinformatics (Oxford, England). 2012;28:2520-2. doi:10.1093\/bioinformatics\/bts480.","journal-title":"Bioinformatics (Oxford, England)"},{"key":"1104_CR12","volume-title":"FastQC: a quality control tool for high throughput sequence data","author":"S Andrews","year":"2010","unstructured":"Andrews S. FastQC: a quality control tool for high throughput sequence data. 2010."},{"key":"1104_CR13","doi-asserted-by":"publisher","first-page":"R36","DOI":"10.1186\/gb-2013-14-4-r36","volume":"14","author":"D Kim","year":"2013","unstructured":"Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biology. 2013;14:R36.","journal-title":"Genome Biology"},{"key":"1104_CR14","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1093\/bioinformatics\/bts635","volume":"29","author":"A Dobin","year":"2013","unstructured":"Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics (Oxford, England). 2013;29:15-21.","journal-title":"Bioinformatics (Oxford, England)"},{"key":"1104_CR15","doi-asserted-by":"publisher","unstructured":"Liao Y, Smyth GK, Shi W. The Subread aligner: Fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Research. 2013;41(10). doi:10.1093\/nar\/gkt214.","DOI":"10.1093\/nar\/gkt214"},{"key":"1104_CR16","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","volume":"26","author":"MD Robinson","year":"2009","unstructured":"Robinson MD, McCarthy DJ, Smyth GK. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009;26:139-40.","journal-title":"Bioinformatics"},{"key":"1104_CR17","doi-asserted-by":"publisher","unstructured":"Kallio MA, Tuimala JT, Hupponen T, Klemel\u00e4 P. Chipster: user-friendly analysis software for microarray and other high-throughput data. BMC Genomics. 2011;12. doi:10.1186\/1471-2164-12-507.","DOI":"10.1186\/1471-2164-12-507"},{"key":"1104_CR18","doi-asserted-by":"publisher","unstructured":"Wolstencroft K, Haines R, Fellows D, Williams A, Withers D, Owen S, et al. The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud. Nucleic Acids Research. 2013:W557-61. doi:10.1093\/nar\/gkt328.","DOI":"10.1093\/nar\/gkt328"},{"key":"1104_CR19","doi-asserted-by":"publisher","first-page":"1471","DOI":"10.1038\/nbt1206-1471","volume":"24","author":"R Edgar","year":"2006","unstructured":"Edgar R, Barrett T. NCBI GEO standards and services for microarray data. Nature Biotechnology. 2006;24:1471-2.","journal-title":"Nature Biotechnology"},{"issue":"Database issue","key":"1104_CR20","doi-asserted-by":"publisher","first-page":"D991","DOI":"10.1093\/nar\/gks1193","volume":"41","author":"T Barrett","year":"2013","unstructured":"Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A. NCBI GEO: archive for functional genomics data sets-update. Nucleic Acids Research. 2013;41(Database issue):D991-5.","journal-title":"Nucleic Acids Research"},{"key":"1104_CR21","doi-asserted-by":"publisher","first-page":"D7","DOI":"10.1093\/nar\/gkv1290","volume":"44","author":"NCBI Resource Coordinators","year":"2016","unstructured":"NCBI Resource Coordinators. Database resources of the National Center for Biotechnology Information. Nucleic Acids Research. 2016;44:D7-D19.","journal-title":"Nucleic Acids Research"},{"key":"1104_CR22","doi-asserted-by":"publisher","unstructured":"Hong EL, Sloan CA, Chan ET, Davidson JM, Malladi VS, Strattan JS, et al. Principles of metadata organization at the ENCODE data coordination center. Database: The Journal of Biological Databases and Curation. 2016;2016:baw001. doi:10.1093\/database\/baw001.","DOI":"10.1093\/database\/baw001"},{"key":"1104_CR23","doi-asserted-by":"publisher","first-page":"bar023","DOI":"10.1093\/database\/bar023","volume":"2011","author":"NL Washington","year":"2011","unstructured":"Washington NL, Stinson EO, Perry MD, Ruzanov P, Contrino S, Smith R, Zha Z, Lyne R, Carr A, Lloyd P, Kephart E, McKay SJ, Micklem G, Stein LD, Lewis SE. The modENCODE Data Coordination Center: lessons in harvesting comprehensive experimental details. Database. 2011;2011:bar023.","journal-title":"Database"},{"key":"1104_CR24","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1038\/ng.1054","volume":"44","author":"S-A Sansone","year":"2012","unstructured":"Sansone S-A, Rocca-Serra P, Field D, Maguire E, Taylor C, Hofmann O, Fang H, Neumann S, Tong W, Amaral-Zettler L, Begley K, Booth T, Bougueleret L, Burns G, Chapman B, Clark T, Coleman L-A, Copeland J, Das S, de Daruvar A, de Matos P, Dix I, Edmunds S, Evelo CT, Forster MJ, Gaudet P, Gilbert J, Goble C, Griffin JL, Jacob D, et al. Toward interoperable bioscience data. Nature genetics. 2012;44:121-6.","journal-title":"Nature genetics"},{"key":"1104_CR25","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1007\/978-3-540-73255-6_6","volume-title":"Data integration in the life sciences","author":"C T\u00fcrker","year":"2007","unstructured":"T\u00fcrker C, Stolte E, Joho D, Schlapbach R. B-fabric: A data and application integration framework for life sciences research. In: Sarah Cohen-Boulakia VT, editor. Data integration in the life sciences, vol. 4544. Berlin Heidelberg: Springer; 2007. p. 37-47. doi:10.1007\/978-3-540-73255-6_6 [Lecture Notes in Computer Science]."},{"key":"1104_CR26","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1007\/978-3-642-02279-1_13","volume-title":"B-Fabric: An Open Source Life Sciences Data Management System","author":"C T\u00fcrker","year":"2009","unstructured":"T\u00fcrker C, Akal F, Joho D, Schlapbach R. B-Fabric: An Open Source Life Sciences Data Management System. Berlin Heidelberg: Springer; 2009. p. 185-90. doi:10.1007\/978-3-642-02279-1_13."},{"key":"1104_CR27","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1007\/978-1-4612-5695-3","volume-title":"Selected Writings on Computing: A Personal Perspective","author":"EW Dijkstra","year":"1982","unstructured":"Dijkstra EW. Selected Writings on Computing: A Personal Perspective. New York: Springer; 1982. p. 60-6."},{"key":"1104_CR28","unstructured":"Buschmann F, Meunier R, Rohnert H, Sommerlad P, Stal M. Pattern-Oriented Software Architecture, a System of Patterns. Chichester, UK: Wiley Publishing; 1996."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1104-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-1104-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1104-8","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1104-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T18:01:53Z","timestamp":1706810513000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-1104-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,6,2]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["1104"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-1104-8","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,6,2]]},"assertion":[{"value":"20 February 2016","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 May 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 June 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"228"}}