{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T11:15:40Z","timestamp":1770722140773,"version":"3.49.0"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Source Code Biol Med"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>There is a significant demand for creating pipelines or workflows in the life science discipline that chain a number of discrete compute and data intensive analysis tasks into sophisticated analysis procedures. This need has led to the development of general as well as domain-specific workflow environments that are either complex desktop applications or Internet-based applications. Complexities can arise when configuring these applications in heterogeneous compute and storage environments if the execution and data access models are not designed appropriately. These complexities manifest themselves through limited access to available HPC resources, significant overhead required to configure tools and inability for users to simply manage files across heterogenous HPC storage infrastructure.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>In this paper, we describe the architecture of a software system that is adaptable to a range of both pluggable execution and data backends in an open source implementation called Yabi. Enabling seamless and transparent access to heterogenous HPC environments at its core, Yabi then provides an analysis workflow environment that can create and reuse workflows as well as manage large amounts of both raw and processed data in a secure and flexible way across geographically distributed compute resources. Yabi can be used via a web-based environment to drag-and-drop tools to create sophisticated workflows. Yabi can also be accessed through the Yabi command line which is designed for users that are more comfortable with writing scripts or for enabling external workflow environments to leverage the features in Yabi. Configuring tools can be a significant overhead in workflow environments. Yabi greatly simplifies this task by enabling system administrators to configure as well as manage running tools via a web-based environment and without the need to write or edit software programs or scripts. In this paper, we highlight Yabi's capabilities through a range of bioinformatics use cases that arise from large-scale biomedical data analysis.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>The Yabi system encapsulates considered design of both execution and data models, while abstracting technical details away from users who are not skilled in HPC and providing an intuitive drag-and-drop scalable web-based workflow environment where the same tools can also be accessed via a command line. Yabi is currently in use and deployed at multiple institutions and is available at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/ccg.murdoch.edu.au\/yabi\" ext-link-type=\"uri\">http:\/\/ccg.murdoch.edu.au\/yabi<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1751-0473-7-1","type":"journal-article","created":{"date-parts":[[2012,2,15]],"date-time":"2012-02-15T07:14:14Z","timestamp":1329290054000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":98,"title":["Yabi: An online research environment for grid, high performance and cloud computing"],"prefix":"10.1186","volume":"7","author":[{"given":"Adam A","family":"Hunter","sequence":"first","affiliation":[]},{"given":"Andrew B","family":"Macgregor","sequence":"additional","affiliation":[]},{"given":"Tamas O","family":"Szabo","sequence":"additional","affiliation":[]},{"given":"Crispin A","family":"Wellington","sequence":"additional","affiliation":[]},{"given":"Matthew I","family":"Bellgard","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,2,15]]},"reference":[{"issue":"5","key":"69_CR1","doi-asserted-by":"publisher","first-page":"687","DOI":"10.1016\/j.jbi.2008.01.008","volume":"41","author":"C Goble","year":"2008","unstructured":"Goble C, Stevens R: State of the nation in the data integration for bioinformatics. Journal of Biomedical Informatics. 2008, 41 (5): 687-693. 10.1016\/j.jbi.2008.01.008.","journal-title":"Journal of Biomedical Informatics"},{"key":"69_CR2","volume-title":"Highlights of Astronomy, XXVIth IAU General Assembly","author":"M Louys","year":"2006","unstructured":"Louys M, Bonnarel F, Schaaff A, Claudon J-J, Pestel C: Implementing astronomical image analysis pipelines using VO standards. Highlights of Astronomy, XXVIth IAU General Assembly. Edited by: van der Hucht KA. 2006, 14:"},{"key":"69_CR3","doi-asserted-by":"publisher","first-page":"3937","DOI":"10.1098\/rsta.2010.0158","volume":"368","author":"NA Walton","year":"2010","unstructured":"Walton NA, Brenton JD, Caldas C, Irwin MJ, Akram A, Gonzalez-Solares E, Lewis JR, Maccallum PH, Morris LJ, Rixon GT: PathGrid: a service-orientated architecture for microscopy image analysis. Philos Transact A Math Phys Eng Sci. 2010, 368: 3937-3952. 10.1098\/rsta.2010.0158.","journal-title":"Philos Transact A Math Phys Eng Sci"},{"issue":"10","key":"69_CR4","doi-asserted-by":"publisher","first-page":"1611","DOI":"10.1101\/gr.361602","volume":"12","author":"JE Stajich","year":"2002","unstructured":"Stajich JE, Block D, Boulez K, Brenner SE, Chervitz SA, Dagdigian C, Fuellen G, Gilbert JG, Korf I, Lapp H, Lehv\u00e4slaiho H, Matsalla C, Mungall CJ, Osborne BI, Pocock MR, Schattner P, Senger M, Stein LD, Stupka E, Wilkinson MD, Birney E: The Bioperl toolkit: Perl modules for the life sciences. Genome Res. 2002, 12 (10): 1611-8. 10.1101\/gr.361602.","journal-title":"Genome Res"},{"issue":"2","key":"69_CR5","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1145\/360262.360266","volume":"20","author":"M Pocock","year":"2000","unstructured":"Pocock M, Down T, Hubbard T: BioJava: open source components for bioinformatics. ACM SIGBIO Newsletter. 2000, 20 (2): 10-12. 10.1145\/360262.360266.","journal-title":"ACM SIGBIO Newsletter"},{"key":"69_CR6","volume-title":"Springer","author":"IJ Taylor","year":"2005","unstructured":"Taylor IJ: From P2P to Web Services and Grids - Peers in a Client\/Server World. Springer. 2005"},{"issue":"4","key":"69_CR7","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1093\/bib\/3.4.331","volume":"3","author":"MD Wilkinson","year":"2002","unstructured":"Wilkinson MD, Links M: BioMOBY: an open source biological web services proposal. Brief Bioinform. 2002, 3 (4): 331-41. 10.1093\/bib\/3.4.331.","journal-title":"Brief Bioinform"},{"key":"69_CR8","volume-title":"Proceedings of the 2005 Australasian workshop on Grid computing and e-research","author":"A Hunter","year":"2005","unstructured":"Hunter A, Schibeci D, Hiew HL, Bellgard M: Grendel: A bioinformatics Web Service-based architecture for accessing HPC resources. Proceedings of the 2005 Australasian workshop on Grid computing and e-research. 2005, 44:"},{"key":"69_CR9","volume-title":"Bioinformatics","author":"M Bellgard","year":"2005","unstructured":"Bellgard M, Hiew HL, Hunter A, Wiebrands M: ORBIT: and integrated environment for user-customised bioinformatics tools. Bioinformatics. 2005, 1:"},{"key":"69_CR10","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1079\/9780851999104.0393","volume-title":"Mammalian Genomics","author":"M Bellgard","year":"2005","unstructured":"Bellgard M: Bioinformatics from comparative genomic analysis through to integrated systems. Mammalian Genomics. 2005, 393-409."},{"issue":"2","key":"69_CR11","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1177\/109434209701100205","volume":"11","author":"I Foster","year":"1997","unstructured":"Foster I, Kesselman C: Globus: A Metacomputing Infrastructure Toolkit. Intl J Supercomputer Applications. 1997, 11 (2): 115-128. 10.1177\/109434209701100205.","journal-title":"Intl J Supercomputer Applications"},{"key":"69_CR12","doi-asserted-by":"crossref","unstructured":"Hull D, Wolstencroft K, Stevens R, Goble C, Pocock M, Li P, Oinn T: Taverna: a tool for building and running workflows of services. Nucleic Acids Research. 2006, 729-732. 34 Web Server","DOI":"10.1093\/nar\/gkl320"},{"issue":"10","key":"69_CR13","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.1002\/cpe.993","volume":"18","author":"T Oinn","year":"2006","unstructured":"Oinn T, Greenwood M, Addis M, Alpdemir N, Ferris J, Glover K, Goble C, Goderis A, Hull D, Marvin D, Li P, Lord P, Pocock M, Senger M, Stevens R, Wipat A, Wroe C: Taverna: lessons in creating a workflow environment for the life sciences. Concurrency and Computation: Practice and Experience. 2006, 18 (10): 1067-1100. 10.1002\/cpe.993.","journal-title":"Concurrency and Computation: Practice and Experience"},{"issue":"10","key":"69_CR14","doi-asserted-by":"publisher","first-page":"1451","DOI":"10.1101\/gr.4086505","volume":"15","author":"B Giardine","year":"2005","unstructured":"Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P, Zhang Y, Blankenberg D, Albert I, Taylor J, Miller W, Kent WJ, Nekrutenko A: Galaxy: a platform for interactive large-scale genome analysis. Genome Research. 2005, 15 (10): 1451-5. 10.1101\/gr.4086505.","journal-title":"Genome Research"},{"key":"69_CR15","first-page":"423","volume-title":"Proceedings. 16th International Conference on Scientific and Statistical Database Management","author":"I Altintas","year":"2004","unstructured":"Altintas I, Berkley C, Jaeger E, Jones M, Ludascher B, Mock S: Kepler: an extensible system for design and execution of scientific workflows. Proceedings. 16th International Conference on Scientific and Statistical Database Management. 2004, 423-424."},{"issue":"3","key":"69_CR16","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1155\/2005\/128026","volume":"13","author":"E Deelman","year":"2005","unstructured":"Deelman E, Singh G, Su M, Blythe J, Gil Y, Kesselman C, Mehta G, Vahi K, Berriman GB, Good J, Laity A, Jacob JC, Katz DS: Pegasus: a Framework for Mapping Complex Scientific Workflows onto Distributed Systems. Scientific Programming Journal. 2005, 13 (3): 219-237.","journal-title":"Scientific Programming Journal"},{"key":"69_CR17","unstructured":"[http:\/\/wiki.g2.bx.psu.edu\/Admin\/Tools\/Add%20Tool%20Tutorial]"},{"key":"69_CR18","unstructured":"[http:\/\/hg.notalon.org\/iracooke\/galaxy-proteomics]"},{"issue":"20","key":"69_CR19","first-page":"10","volume":"3","author":"WW Eckerson","year":"1995","unstructured":"Eckerson WW: Three Tier Client\/Server Architecture: Achieving Scalability, Performance, and Efficiency in Client Server Applications. Open Information Systems. 1995, 3 (20): 10.","journal-title":"Open Information Systems"},{"key":"69_CR20","doi-asserted-by":"publisher","first-page":"1083","DOI":"10.1016\/j.crvi.2003.09.005","volume":"326","author":"M Bellgard","year":"2003","unstructured":"Bellgard M, Kenworthy W, Hunter A: Microarray Analysis Using Bioinformatics Analysis Audit Trails (BAATs). C R Biol. 2003, 326: 1083-1087. 10.1016\/j.crvi.2003.09.005.","journal-title":"C R Biol"},{"key":"69_CR21","unstructured":"[http:\/\/www.adaptivecomputing.com\/products\/torque.php]"},{"key":"69_CR22","unstructured":"[http:\/\/www.pbsworks.com\/Product.aspx?id=1]"},{"issue":"2","key":"69_CR23","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1145\/514183.514185","volume":"2","author":"RT Fielding","year":"2002","unstructured":"Fielding RT, Taylor RN: Principled Design of the Modern Web Architecture. ACM Transactions on Internet Technology. 2002, 2 (2): 115-150. 10.1145\/514183.514185.","journal-title":"ACM Transactions on Internet Technology"},{"issue":"4","key":"69_CR24","first-page":"845","volume":"163","author":"\u0412\u0418 \u041b\u0435\u0432\u0435\u043d\u0448\u0442\u0435\u0439\u043d","year":"1965","unstructured":"\u041b\u0435\u0432\u0435\u043d\u0448\u0442\u0435\u0439\u043d \u0412\u0418: \u0414\u0432\u043e\u0438\u0447\u043d\u044b\u0435 \u043a\u043e\u0434\u044b \u0441 \u0438\u0441\u043f\u0440\u0430\u0432\u043b\u0435\u043d\u0438\u0435\u043c \u0432\u044b\u043f\u0430\u0434\u0435\u043d\u0438\u0439, \u0432\u0441\u0442\u0430\u0432\u043e\u043a \u0438 \u0437\u0430\u043c\u0435\u0449\u0435\u043d\u0438\u0439 \u0441\u0438\u043c\u0432\u043e\u043b\u043e\u0432. \u0414\u043e\u043a\u043b\u0430\u0434\u044b \u0410\u043a\u0430\u0434\u0435\u043c\u0438\u0439 \u041d\u0430\u0443\u043a CCCP. 1965, 163 (4): 845-8. Appeared in English as: Levenshtein VI (1966). Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 1966, 10: 707-10","journal-title":"\u0414\u043e\u043a\u043b\u0430\u0434\u044b \u0410\u043a\u0430\u0434\u0435\u043c\u0438\u0439 \u041d\u0430\u0443\u043a CCCP"},{"issue":"7","key":"69_CR25","doi-asserted-by":"publisher","first-page":"396","DOI":"10.1145\/366663.366704","volume":"6","author":"ME Conway","year":"1963","unstructured":"Conway ME: Design of a Separable Transition-Diagram Compiler. Communications of the ACM. 1963, 6 (7): 396-408. 10.1145\/366663.366704.","journal-title":"Communications of the ACM"},{"key":"69_CR26","doi-asserted-by":"crossref","unstructured":"Bellgard MI, Moolhuijzen P, Guerrero F, Schibeci D, Rodriguez-Valle M, Peterson D, Dowd S, Barrero R, Hunter A, Miller R, Lew-Tabor A: CattleTickBase: An integrated Internet-based bioinformatics resource for Rhipicephalus (Boophilus) microplus. International Journal for Parasitology. 42 (2): 161-169.","DOI":"10.1016\/j.ijpara.2011.11.006"},{"issue":"1","key":"69_CR27","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1006\/jmbi.1997.0951","volume":"268","author":"C Burge","year":"1997","unstructured":"Burge C, Karlin S: Prediction of complete gene structures in human genomic DNA. J Mol Biol. 1997, 268 (1): 78-94. 10.1006\/jmbi.1997.0951.","journal-title":"J Mol Biol"},{"issue":"6","key":"69_CR28","doi-asserted-by":"publisher","first-page":"276","DOI":"10.1016\/S0168-9525(00)02024-2","volume":"16","author":"P Rice","year":"2000","unstructured":"Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16 (6): 276-277. 10.1016\/S0168-9525(00)02024-2.","journal-title":"Trends Genet"},{"issue":"3","key":"69_CR29","doi-asserted-by":"publisher","first-page":"509","DOI":"10.1110\/ps.051745906","volume":"15","author":"M Wistrand","year":"2006","unstructured":"Wistrand M, Kall L, Sonnhammer EL: A general model of G protein-coupled receptor sequences and its application to detect remote homologs. Protein Sci. 2006, 15 (3): 509-521. 10.1110\/ps.051745906.","journal-title":"Protein Sci"},{"key":"69_CR30","doi-asserted-by":"publisher","first-page":"2005.0017","DOI":"10.1038\/msb4100024","volume":"1","author":"A Keller","year":"2005","unstructured":"Keller A, Eng J, Zhang N, Li X, Aebersold R: A uniform proteomics MS\/MS analysis platform utilizing open XML file formats. Molecular Systems Biology. 2005, 1: 2005.0017.","journal-title":"Molecular Systems Biology"},{"issue":"17","key":"69_CR31","doi-asserted-by":"publisher","first-page":"4646","DOI":"10.1021\/ac0341261","volume":"75","author":"AI Nesvizhskii","year":"2003","unstructured":"Nesvizhskii AI, Keller A, Kolker E, Aebersold R: A statistical model for identifying proteins by tandem mass spectrometry. Analytical Chemistry. 2003, 75 (17): 4646-4658. 10.1021\/ac0341261.","journal-title":"Analytical Chemistry"},{"key":"69_CR32","doi-asserted-by":"publisher","first-page":"3551","DOI":"10.1002\/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2","volume":"20","author":"DN Perkins","year":"1999","unstructured":"Perkins DN, Pappin DJ, Creasy DM, Cottrell JS: Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. 1999, 20: 3551-3567. 10.1002\/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2.","journal-title":"Electrophoresis"},{"issue":"10","key":"69_CR33","doi-asserted-by":"publisher","first-page":"e25776","DOI":"10.1371\/journal.pone.0025776","volume":"6","author":"DC Murray","year":"2011","unstructured":"Murray DC, Bunce M, Cannell BL, Oliver R, Houston J, White NE, Barrero RA, Bellgard MI, Haile J: DNA-Based Faecal Dietary Analysis: A Comparison of qPCR and High Throughput Sequencing Approaches. PLoS One. 2011, 6 (10): e25776-10.1371\/journal.pone.0025776. Epub 2011 Oct 6. PubMed PMID: 21998697; PubMed Central PMCID: PMC3188572","journal-title":"PLoS One"}],"container-title":["Source Code for Biology and Medicine"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1751-0473-7-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T18:27:44Z","timestamp":1630520864000},"score":1,"resource":{"primary":{"URL":"https:\/\/scfbm.biomedcentral.com\/articles\/10.1186\/1751-0473-7-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,2,15]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["69"],"URL":"https:\/\/doi.org\/10.1186\/1751-0473-7-1","relation":{},"ISSN":["1751-0473"],"issn-type":[{"value":"1751-0473","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,2,15]]},"assertion":[{"value":"12 December 2011","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 February 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 February 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"1"}}