{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T10:05:36Z","timestamp":1769853936173,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":54,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,9,21]],"date-time":"2020-09-21T00:00:00Z","timestamp":1600646400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,9,21]]},"DOI":"10.1145\/3388440.3412460","type":"proceedings-article","created":{"date-parts":[[2020,11,10]],"date-time":"2020-11-10T12:43:43Z","timestamp":1605012223000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Collaborative Cloud Computing Framework for Health Data with Open Source Technologies"],"prefix":"10.1145","author":[{"given":"Fatemeh","family":"Rouzbeh","sequence":"first","affiliation":[{"name":"Purdue University, West Lafayette, USA"}]},{"given":"Ananth","family":"Grama","sequence":"additional","affiliation":[{"name":"Purdue University, West Lafayette, USA"}]},{"given":"Paul","family":"Griffin","sequence":"additional","affiliation":[{"name":"Purdue University, West Lafayette, USA"}]},{"given":"Mohammad","family":"Adibuzzaman","sequence":"additional","affiliation":[{"name":"Purdue University, West Lafayette, USA"}]}],"member":"320","published-online":{"date-parts":[[2020,11,10]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n.d.]. Apache Hadoop. https:\/\/hadoop.apache.org\/  [n.d.]. Apache Hadoop. https:\/\/hadoop.apache.org\/"},{"key":"e_1_3_2_1_2_1","unstructured":"[n.d.]. BD2K PIC-SURE RESTful API. http:\/\/bd2k-picsure.hms.harvard.edu\/more.html. [Online; accessed 3-March-2018].  [n.d.]. BD2K PIC-SURE RESTful API. http:\/\/bd2k-picsure.hms.harvard.edu\/more.html. [Online; accessed 3-March-2018]."},{"key":"e_1_3_2_1_3_1","unstructured":"[n.d.]. Empowering App Development for Developers. https:\/\/www.docker.com\/  [n.d.]. Empowering App Development for Developers. https:\/\/www.docker.com\/"},{"key":"e_1_3_2_1_4_1","unstructured":"[n.d.]. MetalLB: bare metal load-balancer for kubernetes. https:\/\/metallb.universe.tf\/  [n.d.]. MetalLB: bare metal load-balancer for kubernetes. https:\/\/metallb.universe.tf\/"},{"key":"e_1_3_2_1_5_1","unstructured":"Accessed on 2019. CILogon: An Integrated Identity and Access Management Platform for Science. https:\/\/www.cilogon.org\/.  Accessed on 2019. CILogon: An Integrated Identity and Access Management Platform for Science. https:\/\/www.cilogon.org\/."},{"key":"e_1_3_2_1_6_1","unstructured":"Accessed on 2019. HIPI - Hadoop Image Processing Framework. hipi.cs.virginia.edu\/.  Accessed on 2019. HIPI - Hadoop Image Processing Framework. hipi.cs.virginia.edu\/."},{"key":"e_1_3_2_1_7_1","unstructured":"Accessed on 2019. JupyterHub - Project Jupyter. https:\/\/jupyter.org\/hub.  Accessed on 2019. JupyterHub - Project Jupyter. https:\/\/jupyter.org\/hub."},{"key":"e_1_3_2_1_8_1","unstructured":"Accessed on 2019. Kubernetes - Production-Grade Container Orchestration.\u00e2\u0102\u0130 Kubernetes. https:\/\/kubernetes.io\/.  Accessed on 2019. Kubernetes - Production-Grade Container Orchestration.\u00e2\u0102\u0130 Kubernetes. https:\/\/kubernetes.io\/."},{"key":"e_1_3_2_1_9_1","unstructured":"Accessed on 2019. Rook - open-source cloud-native storage for Kubernetes. https:\/\/rook.io\/.  Accessed on 2019. Rook - open-source cloud-native storage for Kubernetes. https:\/\/rook.io\/."},{"key":"e_1_3_2_1_10_1","volume-title":"Building Machine Learning and Deep Learning Models on Google Cloud Platform","author":"Bisong Ekaba","unstructured":"Ekaba Bisong . 2019. Kubeflow and Kubeflow Pipelines . In Building Machine Learning and Deep Learning Models on Google Cloud Platform . Springer , 671--685. Ekaba Bisong. 2019. Kubeflow and Kubeflow Pipelines. In Building Machine Learning and Deep Learning Models on Google Cloud Platform. Springer, 671--685."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2017.03.017"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367519"},{"key":"e_1_3_2_1_13_1","unstructured":"Maria Odea Ching. [n.d.]. Introduction. https:\/\/ranger.apache.org\/  Maria Odea Ching. [n.d.]. Introduction. https:\/\/ranger.apache.org\/"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2017.01.012"},{"key":"e_1_3_2_1_15_1","volume-title":"Estimating the reproducibility of psychological science. Science 349, 6251","author":"Collaboration Open Science","year":"2015","unstructured":"Open Science Collaboration . 2015. Estimating the reproducibility of psychological science. Science 349, 6251 ( 2015 ), aac4716. Open Science Collaboration. 2015. Estimating the reproducibility of psychological science. Science 349, 6251 (2015), aac4716."},{"key":"e_1_3_2_1_16_1","unstructured":"Breda Corish. 2018. Medical knowledge doubles every few months; how can clinicians keep up? https:\/\/www.elsevier.com\/connect\/medical-knowledge-doubles-every-few-months-how-can-clinicians-keep-up  Breda Corish. 2018. Medical knowledge doubles every few months; how can clinicians keep up? https:\/\/www.elsevier.com\/connect\/medical-knowledge-doubles-every-few-months-how-can-clinicians-keep-up"},{"key":"e_1_3_2_1_17_1","volume-title":"Marco Buongiorno Nardelli, Natalio Mingo, Stefano Sanvito, and Ohad Levy.","author":"Curtarolo Stefano","year":"2013","unstructured":"Stefano Curtarolo , Gus LW Hart , Marco Buongiorno Nardelli, Natalio Mingo, Stefano Sanvito, and Ohad Levy. 2013 . The high-throughput highway to computational materials design. Nature materials 12, 3 (2013), 191. Stefano Curtarolo, Gus LW Hart, Marco Buongiorno Nardelli, Natalio Mingo, Stefano Sanvito, and Ohad Levy. 2013. The high-throughput highway to computational materials design. Nature materials 12, 3 (2013), 191."},{"key":"e_1_3_2_1_18_1","unstructured":"F Daniel Davis Marc S Williams and Rebecca A Stametz. [n.d.]. Geisinger's effort to realize its potential as a learning health system: A progress report. Learning Health Systems ([n. d.]) e10221.  F Daniel Davis Marc S Williams and Rebecca A Stametz. [n.d.]. Geisinger's effort to realize its potential as a learning health system: A progress report. Learning Health Systems ([n. d.]) e10221."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CloudTech.2017.8284718"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.14778\/3192965.3192970"},{"key":"e_1_3_2_1_21_1","volume-title":"Easton-Marks and Paul Avillach","author":"Jeremy","year":"2016","unstructured":"Jeremy R. Easton-Marks and Paul Avillach . 2016 . BD2K PIC-SURE RESTFULL API PROTOCOL, Version 1.0. (2016). Jeremy R. Easton-Marks and Paul Avillach. 2016. BD2K PIC-SURE RESTFULL API PROTOCOL, Version 1.0. (2016)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Louis Ehwerhemuepha Gary Gasperino Nathaniel Bischoff Sharief Taraman Anthony Chang and William Feaster. 2020. HealtheDataLab--a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions. BMC medical informatics and decision making 20 1 (2020) 1--12.  Louis Ehwerhemuepha Gary Gasperino Nathaniel Bischoff Sharief Taraman Anthony Chang and William Feaster. 2020. HealtheDataLab--a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions. BMC medical informatics and decision making 20 1 (2020) 1--12.","DOI":"10.1186\/s12911-020-01153-7"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213908"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2288-6-25"},{"key":"e_1_3_2_1_25_1","volume-title":"What does research reproducibility mean? Science translational medicine 8, 341","author":"Goodman Steven N","year":"2016","unstructured":"Steven N Goodman , Daniele Fanelli , and John PA Ioannidis . 2016. What does research reproducibility mean? Science translational medicine 8, 341 ( 2016 ), 341ps12--341ps12. Steven N Goodman, Daniele Fanelli, and John PA Ioannidis. 2016. What does research reproducibility mean? Science translational medicine 8, 341 (2016), 341ps12--341ps12."},{"key":"e_1_3_2_1_26_1","volume-title":"Rcupcake: an R package for querying and analyzing biomedical data through the BD2K PIC-SURE RESTful API. Bioinformatics","author":"Guti\u00c3l'rrez-Sacrist\u00c3 Alba","year":"2017","unstructured":"Alba Guti\u00c3l'rrez-Sacrist\u00c3 &alpha;n, Romain Guedj , Gabor Korodi , Jason Stedman , Laura I Furlong , Chirag J Patel , Isaac S Kohane , and Paul Avillach . 2017. Rcupcake: an R package for querying and analyzing biomedical data through the BD2K PIC-SURE RESTful API. Bioinformatics ( 2017 ), btx788. https:\/\/doi.org\/10.1093\/bioinformatics\/btx788 Alba Guti\u00c3l'rrez-Sacrist\u00c3&alpha;n, Romain Guedj, Gabor Korodi, Jason Stedman, Laura I Furlong, Chirag J Patel, Isaac S Kohane, and Paul Avillach. 2017. Rcupcake: an R package for querying and analyzing biomedical data through the BD2K PIC-SURE RESTful API. Bioinformatics (2017), btx788. https:\/\/doi.org\/10.1093\/bioinformatics\/btx788"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0202447"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2014.07.006"},{"key":"e_1_3_2_1_29_1","volume-title":"Ian Chi Kei Wong, Peter R Rijnbeek, et al.","author":"Hripcsak George","year":"2015","unstructured":"George Hripcsak , Jon D Duke , Nigam H Shah , Christian G Reich , Vojtech Huser , Martijn J Schuemie , Marc A Suchard , Rae Woong Park , Ian Chi Kei Wong, Peter R Rijnbeek, et al. 2015 . Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers. Studies in health technology and informatics 216 (2015), 574. George Hripcsak, Jon D Duke, Nigam H Shah, Christian G Reich, Vojtech Huser, Martijn J Schuemie, Marc A Suchard, Rae Woong Park, Ian Chi Kei Wong, Peter R Rijnbeek, et al. 2015. Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers. Studies in health technology and informatics 216 (2015), 574."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1001\/jama.294.2.218"},{"key":"e_1_3_2_1_31_1","volume-title":"Why most published research findings are false. PLoS medicine 2, 8","author":"Ioannidis John PA","year":"2005","unstructured":"John PA Ioannidis . 2005. Why most published research findings are false. PLoS medicine 2, 8 ( 2005 ), e124. John PA Ioannidis. 2005. Why most published research findings are false. PLoS medicine 2, 8 (2005), e124."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.35"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocx084"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2014.01.003"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2011-000492"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"crossref","unstructured":"Bo Li Joshua Gould Yiming Yang Siranush Sarkizova Marcin Tabaka Orr Ashenberg Yanay Rosen Michal Slyper Monika S Kowalczyk Alexandra-Chlo\u00e9 Villani etal 2019. Cumulus: a cloud-based data analysis framework for large-scale single-cell and single-nucleus RNA-seq. bioRxiv (2019) 823682.  Bo Li Joshua Gould Yiming Yang Siranush Sarkizova Marcin Tabaka Orr Ashenberg Yanay Rosen Michal Slyper Monika S Kowalczyk Alexandra-Chlo\u00e9 Villani et al. 2019. Cumulus: a cloud-based data analysis framework for large-scale single-cell and single-nucleus RNA-seq. bioRxiv (2019) 823682.","DOI":"10.1101\/823682"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2014-002974"},{"key":"e_1_3_2_1_38_1","volume-title":"2018 Institute of Industrial and Systems Engineers Annual Conference and Expo, IISE","author":"Miao Zhuqi","year":"2018","unstructured":"Zhuqi Miao , Shrieraam Sathyanarayanan , Elvena Fong , William Paiva , and Dursun Delen . 2018 . An assessment and cleaning framework for electronic health records data . In 2018 Institute of Industrial and Systems Engineers Annual Conference and Expo, IISE 2018. Zhuqi Miao, Shrieraam Sathyanarayanan, Elvena Fong, William Paiva, and Dursun Delen. 2018. An assessment and cleaning framework for electronic health records data. In 2018 Institute of Industrial and Systems Engineers Annual Conference and Expo, IISE 2018."},{"key":"e_1_3_2_1_39_1","volume-title":"Grappling with the Future Use of Big Data for Translational Medicine and Clinical Care. Yearbook of medical informatics 26, 01","author":"Murphy S","year":"2017","unstructured":"S Murphy , V Castro , and K Mandl . 2017. Grappling with the Future Use of Big Data for Translational Medicine and Clinical Care. Yearbook of medical informatics 26, 01 ( 2017 ), 96--102. S Murphy, V Castro, and K Mandl. 2017. Grappling with the Future Use of Big Data for Translational Medicine and Clinical Care. Yearbook of medical informatics 26, 01 (2017), 96--102."},{"key":"e_1_3_2_1_40_1","unstructured":"Shawn N Murphy Michael Mendis Kristel Hackett Rajesh Kuttan Wensong Pan Lori C Phillips Vivian Gainer David Berkowicz John P Glaser Isaac Kohane etal 2007. Architecture of the open-source clinical research chart from Informatics for Integrating Biology and the Bedside. In AMIA annual symposium proceedings Vol. 2007. American Medical Informatics Association 548.  Shawn N Murphy Michael Mendis Kristel Hackett Rajesh Kuttan Wensong Pan Lori C Phillips Vivian Gainer David Berkowicz John P Glaser Isaac Kohane et al. 2007. Architecture of the open-source clinical research chart from Informatics for Integrating Biology and the Bedside. In AMIA annual symposium proceedings Vol. 2007. American Medical Informatics Association 548."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1136\/jamia.2009.000893"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1001\/jama.298.18.2164"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2011-000538"},{"key":"e_1_3_2_1_44_1","volume-title":"A database of human exposomes and phenomes from the US National Health and Nutrition Examination Survey. Scientific data 3","author":"Patel Chirag J","year":"2016","unstructured":"Chirag J Patel , Nam Pho , Michael McDuffie , Jeremy Easton-Marks , Cartik Kothari , Isaac S Kohane , and Paul Avillach . 2016. A database of human exposomes and phenomes from the US National Health and Nutrition Examination Survey. Scientific data 3 ( 2016 ), 160096. Chirag J Patel, Nam Pho, Michael McDuffie, Jeremy Easton-Marks, Cartik Kothari, Isaac S Kohane, and Paul Avillach. 2016. A database of human exposomes and phenomes from the US National Health and Nutrition Examination Survey. Scientific data 3 (2016), 160096."},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.annepidem.2015.11.004"},{"key":"e_1_3_2_1_46_1","volume-title":"The digitization of the world: from edge to core","author":"Reinsel David","year":"2018","unstructured":"David Reinsel , John Gantz , and John Rydning . 2018. The digitization of the world: from edge to core . Framingham : International Data Corporation ( 2018 ). David Reinsel, John Gantz, and John Rydning. 2018. The digitization of the world: from edge to core. Framingham: International Data Corporation (2018)."},{"key":"e_1_3_2_1_47_1","volume-title":"Towards reproducibility in scientific workflows: An infrastructure-based approach. Scientific Programming 2015","author":"Santana-Perez Idafen","year":"2015","unstructured":"Idafen Santana-Perez and Mar\u00eda S P\u00e9rez-Hern\u00e1ndez . 2015. Towards reproducibility in scientific workflows: An infrastructure-based approach. Scientific Programming 2015 ( 2015 ). Idafen Santana-Perez and Mar\u00eda S P\u00e9rez-Hern\u00e1ndez. 2015. Towards reproducibility in scientific workflows: An infrastructure-based approach. Scientific Programming 2015 (2015)."},{"key":"e_1_3_2_1_48_1","first-page":"96","article-title":"tranSMART: an open source knowledge management and high content data analytics platform","volume":"2014","author":"Scheufele Elisabeth","year":"2014","unstructured":"Elisabeth Scheufele , Dina Aronzon , Robert Coopersmith , Michael T McDuffie , Manish Kapoor , Christopher A Uhrich , Jean E Avitabile , Jinlei Liu , Dan Housman , and Matvey B Palchuk . 2014 . tranSMART: an open source knowledge management and high content data analytics platform . AMIA Summits on Translational Science Proceedings 2014 (2014), 96 . Elisabeth Scheufele, Dina Aronzon, Robert Coopersmith, Michael T McDuffie, Manish Kapoor, Christopher A Uhrich, Jean E Avitabile, Jinlei Liu, Dan Housman, and Matvey B Palchuk. 2014. tranSMART: an open source knowledge management and high content data analytics platform. AMIA Summits on Translational Science Proceedings 2014 (2014), 96.","journal-title":"AMIA Summits on Translational Science Proceedings"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.compenvurbsys.2013.12.003"},{"key":"e_1_3_2_1_50_1","volume-title":"Opentsdb scalable time series database (tsdb). Stumble Upon","author":"Sigoure B","year":"2012","unstructured":"B Sigoure . 2012. Opentsdb scalable time series database (tsdb). Stumble Upon ( 2012 ). B Sigoure. 2012. Opentsdb scalable time series database (tsdb). Stumble Upon (2012)."},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1097\/MLR.0b013e318259c02b"},{"key":"e_1_3_2_1_52_1","volume-title":"Proceedings of the 7th symposium on Operating systems design and implementation. USENIX Association, 307--320","author":"Weil Sage A","year":"2006","unstructured":"Sage A Weil , Scott A Brandt , Ethan L Miller , Darrell DE Long , and Carlos Maltzahn . 2006 . Ceph: A scalable, high-performance distributed file system . In Proceedings of the 7th symposium on Operating systems design and implementation. USENIX Association, 307--320 . Sage A Weil, Scott A Brandt, Ethan L Miller, Darrell DE Long, and Carlos Maltzahn. 2006. Ceph: A scalable, high-performance distributed file system. In Proceedings of the 7th symposium on Operating systems design and implementation. USENIX Association, 307--320."},{"key":"e_1_3_2_1_53_1","first-page":"617","article-title":"Confidentiality in cyberspace: the HIPAA privacy rules and the common law","volume":"33","author":"Winn Peter A","year":"2001","unstructured":"Peter A Winn . 2001 . Confidentiality in cyberspace: the HIPAA privacy rules and the common law . Rutgers LJ 33 (2001), 617 . Peter A Winn. 2001. Confidentiality in cyberspace: the HIPAA privacy rules and the common law. Rutgers LJ 33 (2001), 617.","journal-title":"Rutgers LJ"},{"key":"e_1_3_2_1_54_1","volume-title":"WaveformECG: A Platform for Visualizing, Annotating, and Analyzing ECG Data. Computing in science & engineering 18, 5","author":"Winslow Raimond L","year":"2016","unstructured":"Raimond L Winslow , Stephen Granite , and Christian Jurado . 2016. WaveformECG: A Platform for Visualizing, Annotating, and Analyzing ECG Data. Computing in science & engineering 18, 5 ( 2016 ), 36. Raimond L Winslow, Stephen Granite, and Christian Jurado. 2016. WaveformECG: A Platform for Visualizing, Annotating, and Analyzing ECG Data. Computing in science & engineering 18, 5 (2016), 36."}],"event":{"name":"BCB '20: 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics","location":"Virtual Event USA","acronym":"BCB '20","sponsor":["SIGBio ACM Special Interest Group on Bioinformatics"]},"container-title":["Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3388440.3412460","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3388440.3412460","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:33:30Z","timestamp":1750199610000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3388440.3412460"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,21]]},"references-count":54,"alternative-id":["10.1145\/3388440.3412460","10.1145\/3388440"],"URL":"https:\/\/doi.org\/10.1145\/3388440.3412460","relation":{},"subject":[],"published":{"date-parts":[[2020,9,21]]},"assertion":[{"value":"2020-11-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}