{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:27:12Z","timestamp":1750220832027,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,12,9]],"date-time":"2019-12-09T00:00:00Z","timestamp":1575849600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,12,9]]},"DOI":"10.1145\/3366623.3368140","type":"proceedings-article","created":{"date-parts":[[2019,11,18]],"date-time":"2019-11-18T13:26:41Z","timestamp":1574083601000},"page":"43-48","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Serverless Workflows for Indexing Large Scientific Data"],"prefix":"10.1145","author":[{"given":"Tyler J.","family":"Skluzacek","sequence":"first","affiliation":[{"name":"University of Chicago"}]},{"given":"Ryan","family":"Chard","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory"}]},{"given":"Ryan","family":"Wong","sequence":"additional","affiliation":[{"name":"University of Chicago"}]},{"given":"Zhuozhao","family":"Li","sequence":"additional","affiliation":[{"name":"University of Chicago"}]},{"given":"Yadu N.","family":"Babuji","sequence":"additional","affiliation":[{"name":"University of Chicago"}]},{"given":"Logan","family":"Ward","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory"}]},{"given":"Ben","family":"Blaiszik","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory"}]},{"given":"Kyle","family":"Chard","sequence":"additional","affiliation":[{"name":"University of Chicago"}]},{"given":"Ian","family":"Foster","sequence":"additional","affiliation":[{"name":"Argonne &amp; University of Chicago"}]}],"member":"320","published-online":{"date-parts":[[2019,12,9]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219104.3219127"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307681.3325400"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11837-016-2001-3"},{"key":"e_1_3_2_1_4_1","volume-title":"A Data Ecosystem to Support Machine Learning in Materials Science. (apr","author":"Blaiszik Ben","year":"2019","unstructured":"Ben Blaiszik , Logan Ward , Marcus Schwarting , Jonathon Gaff , Ryan Chard , Daniel Pike , Kyle Chard , and Ian Foster . 2019. A Data Ecosystem to Support Machine Learning in Materials Science. (apr 2019 ). arXiv:1904.10423 http:\/\/arxiv.org\/abs\/1904.10423 Ben Blaiszik, Logan Ward, Marcus Schwarting, Jonathon Gaff, Ryan Chard, Daniel Pike, Kyle Chard, and Ian Foster. 2019. A Data Ecosystem to Support Machine Learning in Materials Science. (apr 2019). arXiv:1904.10423 http:\/\/arxiv.org\/abs\/1904.10423"},{"key":"e_1_3_2_1_5_1","volume-title":"Serverless Supercomputing: High Performance Function as a Service for Science. arXiv preprint arXiv:1908.04907","author":"Chard Ryan","year":"2019","unstructured":"Ryan Chard , Tyler J Skluzacek , Zhuozhao Li , Yadu Babuji , Anna Woodard , Ben Blaiszik , Steven Tuecke , Ian Foster , and Kyle Chard . 2019 . Serverless Supercomputing: High Performance Function as a Service for Science. arXiv preprint arXiv:1908.04907 (2019). Ryan Chard, Tyler J Skluzacek, Zhuozhao Li, Yadu Babuji, Anna Woodard, Ben Blaiszik, Steven Tuecke, Ian Foster, and Kyle Chard. 2019. Serverless Supercomputing: High Performance Function as a Service for Science. arXiv preprint arXiv:1908.04907 (2019)."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Eric Deutsch Roger Kramer Joseph Ames Andrew Bauman David S Campbell Kyle Chard Kristi Clark Mike D'Arcy Ivo Dinov Rory Donovan etal 2018. BDQC: a general-purpose analytics tool for domain-blind validation of Big Data. bioRxiv (2018) 258822.  Eric Deutsch Roger Kramer Joseph Ames Andrew Bauman David S Campbell Kyle Chard Kristi Clark Mike D'Arcy Ivo Dinov Rory Donovan et al. 2018. BDQC: a general-purpose analytics tool for domain-blind validation of Big Data. bioRxiv (2018) 258822.","DOI":"10.1101\/258822"},{"key":"e_1_3_2_1_7_1","volume-title":"VizieR Online Data Catalog: MSX6C Infrared Point Source Catalog. The Midcourse Space Experiment Point Source Catalog Version 2.3 (October","author":"Egan MP","year":"2003","unstructured":"MP Egan , SD Price , KE Kraemer , DR Mizuno , SJ Carey , CO Wright , CW Engelke , M Cohen , and MG Gugliotti . 2003. VizieR Online Data Catalog: MSX6C Infrared Point Source Catalog. The Midcourse Space Experiment Point Source Catalog Version 2.3 (October 2003 ). VizieR Online Data Catalog 5114 (2003). MP Egan, SD Price, KE Kraemer, DR Mizuno, SJ Carey, CO Wright, CW Engelke, M Cohen, and MG Gugliotti. 2003. VizieR Online Data Catalog: MSX6C Infrared Point Source Catalog. The Midcourse Space Experiment Point Source Catalog Version 2.3 (October 2003). VizieR Online Data Catalog 5114 (2003)."},{"key":"e_1_3_2_1_8_1","unstructured":"Materials Data Facility. 2019. MaterialsIO. https:\/\/github.com\/materials-data-facility\/MaterialsIO.  Materials Data Facility. 2019. MaterialsIO. https:\/\/github.com\/materials-data-facility\/MaterialsIO."},{"key":"e_1_3_2_1_9_1","unstructured":"Environmental Systems Science Data Infrastructure for a Virtual Ecosystem. 2019. ESS-DIVE. https:\/\/ess- dive.lbl.gov\/.  Environmental Systems Science Data Infrastructure for a Virtual Ecosystem. 2019. ESS-DIVE. https:\/\/ess- dive.lbl.gov\/."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Gary King. 2007. An introduction to the dataverse network as an infrastructure for data sharing.  Gary King. 2007. An introduction to the dataverse network as an infrastructure for data sharing.","DOI":"10.1177\/0049124107306660"},{"volume-title":"Tika in action","author":"Mattmann Chris","key":"e_1_3_2_1_11_1","unstructured":"Chris Mattmann and Jukka Zitting . 2011. Tika in action . Manning Publications Co. Chris Mattmann and Jukka Zitting. 2011. Tika in action. Manning Publications Co."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1045\/january2011-michener"},{"key":"e_1_3_2_1_13_1","volume-title":"Brown Dog: Leveraging everything towards autocuration. In 2015 IEEE Int'l Conference on Big Data (Big Data)","author":"Padhy Smruti","year":"2015","unstructured":"Smruti Padhy , Greg Jansen , Jay Alameda , Edgar Black , Liana Diesendruck , Mike Dietze , Praveen Kumar , Rob Kooper , Jong Lee , Rui Liu , 2015 . Brown Dog: Leveraging everything towards autocuration. In 2015 IEEE Int'l Conference on Big Data (Big Data) . IEEE , 493--500. Smruti Padhy, Greg Jansen, Jay Alameda, Edgar Black, Liana Diesendruck, Mike Dietze, Praveen Kumar, Rob Kooper, Jong Lee, Rui Liu, et al. 2015. Brown Dog: Leveraging everything towards autocuration. In 2015 IEEE Int'l Conference on Big Data (Big Data). IEEE, 493--500."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.5555\/1855046"},{"volume-title":"ScienceSearch: Enabling search through automatic metadata generation. In 2018 IEEE 14th Int'l Conference on e-Science (e-Science)","author":"Rodrigo Gonzalo P","key":"e_1_3_2_1_15_1","unstructured":"Gonzalo P Rodrigo , Matt Henderson , Gunther H Weber , Colin Ophus , Katie Antypas , and Lavanya Ramakrishnan . 2018. ScienceSearch: Enabling search through automatic metadata generation. In 2018 IEEE 14th Int'l Conference on e-Science (e-Science) . IEEE , 93--104. Gonzalo P Rodrigo, Matt Henderson, Gunther H Weber, Colin Ophus, Katie Antypas, and Lavanya Ramakrishnan. 2018. ScienceSearch: Enabling search through automatic metadata generation. In 2018 IEEE 14th Int'l Conference on e-Science (e-Science). IEEE, 93--104."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366624.3368170"},{"volume-title":"Klimatic: a virtual data lake for harvesting and distribution of geospatial data. In 2016 1st Joint Int'l Workshop on Parallel Data Storage and data Intensive Scalable Computing Systems (PDSW-DISCS)","author":"Skluzacek Tyler J","key":"e_1_3_2_1_17_1","unstructured":"Tyler J Skluzacek , Kyle Chard , and Ian Foster . 2016. Klimatic: a virtual data lake for harvesting and distribution of geospatial data. In 2016 1st Joint Int'l Workshop on Parallel Data Storage and data Intensive Scalable Computing Systems (PDSW-DISCS) . IEEE , 31--36. Tyler J Skluzacek, Kyle Chard, and Ian Foster. 2016. Klimatic: a virtual data lake for harvesting and distribution of geospatial data. In 2016 1st Joint Int'l Workshop on Parallel Data Storage and data Intensive Scalable Computing Systems (PDSW-DISCS). IEEE, 31--36."},{"key":"e_1_3_2_1_18_1","volume-title":"Skluma: An extensible metadata extraction pipeline for disorganized data. In 2018 IEEE 14th Int'l Conference on e-Science (e-Science)","author":"Skluzacek Tyler J","year":"2018","unstructured":"Tyler J Skluzacek , Rohan Kumar , Ryan Chard , Galen Harrison , Paul Beckman , Kyle Chard , and Ian Foster . 2018 . Skluma: An extensible metadata extraction pipeline for disorganized data. In 2018 IEEE 14th Int'l Conference on e-Science (e-Science) . IEEE , 256--266. Tyler J Skluzacek, Rohan Kumar, Ryan Chard, Galen Harrison, Paul Beckman, Kyle Chard, and Ian Foster. 2018. Skluma: An extensible metadata extraction pipeline for disorganized data. In 2018 IEEE 14th Int'l Conference on e-Science (e-Science). IEEE, 256--266."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2792745.2792774"},{"key":"e_1_3_2_1_20_1","volume-title":"Data Wrangling: The Challenging Yourney from the Wild to the Lake.. In CIDR.","author":"Terrizzano Ignacio G","year":"2015","unstructured":"Ignacio G Terrizzano , Peter M Schwarz , Mary Roth , and John E Colino . 2015 . Data Wrangling: The Challenging Yourney from the Wild to the Lake.. In CIDR. Ignacio G Terrizzano, Peter M Schwarz, Mary Roth, and John E Colino. 2015. Data Wrangling: The Challenging Yourney from the Wild to the Lake.. In CIDR."},{"key":"e_1_3_2_1_21_1","volume-title":"Globus Auth: A research identity and access management platform. In 2016 IEEE 12th Int'l Conference on e-Science (e-Science)","author":"Tuecke Steven","year":"2016","unstructured":"Steven Tuecke , Rachana Ananthakrishnan , Kyle Chard , Mattias Lidman , Brendan McCollam , Stephen Rosen , and Ian Foster . 2016 . Globus Auth: A research identity and access management platform. In 2016 IEEE 12th Int'l Conference on e-Science (e-Science) . IEEE , 203--212. Steven Tuecke, Rachana Ananthakrishnan, Kyle Chard, Mattias Lidman, Brendan McCollam, Stephen Rosen, and Ian Foster. 2016. Globus Auth: A research identity and access management platform. In 2016 IEEE 12th Int'l Conference on e-Science (e-Science). IEEE, 203--212."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkt1229"},{"volume-title":"Big Data Remote Access Interfaces for Light Source Science. In 2nd IEEE\/ACM International Symposium on Big Data Computing(BDC). 51--60","author":"Wozniak J. M.","key":"e_1_3_2_1_23_1","unstructured":"J. M. Wozniak , K. Chard , B. Blaiszik , R. Osborn , M. Wilde , and I. Foster . 2015 . Big Data Remote Access Interfaces for Light Source Science. In 2nd IEEE\/ACM International Symposium on Big Data Computing(BDC). 51--60 . J. M. Wozniak, K. Chard, B. Blaiszik, R. Osborn, M. Wilde, and I. Foster. 2015. Big Data Remote Access Interfaces for Light Source Science. In 2nd IEEE\/ACM International Symposium on Big Data Computing(BDC). 51--60."}],"event":{"name":"Middleware '19: 20th International Middleware Conference","sponsor":["ACM Association for Computing Machinery","IFIP"],"location":"Davis CA USA","acronym":"Middleware '19"},"container-title":["Proceedings of the 5th International Workshop on Serverless Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366623.3368140","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366623.3368140","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:13:33Z","timestamp":1750202013000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366623.3368140"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,9]]},"references-count":23,"alternative-id":["10.1145\/3366623.3368140","10.1145\/3366623"],"URL":"https:\/\/doi.org\/10.1145\/3366623.3368140","relation":{},"subject":[],"published":{"date-parts":[[2019,12,9]]},"assertion":[{"value":"2019-12-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}