{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,12]],"date-time":"2026-04-12T01:33:16Z","timestamp":1775957596765,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":13,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,12,9]],"date-time":"2019-12-09T00:00:00Z","timestamp":1575849600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,12,9]]},"DOI":"10.1145\/3366624.3368170","type":"proceedings-article","created":{"date-parts":[[2019,11,27]],"date-time":"2019-11-27T13:23:09Z","timestamp":1574860989000},"page":"51-53","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Dredging a data lake"],"prefix":"10.1145","author":[{"given":"Tyler J.","family":"Skluzacek","sequence":"first","affiliation":[{"name":"University of Chicago"}]}],"member":"320","published-online":{"date-parts":[[2019,12,9]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307681.3325400"},{"key":"e_1_3_2_1_2_1","volume-title":"A Data Ecosystem to Support Machine Learning in Materials Science. (apr","author":"Blaiszik Ben","year":"2019","unstructured":"Ben Blaiszik , Logan Ward , Marcus Schwarting , Jonathon Gaff , Ryan Chard , Daniel Pike , Kyle Chard , and Ian Foster . 2019. A Data Ecosystem to Support Machine Learning in Materials Science. (apr 2019 ). arXiv:1904.10423 Ben Blaiszik, Logan Ward, Marcus Schwarting, Jonathon Gaff, Ryan Chard, Daniel Pike, Kyle Chard, and Ian Foster. 2019. A Data Ecosystem to Support Machine Learning in Materials Science. (apr 2019). arXiv:1904.10423"},{"key":"e_1_3_2_1_3_1","volume-title":"Serverless Super-computing: High Performance Function as a Service for Science. arXiv preprint arXiv:1908.04907","author":"Chard Ryan","year":"2019","unstructured":"Ryan Chard , Tyler J Skluzacek , Zhuozhao Li , Yadu Babuji , Anna Woodard , Ben Blaiszik , Steven Tuecke , Ian Foster , and Kyle Chard . 2019 . Serverless Super-computing: High Performance Function as a Service for Science. arXiv preprint arXiv:1908.04907 (2019). Ryan Chard, Tyler J Skluzacek, Zhuozhao Li, Yadu Babuji, Anna Woodard, Ben Blaiszik, Steven Tuecke, Ian Foster, and Kyle Chard. 2019. Serverless Super-computing: High Performance Function as a Service for Science. arXiv preprint arXiv:1908.04907 (2019)."},{"key":"e_1_3_2_1_4_1","volume-title":"VizieR Online Data Catalog: MSX6C Infrared Point Source Catalog. The Midcourse Space Experiment Point Source Catalog Version 2.3 (October","author":"Egan MP","year":"2003","unstructured":"MP Egan , SD Price , KE Kraemer , DR Mizuno , SJ Carey , CO Wright , CW Engelke , M Cohen , and MG Gugliotti . 2003. VizieR Online Data Catalog: MSX6C Infrared Point Source Catalog. The Midcourse Space Experiment Point Source Catalog Version 2.3 (October 2003 ). VizieR Online Data Catalog 5114 (2003). MP Egan, SD Price, KE Kraemer, DR Mizuno, SJ Carey, CO Wright, CW Engelke, M Cohen, and MG Gugliotti. 2003. VizieR Online Data Catalog: MSX6C Infrared Point Source Catalog. The Midcourse Space Experiment Point Source Catalog Version 2.3 (October 2003). VizieR Online Data Catalog 5114 (2003)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Gary King. 2007. An introduction to the dataverse network as an infrastructure for data sharing.  Gary King. 2007. An introduction to the dataverse network as an infrastructure for data sharing.","DOI":"10.1177\/0049124107306660"},{"key":"e_1_3_2_1_6_1","volume-title":"Tika in action","author":"Mattmann Chris","unstructured":"Chris Mattmann and Jukka Zitting . 2011. Tika in action . Manning Publications . Chris Mattmann and Jukka Zitting. 2011. Tika in action. Manning Publications."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.2015.7363791"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/eScience.2018.00025"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/PDSW-DISCS.2016.010"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366623.3368140"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/eScience.2018.00040"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2792745.2792774"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/eScience.2016.7870901"}],"event":{"name":"Middleware '19: 20th International Middleware Conference","location":"Davis California","acronym":"Middleware '19","sponsor":["ACM Association for Computing Machinery","USENIX Assoc USENIX Assoc","IFIP"]},"container-title":["Proceedings of the 20th International Middleware Conference Doctoral Symposium"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366624.3368170","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366624.3368170","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:13:33Z","timestamp":1750202013000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366624.3368170"}},"subtitle":["decentralized metadata extraction"],"short-title":[],"issued":{"date-parts":[[2019,12,9]]},"references-count":13,"alternative-id":["10.1145\/3366624.3368170","10.1145\/3366624"],"URL":"https:\/\/doi.org\/10.1145\/3366624.3368170","relation":{},"subject":[],"published":{"date-parts":[[2019,12,9]]},"assertion":[{"value":"2019-12-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}