{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T07:36:57Z","timestamp":1758267417141},"reference-count":12,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2017,8]]},"abstract":"<jats:p>Tree-structured data formats, such as JSON and Protocol Buffers, are capable of expressing sophisticated data types, including nested, repeated, and missing values. While such expressing power contributes to their popularity in real-world applications, it presents a significant challenge for systems supporting tree-structured data. Existing systems have focused on general-purpose solutions either extending RDBMSs or designing native systems. However, the general-purpose approach often results in sophisticated data structures and algorithms, which may not reflect and optimize for the actual structure patterns in the real world.<\/jats:p>\n          <jats:p>In this demonstration, we showcase Steed, an analytical database &lt;u&gt;Sy&lt;\/u&gt;stem for &lt;u&gt;t&lt;\/u&gt;r&lt;u&gt;ee&lt;\/u&gt;-structured &lt;u&gt;d&lt;\/u&gt;ata. We use the insights gained by analyzing representative real-world tree structured data as guidelines in the design of Steed. Steed learns and extracts a schema tree for a data set and uses the schema tree to reduce the storage space and improve the efficiency of data field accesses. We observe that sub-structures in real world data are often simple, while the tree-structured data types can support very sophisticated structures. We optimize the storage structure, the column assembling algorithm, and the in-memory layout for the simple sub-structures (a.k.a. simple paths). Compared to representative state-of-the-art systems (i.e. PostgreSQL\/JSON, MongoDB, and Hive+Parquet), Steed achieves orders of magnitude better performance for data analysis queries.<\/jats:p>","DOI":"10.14778\/3137765.3137803","type":"journal-article","created":{"date-parts":[[2017,9,7]],"date-time":"2017-09-07T13:35:53Z","timestamp":1504791353000},"page":"1897-1900","source":"Crossref","is-referenced-by-count":3,"title":["STEED"],"prefix":"10.14778","volume":"10","author":[{"given":"Zhiyi","family":"Wang","sequence":"first","affiliation":[{"name":"Chinese Academy of Sciences"}]},{"given":"Dongyan","family":"Zhou","sequence":"additional","affiliation":[{"name":"Chinese Academy of Sciences"}]},{"given":"Shimin","family":"Chen","sequence":"additional","affiliation":[{"name":"Chinese Academy of Sciences"}]}],"member":"320","published-online":{"date-parts":[[2017,8]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"PostgreSQL's JSON Support. https:\/\/www.postgresql.org\/docs\/9.4\/static\/datatype-json.html.  PostgreSQL's JSON Support. https:\/\/www.postgresql.org\/docs\/9.4\/static\/datatype-json.html."},{"key":"e_1_2_1_2_1","volume-title":"Morgan Kaufmann","author":"Abiteboul S.","year":"1999","unstructured":"S. Abiteboul , P. Buneman , and D. Suciu . Data on the Web: From Relations to Semistructured Data and XML . Morgan Kaufmann , 1999 . S. Abiteboul, P. Buneman, and D. Suciu. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann, 1999."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10619-011-7082-y"},{"key":"e_1_2_1_4_1","volume-title":"WebDB","author":"Chasseur C.","year":"2013","unstructured":"C. Chasseur , Y. Li , and J. M. Patel . Enabling JSON document stores in relational systems . In WebDB , 2013 . Extented version: http:\/\/pages.cs.wisc.edu\/~chasseur\/argo-long.pdf. C. Chasseur, Y. Li, and J. M. Patel. Enabling JSON document stores in relational systems. In WebDB, 2013. Extented version: http:\/\/pages.cs.wisc.edu\/~chasseur\/argo-long.pdf."},{"key":"e_1_2_1_5_1","first-page":"43","volume-title":"WebDB","author":"Choi B.","year":"2002","unstructured":"B. Choi . What are real dtds like ? In WebDB , pages 43 -- 48 , 2002 . B. Choi. What are real dtds like? In WebDB, pages 43--48, 2002."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1031570.1031588"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/69.273032"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2595628"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2903731"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920886"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2612183"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3035956"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3137765.3137803","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T10:04:42Z","timestamp":1672221882000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3137765.3137803"}},"subtitle":["an analytical database &lt;u&gt;s&lt;\/u&gt;ystem for &lt;u&gt;t&lt;\/u&gt;r&lt;u&gt;ee&lt;\/u&gt;-structured &lt;u&gt;d&lt;\/u&gt;ata"],"short-title":[],"issued":{"date-parts":[[2017,8]]},"references-count":12,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2017,8]]}},"alternative-id":["10.14778\/3137765.3137803"],"URL":"https:\/\/doi.org\/10.14778\/3137765.3137803","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2017,8]]}}}