{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T10:45:13Z","timestamp":1771584313324,"version":"3.50.1"},"reference-count":26,"publisher":"Association for Computing Machinery (ACM)","issue":"1-2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2010,9]]},"abstract":"<jats:p>\n            A Cloud may be seen as a type of flexible computing infrastructure consisting of many compute nodes, where resizable computing capacities can be provided to different customers. To fully harness the power of the Cloud, efficient data management is needed to handle huge volumes of data and support a large number of concurrent end users. To achieve that, a scalable and high-throughput indexing scheme is generally required. Such an indexing scheme must not only incur a low maintenance cost but also support parallel search to improve scalability. In this paper, we present a novel, scalable B\n            <jats:sup>+<\/jats:sup>\n            -tree based indexing scheme for efficient data processing in the Cloud. Our approach can be summarized as follows. First, we build a local B\n            <jats:sup>+<\/jats:sup>\n            -tree index for each compute node which only indexes data residing on the node. Second, we organize the compute nodes as a structured overlay and publish a portion of the local B\n            <jats:sup>+<\/jats:sup>\n            -tree nodes to the overlay for efficient query processing. Finally, we propose an adaptive algorithm to select the published B\n            <jats:sup>+<\/jats:sup>\n            -tree nodes according to query patterns. We conduct extensive experiments on Amazon's EC2, and the results demonstrate that our indexing scheme is dynamic, efficient and scalable.\n          <\/jats:p>","DOI":"10.14778\/1920841.1920991","type":"journal-article","created":{"date-parts":[[2014,6,24]],"date-time":"2014-06-24T12:17:57Z","timestamp":1403612277000},"page":"1207-1218","source":"Crossref","is-referenced-by-count":113,"title":["Efficient B-tree based indexing for cloud data processing"],"prefix":"10.14778","volume":"3","author":[{"given":"Sai","family":"Wu","sequence":"first","affiliation":[{"name":"National University of Singapore, Singapore"}]},{"given":"Dawei","family":"Jiang","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore"}]},{"given":"Beng Chin","family":"Ooi","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore"}]},{"given":"Kun-Lung","family":"Wu","sequence":"additional","affiliation":[{"name":"IBM T. J. Watson Research Center"}]}],"member":"320","published-online":{"date-parts":[[2010,9]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"http:\/\/aws.amazon.com\/ec2\/.  http:\/\/aws.amazon.com\/ec2\/."},{"key":"e_1_2_1_2_1","unstructured":"http:\/\/aws.amazon.com\/s3\/.  http:\/\/aws.amazon.com\/s3\/."},{"key":"e_1_2_1_3_1","unstructured":"http:\/\/hadoop.apache.org.  http:\/\/hadoop.apache.org."},{"key":"e_1_2_1_4_1","unstructured":"http:\/\/hypertable.org.  http:\/\/hypertable.org."},{"key":"e_1_2_1_5_1","unstructured":"http:\/\/incubator.apache.org\/cassandra\/.  http:\/\/incubator.apache.org\/cassandra\/."},{"key":"e_1_2_1_6_1","unstructured":"http:\/\/project-voldemort.com\/.  http:\/\/project-voldemort.com\/."},{"key":"e_1_2_1_7_1","unstructured":"http:\/\/www.comp.nus.edu.sg\/~epic.  http:\/\/www.comp.nus.edu.sg\/~epic."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/945721.945729"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/1060289.1060291"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/1453856.1453922"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1294261.1294278"},{"key":"e_1_2_1_12_1","volume-title":"Rackspace cloud servers versus amazon ec2: Performance analysis","year":"2010","unstructured":"bitsource.com. Rackspace cloud servers versus amazon ec2: Performance analysis . 2010 . bitsource.com. Rackspace cloud servers versus amazon ec2: Performance analysis. 2010."},{"key":"e_1_2_1_13_1","volume-title":"OSDI","author":"Chang F.","year":"2006","unstructured":"F. Chang , J. Dean , S. Ghemawat , W. C. Hsieh , D. A. Wallach , M. Burrows , T. Chandra , A. Fikes , and R. E. Gruber . Bigtable: A distributed storage system for structured data . OSDI , 2006 . F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A distributed storage system for structured data. OSDI, 2006."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1247480.1247507"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1294261.1294281"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/129888.129894"},{"key":"e_1_2_1_17_1","first-page":"151","volume-title":"USENIX","author":"Gabber E.","year":"2003","unstructured":"E. Gabber , J. Fellin , M. Flaster , F. Gu , B. Hillyer , W. T. Ng , B. \u00d6zden , and E. A. M. Shriver . Starfish : highly-available block storage . In USENIX , pages 151 -- 163 , 2003 . E. Gabber, J. Fellin, M. Flaster, F. Gu, B. Hillyer, W. T. Ng, B. \u00d6zden, and E. A. M. Shriver. Starfish: highly-available block storage. In USENIX, pages 151--163, 2003."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/945445.945450"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1142473.1142475"},{"key":"e_1_2_1_20_1","volume-title":"VLDB","author":"Jagadish H. V.","year":"2005","unstructured":"H. V. Jagadish , B. C. Ooi , and Q. H. Vu . Baton: A balanced tree structure for peer-to-peer networks . In VLDB , 2005 . H. V. Jagadish, B. C. Ooi, and Q. H. Vu. Baton: A balanced tree structure for peer-to-peer networks. In VLDB, 2005."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/378993.379239"},{"key":"e_1_2_1_22_1","volume-title":"USA","author":"Parzen E.","year":"1999","unstructured":"E. Parzen . Stochastic processes. Society for Industrial and Applied Mathematics, Philadelphia, PA , USA , 1999 . E. Parzen. Stochastic processes. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 1999."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/383059.383072"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/1833515.1833691"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807232"},{"key":"e_1_2_1_26_1","volume-title":"Ceph: A scalable, high-performance distributed file system","author":"Weil S.","year":"2006","unstructured":"S. Weil , S. A. Brandt , E. L. Miller , D. D. E. Long , and C. Maltzahn . Ceph: A scalable, high-performance distributed file system . 2006 . S. Weil, S. A. Brandt, E. L. Miller, D. D. E. Long, and C. Maltzahn. Ceph: A scalable, high-performance distributed file system. 2006."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/1920841.1920991","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T11:51:54Z","timestamp":1672228314000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/1920841.1920991"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,9]]},"references-count":26,"journal-issue":{"issue":"1-2","published-print":{"date-parts":[[2010,9]]}},"alternative-id":["10.14778\/1920841.1920991"],"URL":"https:\/\/doi.org\/10.14778\/1920841.1920991","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2010,9]]}}}