{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,11,30]],"date-time":"2023-11-30T02:52:33Z","timestamp":1701312753290},"reference-count":11,"publisher":"Association for Computing Machinery (ACM)","issue":"13","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2014,8]]},"abstract":"<jats:p>There is a growing interest in making relational DBMSs work synergistically with MapReduce systems. However, there are interesting technical challenges associated with figuring out the right balance between the use and co-deployment of these systems. This paper focuses on one specific aspect of this balance, namely how to leverage the superior indexing and query processing power of a relational DBMS for data that is often more cost-effectively stored in Hadoop\/HDFS. We present a method to use conventional B+-tree indices in an RDBMS for data stored in HDFS and demonstrate that our approach is especially effective for highly selective queries.<\/jats:p>","DOI":"10.14778\/2733004.2733023","type":"journal-article","created":{"date-parts":[[2015,5,12]],"date-time":"2015-05-12T15:37:52Z","timestamp":1431445072000},"page":"1520-1528","source":"Crossref","is-referenced-by-count":16,"title":["Indexing HDFS data in PDW"],"prefix":"10.14778","volume":"7","author":[{"given":"Vinitha Reddy","family":"Gankidi","sequence":"first","affiliation":[{"name":"University of Wisconsin-Madison"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nikhil","family":"Teletia","sequence":"additional","affiliation":[{"name":"Microsoft Jim Gray Systems Lab"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jignesh M.","family":"Patel","sequence":"additional","affiliation":[{"name":"University of Wisconsin-Madison"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alan","family":"Halverson","sequence":"additional","affiliation":[{"name":"Microsoft Jim Gray Systems Lab"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David J.","family":"DeWitt","sequence":"additional","affiliation":[{"name":"Microsoft Jim Gray Systems Lab"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2014,8]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2463709"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989447"},{"key":"e_1_2_1_3_1","volume-title":"Sanjay Ghemawat: MapReduce: Simplified Data Processing on Large Clusters. OSDI 2004: 137--150","author":"Dean Jeffrey","unstructured":"Jeffrey Dean , Sanjay Ghemawat: MapReduce: Simplified Data Processing on Large Clusters. OSDI 2004: 137--150 Jeffrey Dean, Sanjay Ghemawat: MapReduce: Simplified Data Processing on Large Clusters. OSDI 2004: 137--150"},{"key":"e_1_2_1_4_1","volume-title":"June","author":"Oracle White","year":"2012","unstructured":"Oracle White paper. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database , June 2012 . http:\/\/www.oracle.com\/technetwork\/bdc\/hadoop-loader\/connectors-hdfs-wp-1674035.pdf Oracle White paper. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database, June 2012. http:\/\/www.oracle.com\/technetwork\/bdc\/hadoop-loader\/connectors-hdfs-wp-1674035.pdf"},{"key":"e_1_2_1_5_1","unstructured":"http:\/\/www.greenplum.com\/sites\/default\/files\/EMC_Greenplum_Hadoop_DB_TB_0.pdf  http:\/\/www.greenplum.com\/sites\/default\/files\/EMC_Greenplum_Hadoop_DB_TB_0.pdf"},{"key":"e_1_2_1_6_1","unstructured":"Aster SQL-H: http:\/\/www.asterdata.com\/sqlh\/  Aster SQL-H: http:\/\/www.asterdata.com\/sqlh\/"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2452376.2452388"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920908"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350272"},{"key":"e_1_2_1_10_1","unstructured":"http:\/\/www.greenplum.com\/sites\/default\/files\/EMC_Greenplum_Hadoop_DB_TB_0.pdf  http:\/\/www.greenplum.com\/sites\/default\/files\/EMC_Greenplum_Hadoop_DB_TB_0.pdf"},{"key":"e_1_2_1_11_1","unstructured":"http:\/\/www.asterdata.com\/sqlh\/  http:\/\/www.asterdata.com\/sqlh\/"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/2733004.2733023","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T09:41:35Z","timestamp":1672220495000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/2733004.2733023"}},"subtitle":["splitting the data from the index"],"short-title":[],"issued":{"date-parts":[[2014,8]]},"references-count":11,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2014,8]]}},"alternative-id":["10.14778\/2733004.2733023"],"URL":"https:\/\/doi.org\/10.14778\/2733004.2733023","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2014,8]]}}}