{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T15:16:36Z","timestamp":1778166996614,"version":"3.51.4"},"reference-count":22,"publisher":"Association for Computing Machinery (ACM)","issue":"11","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2013,8,27]]},"abstract":"<jats:p>Facebook takes performance monitoring seriously. Performance issues can impact over one billion users so we track thousands of servers, hundreds of PB of daily network traffic, hundreds of daily code changes, and many other metrics. We require latencies of under a minute from events occuring (a client request on a phone, a bug report filed, a code change checked in) to graphs showing those events on developers' monitors.<\/jats:p>\n          <jats:p>Scuba is the data management system Facebook uses for most real-time analysis. Scuba is a fast, scalable, distributed, in-memory database built at Facebook. It currently ingests millions of rows (events) per second and expires data at the same rate. Scuba stores data completely in memory on hundreds of servers each with 144 GB RAM. To process each query, Scuba aggregates data from all servers. Scuba processes almost a million queries per day. Scuba is used extensively for interactive, ad hoc, analysis queries that run in under a second over live data. In addition, Scuba is the workhorse behind Facebook's code regression analysis, bug report monitoring, ads revenue monitoring, and performance debugging.<\/jats:p>","DOI":"10.14778\/2536222.2536231","type":"journal-article","created":{"date-parts":[[2014,6,24]],"date-time":"2014-06-24T12:17:57Z","timestamp":1403612277000},"page":"1057-1067","source":"Crossref","is-referenced-by-count":60,"title":["Scuba"],"prefix":"10.14778","volume":"6","author":[{"given":"Lior","family":"Abraham","sequence":"first","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"John","family":"Allen","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Oleksandr","family":"Barykin","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vinayak","family":"Borkar","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bhuwan","family":"Chopra","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ciprian","family":"Gerea","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Merl","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Josh","family":"Metzler","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Reiss","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Subbu","family":"Subramanian","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Janet L.","family":"Wiener","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Okay","family":"Zed","sequence":"additional","affiliation":[{"name":"Facebook, Inc. Menlo Park, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2013,8]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Cloudera Impala: Real-time queries in Apache Hadoop for real. http:\/\/blog.cloudera.com\/blog\/2012\/10\/cloudera-impala-real-time-queries-in-apache-hadoop-for-real\/."},{"key":"e_1_2_1_2_1","unstructured":"Druid. https:\/\/github.com\/metamx\/druid\/wiki."},{"key":"e_1_2_1_3_1","unstructured":"MRTG: Multi-router traffic grapher. http:\/\/oss.oetiker.ch\/mrtg\/."},{"key":"e_1_2_1_4_1","unstructured":"RRDTool. http:\/\/oss.oetiker.ch\/rrdtool\/."},{"key":"e_1_2_1_5_1","unstructured":"Scribe. https:\/\/github.com\/facebook\/scribe."},{"key":"e_1_2_1_6_1","unstructured":"Splunk. http:\/\/www.splunk.com."},{"key":"e_1_2_1_7_1","volume-title":"Facebook, 2007","author":"Agarwal Aditya","year":"2007","unstructured":"Aditya Agarwal, Mark Slee, and Marc Kwiatkowski. Thrift: Scalable cross-language services implementation. Technical report, Facebook, 2007. http:\/\/thrift.apache.org\/static\/files\/thrift-20070401.pdf."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1409360.1409380"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213934"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350259"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2011.5767867"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920886"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2331042.2331056"},{"key":"e_1_2_1_14_1","volume-title":"Iterative MapReduce for Large Scale Machine Learning. Technical report, 03","author":"Rosen Joshua","year":"2013","unstructured":"Joshua Rosen, Neoklis Polyzotis, Vinayak Borkar, Yingyi Bu, Michael J. Carey, Markus Weimer, Tyson Condie, and Raghu Ramakrishnan. Iterative MapReduce for Large Scale Machine Learning. Technical report, 03 2013. http:\/\/arxiv.org\/abs\/1303.3517."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213946"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/22952.22956"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/42186.42323"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/1083592.1083658"},{"key":"e_1_2_1_19_1","volume-title":"Disaggregation and next-generation systems design","author":"Taylor Jason","year":"2013","unstructured":"Jason Taylor. Disaggregation and next-generation systems design, 2013. http:\/\/www.opencompute.org\/ocp-summit-iv-agenda\/#keynote."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687553.1687609"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/362084.362137"},{"key":"e_1_2_1_22_1","volume-title":"UC Berkeley, 2012","author":"Xin Reynold","year":"2012","unstructured":"Reynold Xin, Josh Rosen, Matei Zaharia, Michael J. Franklin, Scott Shenker, and Ion Stoica. Shark: Sql and rich analytics at scale. Technical report, UC Berkeley, 2012. http:\/\/shark.cs.berkeley.edu\/presentations\/2012-11-26-shark-tech-report.pdf."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/2536222.2536231","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,23]],"date-time":"2024-10-23T22:35:26Z","timestamp":1729722926000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/2536222.2536231"}},"subtitle":["diving into data at facebook"],"short-title":[],"issued":{"date-parts":[[2013,8]]},"references-count":22,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2013,8,27]]}},"alternative-id":["10.14778\/2536222.2536231"],"URL":"https:\/\/doi.org\/10.14778\/2536222.2536231","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2013,8]]}}}