{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,2]],"date-time":"2025-07-02T20:11:41Z","timestamp":1751487101221,"version":"3.41.0"},"reference-count":26,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2009,1,1]],"date-time":"2009-01-01T00:00:00Z","timestamp":1230768000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGOPS Oper. Syst. Rev."],"published-print":{"date-parts":[[2009,1]]},"abstract":"<jats:p>Structured serial data is used in many scientific fields; such data sets consist of a series of records, and are typically written once, read many times, chronologically ordered, and read sequentially. In this paper we introduce DataSeries, an on-disk format, run-time library and set of tools for storing and analyzing structured serial data. We identify six key properties of a system to store and analyze this type of data, and describe how DataSeries was designed to provide these properties. We quantify the benefits of DataSeries through several experiments. In particular, we demonstrate that DataSeries exceeds the performance of common trace formats by at least a factor of two.<\/jats:p>","DOI":"10.1145\/1496909.1496923","type":"journal-article","created":{"date-parts":[[2009,1,29]],"date-time":"2009-01-29T13:48:36Z","timestamp":1233236916000},"page":"70-75","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["DataSeries"],"prefix":"10.1145","volume":"43","author":[{"given":"Eric","family":"Anderson","sequence":"first","affiliation":[{"name":"HP Labs, Palo Alto, CA"}]},{"given":"Martin","family":"Arlitt","sequence":"additional","affiliation":[{"name":"HP Labs, Palo Alto, CA"}]},{"suffix":"III","given":"Charles B.","family":"Morrey","sequence":"additional","affiliation":[{"name":"HP Labs, Palo Alto, CA"}]},{"given":"Alistair","family":"Veitch","sequence":"additional","affiliation":[{"name":"HP Labs, Palo Alto, CA"}]}],"member":"320","published-online":{"date-parts":[[2009,1]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/301816.301823"},{"volume-title":"http:\/\/www.bzip.org\/, accessed","year":"2007","key":"e_1_2_1_2_1","unstructured":"bzip2 compression library , http:\/\/www.bzip.org\/, accessed September 2007 . bzip2 compression library, http:\/\/www.bzip.org\/, accessed September 2007."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1064212.1064230"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1006\/jnca.2000.0110"},{"key":"e_1_2_1_5_1","unstructured":"http:\/\/tesla.hpl.hp.com\/opensource\/DataSeries-tr-snapshot.pdf.  http:\/\/tesla.hpl.hp.com\/opensource\/DataSeries-tr-snapshot.pdf."},{"key":"e_1_2_1_6_1","first-page":"203","volume-title":"Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST 2003","author":"Ellard D.","year":"2003","unstructured":"D. Ellard , J. Ledlie , P. Malkani , and M. Seltzer . Passive NFS tracing of email and research workloads . In Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST 2003 ), pages 203 -- 216 , San Francisco, CA , 2003 . USENIX. D. Ellard, J. Ledlie, P. Malkani, and M. Seltzer. Passive NFS tracing of email and research workloads. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST 2003), pages 203--216, San Francisco, CA, 2003. USENIX."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/DSN.2005.32"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/152610.152611"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009726021843"},{"volume-title":"http:\/\/www.gzip.org\/, accessed","year":"2007","key":"e_1_2_1_10_1","unstructured":"gzip compression library , http:\/\/www.gzip.org\/, accessed September 2007 . gzip compression library, http:\/\/www.gzip.org\/, accessed September 2007."},{"key":"e_1_2_1_11_1","first-page":"1239","volume-title":"VLDB","author":"Hoke E.","year":"2006","unstructured":"E. Hoke , J. Sun , and C. Faloutsos . Intemon: Intelligent system monitoring on large clusters . In VLDB , pages 1239 -- 1242 , September 2006 . E. Hoke, J. Sun, and C. Faloutsos. Intemon: Intelligent system monitoring on large clusters. In VLDB, pages 1239--1242, September 2006."},{"volume-title":"accesed","year":"2008","key":"e_1_2_1_12_1","unstructured":"http:\/\/iotta.snia.org\/ , accesed July 2008 . http:\/\/iotta.snia.org\/, accesed July 2008."},{"volume-title":"accessed","year":"2008","key":"e_1_2_1_13_1","unstructured":"http:\/\/ita.ee.lbl.gov\/html\/contrib\/WorldCup.html , accessed July 2008 . http:\/\/ita.ee.lbl.gov\/html\/contrib\/WorldCup.html, accessed July 2008."},{"key":"e_1_2_1_14_1","first-page":"253","volume-title":"USENIX Technical Conference","author":"Ji M.","year":"2003","unstructured":"M. Ji , A. Veitch , and J. Wilkes . Seneca: remote mirroring done write . In USENIX Technical Conference , pages 253 -- 268 , June 2003 . M. Ji, A. Veitch, and J. Wilkes. Seneca: remote mirroring done write. In USENIX Technical Conference, pages 253--268, June 2003."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/90.282603"},{"volume-title":"accessed","year":"2007","key":"e_1_2_1_16_1","unstructured":"http:\/\/www.tcpdump.org\/ , accessed September 2007 . http:\/\/www.tcpdump.org\/, accessed September 2007."},{"volume-title":"http:\/\/www.goof.com\/pcg\/marc\/liblzf.html, accessed","year":"2007","key":"e_1_2_1_17_1","unstructured":"lzf compression library , http:\/\/www.goof.com\/pcg\/marc\/liblzf.html, accessed September 2007 . lzf compression library, http:\/\/www.goof.com\/pcg\/marc\/liblzf.html, accessed September 2007."},{"volume-title":"http:\/\/www.oberhumer.com\/opensource\/lzo\/, accessed","year":"2007","key":"e_1_2_1_18_1","unstructured":"lzo compression library , http:\/\/www.oberhumer.com\/opensource\/lzo\/, accessed September 2007 . lzo compression library, http:\/\/www.oberhumer.com\/opensource\/lzo\/, accessed September 2007."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1140277.1140303"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/174267.174268"},{"key":"e_1_2_1_21_1","first-page":"17","volume-title":"FAST","author":"Pinheiro E.","year":"2007","unstructured":"E. Pinheiro , W.-D. Weber , and L. A. Barroso . Failure trends in a large disk drive population . In FAST , pages 17 -- 28 , February 2007 . E. Pinheiro, W.-D. Weber, and L. A. Barroso. Failure trends in a large disk drive population. In FAST, pages 17--28, February 2007."},{"key":"e_1_2_1_22_1","volume-title":"http:\/\/www.platform.com\/Products\/Platform. LSF.Family\/, accessed","author":"Sharing Facility Platform Load","year":"2007","unstructured":"Platform Load Sharing Facility , http:\/\/www.platform.com\/Products\/Platform. LSF.Family\/, accessed September 2007 . Platform Load Sharing Facility, http:\/\/www.platform.com\/Products\/Platform. LSF.Family\/, accessed September 2007."},{"key":"e_1_2_1_23_1","first-page":"1","volume-title":"FAST","author":"Schroeder B.","year":"2007","unstructured":"B. Schroeder and G. A. Gibson . Disk failures in the real world: what does an MTTF of 1,000,000 hours mean to you . In FAST , pages 1 -- 16 , February 2007 . B. Schroeder and G. A. Gibson. Disk failures in the real world: what does an MTTF of 1,000,000 hours mean to you. In FAST, pages 1--16, February 2007."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/378420.378784"},{"key":"e_1_2_1_25_1","unstructured":"http:\/\/tesla.hpl.hp.com\/public software\/; SRT and trace data sections; accessed September 2007.  http:\/\/tesla.hpl.hp.com\/public software\/; SRT and trace data sections; accessed September 2007."},{"key":"e_1_2_1_26_1","first-page":"89","volume-title":"Conference on File and Storage Technologies (FAST)","author":"Uysal M.","year":"2003","unstructured":"M. Uysal , A. Merchant , and G. Alvarez . Using MEMS-based storage in disk arrays . In Conference on File and Storage Technologies (FAST) , pages 89 -- 102 , April 2003 . M. Uysal, A. Merchant, and G. Alvarez. Using MEMS-based storage in disk arrays. In Conference on File and Storage Technologies (FAST), pages 89--102, April 2003."}],"container-title":["ACM SIGOPS Operating Systems Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1496909.1496923","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1496909.1496923","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T14:47:31Z","timestamp":1750258051000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1496909.1496923"}},"subtitle":["an efficient, flexible data format for structured serial data"],"short-title":[],"issued":{"date-parts":[[2009,1]]},"references-count":26,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2009,1]]}},"alternative-id":["10.1145\/1496909.1496923"],"URL":"https:\/\/doi.org\/10.1145\/1496909.1496923","relation":{},"ISSN":["0163-5980"],"issn-type":[{"type":"print","value":"0163-5980"}],"subject":[],"published":{"date-parts":[[2009,1]]},"assertion":[{"value":"2009-01-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}