{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T15:56:05Z","timestamp":1774194965673,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2014,4,7]],"date-time":"2014-04-07T00:00:00Z","timestamp":1396828800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000145","name":"Division of Information and Intelligent Systems","doi-asserted-by":"publisher","award":["IIS-1144034 and IIS-1218043"],"award-info":[{"award-number":["IIS-1144034 and IIS-1218043"]}],"id":[{"id":"10.13039\/100000145","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2014,4,7]]},"DOI":"10.1145\/2567948.2579045","type":"proceedings-article","created":{"date-parts":[[2016,2,5]],"date-time":"2016-02-05T19:44:31Z","timestamp":1454701471000},"page":"851-856","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":17,"title":["Infrastructure for supporting exploration and discovery in web archives"],"prefix":"10.1145","author":[{"given":"Jimmy","family":"Lin","sequence":"first","affiliation":[{"name":"University of Maryland, College Park, MD, USA"}]},{"given":"Milad","family":"Gholami","sequence":"additional","affiliation":[{"name":"University of Maryland, College Park, MD, USA"}]},{"given":"Jinfeng","family":"Rao","sequence":"additional","affiliation":[{"name":"University of Maryland, College Park, MD, USA"}]}],"member":"320","published-online":{"date-parts":[[2014,4,7]]},"reference":[{"issue":"2","key":"e_1_3_2_1_1_1","first-page":"4","article-title":"Storage infrastructure behind Facebook Messages: Using HBase at scale","volume":"35","author":"Aiyer A.","year":"2012","unstructured":"A. Aiyer , M. Bautin , G. Chen , P. Khemani , K. Muthukkaruppan , K. Spiegelberg , L. Tang , and M. Vaidya . Storage infrastructure behind Facebook Messages: Using HBase at scale . IEEE Data Engineering Bulletin , 35 ( 2 ): 4 -- 13 , 2012 . A. Aiyer, M. Bautin, G. Chen, P. Khemani, K. Muthukkaruppan, K. Spiegelberg, L. Tang, and M. Vaidya. Storage infrastructure behind Facebook Messages: Using HBase at scale. IEEE Data Engineering Bulletin, 35(2):4--13, 2012.","journal-title":"IEEE Data Engineering Bulletin"},{"key":"e_1_3_2_1_2_1","volume-title":"HBaseCon","author":"Barton S.","year":"2012","unstructured":"S. Barton . Mignify : A big data refinery built on HBase . HBaseCon , 2012 . S. Barton. Mignify: A big data refinery built on HBase. HBaseCon, 2012."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277831"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937"},{"key":"e_1_3_2_1_5_1","volume-title":"OSDI","author":"Chang F.","year":"2006","unstructured":"F. Chang , J. Dean , S. Ghemawat , W. Hsieh , D. A. Wallach , M. Burrows , T. Chandra , A. Fikes , and R. Gruber . Bigtable: A distributed storage system for structured data . OSDI , 2006 . F. Chang, J. Dean, S. Ghemawat, W. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. Gruber. Bigtable: A distributed storage system for structured data. OSDI, 2006."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2254556.2254572"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487788.2488116"},{"key":"e_1_3_2_1_8_1","volume-title":"OSDI","author":"Dean J.","year":"2004","unstructured":"J. Dean and S. Ghemawat . MapReduce: Simplified data processing on large clusters . OSDI , 2004 . J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. OSDI, 2004."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487788.2487934"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/2042536.2042590"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871594"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/1763653.1763666"},{"key":"e_1_3_2_1_13_1","volume-title":"Scholarly use of web archives","author":"Hockx-Yu H.","year":"2013","unstructured":"H. Hockx-Yu . Scholarly use of web archives , 2013 . H. Hockx-Yu. Scholarly use of web archives, 2013."},{"key":"e_1_3_2_1_14_1","volume-title":"USENIX","author":"Hunt P.","year":"2010","unstructured":"P. Hunt , M. Konar , F. Junqueira , and B. Reed . ZooKeeper: Wait-free coordination for Internet-scale systems . USENIX , 2010 . P. Hunt, M. Konar, F. Junqueira, and B. Reed. ZooKeeper: Wait-free coordination for Internet-scale systems. USENIX, 2010."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/1855013"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807184"},{"key":"e_1_3_2_1_17_1","volume-title":"Hadoop Summit Europe","author":"Neudecker C.","year":"2013","unstructured":"C. Neudecker and S. Schlarb . The elephant in the library: Integrating Hadoop . Hadoop Summit Europe , 2013 . C. Neudecker and S. Schlarb. The elephant in the library: Integrating Hadoop. Hadoop Summit Europe, 2013."},{"key":"e_1_3_2_1_18_1","volume-title":"ECDL","author":"N\u00f8rv\u00e5g K.","year":"2003","unstructured":"K. N\u00f8rv\u00e5g . Space-efficient support for temporal text indexing in a document archive context . ECDL , 2003 . K. N\u00f8rv\u00e5g. Space-efficient support for temporal text indexing in a document archive context. ECDL, 2003."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/988672.988674"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376726"},{"key":"e_1_3_2_1_21_1","volume-title":"Fedora Commons with Apache Hadoop: A research study. code4lib Journal, 22","author":"Rasheed M.","year":"2013","unstructured":"M. Rasheed . Fedora Commons with Apache Hadoop: A research study. code4lib Journal, 22 , 2013 . M. Rasheed. Fedora Commons with Apache Hadoop: A research study. code4lib Journal, 22, 2013."},{"key":"e_1_3_2_1_23_1","volume-title":"NSDI","author":"Zaharia M.","year":"2012","unstructured":"M. Zaharia , M. Chowdhury , T. Das , A. Dave , J. Ma , M. McCauley , M. Franklin , S. Shenker , and I. Stoica . Resilient Distributed Datasets: A fault-tolerant abstraction for in-memory cluster computing . NSDI , 2012 . M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. Franklin, S. Shenker, and I. Stoica. Resilient Distributed Datasets: A fault-tolerant abstraction for in-memory cluster computing. NSDI, 2012."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2187836.2187955"}],"event":{"name":"WWW '14: 23rd International World Wide Web Conference","location":"Seoul Korea","acronym":"WWW '14","sponsor":["IW3C2 International World Wide Web Conference Committee","SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Proceedings of the 23rd International Conference on World Wide Web"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2567948.2579045","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2567948.2579045","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:34:46Z","timestamp":1750232086000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2567948.2579045"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,4,7]]},"references-count":23,"alternative-id":["10.1145\/2567948.2579045","10.1145\/2567948"],"URL":"https:\/\/doi.org\/10.1145\/2567948.2579045","relation":{},"subject":[],"published":{"date-parts":[[2014,4,7]]},"assertion":[{"value":"2014-04-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}