{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T04:41:14Z","timestamp":1773895274746,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":48,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,5,27]],"date-time":"2018-05-27T00:00:00Z","timestamp":1527379200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61572039, 61702016, U1536201"],"award-info":[{"award-number":["61572039, 61702016, U1536201"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002858","name":"China Postdoctoral Science Foundation","doi-asserted-by":"publisher","award":["2017M610019"],"award-info":[{"award-number":["2017M610019"]}],"id":[{"id":"10.13039\/501100002858","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Basic Research Program of China (973 Program)","award":["2014CB340405"],"award-info":[{"award-number":["2014CB340405"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,5,27]]},"DOI":"10.1145\/3183713.3196894","type":"proceedings-article","created":{"date-parts":[[2018,5,25]],"date-time":"2018-05-25T12:39:28Z","timestamp":1527251968000},"page":"1269-1284","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":88,"title":["SketchML"],"prefix":"10.1145","author":[{"given":"Jiawei","family":"Jiang","sequence":"first","affiliation":[{"name":"Peking University &amp;Tencent Inc., Beijing, China"}]},{"given":"Fangcheng","family":"Fu","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]},{"given":"Tong","family":"Yang","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]},{"given":"Bin","family":"Cui","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2018,5,27]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"QSGD: Randomized Quantization for Communication-Optimal Stochastic Gradient Descent. arXiv preprint arXiv:1610.02132","author":"Alistarh Dan","year":"2016","unstructured":"Dan Alistarh , Jerry Li , Ryota Tomioka , and Milan Vojnovic . 2016 . QSGD: Randomized Quantization for Communication-Optimal Stochastic Gradient Descent. arXiv preprint arXiv:1610.02132 (2016). Dan Alistarh, Jerry Li, Ryota Tomioka, and Milan Vojnovic. 2016. QSGD: Randomized Quantization for Communication-Optimal Stochastic Gradient Descent. arXiv preprint arXiv:1610.02132 (2016)."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1721654.1721672"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-7908-2604-3_16"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"L\u00e9on Bottou. 2012. Stochastic gradient descent tricks. Neural Networks: Tricks of the Trade. 421--436.  L\u00e9on Bottou. 2012. Stochastic gradient descent tricks. Neural Networks: Tricks of the Trade. 421--436.","DOI":"10.1007\/978-3-642-35289-8_25"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1561\/2200000050"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939785"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jalgor.2003.12.001"},{"key":"e_1_3_2_1_9_1","volume-title":"et almbox","author":"Dean Jeffrey","year":"2012","unstructured":"Jeffrey Dean , Greg Corrado , Rajat Monga , et almbox .. 2012 . Large scale distributed deep networks. In Advances in neural information processing systems. 1223--1231. Jeffrey Dean, Greg Corrado, Rajat Monga, et almbox.. 2012. Large scale distributed deep networks. In Advances in neural information processing systems. 1223--1231."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"L Peter Deutsch. 1996. DEFLATE compressed data format specification version 1.3. (1996).  L Peter Deutsch. 1996. DEFLATE compressed data format specification version 1.3. (1996).","DOI":"10.17487\/rfc1951"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/1953048.2021068"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/376284.375670"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2013.01.010"},{"key":"e_1_3_2_1_14_1","volume-title":"Proceedings., 10th International Conference on","volume":"1","author":"Hinds Stuart C","year":"1990","unstructured":"Stuart C Hinds , James L Fisher , and Donald P D'Amato . 1990 . A document skew detection method using run-length encoding and the Hough transform Pattern Recognition, 1990 . Proceedings., 10th International Conference on , Vol. Vol. 1 . IEEE, 464--468. Stuart C Hinds, James L Fisher, and Donald P D'Amato. 1990. A document skew detection method using run-length encoding and the Hough transform Pattern Recognition, 1990. Proceedings., 10th International Conference on, Vol. Vol. 1. IEEE, 464--468."},{"key":"e_1_3_2_1_15_1","volume-title":"Phillip B Gibbons, Garth A Gibson, Greg Ganger, and Eric P Xing.","author":"Ho Qirong","year":"2013","unstructured":"Qirong Ho , James Cipar , Henggang Cui , Seunghak Lee , Jin Kyu Kim , Phillip B Gibbons, Garth A Gibson, Greg Ganger, and Eric P Xing. 2013 . More effective distributed ml via a stale synchronous parallel parameter server Advances in neural information processing systems. 1223--1231. Qirong Ho, James Cipar, Henggang Cui, Seunghak Lee, Jin Kyu Kim, Phillip B Gibbons, Garth A Gibson, Greg Ganger, and Eric P Xing. 2013. More effective distributed ml via a stale synchronous parallel parameter server Advances in neural information processing systems. 1223--1231."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1002\/0471722146"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2742785"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3035933"},{"key":"e_1_3_2_1_19_1","volume-title":"TeslaML: Steering Machine Learning Automatically in Tencent Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data. Springer, 313--318","author":"Jiang Jiawei","year":"2017","unstructured":"Jiawei Jiang , Ming Huang , Jie Jiang , and Bin Cui . 2017 b . TeslaML: Steering Machine Learning Automatically in Tencent Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data. Springer, 313--318 . Jiawei Jiang, Ming Huang, Jie Jiang, and Bin Cui. 2017 b. TeslaML: Steering Machine Learning Automatically in Tencent Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data. Springer, 313--318."},{"key":"e_1_3_2_1_20_1","volume-title":"TencentBoost: A Gradient Boosting Tree System with Parameter Server Data Engineering (ICDE), 2017 IEEE 33rd International Conference on. 281--284","author":"Jiang Jie","year":"2017","unstructured":"Jie Jiang , Jiawei Jiang , Bin Cui , and Ce Zhang . 2017 c . TencentBoost: A Gradient Boosting Tree System with Parameter Server Data Engineering (ICDE), 2017 IEEE 33rd International Conference on. 281--284 . Jie Jiang, Jiawei Jiang, Bin Cui, and Ce Zhang. 2017 c. TencentBoost: A Gradient Boosting Tree System with Parameter Server Data Engineering (ICDE), 2017 IEEE 33rd International Conference on. 281--284."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3041657"},{"key":"e_1_3_2_1_22_1","unstructured":"Rie Johnson and Tong Zhang. 2013. Accelerating stochastic gradient descent using predictive variance reduction Advances in neural information processing systems. 315--323.   Rie Johnson and Tong Zhang. 2013. Accelerating stochastic gradient descent using predictive variance reduction Advances in neural information processing systems. 315--323."},{"key":"e_1_3_2_1_23_1","unstructured":"KDD. 2010. KDD Cup 2010. (2010). http:\/\/www.kdd.org\/kdd-cup\/  KDD. 2010. KDD Cup 2010. (2010). http:\/\/www.kdd.org\/kdd-cup\/"},{"key":"e_1_3_2_1_24_1","volume-title":"https:\/\/www.kaggle.com\/c\/kddcup2012-track1","author":"KDD.","year":"2012","unstructured":"KDD. 2012. KDD Cup 2012. ( 2012 ). https:\/\/www.kaggle.com\/c\/kddcup2012-track1 KDD. 2012. KDD Cup 2012. (2012). https:\/\/www.kaggle.com\/c\/kddcup2012-track1"},{"key":"e_1_3_2_1_25_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik","year":"2014","unstructured":"Diederik Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/0196-6774(85)90036-7"},{"key":"e_1_3_2_1_27_1","unstructured":"Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105.   Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105."},{"key":"e_1_3_2_1_28_1","unstructured":"Yann LeCun. 1998. MNIST. (1998). http:\/\/yann.lecun.com\/exdb\/mnist\/  Yann LeCun. 1998. MNIST. (1998). http:\/\/yann.lecun.com\/exdb\/mnist\/"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2835776.2835781"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2623330.2623612"},{"key":"e_1_3_2_1_31_1","unstructured":"Brendan McMahan and Matthew Streeter. 2014. Delay-tolerant algorithms for asynchronous distributed online learning Advances in Neural Information Processing Systems. 2915--2923.   Brendan McMahan and Matthew Streeter. 2014. Delay-tolerant algorithms for asynchronous distributed online learning Advances in Neural Information Processing Systems. 2915--2923."},{"key":"e_1_3_2_1_32_1","unstructured":"Deanna Needell Rachel Ward and Nati Srebro. 2014. Stochastic gradient descent weighted sampling and the randomized kaczmarz algorithm Advances in Neural Information Processing Systems. 1017--1025.   Deanna Needell Rachel Ward and Nati Srebro. 2014. Stochastic gradient descent weighted sampling and the randomized kaczmarz algorithm Advances in Neural Information Processing Systems. 1017--1025."},{"key":"e_1_3_2_1_33_1","volume-title":"Adding gradient noise improves learning for very deep networks. arXiv preprint arXiv:1511.06807","author":"Neelakantan Arvind","year":"2015","unstructured":"Arvind Neelakantan , Luke Vilnis , Quoc V Le , Ilya Sutskever , Lukasz Kaiser , Karol Kurach , and James Martens . 2015. Adding gradient noise improves learning for very deep networks. arXiv preprint arXiv:1511.06807 ( 2015 ). Arvind Neelakantan, Luke Vilnis, Quoc V Le, Ilya Sutskever, Lukasz Kaiser, Karol Kurach, and James Martens. 2015. Adding gradient noise improves learning for very deep networks. arXiv preprint arXiv:1511.06807 (2015)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1137\/070704277"},{"key":"e_1_3_2_1_35_1","volume-title":"Doklady AN USSR","volume":"269","author":"Nesterov Yurii","year":"1983","unstructured":"Yurii Nesterov . 1983 . A method for unconstrained convex minimization problem with the rate of convergence O (1\/k 2) . In Doklady AN USSR , Vol. Vol. 269 . 543--547. Yurii Nesterov. 1983. A method for unconstrained convex minimization problem with the rate of convergence O (1\/k 2). In Doklady AN USSR, Vol. Vol. 269. 543--547."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0893-6080(98)00116-6"},{"key":"e_1_3_2_1_37_1","volume-title":"Linear regression analysis","author":"Seber George AF","unstructured":"George AF Seber and Alan J Lee . 2012. Linear regression analysis . Vol. Vol. 936 . John Wiley & Sons . George AF Seber and Alan J Lee. 2012. Linear regression analysis. Vol. Vol. 936. John Wiley & Sons."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Frank Seide Hao Fu Jasha Droppo Gang Li and Dong Yu. 2014. 1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs.. In INTERSPEECH. 1058--1062.  Frank Seide Hao Fu Jasha Droppo Gang Li and Dong Yu. 2014. 1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs.. In INTERSPEECH. 1058--1062.","DOI":"10.21437\/Interspeech.2014-274"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1018628609742"},{"key":"e_1_3_2_1_40_1","volume-title":"Sparse matrices","author":"Tewarson Reginald P","unstructured":"Reginald P Tewarson . 1973. Sparse matrices . Academic Press . Reginald P Tewarson. 1973. Sparse matrices. Academic Press."},{"key":"e_1_3_2_1_41_1","unstructured":"Yahoo. 2004. Data Sketches. (2004). https:\/\/datasketches.github.io\/  Yahoo. 2004. Data Sketches. (2004). https:\/\/datasketches.github.io\/"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.14778\/3137628.3137649"},{"key":"e_1_3_2_1_43_1","volume-title":"ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701","author":"Zeiler Matthew D","year":"2012","unstructured":"Matthew D Zeiler . 2012. ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 ( 2012 ). Matthew D Zeiler. 2012. ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732977.2733001"},{"key":"e_1_3_2_1_45_1","volume-title":"ZipML: An End-to-end Bitwise Framework for Dense Generalized Linear Models. arXiv:1611.05402","author":"Zhang Hantian","year":"2016","unstructured":"Hantian Zhang , Kaan Kara , Jerry Li , Dan Alistarh , Ji Liu , and Ce Zhang . 2016. ZipML: An End-to-end Bitwise Framework for Dense Generalized Linear Models. arXiv:1611.05402 ( 2016 ). Hantian Zhang, Kaan Kara, Jerry Li, Dan Alistarh, Ji Liu, and Ce Zhang. 2016. ZipML: An End-to-end Bitwise Framework for Dense Generalized Linear Models. arXiv:1611.05402 (2016)."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/SSDBM.2007.27"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526816"},{"key":"e_1_3_2_1_48_1","unstructured":"Martin Zinkevich Markus Weimer Lihong Li and Alex J Smola. 2010. Parallelized stochastic gradient descent. In Advances in neural information processing systems. 2595--2603.   Martin Zinkevich Markus Weimer Lihong Li and Alex J Smola. 2010. Parallelized stochastic gradient descent. In Advances in neural information processing systems. 2595--2603."},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1977.1055714"}],"event":{"name":"SIGMOD\/PODS '18: International Conference on Management of Data","location":"Houston TX USA","acronym":"SIGMOD\/PODS '18","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"]},"container-title":["Proceedings of the 2018 International Conference on Management of Data"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3183713.3196894","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3183713.3196894","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:39:18Z","timestamp":1750210758000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3183713.3196894"}},"subtitle":["Accelerating Distributed Machine Learning with Data Sketches"],"short-title":[],"issued":{"date-parts":[[2018,5,27]]},"references-count":48,"alternative-id":["10.1145\/3183713.3196894","10.1145\/3183713"],"URL":"https:\/\/doi.org\/10.1145\/3183713.3196894","relation":{},"subject":[],"published":{"date-parts":[[2018,5,27]]},"assertion":[{"value":"2018-05-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}