{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T03:47:21Z","timestamp":1772164041347,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":48,"publisher":"ACM","license":[{"start":{"date-parts":[[2015,6,13]],"date-time":"2015-06-13T00:00:00Z","timestamp":1434153600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2015,6,13]]},"DOI":"10.1145\/2749469.2749472","type":"proceedings-article","created":{"date-parts":[[2015,5,26]],"date-time":"2015-05-26T10:36:25Z","timestamp":1432636585000},"page":"27-40","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":122,"title":["DjiNN and Tonic"],"prefix":"10.1145","author":[{"given":"Johann","family":"Hauswald","sequence":"first","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Yiping","family":"Kang","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Michael A.","family":"Laurenzano","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Quan","family":"Chen","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Cheng","family":"Li","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Trevor","family":"Mudge","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Ronald G.","family":"Dreslinski","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Jason","family":"Mars","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]},{"given":"Lingjia","family":"Tang","sequence":"additional","affiliation":[{"name":"University of Michigan - Ann Arbor, MI"}]}],"member":"320","published-online":{"date-parts":[[2015,6,13]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"\"Cuda toolkit documentation \" http:\/\/docs.nvidia.com\/cuda\/profiler-users-guide\/.  \"Cuda toolkit documentation \" http:\/\/docs.nvidia.com\/cuda\/profiler-users-guide\/."},{"key":"e_1_3_2_1_2_1","unstructured":"\"DjiNN and Tonic: DNN as a Service \" http:\/\/djinn.clarity-lab.org.  \"DjiNN and Tonic: DNN as a Service \" http:\/\/djinn.clarity-lab.org."},{"key":"e_1_3_2_1_3_1","unstructured":"\"Facebook's quest to build an artificial brain depends on this guy \" www.wired.com\/2014\/08\/deep-learning-yann-lecun.  \"Facebook's quest to build an artificial brain depends on this guy \" www.wired.com\/2014\/08\/deep-learning-yann-lecun."},{"key":"e_1_3_2_1_4_1","unstructured":"\"Google Glass \" www.google.com\/glass\/start.  \"Google Glass \" www.google.com\/glass\/start."},{"key":"e_1_3_2_1_5_1","unstructured":"\"Inside the artificial brain that's remaking the google empire \" www.wired.com\/2014\/07\/google_brain\/.  \"Inside the artificial brain that's remaking the google empire \" www.wired.com\/2014\/07\/google_brain\/."},{"key":"e_1_3_2_1_6_1","unstructured":"\"Maple 2015. maplesoft a division of waterloo maple inc. waterloo ontario.\" http:\/\/www.maplesoft.com\/.  \"Maple 2015. maplesoft a division of waterloo maple inc. waterloo ontario.\" http:\/\/www.maplesoft.com\/."},{"key":"e_1_3_2_1_7_1","unstructured":"\"Microsoft corp to challenge apple inc with siri alternative: More intelligent and fast enough!\" www.dazeinfo.com\/2013\/06\/18\/microsoft-corp-to-challenge-apple-inc-with-siri-alternative\/-more-intelligent-and-fast-enough.  \"Microsoft corp to challenge apple inc with siri alternative: More intelligent and fast enough!\" www.dazeinfo.com\/2013\/06\/18\/microsoft-corp-to-challenge-apple-inc-with-siri-alternative\/-more-intelligent-and-fast-enough."},{"key":"e_1_3_2_1_8_1","unstructured":"\"Multi-process service \" https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf.  \"Multi-process service \" https:\/\/docs.nvidia.com\/deploy\/pdf\/CUDA_Multi_Process_Service_Overview.pdf."},{"key":"e_1_3_2_1_9_1","unstructured":"\"Nvidia visual profiler \" https:\/\/developer.nvidia.com\/NVIDIA-visual-profiler.  \"Nvidia visual profiler \" https:\/\/developer.nvidia.com\/NVIDIA-visual-profiler."},{"key":"e_1_3_2_1_10_1","unstructured":"\"Apple's Massive New Data Center Set To Host Nuance Tech \" http:\/\/techcrunch.com\/2011\/05\/09\/apple-nuance-data-center-deal\/ 2011.  \"Apple's Massive New Data Center Set To Host Nuance Tech \" http:\/\/techcrunch.com\/2011\/05\/09\/apple-nuance-data-center-deal\/ 2011."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/2534500"},{"key":"e_1_3_2_1_12_1","volume-title":"Theano: new features and speed improvements,\" Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop","author":"Bastien F.","year":"2012","unstructured":"F. Bastien , P. Lamblin , R. Pascanu , J. Bergstra , I. Goodfellow , A. Bergeron , N. Bouchard , D. Warde-Farley , and Y. Bengio , \" Theano: new features and speed improvements,\" Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop , 2012 . F. Bastien, P. Lamblin, R. Pascanu, J. Bergstra, I. Goodfellow, A. Bergeron, N. Bouchard, D. Warde-Farley, and Y. Bengio, \"Theano: new features and speed improvements,\" Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2013.133"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2541940.2541967"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.58"},{"key":"e_1_3_2_1_16_1","volume-title":"cudnn: Efficient primitives for deep learning,\" arXiv preprint arXiv:1410.0759","author":"Chetlur S.","year":"2014","unstructured":"S. Chetlur , C. Woolley , P. Vandermersch , J. Cohen , J. Tran , B. Catanzaro , and E. Shelhamer , \" cudnn: Efficient primitives for deep learning,\" arXiv preprint arXiv:1410.0759 , 2014 . S. Chetlur, C. Woolley, P. Vandermersch, J. Cohen, J. Tran, B. Catanzaro, and E. Shelhamer, \"cudnn: Efficient primitives for deep learning,\" arXiv preprint arXiv:1410.0759, 2014."},{"key":"e_1_3_2_1_17_1","volume-title":"Project adam: building an efficient and scalable deep learning training system,\" in Operating Systems Design and Implementation(OSDI)","author":"Chilimbi T.","year":"2014","unstructured":"T. Chilimbi , Y. Suzue , J. Apacible , and K. Kalyanaraman , \" Project adam: building an efficient and scalable deep learning training system,\" in Operating Systems Design and Implementation(OSDI) , 2014 . T. Chilimbi, Y. Suzue, J. Apacible, and K. Kalyanaraman, \"Project adam: building an efficient and scalable deep learning training system,\" in Operating Systems Design and Implementation(OSDI), 2014."},{"key":"e_1_3_2_1_18_1","volume-title":"Deep learning with cots hpc systems,\" in International Conference on Machine Learning(ICML)","author":"Coates A.","year":"2013","unstructured":"A. Coates , B. Huval , T. Wang , D. Wu , B. Catanzaro , and N. Andrew , \" Deep learning with cots hpc systems,\" in International Conference on Machine Learning(ICML) , 2013 . A. Coates, B. Huval, T. Wang, D. Wu, B. Catanzaro, and N. Andrew, \"Deep learning with cots hpc systems,\" in International Conference on Machine Learning(ICML), 2013."},{"key":"e_1_3_2_1_19_1","author":"Collobert R.","year":"2011","unstructured":"R. Collobert , J. Weston , L. Bottou , M. Karlen , K. Kavukcuoglu , and P. Kuksa , \"Natural language processing (almost) from scratch,\" The Journal of Machine Learning Research , 2011 . R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu, and P. Kuksa, \"Natural language processing (almost) from scratch,\" The Journal of Machine Learning Research, 2011.","journal-title":"\"Natural language processing (almost) from scratch,\" The Journal of Machine Learning Research"},{"key":"e_1_3_2_1_20_1","volume-title":"Imagenet: A large-scale hierarchical image database,\" in Computer Vision and Pattern Recognition (CVPR)","author":"Deng J.","year":"2009","unstructured":"J. Deng , W. Dong , R. Socher , L.-J. Li , K. Li , and L. Fei-Fei , \" Imagenet: A large-scale hierarchical image database,\" in Computer Vision and Pattern Recognition (CVPR) , 2009 . J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, \"Imagenet: A large-scale hierarchical image database,\" in Computer Vision and Pattern Recognition (CVPR), 2009."},{"key":"e_1_3_2_1_21_1","volume-title":"Pylearn2: a machine learning research library,\" arXiv preprint arXiv:1308.4214","author":"Goodfellow I. J.","year":"2013","unstructured":"I. J. Goodfellow , D. Warde-Farley , P. Lamblin , V. Dumoulin , M. Mirza , R. Pascanu , J. Bergstra , F. Bastien , and Y. Bengio , \" Pylearn2: a machine learning research library,\" arXiv preprint arXiv:1308.4214 , 2013 . I. J. Goodfellow, D. Warde-Farley, P. Lamblin, V. Dumoulin, M. Mirza, R. Pascanu, J. Bergstra, F. Bastien, and Y. Bengio, \"Pylearn2: a machine learning research library,\" arXiv preprint arXiv:1308.4214, 2013."},{"key":"e_1_3_2_1_22_1","volume-title":"Speech and Signal Processing(ICASSp)","author":"Graves A.","year":"2013","unstructured":"A. Graves , A.-R. Mohamed , and G. Hinton , \" Speech recognition with deep recurrent neural networks,\" in International Conference on Acoustics , Speech and Signal Processing(ICASSp) , 2013 . A. Graves, A.-R. Mohamed, and G. Hinton, \"Speech recognition with deep recurrent neural networks,\" in International Conference on Acoustics, Speech and Signal Processing(ICASSp), 2013."},{"key":"e_1_3_2_1_23_1","volume-title":"Speech and Signal Processing (ICASSP)","author":"Hauswald J.","year":"2014","unstructured":"J. Hauswald , T. Manville , Q. Zheng , R. Dreslinski , C. Chakrabarti , and T. Mudge , \" A hybrid approach to offloading mobile image classification,\" in International Conference on Acoustics , Speech and Signal Processing (ICASSP) , 2014 . J. Hauswald, T. Manville, Q. Zheng, R. Dreslinski, C. Chakrabarti, and T. Mudge, \"A hybrid approach to offloading mobile image classification,\" in International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2694344.2694347"},{"key":"e_1_3_2_1_25_1","volume-title":"Adrenaline: Pinpointing and reigning in tail queries with quick voltage boosting,\" in International Symposium on High Performance Computer Architecture (HPCA)","author":"Hsu C.-H.","year":"2015","unstructured":"C.-H. Hsu , Y. Zhang , M. A. Laurenzano , D. Meisner , T. Wenisch , L. Tang , J. Mars , and R. Dreslinski , \" Adrenaline: Pinpointing and reigning in tail queries with quick voltage boosting,\" in International Symposium on High Performance Computer Architecture (HPCA) , 2015 . C.-H. Hsu, Y. Zhang, M. A. Laurenzano, D. Meisner, T. Wenisch, L. Tang, J. Mars, and R. Dreslinski, \"Adrenaline: Pinpointing and reigning in tail queries with quick voltage boosting,\" in International Symposium on High Performance Computer Architecture (HPCA), 2015."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2500887"},{"key":"e_1_3_2_1_27_1","volume-title":"Caffe: Convolutional architecture for fast feature embedding,\" arXiv preprint arXiv:1408.5093","author":"Jia Y.","year":"2014","unstructured":"Y. Jia , E. Shelhamer , J. Donahue , S. Karayev , J. Long , R. Girshick , S. Guadarrama , and T. Darrell , \" Caffe: Convolutional architecture for fast feature embedding,\" arXiv preprint arXiv:1408.5093 , 2014 . Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell, \"Caffe: Convolutional architecture for fast feature embedding,\" arXiv preprint arXiv:1408.5093, 2014."},{"key":"e_1_3_2_1_28_1","volume-title":"Imagenet classification with deep convolutional neural networks,\" in Advances in neural information processing systems","author":"Krizhevsky A.","year":"2012","unstructured":"A. Krizhevsky , I. Sutskever , and G. E. Hinton , \" Imagenet classification with deep convolutional neural networks,\" in Advances in neural information processing systems , 2012 . A. Krizhevsky, I. Sutskever, and G. E. Hinton, \"Imagenet classification with deep convolutional neural networks,\" in Advances in neural information processing systems, 2012."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/VLSIC.2008.4585952"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.21"},{"key":"e_1_3_2_1_31_1","volume-title":"Gradient-based learning applied to document recognition,\" Proceedings of the IEEE","author":"LeCun Y.","year":"1998","unstructured":"Y. LeCun , L. Bottou , Y. Bengio , and P. Haffner , \" Gradient-based learning applied to document recognition,\" Proceedings of the IEEE , 1998 . Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, \"Gradient-based learning applied to document recognition,\" Proceedings of the IEEE, 1998."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2694344.2694358"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.14778\/2212351.2212354"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2485922.2485975"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155650"},{"key":"e_1_3_2_1_36_1","volume-title":"Octopus-man: Qos-driven task management for heterogeneous multicore in warehouse scale computers,\" in International Symposium on High Performance Computer Architecture (HPCA)","author":"Petrucci V.","year":"2015","unstructured":"V. Petrucci , M. A. Laurenzano , Y. Zhang , J. Doherty , D. Mosse , J. Mars , and L. Tang , \" Octopus-man: Qos-driven task management for heterogeneous multicore in warehouse scale computers,\" in International Symposium on High Performance Computer Architecture (HPCA) , 2015 . V. Petrucci, M. A. Laurenzano, Y. Zhang, J. Doherty, D. Mosse, J. Mars, and L. Tang, \"Octopus-man: Qos-driven task management for heterogeneous multicore in warehouse scale computers,\" in International Symposium on High Performance Computer Architecture (HPCA), 2015."},{"key":"e_1_3_2_1_37_1","volume-title":"ASRU","author":"Povey D.","year":"2011","unstructured":"D. Povey , A. Ghoshal , G. Boulianne , L. Burget , O. Glembek , N. Goel , M. Hannemann , P. Motlicek , Y. Qian , P. Schwarz et al., \"The kaldi speech recognition toolkit,\" in Proc . ASRU , 2011 . D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz et al., \"The kaldi speech recognition toolkit,\" in Proc. ASRU, 2011."},{"key":"e_1_3_2_1_38_1","volume-title":"Gray et al., \"A reconfigurable fabric for accelerating large-scale datacenter services,\" in International Symposium on Computer Architecture (ISCA)","author":"Putnam A.","year":"2014","unstructured":"A. Putnam , A. M. Caulfield , E. S. Chung , D. Chiou , K. Constantinides , J. Demme , H. Esmaeilzadeh , J. Fowers , G. P. Gopal , J. Gray et al., \"A reconfigurable fabric for accelerating large-scale datacenter services,\" in International Symposium on Computer Architecture (ISCA) , 2014 . A. Putnam, A. M. Caulfield, E. S. Chung, D. Chiou, K. Constantinides, J. Demme, H. Esmaeilzadeh, J. Fowers, G. P. Gopal, J. Gray et al., \"A reconfigurable fabric for accelerating large-scale datacenter services,\" in International Symposium on Computer Architecture (ISCA), 2014."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2485922.2485925"},{"key":"e_1_3_2_1_40_1","volume-title":"Will Exceed 485 Million Annual Shipments by","author":"Research A.","year":"2018","unstructured":"A. Research , \" Wearable Computing Devices , Like Apple iWatch , Will Exceed 485 Million Annual Shipments by 2018 ,\" 2013. A. Research, \"Wearable Computing Devices, Like Apple iWatch, Will Exceed 485 Million Annual Shipments by 2018,\" 2013."},{"key":"e_1_3_2_1_41_1","volume-title":"ImageNet Large Scale Visual Recognition Challenge(ILSVRC)","author":"Russakovsky O.","year":"2014","unstructured":"O. Russakovsky , J. Deng , H. Su , J. Krause , S. Satheesh , S. Ma , Z. Huang , A. Karpathy , A. Khosla , M. Bernstein , A. C. Berg , and L. Fei-Fei , \" ImageNet Large Scale Visual Recognition Challenge(ILSVRC) ,\" 2014 . O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, \"ImageNet Large Scale Visual Recognition Challenge(ILSVRC),\" 2014."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2749474"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.220"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366231.2337200"},{"key":"e_1_3_2_1_45_1","volume-title":"Deep Learning and Unsupervised Feature Learning NIPS Workshop","author":"Vanhoucke V.","year":"2011","unstructured":"V. Vanhoucke , A. Senior , and M. Z. Mao , \" Improving the speed of neural networks on cpus,\" in Proc . Deep Learning and Unsupervised Feature Learning NIPS Workshop , 2011 . V. Vanhoucke, A. Senior, and M. Z. Mao, \"Improving the speed of neural networks on cpus,\" in Proc. Deep Learning and Unsupervised Feature Learning NIPS Workshop, 2011."},{"key":"e_1_3_2_1_46_1","volume-title":"Automatically tuned linear algebra software,\" in SuperComputing: High Performance Networking and Computing","author":"Whaley R. C.","year":"1998","unstructured":"R. C. Whaley and J. Dongarra , \" Automatically tuned linear algebra software,\" in SuperComputing: High Performance Networking and Computing , 1998 . R. C. Whaley and J. Dongarra, \"Automatically tuned linear algebra software,\" in SuperComputing: High Performance Networking and Computing, 1998."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/2485922.2485974"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.53"}],"event":{"name":"ISCA '15: The 42nd Annual International Symposium on Computer Architecture","location":"Portland Oregon","acronym":"ISCA '15","sponsor":["IEEE TCCA IEEE Computer Society Technical Committee on Computer Architecture","SIGARCH ACM Special Interest Group on Computer Architecture"]},"container-title":["Proceedings of the 42nd Annual International Symposium on Computer Architecture"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2749469.2749472","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2749469.2749472","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T15:04:04Z","timestamp":1750259044000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2749469.2749472"}},"subtitle":["DNN as a service and its implications for future warehouse scale computers"],"short-title":[],"issued":{"date-parts":[[2015,6,13]]},"references-count":48,"alternative-id":["10.1145\/2749469.2749472","10.1145\/2749469"],"URL":"https:\/\/doi.org\/10.1145\/2749469.2749472","relation":{"is-identical-to":[{"id-type":"doi","id":"10.1145\/2872887.2749472","asserted-by":"object"}]},"subject":[],"published":{"date-parts":[[2015,6,13]]},"assertion":[{"value":"2015-06-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}