{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,3]],"date-time":"2025-07-03T05:45:59Z","timestamp":1751521559433,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,6,2]],"date-time":"2019-06-02T00:00:00Z","timestamp":1559433600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1815899"],"award-info":[{"award-number":["1815899"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,6,2]]},"DOI":"10.1145\/3316781.3317828","type":"proceedings-article","created":{"date-parts":[[2019,5,23]],"date-time":"2019-05-23T18:07:13Z","timestamp":1558634833000},"page":"1-6","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["FLightNNs"],"prefix":"10.1145","author":[{"given":"Ruizhou","family":"Ding","sequence":"first","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, U.S.A."}]},{"given":"Zeye","family":"Liu","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, U.S.A."}]},{"given":"Ting-Wu","family":"Chin","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, U.S.A."}]},{"given":"Diana","family":"Marculescu","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, U.S.A."}]},{"given":"R. D. (Shawn)","family":"Blanton","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, U.S.A."}]}],"member":"320","published-online":{"date-parts":[[2019,6,2]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"\"Nnapi \" https:\/\/developer.android.com\/ndk\/guides\/neuralnetworks\/ accessed: 2018-10-15.  \"Nnapi \" https:\/\/developer.android.com\/ndk\/guides\/neuralnetworks\/ accessed: 2018-10-15."},{"key":"e_1_3_2_1_2_1","first-page":"6071","volume-title":"Designing energy-efficient convolutional neural networks using energy-aware pruning,\" in IEEE CVPR","author":"Yang T.-J.","year":"2017","unstructured":"T.-J. Yang , Y.-H. Chen , and V. Sze , \" Designing energy-efficient convolutional neural networks using energy-aware pruning,\" in IEEE CVPR , pp. 6071 -- 6079 , 2017 . T.-J. Yang, Y.-H. Chen, and V. Sze, \"Designing energy-efficient convolutional neural networks using energy-aware pruning,\" in IEEE CVPR, pp. 6071--6079, 2017."},{"key":"e_1_3_2_1_3_1","first-page":"2849","volume-title":"Fixed point quantization of deep convolutional networks,\" in International Conference on Machine Learning","author":"Lin D.","year":"2016","unstructured":"D. Lin , S. Talathi , S. Talathi , and S. Annapureddy , \" Fixed point quantization of deep convolutional networks,\" in International Conference on Machine Learning , pp. 2849 -- 2858 , 2016 . D. Lin, S. Talathi, S. Talathi, and S. Annapureddy, \"Fixed point quantization of deep convolutional networks,\" in International Conference on Machine Learning, pp. 2849--2858, 2016."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3060403.3060465"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3270689"},{"key":"e_1_3_2_1_6_1","unstructured":"I. Hubara M. Courbariaux D. Soudry R. El-Yaniv and Y Bengio \"Binarized neural networks \" in Advances in Neural Information Processing Systems pp. 4107--4115 2016.   I. Hubara M. Courbariaux D. Soudry R. El-Yaniv and Y Bengio \"Binarized neural networks \" in Advances in Neural Information Processing Systems pp. 4107--4115 2016."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2016.2616357"},{"key":"e_1_3_2_1_8_1","volume-title":"Processing-in-memory for energy-efficient neural network training: A heterogeneous approach,\" in IEEE\/ACM International Symposium on Microarchitecture","author":"Zhao H.","year":"2018","unstructured":"H. Zhao and J. Liu , \" Processing-in-memory for energy-efficient neural network training: A heterogeneous approach,\" in IEEE\/ACM International Symposium on Microarchitecture , 2018 . H. Zhao and J. Liu, \"Processing-in-memory for energy-efficient neural network training: A heterogeneous approach,\" in IEEE\/ACM International Symposium on Microarchitecture, 2018."},{"key":"e_1_3_2_1_9_1","first-page":"1135","volume-title":"Learning both weights and connections for efficient neural network,\" in Advances in Neural Information Processing Systems","author":"Han S.","year":"2015","unstructured":"S. Han , J. Pool , J. Tran , and W. Dally , \" Learning both weights and connections for efficient neural network,\" in Advances in Neural Information Processing Systems , pp. 1135 -- 1143 , 2015 . S. Han, J. Pool, J. Tran, and W. Dally, \"Learning both weights and connections for efficient neural network,\" in Advances in Neural Information Processing Systems, pp. 1135--1143, 2015."},{"key":"e_1_3_2_1_10_1","first-page":"2074","volume-title":"Learning structured sparsity in deep neural networks,\" in Advances in Neural Information Processing Systems","author":"Wen W.","year":"2016","unstructured":"W. Wen , C. Wu , Y. Wang , Y. Chen , and H. Li , \" Learning structured sparsity in deep neural networks,\" in Advances in Neural Information Processing Systems , pp. 2074 -- 2082 , 2016 . W. Wen, C. Wu, Y. Wang, Y. Chen, and H. Li, \"Learning structured sparsity in deep neural networks,\" in Advances in Neural Information Processing Systems, pp. 2074--2082, 2016."},{"key":"e_1_3_2_1_11_1","first-page":"1","volume-title":"Automation and Test (VLSI-DAT), International Symposium","author":"Brooks D. M.","year":"2018","unstructured":"D. M. Brooks , \"Co-designed systems for deep learning hardware accelerators,\" in IEEE VLSI Design , Automation and Test (VLSI-DAT), International Symposium , pp. 1 -- 1 , 2018 . D. M. Brooks, \"Co-designed systems for deep learning hardware accelerators,\" in IEEE VLSI Design, Automation and Test (VLSI-DAT), International Symposium, pp. 1--1, 2018."},{"key":"e_1_3_2_1_12_1","first-page":"4510","article-title":"Mobilenetv2: Inverted residuals and linear bottlenecks","author":"Sandler M.","year":"2018","unstructured":"M. Sandler , A. Howard , M. Zhu , A. Zhmoginov , and L.-C. Chen , \" Mobilenetv2: Inverted residuals and linear bottlenecks ,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 4510 -- 4520 , 2018 . M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.-C. Chen, \"Mobilenetv2: Inverted residuals and linear bottlenecks,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4510--4520, 2018.","journal-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3061639.3062259"},{"key":"e_1_3_2_1_14_1","first-page":"1737","volume-title":"Deep learning with limited numerical precision,\" in Proceedings of the 32nd International Conference on Machine Learning","author":"Gupta S.","year":"2015","unstructured":"S. Gupta , A. Agrawal , K. Gopalakrishnan , and P. Narayanan , \" Deep learning with limited numerical precision,\" in Proceedings of the 32nd International Conference on Machine Learning , pp. 1737 -- 1746 , 2015 . S. Gupta, A. Agrawal, K. Gopalakrishnan, and P. Narayanan, \"Deep learning with limited numerical precision,\" in Proceedings of the 32nd International Conference on Machine Learning, pp. 1737--1746, 2015."},{"key":"e_1_3_2_1_15_1","volume-title":"Dorefa-net: Training low bitwidth convolutional neural networks with low bitwidth gradients,\" arXiv preprint arXiv: 1606.06160","author":"Zhou S.","year":"2016","unstructured":"S. Zhou , Y. Wu , Z. Ni , X. Zhou , H. Wen , and Y. Zou , \" Dorefa-net: Training low bitwidth convolutional neural networks with low bitwidth gradients,\" arXiv preprint arXiv: 1606.06160 , 2016 . S. Zhou, Y. Wu, Z. Ni, X. Zhou, H. Wen, and Y. Zou, \"Dorefa-net: Training low bitwidth convolutional neural networks with low bitwidth gradients,\" arXiv preprint arXiv: 1606.06160, 2016."},{"key":"e_1_3_2_1_16_1","first-page":"3123","volume-title":"Training deep neural networks with binary weights during propagations,\" in Advances in neural information processing systems","author":"Courbariaux M.","year":"2015","unstructured":"M. Courbariaux , Y. Bengio , and J.-P. David , \"Binaryconnect : Training deep neural networks with binary weights during propagations,\" in Advances in neural information processing systems , pp. 3123 -- 3131 , 2015 . M. Courbariaux, Y. Bengio, and J.-P. David, \"Binaryconnect: Training deep neural networks with binary weights during propagations,\" in Advances in neural information processing systems, pp. 3123--3131, 2015."},{"key":"e_1_3_2_1_17_1","volume-title":"Darts: Differentiable architecture search,\" arXiv preprint arXiv:1806.09055","author":"Liu H.","year":"2018","unstructured":"H. Liu , K. Simonyan , and Y. Yang , \" Darts: Differentiable architecture search,\" arXiv preprint arXiv:1806.09055 , 2018 . H. Liu, K. Simonyan, and Y. Yang, \"Darts: Differentiable architecture search,\" arXiv preprint arXiv:1806.09055, 2018."},{"key":"e_1_3_2_1_18_1","volume-title":"Learning sparse neural networks through 1_0 regularization,\" in International Conference on Learning Representations","author":"Louizos C.","year":"2018","unstructured":"C. Louizos , M. Welling , and D. P. Kingma , \" Learning sparse neural networks through 1_0 regularization,\" in International Conference on Learning Representations , 2018 . C. Louizos, M. Welling, and D. P. Kingma, \"Learning sparse neural networks through 1_0 regularization,\" in International Conference on Learning Representations, 2018."},{"key":"e_1_3_2_1_19_1","volume-title":"Estimating or propagating gradients through stochastic neurons for conditional computation,\" arXiv preprint arXiv:1308.3432","author":"Bengio Y.","year":"2013","unstructured":"Y. Bengio , N. L\u00e9onard , and A. Courville , \" Estimating or propagating gradients through stochastic neurons for conditional computation,\" arXiv preprint arXiv:1308.3432 , 2013 . Y. Bengio, N. L\u00e9onard, and A. Courville, \"Estimating or propagating gradients through stochastic neurons for conditional computation,\" arXiv preprint arXiv:1308.3432, 2013."},{"key":"e_1_3_2_1_20_1","first-page":"195","volume-title":"The influence of the sigmoid function parameters on the speed of backpropagation learning,\" in International Workshop on Artificial Neural Networks","author":"Han J.","year":"1995","unstructured":"J. Han and C. Moraga , \" The influence of the sigmoid function parameters on the speed of backpropagation learning,\" in International Workshop on Artificial Neural Networks , pp. 195 -- 201 , 1995 . J. Han and C. Moraga, \"The influence of the sigmoid function parameters on the speed of backpropagation learning,\" in International Workshop on Artificial Neural Networks, pp. 195--201, 1995."},{"issue":"1","key":"e_1_3_2_1_21_1","first-page":"3","article-title":"Rectifier nonlinearities improve neural network acoustic models","volume":"30","author":"Maas A. L.","year":"2013","unstructured":"A. L. Maas , A. Y. Hannun , and A. Y. Ng , \" Rectifier nonlinearities improve neural network acoustic models ,\" in Proc. ICML , vol. 30 , no. 1 , pp. 3 , 2013 . A. L. Maas, A. Y. Hannun, and A. Y. Ng, \"Rectifier nonlinearities improve neural network acoustic models,\" in Proc. ICML, vol. 30, no. 1, pp. 3, 2013.","journal-title":"Proc. ICML"},{"key":"e_1_3_2_1_22_1","volume-title":"Adam: A method for stochastic optimization,\" ICLR","author":"Kingma D. P.","year":"2015","unstructured":"D. P. Kingma and J. Ba , \" Adam: A method for stochastic optimization,\" ICLR , 2015 . D. P. Kingma and J. Ba, \"Adam: A method for stochastic optimization,\" ICLR, 2015."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2684746.2689060"},{"key":"e_1_3_2_1_24_1","unstructured":"Xilinx \"Vivado high-level synthesis \" https:\/\/www.xilinx.com\/products\/designtools\/vivado\/integration\/esl-design.html 2017.  Xilinx \"Vivado high-level synthesis \" https:\/\/www.xilinx.com\/products\/designtools\/vivado\/integration\/esl-design.html 2017."},{"key":"e_1_3_2_1_25_1","volume-title":"Learning accurate low-bit deep neural networks with stochastic quantization,\" BMVC","author":"Dong Y","year":"2017","unstructured":"Y Dong , R. Ni , J. Li , Y. Chen , J. Zhu , and H. Su , \" Learning accurate low-bit deep neural networks with stochastic quantization,\" BMVC , 2017 . Y Dong, R. Ni, J. Li, Y. Chen, J. Zhu, and H. Su, \"Learning accurate low-bit deep neural networks with stochastic quantization,\" BMVC, 2017."},{"key":"e_1_3_2_1_26_1","unstructured":"S. M. U. Manual \"Synopsys inc \" Mountain View CA 2010.  S. M. U. Manual \"Synopsys inc \" Mountain View CA 2010."},{"key":"e_1_3_2_1_27_1","volume-title":"Regularizing activation distribution for training binarized deep networks,\" in IEEE CVPR","author":"Ding R.","year":"2019","unstructured":"R. Ding , T.-W. Chin , D. Marculescu , and Z. Liu , \" Regularizing activation distribution for training binarized deep networks,\" in IEEE CVPR . IEEE , 2019 . R. Ding, T.-W. Chin, D. Marculescu, and Z. Liu, \"Regularizing activation distribution for training binarized deep networks,\" in IEEE CVPR. IEEE, 2019."},{"key":"e_1_3_2_1_28_1","first-page":"895","volume-title":"IEEE","author":"Chen Z.","year":"2018","unstructured":"Z. Chen , R. Ding , T.-W. Chin , and D. Marculescu , \" Understanding the impact of label granularity on cnn-based image classification,\" in 2018 IEEE International Conference on Data Mining Workshops (ICDMW) . IEEE , 2018 , pp. 895 -- 904 . Z. Chen, R. Ding, T.-W. Chin, and D. Marculescu, \"Understanding the impact of label granularity on cnn-based image classification,\" in 2018 IEEE International Conference on Data Mining Workshops (ICDMW). IEEE, 2018, pp. 895--904."}],"event":{"name":"DAC '19: The 56th Annual Design Automation Conference 2019","sponsor":["SIGDA ACM Special Interest Group on Design Automation","IEEE-CEDA","SIGBED ACM Special Interest Group on Embedded Systems"],"location":"Las Vegas NV USA","acronym":"DAC '19"},"container-title":["Proceedings of the 56th Annual Design Automation Conference 2019"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3316781.3317828","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3316781.3317828","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3316781.3317828","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:53:52Z","timestamp":1750204432000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3316781.3317828"}},"subtitle":["Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference"],"short-title":[],"issued":{"date-parts":[[2019,6,2]]},"references-count":28,"alternative-id":["10.1145\/3316781.3317828","10.1145\/3316781"],"URL":"https:\/\/doi.org\/10.1145\/3316781.3317828","relation":{},"subject":[],"published":{"date-parts":[[2019,6,2]]},"assertion":[{"value":"2019-06-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}