{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T16:54:24Z","timestamp":1774630464543,"version":"3.50.1"},"reference-count":68,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2023,12,14]],"date-time":"2023-12-14T00:00:00Z","timestamp":1702512000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Semiconductor Research Corporation (SRC) Tasks","award":["3015.001 and 3148.001"],"award-info":[{"award-number":["3015.001 and 3148.001"]}]},{"name":"National Science Foundation","award":["2326894"],"award-info":[{"award-number":["2326894"]}]},{"name":"CAPES and CNPq, Brazil"},{"name":"FCT\/COMPETE\/FEDER, FCT\/CMU IT Project FLOYD","award":["POCI-01-0247-FEDER-045912"],"award-info":[{"award-number":["POCI-01-0247-FEDER-045912"]}]},{"name":"FCT\/MCTES"},{"name":"ISTAR","award":["UIDB\/04466\/2020, UIDP\/04466\/2020, and DSAIPA\/AI\/0122\/2020"],"award-info":[{"award-number":["UIDB\/04466\/2020, UIDP\/04466\/2020, and DSAIPA\/AI\/0122\/2020"]}]},{"name":"Aim Health Portugal, through national funds and when applicable co-funded EU funds","award":["UIDB\/50008\/2020"],"award-info":[{"award-number":["UIDB\/50008\/2020"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2023,12,31]]},"abstract":"<jats:p>\n            \u2018\u2018Extreme edge\u201d\n            <jats:xref ref-type=\"fn\">\n              <jats:sup>1<\/jats:sup>\n            <\/jats:xref>\n            devices, such as smart sensors, are a uniquely challenging environment for the deployment of machine learning. The tiny energy budgets of these devices lie beyond what is feasible for conventional deep neural networks, particularly in high-throughput scenarios, requiring us to rethink how we approach edge inference. In this work, we propose ULEEN, a model and FPGA-based accelerator architecture based on weightless neural networks (WNNs). WNNs eliminate energy-intensive arithmetic operations, instead using table lookups to perform computation, which makes them theoretically well-suited for edge inference. However, WNNs have historically suffered from poor accuracy and excessive memory usage. ULEEN incorporates algorithmic improvements and a novel training strategy inspired by binary neural networks (BNNs) to make significant strides in addressing these issues. We compare ULEEN against BNNs in software and hardware using the four MLPerf Tiny datasets and MNIST. Our FPGA implementations of ULEEN accomplish classification at 4.0\u201314.3 million inferences per second, improving area-normalized throughput by an average of 3.6\u00d7 and steady-state energy efficiency by an average of 7.1\u00d7 compared to the FPGA-based Xilinx FINN BNN inference platform. While ULEEN is not a universally applicable machine learning model, we demonstrate that it can be an excellent choice for certain applications in energy- and latency-critical edge environments.\n          <\/jats:p>","DOI":"10.1145\/3629522","type":"journal-article","created":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T21:37:02Z","timestamp":1698269822000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["ULEEN: A Novel Architecture for Ultra-low-energy Edge Neural Networks"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7244-6285","authenticated-orcid":false,"given":"Zachary","family":"Susskind","sequence":"first","affiliation":[{"name":"The University of Texas at Austin, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2547-4424","authenticated-orcid":false,"given":"Aman","family":"Arora","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8555-9609","authenticated-orcid":false,"given":"Igor D. S.","family":"Miranda","sequence":"additional","affiliation":[{"name":"Federal University of Rec\u00f4ncavo da Bahia, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3346-7665","authenticated-orcid":false,"given":"Alan T. L.","family":"Bacellar","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4725-154X","authenticated-orcid":false,"given":"Luis A. Q.","family":"Villon","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7902-3103","authenticated-orcid":false,"given":"Rafael F.","family":"Katopodis","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3631-8761","authenticated-orcid":false,"given":"Leandro S.","family":"de Ara\u00fajo","sequence":"additional","affiliation":[{"name":"Universidade Federal Fluminense, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4262-7242","authenticated-orcid":false,"given":"Diego L. C.","family":"Dutra","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8515-9904","authenticated-orcid":false,"given":"Priscila M. V.","family":"Lima","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8980-6208","authenticated-orcid":false,"given":"Felipe M. G.","family":"Fran\u00e7a","sequence":"additional","affiliation":[{"name":"Instituto de Telecomunica\u00e7\u00f5es, Portugal and Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1752-6255","authenticated-orcid":false,"given":"Mauricio","family":"Breternitz Jr.","sequence":"additional","affiliation":[{"name":"ISCTE\u2013Instituto Universitario de Lisboa, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8747-5214","authenticated-orcid":false,"given":"Lizy K.","family":"John","sequence":"additional","affiliation":[{"name":"The University of Texas at Austin, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,12,14]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3123982"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2016.11"},{"key":"e_1_3_2_4_2","first-page":"299","volume-title":"Proceedings of the 17th European Symposium on Artificial Neural Networks (ESANN\u201909)","author":"Aleksander Igor","year":"2009","unstructured":"Igor Aleksander, Massimo De Gregorio, Felipe Fran\u00e7a, Priscila Lima, and Helen Morton. 2009. A brief introduction to Weightless Neural Systems. In Proceedings of the 17th European Symposium on Artificial Neural Networks (ESANN\u201909). 299\u2013305."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1108\/eb007637"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2017.7966166"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2017.2682138"},{"key":"e_1_3_2_8_2","article-title":"MLPerf tiny benchmark","author":"Banbury Colby","year":"2021","unstructured":"Colby Banbury, Vijay Janapa Reddi, Peter Torelli, Jeremy Holleman, Nat Jeffries, Csaba Kiraly, Pietro Montino, David Kanter, Sebastian Ahmed, Danilo Pau, et\u00a0al. 2021. MLPerf tiny benchmark. Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS\u201921).","journal-title":"Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS\u201921)"},{"key":"e_1_3_2_9_2","unstructured":"Michael Bayer. 2021. Mako Templates for Python. Retrieved from https:\/\/www.makotemplates.org\/"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS51385.2021.00027"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.1908.09791"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2015.06.105"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco_a_01149"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/0022-0000(79)90044-8"},{"key":"e_1_3_2_15_2","volume-title":"Proceedings of the European Symposium on Artificial Neural Networks (ESANN\u201913)","author":"Carvalho Danilo","year":"2013","unstructured":"Danilo Carvalho, Hugo Carneiro, Felipe Fran\u00e7a, and Priscila Lima. 2013. B-bleaching: Agile overtraining avoidance in the wisard weightless neural classifier. In Proceedings of the European Symposium on Artificial Neural Networks (ESANN\u201913)."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/iSES50453.2020.00055"},{"key":"e_1_3_2_17_2","first-page":"3123","volume-title":"Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS\u201915)","author":"Courbariaux Matthieu","year":"2015","unstructured":"Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2015. BinaryConnect: Training deep neural networks with binary weights during propagations. In Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS\u201915). MIT Press, Cambridge, MA, 3123\u20133131."},{"key":"e_1_3_2_18_2","unstructured":"Matthieu Courbariaux Itay Hubara Daniel Soudry Ran El-Yaniv and Yoshua Bengio. 2016. Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or  \\(-1\\) . Retrieved from https:\/\/arxiv:1602.02830"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.3389\/fnins.2021.611300"},{"key":"e_1_3_2_20_2","first-page":"1","volume-title":"Multiple Classifier Systems","author":"Dietterich Thomas G.","year":"2000","unstructured":"Thomas G. Dietterich. 2000. Ensemble methods in machine learning. In Multiple Classifier Systems. Springer, Berlin, 1\u201315."},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3124552"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.3389\/fncom.2021.584797"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2018.00012"},{"key":"e_1_3_2_24_2","first-page":"379","volume-title":"Proceedings of Machine Learning and Systems","volume":"2","author":"Fromm Joshua","year":"2020","unstructured":"Joshua Fromm, Meghan Cowan, Matthai Philipose, Luis Ceze, and Shwetak Patel. 2020. Riptide: Fast end-to-end binarized neural networks. In Proceedings of Machine Learning and Systems, I. Dhillon, D. Papailiopoulos, and V. Sze (Eds.), Vol. 2. 379\u2013389. Retrieved from https:\/\/proceedings.mlsys.org\/paper\/2020\/file\/2a79ea27c279e471f4d180b08d62b00a-Paper.pdf"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2018.00018"},{"key":"e_1_3_2_26_2","doi-asserted-by":"crossref","unstructured":"Amir Gholami Sehoon Kim Zhen Dong Zhewei Yao Michael W. Mahoney and Kurt Keutzer. 2021. A Survey of Quantization Methods for Efficient Neural Network Inference. Retrieved from https:\/\/arxiv:2103.13630","DOI":"10.1201\/9781003162810-13"},{"key":"e_1_3_2_27_2","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR\u201916)","author":"Han Song","year":"2016","unstructured":"Song Han, Huizi Mao, and William Dally. 2016. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. In Proceedings of the International Conference on Learning Representations (ICLR\u201916)."},{"key":"e_1_3_2_28_2","volume-title":"Advances in Neural Information Processing Systems","author":"Hubara Itay","year":"2016","unstructured":"Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2016. Binarized neural networks. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.), Vol. 29. Curran Associates, Inc. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2016\/file\/d8330f857a17c53d217014ee776bfd50-Paper.pdf"},{"key":"e_1_3_2_29_2","unstructured":"Forrest N. Iandola Matthew W. Moskewicz Khalid Ashraf Song Han William J. Dally and Kurt Keutzer. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1 MB model size. Retrieved from https:\/\/arxiv.org\/abs\/1602.07360"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.1712.05877"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3140659.3080246"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/BRACIS.2016.029"},{"key":"e_1_3_2_33_2","article-title":"Adam: A method for stochastic optimization","author":"Kingma Diederik","year":"2014","unstructured":"Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations.","journal-title":"Proceedings of the International Conference on Learning Representations"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.23919\/DATE.2017.7926982"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/WASPAA.2019.8937164"},{"key":"e_1_3_2_36_2","volume-title":"Learning Multiple Layers of Features from Tiny Images","author":"Krizhevsky Alex","year":"2009","unstructured":"Alex Krizhevsky and Geoffrey Hinton. 2009. Learning Multiple Layers of Features from Tiny Images. Technical Report 0. University of Toronto, Toronto, Ontario."},{"key":"e_1_3_2_37_2","article-title":"MNIST handwritten digit database","author":"LeCun Yann","year":"2010","unstructured":"Yann LeCun and Corinna Cortes. 2010. MNIST handwritten digit database. Retrieved from http:\/\/yann.lecun.com\/exdb\/mnist\/http:\/\/yann.lecun.com\/exdb\/mnist\/","journal-title":"R"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISCAS.2016.7538947"},{"key":"e_1_3_2_39_2","unstructured":"Fengfu Li and Bin Liu. 2016. Ternary weight networks. Retrieved from http:\/\/arxiv.org\/abs\/1605.04711"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2017.09.046"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_42_2","unstructured":"Zhouhan Lin Matthieu Courbariaux Roland Memisevic and Yoshua Bengio. 2016. Neural networks with few multiplications. Retrieved from https:\/\/arxiv.org\/abs\/1510.03009"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1145\/3210240.3210337"},{"key":"e_1_3_2_44_2","first-page":"41","article-title":"Weightless neural models: A review of current and past works","volume":"2","author":"Ludermir Teresa","year":"1999","unstructured":"Teresa Ludermir, Andre de Carvalho, Ant\u00f4nio Braga, and M. C. P. Souto. 1999. Weightless neural models: A review of current and past works. Neural Comput. Surveys 2 (Jan. 1999), 41\u201361.","journal-title":"Neural Comput. Surveys"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2019.12.134"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2018.00020"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/ASAP54787.2022.00014"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3373376.3378534"},{"key":"e_1_3_2_49_2","unstructured":"Alessandro Pappalardo. 2021. Xilinx\/brevitas. Retrieved from DOI:https:\/\/zenodo.org\/records\/8364211"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080254"},{"key":"e_1_3_2_51_2","first-page":"8024","volume-title":"Advances in Neural Information Processing Systems 32","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems 32. Curran Associates, 8024\u20138035. Retrieved from http:\/\/papers.neurips.cc\/paper\/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSEN.2019.2891911"},{"key":"e_1_3_2_53_2","doi-asserted-by":"crossref","unstructured":"Mohammad Rastegari Vicente Ordonez Joseph Redmon and Ali Farhadi. 2016. XNOR-net: ImageNet classification using binary convolutional neural networks. Retrieved from http:\/\/arxiv.org\/abs\/1603.05279","DOI":"10.1007\/978-3-319-46493-0_32"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1145\/3440689"},{"issue":"1","key":"e_1_3_2_55_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0893-6080(97)00062-2","article-title":"The theoretical and experimental status of the n-tuple classifier","volume":"11","author":"Rohwer Richard","year":"1998","unstructured":"Richard Rohwer and Michal Morciniec. 1998. The theoretical and experimental status of the n-tuple classifier. Neural Netw. 11, 1 (Jan. 1998), 1\u201314.","journal-title":"Neural Netw."},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.01.115"},{"key":"e_1_3_2_57_2","unstructured":"Simone. 2020. TinyML or Arduino and STM32: Convolutional Neural Network (CNN) Example. Retrieved from https:\/\/eloquentarduino.github.io\/2020\/11\/tinyml-on-arduino-and-stm32-cnn-convolutional-neural-network-example\/"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1145\/3559009.3569680"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.14428\/esann\/2022.ES2022-55"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1145\/3020078.3021744"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.48550\/ARXIV.1804.03209"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICONIP.2002.1198114"},{"key":"e_1_3_2_63_2","first-page":"643","volume-title":"Proceedings of the 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN\u201920)","author":"Xavier Pedro","year":"2020","unstructured":"Pedro Xavier, Massimo De Gregorio, Felipe M. G. Fran\u00e7a, and Priscila M. V. Lima. 2020. Detection of elementary particles with the WiSARD n-tuple classifier. In Proceedings of the 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN\u201920). 643\u2013648. Retrieved from https:\/\/www.esann.org\/sites\/default\/files\/proceedings\/2020\/ES2020-170.pdf"},{"key":"e_1_3_2_64_2","unstructured":"Penghang Yin Jiancheng Lyu Shuai Zhang Stanley J. Osher Yingyong Qi and Jack Xin. 2019. Understanding straight-through estimator in training activation quantized neural nets. Retrieved from http:\/\/arxiv.org\/abs\/1903.05662"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080215"},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2016.7783723"},{"key":"e_1_3_2_67_2","first-page":"191","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV\u201918)","author":"Zhang Tianyun","year":"2018","unstructured":"Tianyun Zhang, Shaokai Ye, Kaiqi Zhang, Jian Tang, Wujie Wen, Makan Fardad, and Yanzhi Wang. 2018. A systematic DNN weight pruning framework using alternating direction method of multipliers. In Proceedings of the European Conference on Computer Vision (ECCV\u201918), Vittorio Ferrari, Martial Hebert, Cristian Sminchisescu, and Yair Weiss (Eds.). Springer International Publishing, Cham, 191\u2013207."},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2018.00011"},{"key":"e_1_3_2_69_2","volume-title":"Proceedings of the 5th International Conference on Learning Representations (ICLR\u201917)","author":"Zhu Chenzhuo","year":"2017","unstructured":"Chenzhuo Zhu, Song Han, Huizi Mao, and William J. Dally. 2017. Trained ternary quantization. In Proceedings of the 5th International Conference on Learning Representations (ICLR\u201917). OpenReview.net. Retrieved from https:\/\/openreview.net\/forum?id=S1_pAu9xl"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3629522","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3629522","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:01Z","timestamp":1750178161000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3629522"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,14]]},"references-count":68,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,12,31]]}},"alternative-id":["10.1145\/3629522"],"URL":"https:\/\/doi.org\/10.1145\/3629522","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"value":"1544-3566","type":"print"},{"value":"1544-3973","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,14]]},"assertion":[{"value":"2023-04-14","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-10-08","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}