{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T00:18:03Z","timestamp":1767831483330,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":39,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,2,11]],"date-time":"2022-02-11T00:00:00Z","timestamp":1644537600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62102257"],"award-info":[{"award-number":["62102257"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Alibaba Group","award":["Alibaba Innovative Research Program"],"award-info":[{"award-number":["Alibaba Innovative Research Program"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,2,13]]},"DOI":"10.1145\/3490422.3502367","type":"proceedings-article","created":{"date-parts":[[2022,2,12]],"date-time":"2022-02-12T05:09:21Z","timestamp":1644642561000},"page":"112-122","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":17,"title":["N3H-Core"],"prefix":"10.1145","author":[{"given":"Yu","family":"Gong","sequence":"first","affiliation":[{"name":"Shanghai Qi Zhi Institute, Shanghai, China"}]},{"given":"Zhihan","family":"Xu","sequence":"additional","affiliation":[{"name":"Shanghai Qi Zhi Institute, Shanghai, China"}]},{"given":"Zhezhi","family":"He","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"given":"Weifeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Alibaba Group US Inc., San Diego, CA, USA"}]},{"given":"Xiaobing","family":"Tu","sequence":"additional","affiliation":[{"name":"Alibaba Group, Hangzhou, China"}]},{"given":"Xiaoyao","family":"Liang","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"given":"Li","family":"Jiang","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2022,2,11]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"et almbox","author":"Brown Tom B","year":"2020","unstructured":"Tom B Brown , Benjamin Mann , Nick Ryder , Melanie Subbiah , Jared Kaplan , Prafulla Dhariwal , Arvind Neelakantan , Pranav Shyam , Girish Sastry , Amanda Askell , et almbox . 2020 . Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020). Tom B Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et almbox. 2020. Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020)."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01318"},{"key":"e_1_3_2_2_3_1","volume-title":"Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework. In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA) . 208--220","author":"Chang Sung-En","year":"2021","unstructured":"Sung-En Chang , Yanyu Li , Mengshu Sun , Runbin Shi , Hayden K.-H. So , Xuehai Qian , Yanzhi Wang , and Xue Lin . 2021 . Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework. In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA) . 208--220 . https:\/\/doi.org\/10.1109\/HPCA51647.2021.00027 10.1109\/HPCA51647.2021.00027 Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden K.-H. So, Xuehai Qian, Yanzhi Wang, and Xue Lin. 2021. Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework. In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA) . 208--220. https:\/\/doi.org\/10.1109\/HPCA51647.2021.00027"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001177"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/JETCAS.2019.2910232"},{"key":"e_1_3_2_2_6_1","volume-title":"Vijayalakshmi Srinivasan, and Kailash Gopalakrishnan.","author":"Choi Jungwook","year":"2018","unstructured":"Jungwook Choi , Zhuo Wang , Swagath Venkataramani , Pierce I-Jen Chuang , Vijayalakshmi Srinivasan, and Kailash Gopalakrishnan. 2018 . Pact : Parameterized clipping activation for quantized neural networks. arXiv preprint arXiv:1805.06085 (2018). Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce I-Jen Chuang, Vijayalakshmi Srinivasan, and Kailash Gopalakrishnan. 2018. Pact: Parameterized clipping activation for quantized neural networks. arXiv preprint arXiv:1805.06085 (2018)."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2019.00363"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00038"},{"key":"e_1_3_2_2_10_1","volume-title":"NeurIPS ML for Systems workshop","author":"Elthakeb Ahmed","year":"2019","unstructured":"Ahmed Elthakeb , Prannoy Pilligundla , FatemehSadat Mireshghallah , Amir Yazdanbakhsh , Sicuan Gao , and Hadi Esmaeilzadeh . 2019 . Releq: An automatic reinforcement learning approach for deep quantization of neural networks . In NeurIPS ML for Systems workshop , 2018 . Ahmed Elthakeb, Prannoy Pilligundla, FatemehSadat Mireshghallah, Amir Yazdanbakhsh, Sicuan Gao, and Hadi Esmaeilzadeh. 2019. Releq: An automatic reinforcement learning approach for deep quantization of neural networks. In NeurIPS ML for Systems workshop, 2018 ."},{"key":"e_1_3_2_2_11_1","volume-title":"Learned step size quantization. ICLR","author":"Esser Steven K","year":"2019","unstructured":"Steven K Esser , Jeffrey L McKinstry , Deepika Bablani , Rathinakumar Appuswamy , and Dharmendra S Modha . 2019. Learned step size quantization. ICLR ( 2019 ). Steven K Esser, Jeffrey L McKinstry, Deepika Bablani, Rathinakumar Appuswamy, and Dharmendra S Modha. 2019. Learned step size quantization. ICLR (2019)."},{"key":"e_1_3_2_2_12_1","volume-title":"Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity. arXiv preprint arXiv:2101.03961","author":"Fedus William","year":"2021","unstructured":"William Fedus , Barret Zoph , and Noam Shazeer . 2021 . Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity. arXiv preprint arXiv:2101.03961 (2021). William Fedus, Barret Zoph, and Noam Shazeer. 2021. Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity. arXiv preprint arXiv:2101.03961 (2021)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00495"},{"key":"e_1_3_2_2_14_1","volume-title":"Deep learning","author":"Goodfellow Ian","unstructured":"Ian Goodfellow , Yoshua Bengio , Aaron Courville , and Yoshua Bengio . 2016. Deep learning . Vol. 1 . MIT press Cambridge . Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. 2016. Deep learning. Vol. 1. MIT press Cambridge."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2017.2705069"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_48"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01170"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV.2019.00102"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00286"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00523"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ReConFig.2013.6732284"},{"key":"e_1_3_2_2_23_1","volume-title":"Convolutional neural networks using logarithmic data representation. arXiv preprint arXiv:1603.01025","author":"Miyashita Daisuke","year":"2016","unstructured":"Daisuke Miyashita , Edward H Lee , and Boris Murmann . 2016. Convolutional neural networks using logarithmic data representation. arXiv preprint arXiv:1603.01025 ( 2016 ). Daisuke Miyashita, Edward H Lee, and Boris Murmann. 2016. Convolutional neural networks using logarithmic data representation. arXiv preprint arXiv:1603.01025 (2016)."},{"key":"e_1_3_2_2_24_1","unstructured":"Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017).  Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017)."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2019.2921159"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00474"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA45697.2020.00086"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2847263.2847276"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/FPL.2018.00059"},{"key":"e_1_3_2_2_30_1","volume-title":"2017 27th International Conference on Field Programmable Logic and Applications, FPL 2017 (2017","author":"Stylianos","year":"2017","unstructured":"Stylianos I. Venieris and Christos Savvas Bouganis. 2017a. Latency-driven design for FPGA-based convolutional neural networks . 2017 27th International Conference on Field Programmable Logic and Applications, FPL 2017 (2017 ). https:\/\/doi.org\/10.23919\/FPL. 2017 .8056828 10.23919\/FPL.2017.8056828 Stylianos I. Venieris and Christos Savvas Bouganis. 2017a. Latency-driven design for FPGA-based convolutional neural networks . 2017 27th International Conference on Field Programmable Logic and Applications, FPL 2017 (2017). https:\/\/doi.org\/10.23919\/FPL.2017.8056828"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.23919\/FPL.2017.8056828"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.73"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00881"},{"key":"e_1_3_2_2_34_1","volume-title":"Mixed precision quantization of convnets via differentiable neural architecture search. arXiv preprint arXiv:1812.00090","author":"Wu Bichen","year":"2018","unstructured":"Bichen Wu , Yanghan Wang , Peizhao Zhang , Yuandong Tian , Peter Vajda , and Kurt Keutzer . 2018. Mixed precision quantization of convnets via differentiable neural architecture search. arXiv preprint arXiv:1812.00090 ( 2018 ). Bichen Wu, Yanghan Wang, Peizhao Zhang, Yuandong Tian, Peter Vajda, and Kurt Keutzer. 2018. Mixed precision quantization of convnets via differentiable neural architecture search. arXiv preprint arXiv:1812.00090 (2018)."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/FPL.2019.00030"},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/216585.216588"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"crossref","unstructured":"Chen Zhang Peng Li Guangyu Sun Yijin Guan Bingjun Xiao and Jason Cong. [n. d.]. Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks . ( [n. d.]). https:\/\/doi.org\/10.1145\/2684746.2689060    10.1145\/2684746.2689060\nChen Zhang Peng Li Guangyu Sun Yijin Guan Bingjun Xiao and Jason Cong. [n. d.]. Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks . ( [n. d.]). https:\/\/doi.org\/10.1145\/2684746.2689060","DOI":"10.1145\/2684746.2689060"},{"key":"e_1_3_2_2_38_1","volume-title":"Dorefa-net: Training low bitwidth convolutional neural networks with low bitwidth gradients. arXiv preprint arXiv:1606.06160","author":"Zhou Shuchang","year":"2016","unstructured":"Shuchang Zhou , Yuxin Wu , Zekun Ni , Xinyu Zhou , He Wen , and Yuheng Zou . 2016 . Dorefa-net: Training low bitwidth convolutional neural networks with low bitwidth gradients. arXiv preprint arXiv:1606.06160 (2016). Shuchang Zhou, Yuxin Wu, Zekun Ni, Xinyu Zhou, He Wen, and Yuheng Zou. 2016. Dorefa-net: Training low bitwidth convolutional neural networks with low bitwidth gradients. arXiv preprint arXiv:1606.06160 (2016)."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2018.8486500"}],"event":{"name":"FPGA '22: The 2022 ACM\/SIGDA International Symposium on Field-Programmable Gate Arrays","location":"Virtual Event USA","acronym":"FPGA '22","sponsor":["SIGDA ACM Special Interest Group on Design Automation"]},"container-title":["Proceedings of the 2022 ACM\/SIGDA International Symposium on Field-Programmable Gate Arrays"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3490422.3502367","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3490422.3502367","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:31:03Z","timestamp":1750188663000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3490422.3502367"}},"subtitle":["Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores"],"short-title":[],"issued":{"date-parts":[[2022,2,11]]},"references-count":39,"alternative-id":["10.1145\/3490422.3502367","10.1145\/3490422"],"URL":"https:\/\/doi.org\/10.1145\/3490422.3502367","relation":{},"subject":[],"published":{"date-parts":[[2022,2,11]]},"assertion":[{"value":"2022-02-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}