{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,22]],"date-time":"2025-11-22T17:10:45Z","timestamp":1763831445025,"version":"3.41.0"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"2","funder":[{"name":"National Key Research and Development Program of China","award":["2022YFB4500405"],"award-info":[{"award-number":["2022YFB4500405"]}]},{"name":"3rd Xinjiang Scientific Expedition Program","award":["2021xjkk1300"],"award-info":[{"award-number":["2021xjkk1300"]}]},{"name":"Shenzhen Science and Technology Plan Project","award":["SGDX20220530111001003"],"award-info":[{"award-number":["SGDX20220530111001003"]}]},{"name":"Shenzhen Science and Technology Program","award":["CJGJZD20230724093659004"],"award-info":[{"award-number":["CJGJZD20230724093659004"]}]},{"name":"Regional Innovation and Development Joint Fund of National Natural Science Foundation of China","award":["U22A6001"],"award-info":[{"award-number":["U22A6001"]}]},{"name":"Zhejiang provincial \u201cTen Thousand Talents Program\u201d","award":["2021R52007"],"award-info":[{"award-number":["2021R52007"]}]},{"DOI":"10.13039\/501100020771","name":"Young Scientists Fund of the National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62306286"],"award-info":[{"award-number":["62306286"]}],"id":[{"id":"10.13039\/501100020771","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100018537","name":"National Science and Technology Major Project","doi-asserted-by":"crossref","award":["2021ZD0114300"],"award-info":[{"award-number":["2021ZD0114300"]}],"id":[{"id":"10.13039\/501100018537","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2025,6,30]]},"abstract":"<jats:p>\n            The massive computational and memory requirements of deep convolutional neural networks (DCNNs) have led to the development of neural network (NN) accelerators. However, as DCNN models grow in size, the demands on NN accelerators in terms of performance, memory bandwidth, and power efficiency continue to increase. We, therefore, present\n            <jats:italic toggle=\"yes\">9Ring<\/jats:italic>\n            , a flexible and efficient DCNN accelerator that takes full advantage of 3D-stacked memory, focusing on its hardware architecture, software scheduling, and optimization strategy. In particular, we first show that the mismatch between DCNN accelerators and DCNN models can lead to increased energy consumption and performance bottlenecks. We then present three flexible\n            <jats:italic toggle=\"yes\">dataflow<\/jats:italic>\n            scheduling strategies to mitigate this mismatch. Afterward, we introduce an energy efficiency analysis tool that can automatically search for the optimal scheduling scheme with respect to different DCNN models for energy efficiency. Finally, we conduct an empirical study showing that 9Ring can reduce energy consumption by 31.4% and 43.9% on average, and improve performance by 12% and 10% on average, compared with Tetris and the NN accelerators on conventional low-power DRAM memory systems, respectively.\n          <\/jats:p>","DOI":"10.1145\/3732940","type":"journal-article","created":{"date-parts":[[2025,4,28]],"date-time":"2025-04-28T11:07:07Z","timestamp":1745838427000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["9Ring: A 3D-Stacked Memory-Based Accelerator for Flexible and Efficient Deep CNN Applications"],"prefix":"10.1145","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7092-6376","authenticated-orcid":false,"given":"Wen","family":"Cheng","sequence":"first","affiliation":[{"name":"Zhejiang Lab","place":["Hangzhou, China"]}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-8916-8469","authenticated-orcid":false,"given":"Qianya","family":"Cheng","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology","place":["Wuhan, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3244-3258","authenticated-orcid":false,"given":"Yi","family":"Liu","sequence":"additional","affiliation":[{"name":"Zhejiang Lab","place":["Hangzhou, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3130-3015","authenticated-orcid":false,"given":"Lingfang","family":"Zeng","sequence":"additional","affiliation":[{"name":"Zhejiang Lab","place":["Hangzhou, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3083-2775","authenticated-orcid":false,"given":"Andre","family":"Brinkmann","sequence":"additional","affiliation":[{"name":"Johannes Gutenberg University Mainz","place":["Mainz, Germany"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9438-6060","authenticated-orcid":false,"given":"Yang","family":"Wang","sequence":"additional","affiliation":[{"name":"Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences","place":["Shenzhen, China"]}]}],"member":"320","published-online":{"date-parts":[[2025,7]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/tvlsi.2016.2570283"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/tpds.2017.2752706"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.isprsjprs.2020.01.023"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.5555\/AAI28152127"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/2541940.2541967"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/micro.2014.58"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/jssc.2016.2616357"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/HCS55958.2022.9895592"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2023.3256796"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750389"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2011.5981829"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2016.7446059"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037702"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/asp-dac58780.2024.10473935"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.2017.10.013"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO56248.2022.00016"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICEPT56209.2022.9873433"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/isca.2016.41"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/3065386"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358252"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/jssc.2014.2360379"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3077294"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1049\/sil2.12078"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCAD.2011.6105405"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/2786763.2694358"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001179"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/hpca.2017.29"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1145\/1065010.1065034"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3649219"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/3020078.3021736"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/3460971"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/3dic52383.2021.9687615"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/lca.2023.3287976"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/2485922.2485963"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","unstructured":"Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations (ICLR). Computational and Biological Learning Society. DOI:10.48550\/arXiv.1409.1556","DOI":"10.48550\/arXiv.1409.1556"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/3649476.3658732"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/hpca.2019.00027"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1145\/2897937.2897995"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-021-06725-w"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2021.3052138"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.apacoust.2020.107854"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1109\/jproc.2017.2761740"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TGRS.2022.3169835"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2017.2688340"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2018.2871623"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/2903150.2903478"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3576196"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1145\/1498765.1498785"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/2486001.2486020"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.compind.2022.103737"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","unstructured":"Matthew D. Zeiler and Rob Fergus. 2014. Visualizing and understanding convolutional networks. In Proceedings of the European Conference on Computer Vision (ECCV). Springer. DOI:10.1007\/978-3-319-10590-1_53","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA52012.2021.00061"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3732940","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T12:31:28Z","timestamp":1751373088000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3732940"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,30]]},"references-count":54,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,6,30]]}},"alternative-id":["10.1145\/3732940"],"URL":"https:\/\/doi.org\/10.1145\/3732940","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"type":"print","value":"1544-3566"},{"type":"electronic","value":"1544-3973"}],"subject":[],"published":{"date-parts":[[2025,6,30]]},"assertion":[{"value":"2023-08-11","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-04-21","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-07-01","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}