{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T15:34:45Z","timestamp":1759937685659,"version":"3.41.0"},"reference-count":26,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2013,12,1]],"date-time":"2013-12-01T00:00:00Z","timestamp":1385856000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001459","name":"Ministry of Education - Singapore","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001459","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Embed. Comput. Syst."],"published-print":{"date-parts":[[2013,12]]},"abstract":"<jats:p>Application-specific system-on-chip platforms create the opportunity to customize the cache configuration for optimal performance with minimal chip area. Simulation, in particular trace-driven simulation, is widely used to estimate cache hit rates. However, simulation is too slow to be deployed in design space exploration, especially when there are hundreds of design points and the traces are huge. In this article, we propose a novel analytical approach for design space exploration of instruction caches. Given the program control flow graph (CFG) annotated only with basic block and control flow edge execution counts, we first model the cache states at each point of the CFG in a probabilistic manner. Then, we exploit the structural similarities among related cache configurations to estimate the cache hit rates for multiple cache configurations in one pass. Experimental results indicate that our analysis is 28--2,500 times faster compared to the fastest known cache simulator while maintaining high accuracy (0.2% average error) in estimating cache hit rates for a large set of popular benchmarks. Moreover, compared to a state-of-the-art cache design space exploration technique, our approach achieves 304--8,086 times speedup and saves up to 62% (average 7%) energy for the evaluated benchmarks.<\/jats:p>","DOI":"10.1145\/2539036.2539039","type":"journal-article","created":{"date-parts":[[2013,12,20]],"date-time":"2013-12-20T19:50:46Z","timestamp":1387569046000},"page":"1-29","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["An analytical approach for fast and accurate design space exploration of instruction caches"],"prefix":"10.1145","volume":"13","author":[{"given":"Yun","family":"Liang","sequence":"first","affiliation":[{"name":"Peking University and Advanced Digital Science Center, University of Illinois at Urbana-Champaign, Beijing, P.R. China"}]},{"given":"Tulika","family":"Mitra","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore"}]}],"member":"320","published-online":{"date-parts":[[2013,12,24]]},"reference":[{"volume-title":"Proceedings of the Real-Time Systems Symposium. 172--181","author":"Arnold R.","key":"e_1_2_1_1_1","unstructured":"Arnold , R. , Mueller , F. , Whalley , D. , and Harmon , M . 1994. Bounding worst-case instruction cache performance . In Proceedings of the Real-Time Systems Symposium. 172--181 . Arnold, R., Mueller, F., Whalley, D., and Harmon, M. 1994. Bounding worst-case instruction cache performance. In Proceedings of the Real-Time Systems Symposium. 172--181."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.982917"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/186025.186027"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/339647.339657"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1027084.1027086"},{"volume-title":"Proceedings of the Conference on Design, Automation and Test in Europe (DATE'07)","author":"Gordon-Ross A.","key":"e_1_2_1_6_1","unstructured":"Gordon-Ross , A. , Viana , P. , Vahid , F. , Najjar , W. , and Barros , E . 2007. A one-shot configurable-cache tuner for improved energy and performance . In Proceedings of the Conference on Design, Automation and Test in Europe (DATE'07) . 755--760. Gordon-Ross, A., Viana, P., Vahid, F., Najjar, W., and Barros, E. 2007. A one-shot configurable-cache tuner for improved energy and performance. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE'07). 755--760."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1023833.1023870"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1128020.1128563"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1629435.1629476"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/12.40842"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1006209.1006227"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/337292.337559"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1391469.1391551"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1450135.1450159"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1837274.1837362"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1878921.1878944"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1147\/sj.92.0078"},{"key":"e_1_2_1_18_1","first-page":"1","article-title":"A 160-mhz, 32-b, 0.5-w cmos risc microprocessor","volume":"9","author":"Montanaro J.","year":"1997","unstructured":"Montanaro , J. , Witek , R. T. Anne , K. , Black , A. J. , Cooper , E. M. , Dobberpuhl , D. W. , Donahue , P. M. , Eno , J. , Farell , A. , Hoeppner , G. W. , Kruckmeyer , D. , Lee , T. H. , Lin , P. C. M , Madden , L. , Murray , D. , Pearce , M. H. , Santhanam , S. , Snyder , K. J. , Stephany , R. , and Thieruf , S. C. 1997 . A 160-mhz, 32-b, 0.5-w cmos risc microprocessor . Digital Tech. J. 9 , 1 . Montanaro, J., Witek, R. T. Anne, K., Black, A. J., Cooper, E. M., Dobberpuhl, D. W., Donahue, P. M., Eno, J., Farell, A., Hoeppner, G. W., Kruckmeyer, D., Lee, T. H., Lin, P. C. M, Madden, L., Murray, D., Pearce, M. H., Santhanam, S., Snyder, K. J., Stephany, R., and Thieruf, S. C. 1997. A 160-mhz, 32-b, 0.5-w cmos risc microprocessor. Digital Tech. J. 9, 1.","journal-title":"Digital Tech. J."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/4.509850"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/200912.200918"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/254180.254184"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/128738.128740"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/301177.301496"},{"volume-title":"Proceeding of the 14th IEEE International Workshop on Rapid System Prototyping. 164","author":"Zhang C.","key":"e_1_2_1_24_1","unstructured":"Zhang , C. and Vahid , F . 2003. Cache configuratoin exploration on prototying platforms . In Proceeding of the 14th IEEE International Workshop on Rapid System Prototyping. 164 . Zhang, C. and Vahid, F. 2003. Cache configuratoin exploration on prototying platforms. In Proceeding of the 14th IEEE International Workshop on Rapid System Prototyping. 164."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/871656.859635"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1162\/106365600568202"}],"container-title":["ACM Transactions on Embedded Computing Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2539036.2539039","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2539036.2539039","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:34:50Z","timestamp":1750232090000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2539036.2539039"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,12]]},"references-count":26,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2013,12]]}},"alternative-id":["10.1145\/2539036.2539039"],"URL":"https:\/\/doi.org\/10.1145\/2539036.2539039","relation":{},"ISSN":["1539-9087","1558-3465"],"issn-type":[{"type":"print","value":"1539-9087"},{"type":"electronic","value":"1558-3465"}],"subject":[],"published":{"date-parts":[[2013,12]]},"assertion":[{"value":"2011-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2012-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-12-24","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}