{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,3,31]],"date-time":"2022-03-31T05:13:56Z","timestamp":1648703636376},"reference-count":9,"publisher":"World Scientific Pub Co Pte Lt","issue":"05","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J CIRCUIT SYST COMP"],"published-print":{"date-parts":[[2014,6]]},"abstract":"<jats:p> Multicore designs have become the dominant organization for future high performance microprocessors. Instead of increasing cache sizes, clock frequencies, pipeline depths or register file (RF) ports, multicore designs tend to make each processor core simple but highly efficient. This new dimension for improving performance and power efficiency in multicore requires us to rethink processor architecture. Multiply-accumulate (MAC) operation is such a performance improvement technique that needs to be reviewed. MAC operation is the fundamentals of many DSP and multimedia applications, but it tends to be awkward to implement in an orthogonal instruction set architecture (ISA) because of operand bandwidth problem, instruction encoding problem, and hardware cost problem. So a big question is that whether we should support MAC or not in high-efficiency processor designs? This paper does a comparative study on this question and introduce data bandwidth relaxing techniques to eliminate narrow bandwidth provided by two-port RFs. The trade-off are also made to solve the instruction coding and hardware cost problem. So, the new design wisdom becomes that if you support multiply (MUL) operation, then support MAC operation. <\/jats:p>","DOI":"10.1142\/s0218126614500686","type":"journal-article","created":{"date-parts":[[2014,3,10]],"date-time":"2014-03-10T02:07:01Z","timestamp":1394417221000},"page":"1450068","source":"Crossref","is-referenced-by-count":0,"title":["MAC OR NON-MAC: NOT A PROBLEM"],"prefix":"10.1142","volume":"23","author":[{"given":"LIBO","family":"HUANG","sequence":"first","affiliation":[{"name":"State Key Laboratory of High Performance Computing and School of Computer, National University of Defense Technology, Changsha 410073, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"LI","family":"SHEN","sequence":"additional","affiliation":[{"name":"State Key Laboratory of High Performance Computing and School of Computer, National University of Defense Technology, Changsha 410073, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"YASHUAI","family":"LV","sequence":"additional","affiliation":[{"name":"State Key Laboratory of High Performance Computing and School of Computer, National University of Defense Technology, Changsha 410073, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"ZHIYING","family":"WANG","sequence":"additional","affiliation":[{"name":"State Key Laboratory of High Performance Computing and School of Computer, National University of Defense Technology, Changsha 410073, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"KUI","family":"DAI","sequence":"additional","affiliation":[{"name":"Department of Science &amp; Technology of Electronics, Huangzhong University of Science and Technology, Wuhan 430074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2014,5,8]]},"reference":[{"key":"rf3","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2011.77"},{"key":"rf4","volume-title":"Embedded computing: A VLIW Approach to Architecture Compilers and Tools","author":"Fisher Joseph A.","year":"2005"},{"key":"rf6","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2008.915536"},{"key":"rf7","doi-asserted-by":"publisher","DOI":"10.1145\/1968502.1968509"},{"key":"rf8","volume":"3","author":"Seto K.","year":"2010","journal-title":"IPSJ Trans. Syst. LSI Design Methodol."},{"key":"rf9","author":"Lee R. B.","year":"2005","journal-title":"J. VLSI Signal Proces."},{"key":"rf15","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2003.821545"},{"key":"rf19","doi-asserted-by":"publisher","DOI":"10.1109\/4.799851"},{"key":"rf21","author":"Lee C.","year":"1997","journal-title":"Micro."}],"container-title":["Journal of Circuits, Systems and Computers"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218126614500686","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T15:51:44Z","timestamp":1565106704000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218126614500686"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,5,8]]},"references-count":9,"journal-issue":{"issue":"05","published-online":{"date-parts":[[2014,5,8]]},"published-print":{"date-parts":[[2014,6]]}},"alternative-id":["10.1142\/S0218126614500686"],"URL":"https:\/\/doi.org\/10.1142\/s0218126614500686","relation":{},"ISSN":["0218-1266","1793-6454"],"issn-type":[{"value":"0218-1266","type":"print"},{"value":"1793-6454","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,5,8]]}}}