{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T16:56:39Z","timestamp":1777654599364,"version":"3.51.4"},"reference-count":180,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2018,1,10]],"date-time":"2018-01-10T00:00:00Z","timestamp":1515542400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2019,1,31]]},"abstract":"<jats:p>Approximate computing has gained research attention recently as a way to increase energy efficiency and\/or performance by exploiting some applications\u2019 intrinsic error resiliency. However, little attention has been given to its potential for tackling the communication bottleneck that remains one of the looming challenges to be tackled for efficient parallelism. This article explores the potential benefits of approximate computing for communication reduction by surveying three promising techniques for approximate communication: compression, relaxed synchronization, and value prediction. The techniques are compared based on an evaluation framework composed of communication cost reduction, performance, energy reduction, applicability, overheads, and output degradation. Comparison results demonstrate that lossy link compression and approximate value prediction show great promise for reducing the communication bottleneck in bandwidth-constrained applications. Meanwhile, relaxed synchronization is found to provide large speedups for select error-tolerant applications, but suffers from limited general applicability and unreliable output degradation guarantees. Finally, this article concludes with several suggestions for future research on approximate communication techniques.<\/jats:p>","DOI":"10.1145\/3145812","type":"journal-article","created":{"date-parts":[[2018,1,10]],"date-time":"2018-01-10T16:51:38Z","timestamp":1515603098000},"page":"1-32","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":56,"title":["Approximate Communication"],"prefix":"10.1145","volume":"51","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6180-8490","authenticated-orcid":false,"given":"Filipe","family":"Betzel","sequence":"first","affiliation":[{"name":"University of Minnesota, Minneapolis, MN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Karen","family":"Khatamifard","sequence":"additional","affiliation":[{"name":"University of Minnesota, Minneapolis, MN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Harini","family":"Suresh","sequence":"additional","affiliation":[{"name":"University of Minnesota, Minneapolis, MN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David J.","family":"Lilja","sequence":"additional","affiliation":[{"name":"University of Minnesota, Minneapolis, MN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"John","family":"Sartori","sequence":"additional","affiliation":[{"name":"University of Minnesota, Minneapolis, MN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ulya","family":"Karpuzcu","sequence":"additional","affiliation":[{"name":"University of Minnesota, Minneapolis, MN"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,1,10]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1347375.1347379"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.452.0287"},{"key":"e_1_2_1_3_1","volume-title":"Retrieved","author":"Adams Don","year":"1993"},{"key":"e_1_2_1_4_1","volume-title":"Workshop on Duplicating, Deconstructing and Debunking (WDDD\u201915)","author":"Akturk Ismail"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the 31st Annual International Symposium on Computer Architecture. IEEE, 212--223","author":"Alaa"},{"key":"e_1_2_1_6_1","volume-title":"IEEE 13th International Symposium on High Performance Computer Architecture (HPCA\u201907)","author":"Alaa"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1088149.1088183"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2005.119"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1465482.1465560"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/DCC.2012.21"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ESTIMedia.2013.6704499"},{"key":"e_1_2_1_12_1","volume-title":"Marek Olszewski, Qin Zhao, Alan Edelman, and Saman Amarasinghe.","author":"Ansel Jason","year":"2009"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/2190025.2190056"},{"key":"e_1_2_1_14_1","volume-title":"Chilimbi","author":"Baek Woongki","year":"2010"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2008.2011232"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the 1990 ACM\/IEEE Conference on Supercomputing (Supercomputing\u201990)","author":"Carl"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2013.81"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISLPED.2013.6629279"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/PDP.2014.60"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1735688.1735705"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 26th International Symposium on Computer Architecture. IEEE, 64--74","author":"Calder Brad"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the 33rd Annual ACM\/IEEE International Symposium on Microarchitecture. ACM, 181--190","author":"Canal Ramon"},{"key":"e_1_2_1_24_1","volume-title":"Data Compression and Error Control Techniques with Applications","author":"Cappellini Vito"},{"key":"e_1_2_1_25_1","volume-title":"Rinard","author":"Carbin Michael","year":"2013"},{"key":"e_1_2_1_26_1","volume-title":"47th ACM\/IEEE Design Automation Conference (DAC\u201910)","author":"Srimat"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/NAS.2008.27"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2008.917729"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CICC.2013.6658433"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2013.2276759"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACSSC.2013.6810241"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2627369.2627645"},{"key":"e_1_2_1_33_1","volume-title":"IEEE\/ACM International Symposium on Low Power Electronics and Design (ISLPED\u201914)","author":"Cho Kyungsang"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.5555\/874076.876479"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00034-011-9318-5"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2011.2163394"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.5555\/527072.822610"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.492.0213"},{"key":"e_1_2_1_39_1","volume-title":"Jaswinder Pal Singh, and Anoop Gupta","author":"Culler David E.","year":"1999"},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the Design Automation Conference. IEEE, 684--689","author":"William"},{"key":"e_1_2_1_41_1","volume-title":"IEEE 14th International Symposium on High Performance Computer Architecture (HPCA\u201908)","author":"Das Reetuparna"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/1815961.1816026"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1201\/9781482276237"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.728"},{"key":"e_1_2_1_45_1","volume-title":"19th Asia and South Pacific Design Automation Conference (ASP-DAC\u201914)","author":"Du Zidong","year":"2014"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.7873\/DATE.2015.1116"},{"key":"e_1_2_1_47_1","unstructured":"Pradeep Dubey. 2005. Recognition mining and synthesis moves computers to the era of Tera. Technology@ Intel Magazine 9 2 1--10.  Pradeep Dubey. 2005. Recognition mining and synthesis moves computers to the era of Tera. Technology@ Intel Magazine 9 2 1--10."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1955.1055126"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2000064.2000108"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/2150976.2151008"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2012.48"},{"key":"e_1_2_1_52_1","volume-title":"Patt","author":"Evers Marius","year":"1996"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ATS.2012.57"},{"key":"e_1_2_1_54_1","unstructured":"Eric Freudenthal and Olivier Peze. 1988. Efficient Synchronization Algorithms Using Fetch-and-Add on Multiple Bitfield Integers. Ultracomputer Note 148.  Eric Freudenthal and Olivier Peze. 1988. Efficient Synchronization Algorithms Using Fetch-and-Add on Multiple Bitfield Integers. Ultracomputer Note 148."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/2744769.2744871"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2001.903264"},{"key":"e_1_2_1_57_1","volume-title":"Nguyen","author":"Goiri Inigo","year":"2015"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/2717311"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.5555\/2016802.2016898"},{"key":"e_1_2_1_61_1","volume-title":"Proceedings of the 3rd Workshop on Memory Performance Issues: In Conjunction with the 31st International Symposium on Computer Architecture. ACM, 9--15","author":"Erik"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/165123.165164"},{"key":"e_1_2_1_63_1","unstructured":"T. Hoefler T. Mehlan F. Mietke and W. Rehm. 2004. A survey of barrier algorithms for coarse grained supercomputers. Chemnitzer Informatik Berichte 4 3 (2004).  T. Hoefler T. Mehlan F. Mietke and W. Rehm. 2004. A survey of barrier algorithms for coarse grained supercomputers. Chemnitzer Informatik Berichte 4 3 (2004)."},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/1950365.1950390"},{"key":"e_1_2_1_66_1","doi-asserted-by":"crossref","unstructured":"Chih-Chieh Hsiao Slo-Li Chu and Chen-Yu Chen. 2013. Energy-aware hybrid precision selection framework for mobile GPUs. Computers 8 Graphics 37 5 431--444.  Chih-Chieh Hsiao Slo-Li Chu and Chen-Yu Chen. 2013. Energy-aware hybrid precision selection framework for mobile GPUs. Computers 8 Graphics 37 5 431--444.","DOI":"10.1016\/j.cag.2013.03.003"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2228360.2228450"},{"key":"e_1_2_1_68_1","volume-title":"Euro-Par 2012 Parallel Processing","author":"Iverson Jeremy"},{"key":"e_1_2_1_69_1","volume-title":"Proceedings of the 41st Annual IEEE\/ACM International Symposium on Microarchitecture. IEEE Computer Society, 354--363","author":"Jin Yuho","year":"2008"},{"key":"e_1_2_1_70_1","volume-title":"Proceedings of the 49th Annual Design Automation Conference. ACM, 820--825","author":"Andrew"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-011-0631-9"},{"key":"e_1_2_1_72_1","volume-title":"Workshop on Approximate Computing (WAPCO\u201915)","author":"Keramidas Georgios","year":"2015"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.33"},{"key":"e_1_2_1_74_1","volume-title":"Proceedings of the 5th ACM\/IEEE International Symposium on Networks-on-Chip. ACM, 233--240","author":"Kim Hyungjun"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2006.3"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1109\/VLSID.2011.51"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/103085.103090"},{"key":"e_1_2_1_78_1","volume-title":"Supercomputing\u201998: Proceedings of the 1998 ACM\/IEEE Conference on Supercomputing (CDROM). IEEE Computer Society","author":"Lee Jae Bum","year":"1998"},{"key":"e_1_2_1_79_1","volume-title":"International Conference on Computer Design (ICCD\u201999)","author":"Lee Jang-Soo","year":"1999"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2005.863753"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0141-9331(02)00041-8"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.3390\/s121114647"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750417"},{"key":"e_1_2_1_84_1","volume-title":"ERSA: Error resilient system architecture for probabilistic applications. In Design, Automation 8 Test in Europe Conference 8 Exhibition (DATE\u201910)","author":"Leem Larkhoon","year":"2010"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1145\/45072.45074"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2004.835151"},{"key":"e_1_2_1_87_1","volume-title":"Proceedings of the 29th Annual ACM\/IEEE International Symposium on Microarchitecture. IEEE Computer Society, 226--237","author":"Mikko"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1145\/248208.237173"},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2010.10"},{"key":"e_1_2_1_90_1","volume-title":"Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS\u201909)","author":"Liu Song"},{"key":"e_1_2_1_91_1","volume-title":"Workshop on Near-Data Processing (WoNDP\u201913)","author":"Loh Gabriel H.","year":"2013"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1109\/TGRS.2003.811556"},{"key":"e_1_2_1_93_1","volume-title":"Proceedings of the 34th Annual ACM\/IEEE International Symposium on Microarchitecture. IEEE Computer Society, 328--337","author":"Martin Milo M. K."},{"key":"e_1_2_1_94_1","doi-asserted-by":"publisher","DOI":"10.1145\/103727.103729"},{"key":"e_1_2_1_95_1","volume-title":"IEEE International Symposium on Parallel 8 Distributed Processing (IPDPS\u201909)","author":"Meng Jiayuan","year":"2009"},{"key":"e_1_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2010.5470469"},{"key":"e_1_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.22"},{"key":"e_1_2_1_98_1","volume-title":"Rinard","author":"Misailovic Sasa","year":"2014"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465787.2465790"},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.1145\/1806799.1806808"},{"key":"e_1_2_1_101_1","volume-title":"Proceedings of the 2012 ACM Workshop on Relaxing Synchronization for Multicore and Manycore Scalability. ACM, 51--60","author":"Misailovic Sasa"},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1145\/2893356"},{"key":"e_1_2_1_103_1","volume-title":"Design, Automation 8 Test in Europe Conference 8 Exhibition (DATE\u201911)","author":"Mohapatra Debabrata"},{"key":"e_1_2_1_104_1","doi-asserted-by":"publisher","DOI":"10.1145\/1594233.1594282"},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2015.7056066"},{"key":"e_1_2_1_106_1","volume-title":"The GSM System for Mobile Communications","author":"Mouly Michel"},{"key":"e_1_2_1_107_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.1999.744311"},{"key":"e_1_2_1_108_1","volume-title":"Proceedings of the Conference on Design, Automation and Test in Europe. European Design and Automation Association, 335--338","author":"Narayanan Sriram"},{"key":"e_1_2_1_109_1","volume-title":"Proceedings of the 14th International Symposium on Parallel and Distributed Processing (IPDPS\u201900)","author":"Nikolopoulos D."},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1109\/79.618009"},{"key":"e_1_2_1_111_1","volume-title":"Retrieved","author":"NVIDIA.","year":"2014"},{"key":"e_1_2_1_112_1","doi-asserted-by":"publisher","DOI":"10.1109\/VLSID.2006.103"},{"key":"e_1_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.1109\/78.984749"},{"key":"e_1_2_1_114_1","volume-title":"IEEE 20th International Symposium on High Performance Computer Architecture (HPCA\u201914)","author":"Palframan David J."},{"key":"e_1_2_1_115_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTCHIPS.2011.7477494"},{"key":"e_1_2_1_116_1","doi-asserted-by":"publisher","DOI":"10.1109\/LCA.2015.2430853"},{"key":"e_1_2_1_117_1","volume-title":"IEEE International Symposium on High Performance Computer Architecture (HPCA\u201916)","author":"Pekhimenko Gennady"},{"key":"e_1_2_1_118_1","doi-asserted-by":"publisher","DOI":"10.1145\/2678373.2665742"},{"key":"e_1_2_1_119_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2014.6835952"},{"key":"e_1_2_1_120_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2015.7056018"},{"key":"e_1_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDCS.2005.37"},{"key":"e_1_2_1_122_1","volume-title":"Proceedings of the 2015 Design, Automation 8 Test in Europe Conference 8 Exhibition. EDA Consortium, 1497--1502","author":"Rahimi Abbas"},{"key":"e_1_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1006\/jpdc.1999.1556"},{"key":"e_1_2_1_124_1","doi-asserted-by":"publisher","DOI":"10.1145\/1356058.1356082"},{"key":"e_1_2_1_125_1","doi-asserted-by":"publisher","DOI":"10.1145\/2744769.2744799"},{"key":"e_1_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1145\/2414729.2414737"},{"key":"e_1_2_1_127_1","unstructured":"Martin Rinard. 2013. Parallel synchronization-free approximate data structure construction. In HotPar.  Martin Rinard. 2013. Parallel synchronization-free approximate data structure construction. In HotPar."},{"key":"e_1_2_1_128_1","volume-title":"RACES Workshop.","author":"Rinard Martin C.","year":"2012"},{"key":"e_1_2_1_129_1","volume-title":"17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM\u201909)","author":"Roldao-Lopes Antonio"},{"key":"e_1_2_1_130_1","doi-asserted-by":"publisher","DOI":"10.1145\/2503210.2503296"},{"key":"e_1_2_1_131_1","volume-title":"Data Compression: The Complete Reference","author":"Salomon David","year":"2004"},{"key":"e_1_2_1_132_1","volume-title":"Janghaeng Lee, and Scott Mahlke.","author":"Samadi Mehrzad","year":"2014"},{"key":"e_1_2_1_133_1","doi-asserted-by":"publisher","DOI":"10.1145\/2540708.2540711"},{"key":"e_1_2_1_134_1","doi-asserted-by":"publisher","DOI":"10.1145\/1993498.1993518"},{"key":"e_1_2_1_135_1","doi-asserted-by":"publisher","DOI":"10.1145\/2644808"},{"key":"e_1_2_1_136_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2006.23"},{"key":"e_1_2_1_137_1","volume-title":"Workshop on Approximate Computing Across the System Stack.","author":"Miguel Joshua San"},{"key":"e_1_2_1_138_1","doi-asserted-by":"crossref","unstructured":"J. Sartori and R. Kumar. 2010. Low-overhead high-speed multi-core barrier synchronization. In HiPEAC. 18--34.  J. Sartori and R. Kumar. 2010. Low-overhead high-speed multi-core barrier synchronization. In HiPEAC. 18--34.","DOI":"10.1007\/978-3-642-11515-8_4"},{"key":"e_1_2_1_139_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2012.2232647"},{"key":"e_1_2_1_140_1","doi-asserted-by":"publisher","DOI":"10.1145\/2370816.2370864"},{"key":"e_1_2_1_141_1","volume-title":"Proceedings of the 30th Annual IEEE\/ACM International Symposium on Microarchitecture. IEEE, 248--258","author":"Sazeides Yiannakis"},{"key":"e_1_2_1_142_1","doi-asserted-by":"publisher","DOI":"10.1145\/248208.237144"},{"key":"e_1_2_1_143_1","volume-title":"Processing-in-memory: Exploring the design space. In Architecture of Computing Systems (ARCS\u201915)","author":"Scrbak Marko","year":"2015"},{"key":"e_1_2_1_144_1","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155635"},{"key":"e_1_2_1_145_1","volume-title":"IEEE 20th International Symposium on High Performance Computer Architecture (HPCA\u201914)","author":"Shafiee Ali"},{"key":"e_1_2_1_146_1","volume-title":"Proceedings of the 9th International Symposium on High-Performance Computer Architecture (HPCA\u201903)","author":"Shang Li"},{"key":"e_1_2_1_147_1","doi-asserted-by":"publisher","DOI":"10.1109\/71.388040"},{"key":"e_1_2_1_148_1","doi-asserted-by":"publisher","DOI":"10.1109\/LES.2015.2393860"},{"key":"e_1_2_1_149_1","doi-asserted-by":"publisher","DOI":"10.1145\/2025113.2025133"},{"key":"e_1_2_1_150_1","volume-title":"European Conference on Parallel Processing. Springer, 465--476","author":"Soler Mar\u00eda","year":"2013"},{"key":"e_1_2_1_151_1","doi-asserted-by":"publisher","DOI":"10.1145\/2678373.2665746"},{"key":"e_1_2_1_152_1","volume-title":"Proceedings of the 8th International Symposium on High-Performance Computer Architecture. IEEE, 65--75","author":"Steffan J. Gregory"},{"key":"e_1_2_1_153_1","doi-asserted-by":"publisher","DOI":"10.1145\/1366224.1366227"},{"key":"e_1_2_1_154_1","volume-title":"Workshop on Approximate Computing Across the Stack.","author":"Sutherland Mark","year":"2015"},{"key":"e_1_2_1_155_1","volume-title":"Design Automation Conference.","author":"Taylor M. B.","year":"2012"},{"key":"e_1_2_1_156_1","doi-asserted-by":"publisher","DOI":"10.1145\/2628071.2628110"},{"key":"e_1_2_1_157_1","doi-asserted-by":"publisher","DOI":"10.1145\/2742060.2743759"},{"key":"e_1_2_1_158_1","volume-title":"Nikolopoulos","author":"Vassiliadis Vassilis","year":"2015"},{"key":"e_1_2_1_159_1","doi-asserted-by":"publisher","DOI":"10.1145\/2744769.2751163"},{"key":"e_1_2_1_160_1","doi-asserted-by":"publisher","DOI":"10.1145\/2540708.2540710"},{"key":"e_1_2_1_161_1","doi-asserted-by":"publisher","DOI":"10.1145\/2627369.2627613"},{"key":"e_1_2_1_162_1","doi-asserted-by":"publisher","DOI":"10.7873\/DATE.2013.280"},{"key":"e_1_2_1_163_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750399"},{"key":"e_1_2_1_164_1","doi-asserted-by":"publisher","DOI":"10.1145\/1450095.1450110"},{"key":"e_1_2_1_165_1","doi-asserted-by":"publisher","DOI":"10.1109\/30.125072"},{"key":"e_1_2_1_166_1","doi-asserted-by":"publisher","DOI":"10.5555\/266800.266827"},{"key":"e_1_2_1_167_1","volume-title":"Conference Record of the 37th Asilomar Conference on Signals, Systems and Computers","volume":"2","author":"Wang Zhou"},{"key":"e_1_2_1_168_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.1984.1659158"},{"key":"e_1_2_1_169_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2011.80"},{"key":"e_1_2_1_170_1","doi-asserted-by":"publisher","DOI":"10.1145\/1075395.1075396"},{"key":"e_1_2_1_171_1","volume-title":"Approximate computing: A survey","author":"Xu Qiang"},{"key":"e_1_2_1_172_1","doi-asserted-by":"publisher","DOI":"10.1109\/TR.2014.2354931"},{"key":"e_1_2_1_173_1","doi-asserted-by":"publisher","DOI":"10.1145\/2836168"},{"key":"e_1_2_1_174_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCAD.2013.6691096"},{"key":"e_1_2_1_175_1","doi-asserted-by":"publisher","DOI":"10.1145\/2694344.2694354"},{"key":"e_1_2_1_176_1","volume-title":"Design, Automation 8 Test in Europe Conference 8 Exhibition (DATE\u201913)","author":"Yetim Yavuz"},{"key":"e_1_2_1_177_1","doi-asserted-by":"publisher","DOI":"10.1109\/HiPINEB.2017.10"},{"key":"e_1_2_1_178_1","doi-asserted-by":"publisher","DOI":"10.1109\/HIPINEB.2016.13"},{"key":"e_1_2_1_179_1","doi-asserted-by":"publisher","DOI":"10.1145\/2593069.2593156"},{"key":"e_1_2_1_180_1","doi-asserted-by":"publisher","DOI":"10.5555\/2755753.2755913"},{"key":"e_1_2_1_181_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2005.117"},{"key":"e_1_2_1_182_1","volume-title":"A 3D-stacked logic-in-memory accelerator for application-specific data intensive computing","author":"Zhu Qiuling"},{"key":"e_1_2_1_183_1","volume-title":"Proceedings of the 34th Annual International Symposium on Computer Architecture (ISCA\u201907)","author":"Zhu Weirong"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3145812","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3145812","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:13:32Z","timestamp":1750212812000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3145812"}},"subtitle":["Techniques for Reducing Communication Bottlenecks in Large-Scale Parallel Systems"],"short-title":[],"issued":{"date-parts":[[2018,1,10]]},"references-count":180,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,1,31]]}},"alternative-id":["10.1145\/3145812"],"URL":"https:\/\/doi.org\/10.1145\/3145812","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,1,10]]},"assertion":[{"value":"2016-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-09-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-01-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}