{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:33:05Z","timestamp":1750221185609,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":34,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,8,13]],"date-time":"2018-08-13T00:00:00Z","timestamp":1534118400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,8,13]]},"DOI":"10.1145\/3225058.3225062","type":"proceedings-article","created":{"date-parts":[[2018,8,8]],"date-time":"2018-08-08T19:13:06Z","timestamp":1533755586000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Memory Coalescing for Hybrid Memory Cube"],"prefix":"10.1145","author":[{"given":"Xi","family":"Wang","sequence":"first","affiliation":[{"name":"Texas Tech University, Lubbock, Texas"}]},{"given":"John D.","family":"Leidel","sequence":"additional","affiliation":[{"name":"Texas Tech University, Lubbock, Texas"}]},{"given":"Yong","family":"Chen","sequence":"additional","affiliation":[{"name":"Texas Tech University, Lubbock, Texas"}]}],"member":"320","published-online":{"date-parts":[[2018,8,13]]},"reference":[{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080232"},{"volume-title":"Selective GPU caches to eliminate CPU-GPU HW cache coherence","author":"Agarwal Neha","key":"e_1_3_2_1_7_1","unstructured":"Neha Agarwal , David Nellans , Eiman Ebrahimi , Thomas F Wenisch , John Danskin , and Stephen W Keckler . 2016. Selective GPU caches to eliminate CPU-GPU HW cache coherence . In HPCA. IEEE , 494--506. Neha Agarwal, David Nellans, Eiman Ebrahimi, Thomas F Wenisch, John Danskin, and Stephen W Keckler. 2016. Selective GPU caches to eliminate CPU-GPU HW cache coherence. In HPCA. IEEE, 494--506."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750386"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750385"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/11602569_48"},{"volume-title":"Proceedings of the 1992 ACM\/IEEE Conference on Supercomputing (Supercomputing '92)","author":"Bailey D. H.","key":"e_1_3_2_1_11_1","unstructured":"D. H. Bailey , L. Dagum , E. Barszcz , and H. D. Simon . 1992. NAS Parallel Benchmark Results . In Proceedings of the 1992 ACM\/IEEE Conference on Supercomputing (Supercomputing '92) . IEEE Computer Society Press, Los Alamitos, CA, USA, 386--393. http:\/\/dl.acm.org\/citation.cfm?id=147877.148032 D. H. Bailey, L. Dagum, E. Barszcz, and H. D. Simon. 1992. NAS Parallel Benchmark Results. In Proceedings of the 1992 ACM\/IEEE Conference on Supercomputing (Supercomputing '92). IEEE Computer Society Press, Los Alamitos, CA, USA, 386--393. http:\/\/dl.acm.org\/citation.cfm?id=147877.148032"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1468075.1468121"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063384.2063401"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2016.13"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897937.2897966"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2009.64"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/eScience.2008.59"},{"key":"e_1_3_2_1_18_1","volume-title":"Katherine Morrow, and Nam Sung Kim.","author":"Farmahini-Farahani Amin","year":"2015","unstructured":"Amin Farmahini-Farahani , Jung Ho Ahn , Katherine Morrow, and Nam Sung Kim. 2015 . NDA : Near-DRAM acceleration architecture leveraging commodity DRAM devices and standard memory modules. In 21st HPCA. IEEE , 283--295. Amin Farmahini-Farahani, Jung Ho Ahn, Katherine Morrow, and Nam Sung Kim. 2015. NDA: Near-DRAM acceleration architecture leveraging commodity DRAM devices and standard memory modules. In 21st HPCA. IEEE, 283--295."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/PACT.2015.22"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2833179.2833184"},{"volume-title":"20th","author":"Greb Alexander","key":"e_1_3_2_1_21_1","unstructured":"Alexander Greb and Gabriel Zachmann . 2006. GPU-ABiSort: Optimal parallel sorting on stream architectures . In 20th IPDPS. IEEE , 10-pp. Alexander Greb and Gabriel Zachmann. 2006. GPU-ABiSort: Optimal parallel sorting on stream architectures. In 20th IPDPS. IEEE, 10-pp."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155660"},{"key":"e_1_3_2_1_23_1","volume-title":"Kartikay Garg, Tushar Krishna, and Hyesoon Kim.","author":"Hadidi Ramyad","year":"2017","unstructured":"Ramyad Hadidi , Bahar Asgari , Jeffrey Young , Burhan Ahmad Mudassar , Kartikay Garg, Tushar Krishna, and Hyesoon Kim. 2017 . Performance Implications of NoCs on 3D-Stacked Memories : Insights from the Hybrid Memory Cube . arXiv preprint arXiv:1707.05399 (2017). Ramyad Hadidi, Bahar Asgari, Jeffrey Young, Burhan Ahmad Mudassar, Kartikay Garg, Tushar Krishna, and Hyesoon Kim. 2017. Performance Implications of NoCs on 3D-Stacked Memories: Insights from the Hybrid Memory Cube. arXiv preprint arXiv:1707.05399 (2017)."},{"volume-title":"21st","author":"Hayes Timothy","key":"e_1_3_2_1_24_1","unstructured":"Timothy Hayes , Oscar Palomar , Osman Unsal , Adrian Cristal , and Mateo Valero . 2015. VSR sort: A novel vectorised sorting algorithm & architecture extensions for future microprocessors . In 21st HPCA. IEEE , 26--38. Timothy Hayes, Oscar Palomar, Osman Unsal, Adrian Cristal, and Mateo Valero. 2015. VSR sort: A novel vectorised sorting algorithm & architecture extensions for future microprocessors. In 21st HPCA. IEEE, 26--38."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2016.27"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2010.107"},{"key":"e_1_3_2_1_27_1","volume-title":"Jung Ho Ahn, and Jaeha Kim","author":"Kim Gwangsun","year":"2013","unstructured":"Gwangsun Kim , John Kim , Jung Ho Ahn, and Jaeha Kim . 2013 . Memory-centric system interconnect design with hybrid memory cubes. In 22nd PACT. IEEE Press , 145--156. Gwangsun Kim, John Kim, Jung Ho Ahn, and Jaeha Kim. 2013. Memory-centric system interconnect design with hybrid memory cubes. In 22nd PACT. IEEE Press, 145--156."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2830772.2830830"},{"volume-title":"8th","author":"Kroft David","key":"e_1_3_2_1_29_1","unstructured":"David Kroft . 1981. Lockup-free instruction fetch\/prefetch cache organization . In 8th ISCA. IEEE Computer Society Press , 81--87. David Kroft. 1981. Lockup-free instruction fetch\/prefetch cache organization. In 8th ISCA. IEEE Computer Society Press, 81--87."},{"key":"e_1_3_2_1_30_1","unstructured":"John D. McCalpin. 1995. A Survey of Memory Bandwidth and Machine Balance in Current High Performance Computers. (1995).  John D. McCalpin. 1995. A Survey of Memory Bandwidth and Machine Balance in Current High Performance Computers. (1995)."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2464996.2465019"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2540708.2540717"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2006.44"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2989081.2989128"},{"key":"e_1_3_2_1_37_1","volume-title":"OpenMP Memkind: An Extension for Heterogeneous Physical Memories. In Parallel Processing Workshops (ICPPW), 2017 46th International Conference on. IEEE, 220--227","author":"Wang Xi","year":"2017","unstructured":"Xi Wang , John D Leidel , and Yong Chen . 2017 . OpenMP Memkind: An Extension for Heterogeneous Physical Memories. In Parallel Processing Workshops (ICPPW), 2017 46th International Conference on. IEEE, 220--227 . Xi Wang, John D Leidel, and Yong Chen. 2017. OpenMP Memkind: An Extension for Heterogeneous Physical Memories. In Parallel Processing Workshops (ICPPW), 2017 46th International Conference on. IEEE, 220--227."},{"volume-title":"High performance comparison-based sorting algorithm on many-core GPUs","author":"Ye Xiaochun","key":"e_1_3_2_1_39_1","unstructured":"Xiaochun Ye , Dongrui Fan , Wei Lin , Nan Yuan , and Paolo Ienne . 2010. High performance comparison-based sorting algorithm on many-core GPUs . In IPDPS. IEEE , 1--10. Xiaochun Ye, Dongrui Fan, Wei Lin, Nan Yuan, and Paolo Ienne. 2010. High performance comparison-based sorting algorithm on many-core GPUs. In IPDPS. IEEE, 1--10."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2000064.2000100"},{"key":"e_1_3_2_1_41_1","volume-title":"Xu Zhao, Yongle Zhang, Pranay Jain, and Michael Stumm.","author":"Yuan Ding","year":"2014","unstructured":"Ding Yuan , Yu Luo , Xin Zhuang , Guilherme Renna Rodrigues , Xu Zhao, Yongle Zhang, Pranay Jain, and Michael Stumm. 2014 . Simple Testing Can Prevent Most Critical Failures: An Analysis of Production Failures in Distributed Data-Intensive Systems.. In OSDI. 249--265. Ding Yuan, Yu Luo, Xin Zhuang, Guilherme Renna Rodrigues, Xu Zhao, Yongle Zhang, Pranay Jain, and Michael Stumm. 2014. Simple Testing Can Prevent Most Critical Failures: An Analysis of Production Failures in Distributed Data-Intensive Systems.. In OSDI. 249--265."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2600212.2600213"}],"event":{"name":"ICPP 2018: 47th International Conference on Parallel Processing","sponsor":["University of Oregon University of Oregon"],"location":"Eugene OR USA","acronym":"ICPP 2018"},"container-title":["Proceedings of the 47th International Conference on Parallel Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3225058.3225062","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3225058.3225062","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:39:06Z","timestamp":1750210746000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3225058.3225062"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,8,13]]},"references-count":34,"alternative-id":["10.1145\/3225058.3225062","10.1145\/3225058"],"URL":"https:\/\/doi.org\/10.1145\/3225058.3225062","relation":{},"subject":[],"published":{"date-parts":[[2018,8,13]]},"assertion":[{"value":"2018-08-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}