{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:16:13Z","timestamp":1750306573984,"version":"3.41.0"},"reference-count":11,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2014,12,3]],"date-time":"2014-12-03T00:00:00Z","timestamp":1417564800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGARCH Comput. Archit. News"],"published-print":{"date-parts":[[2014,12,3]]},"abstract":"<jats:p>A GPU cluster in which each node provides a few GPUs connected with PCIe (PCI Express) is commonly used for acceleration of a large application program requiring the performance beyond a single GPU. However, in such a system, programmers are required to describe two parallel programming between nodes in MPIs or other message passing library as well as the fine grained parallel programming for intra-GPUs. As a cost effective alternative of such clusters, we propose a novel multi-GPU system with ExpEther, a virtualization technique which extends PCIe of a host CPU to Ethernet. All devices connected by ExpEther can be treated as if they were directly connected to the host. Evaluation with two application programs with and without GPU-GPU communication revealed that the proposed system with four GPUs achieved 3.88 and 3.29 times performance improvement respectively compared with a single GPU system. Compared with GPU cluster system in which each node provides a GPU, the proposed system achieved about 7% and 30% performance improvement, respectively.<\/jats:p>","DOI":"10.1145\/2693714.2693717","type":"journal-article","created":{"date-parts":[[2014,12,8]],"date-time":"2014-12-08T16:17:14Z","timestamp":1418055434000},"page":"9-14","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Performance Analysis of the Multi-GPU System with ExpEther"],"prefix":"10.1145","volume":"42","author":[{"given":"Shimpei","family":"Nomura","sequence":"first","affiliation":[{"name":"NEC Corporation, Kanagawa, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Takuji","family":"Mitsuishi","sequence":"additional","affiliation":[{"name":"NEC Corporation, Kanagawa, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jun","family":"Suzuki","sequence":"additional","affiliation":[{"name":"NEC Corporation, Kanagawa, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuki","family":"Hayashi","sequence":"additional","affiliation":[{"name":"NEC Corporation, Kanagawa, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Masaki","family":"Kan","sequence":"additional","affiliation":[{"name":"NEC Corporation, Kanagawa, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hideharu","family":"Amano","sequence":"additional","affiliation":[{"name":"Keio University, Yokohama, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2014,12,3]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICNC.2011.28"},{"key":"e_1_2_1_2_1","unstructured":"GSIC. Tsubame computing services. http:\/\/tsubame.gsic.titech.ac.jp\/en.  GSIC. Tsubame computing services. http:\/\/tsubame.gsic.titech.ac.jp\/en."},{"key":"e_1_2_1_3_1","unstructured":"T. Hamada. Degima: The greenest accelerator-based supercomputer in the top500 list. http:\/\/www.cs.tsukuba.ac.jp\/\u00bfyoshiki\/heart\/HEART2012\/keynote\/HEART2012-Hamada.pdf June 2012.  T. Hamada. Degima: The greenest accelerator-based supercomputer in the top500 list. http:\/\/www.cs.tsukuba.ac.jp\/\u00bfyoshiki\/heart\/HEART2012\/keynote\/HEART2012-Hamada.pdf June 2012."},{"key":"e_1_2_1_4_1","unstructured":"Integrated Device Technology. Pci express switches. http:\/\/www.idt.com\/products\/interfaceconnectivity\/pci-express-solutions\/pci-expressswitches.  Integrated Device Technology. Pci express switches. http:\/\/www.idt.com\/products\/interfaceconnectivity\/pci-express-solutions\/pci-expressswitches."},{"volume-title":"November","year":"2013","key":"e_1_2_1_5_1","unstructured":"Khronos. The opencl specification version: 2.0 , November 2013 . Khronos. The opencl specification version: 2.0, November 2013."},{"key":"e_1_2_1_6_1","unstructured":"NEC Corporation. http:\/\/www.nec.co.jp.  NEC Corporation. http:\/\/www.nec.co.jp."},{"key":"e_1_2_1_7_1","unstructured":"NVIDIA. CUDA Toolkit Documentation. http:\/\/docs.nvidia.com\/cuda\/index.html.  NVIDIA. CUDA Toolkit Documentation. http:\/\/docs.nvidia.com\/cuda\/index.html."},{"key":"e_1_2_1_8_1","unstructured":"PCI-SIG. Pci express. http:\/\/www.pcisig.com\/specifications\/pciexpress\/.  PCI-SIG. Pci express. http:\/\/www.pcisig.com\/specifications\/pciexpress\/."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPA.2011.28"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTI.2006.12"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2159430.2159433"}],"container-title":["ACM SIGARCH Computer Architecture News"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2693714.2693717","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2693714.2693717","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:13:31Z","timestamp":1750227211000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2693714.2693717"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,12,3]]},"references-count":11,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2014,12,3]]}},"alternative-id":["10.1145\/2693714.2693717"],"URL":"https:\/\/doi.org\/10.1145\/2693714.2693717","relation":{},"ISSN":["0163-5964"],"issn-type":[{"type":"print","value":"0163-5964"}],"subject":[],"published":{"date-parts":[[2014,12,3]]},"assertion":[{"value":"2014-12-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}