{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,3,29]],"date-time":"2022-03-29T09:42:11Z","timestamp":1648546931936},"reference-count":2,"publisher":"World Scientific Pub Co Pte Lt","issue":"04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Parallel Process. Lett."],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:p> The achievable performance on Infiniband networks is governed by the latencies and bandwidths of communication channels as well as by contention within the network. Currently Infiniband uses static routing to transfer messages and thus does not take into account dynamic loading of the channels. By interrogating the network routing tables we quantify the contention that occurs for a number of communication patterns using a large-scale (1024 processor) system. Empirical data confirms our contention calculation almost exactly. Custom routing tables are defined that provide both optimum and worst-case performance for a large-range of communication patterns. Performance differences can be as large as 12\u00d7 (from optimum to worst-case). Two large-scale applications show a run-time improvement of between 10-20% and up to a 40% improvement in just their communication time when using optimized routing tables. The approach taken is applicable to many Infiniband systems, and we expect the performance improvements to be even greater on larger-scale systems. <\/jats:p>","DOI":"10.1142\/s0129626408003582","type":"journal-article","created":{"date-parts":[[2008,12,4]],"date-time":"2008-12-04T11:40:57Z","timestamp":1228390857000},"page":"589-608","source":"Crossref","is-referenced-by-count":5,"title":["INFINIBAND ROUTING TABLE OPTIMIZATIONS FOR SCIENTIFIC APPLICATIONS"],"prefix":"10.1142","volume":"18","author":[{"given":"DARREN J.","family":"KERBYSON","sequence":"first","affiliation":[{"name":"Performance and Architecture Lab (PAL), Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA"}]},{"given":"MIKE","family":"LANG","sequence":"additional","affiliation":[{"name":"Performance and Architecture Lab (PAL), Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA"}]},{"given":"GREGORY","family":"JOHNSON","sequence":"additional","affiliation":[{"name":"Performance and Architecture Lab (PAL), Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA"}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"rf5","first-page":"34","volume":"24","author":"Beecroft J.","journal-title":"IEEE Micro"},{"key":"rf19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.13182\/NSE02-A2262","volume":"141","author":"Baker R. S.","journal-title":"Nuclear Science & Engineering"}],"container-title":["Parallel Processing Letters"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0129626408003582","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T16:17:04Z","timestamp":1565108224000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0129626408003582"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,12]]},"references-count":2,"journal-issue":{"issue":"04","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2008,12]]}},"alternative-id":["10.1142\/S0129626408003582"],"URL":"https:\/\/doi.org\/10.1142\/s0129626408003582","relation":{},"ISSN":["0129-6264","1793-642X"],"issn-type":[{"value":"0129-6264","type":"print"},{"value":"1793-642X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,12]]}}}