{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,2]],"date-time":"2022-04-02T04:27:22Z","timestamp":1648873642989},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2009,3,25]],"date-time":"2009-03-25T00:00:00Z","timestamp":1237939200000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2012,4]]},"DOI":"10.1007\/s11227-009-0280-y","type":"journal-article","created":{"date-parts":[[2009,3,24]],"date-time":"2009-03-24T18:04:35Z","timestamp":1237917875000},"page":"4-30","source":"Crossref","is-referenced-by-count":1,"title":["Communication-free data alignment for arrays with\u00a0exponential references in parallelizing compilers for\u00a0scalable parallel systems"],"prefix":"10.1007","volume":"60","author":[{"given":"Minyi","family":"Guo","sequence":"first","affiliation":[]},{"given":"Weng-Long","family":"Chang","sequence":"additional","affiliation":[]},{"given":"Bo","family":"Jiang","sequence":"additional","affiliation":[]},{"given":"Shu-Chien","family":"Huang","sequence":"additional","affiliation":[]},{"given":"Sien-Tang","family":"Tsai","sequence":"additional","affiliation":[]},{"given":"Michael","family":"Ho","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2009,3,25]]},"reference":[{"issue":"2","key":"280_CR1","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1145\/1011528.1011529","volume":"1","author":"A Alex","year":"2004","unstructured":"Alex\u00a0A, Codina MJ, Alez GA, Kaeli\u00a0D (2004) Removing communications in clustered micro-architectures through instruction replication. ACM Trans Archit Code Optim 1(2):127\u2013151","journal-title":"ACM Trans Archit Code Optim"},{"key":"280_CR2","unstructured":"Bau\u00a0D, Kodukula\u00a0I, Kotlyar\u00a0V, Pingali\u00a0K, Stodghill\u00a0P (1994) Solving alignment using elementary linear algebra. In: Conference record of the 7th workshop on languages and compilers for parallel computing, pp 46\u201360"},{"key":"280_CR3","unstructured":"Boudet\u00a0V, Rastello\u00a0F, Yves\u00a0R (1998) Alignment and distribution is NOT (always) NP-hard. In: Proceeding of 1998 international conference on parallel and distributed systems, vol\u00a05(9), 1998, pp 648\u2013657"},{"issue":"1","key":"280_CR4","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1023\/A:1011144404437","volume":"20","author":"W-L Chang","year":"2001","unstructured":"Chang W-L, Chu C-P, Wu J-H (2001) Communication-free alignment for array references with linear subscripts in three loop index variables or quadratic subscripts. J\u00a0Supercomput 20(1):67\u201383","journal-title":"J\u00a0Supercomput"},{"issue":"1","key":"280_CR5","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1109\/TPDS.2004.1264783","volume":"15","author":"W-L Chang","year":"2004","unstructured":"Chang W-L, Huang J-W, Chu C-P (2004) Using elementary linear algebra to solve data alignment for arrays with linear or quadratic references. IEEE Trans Parallel Distrib Syst 15(1):28\u201339","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"280_CR6","unstructured":"Chu C-P, Chang W-L, Chen\u00a0I, Chen P-S (1998) Communication-free alignment for array references with linear subscripts in two loop index variables or quadratic subscripts. In: Proceedings of the second IASTED international conference on parallel and distributed computing and networks (PDCN\u201998), Australia, 1998, pp 571\u2013576"},{"issue":"10","key":"280_CR7","doi-asserted-by":"crossref","first-page":"1373","DOI":"10.1016\/S0167-8191(96)00049-X","volume":"22","author":"M Dion","year":"1996","unstructured":"Dion\u00a0M, Yves\u00a0R (1996) Mapping Affine loop nests: new results. Parallel Comput 22(10):1373\u20131397","journal-title":"Parallel Comput"},{"issue":"4","key":"280_CR8","doi-asserted-by":"crossref","first-page":"241","DOI":"10.6028\/jres.071B.033","volume":"71","author":"J Edmonds","year":"1967","unstructured":"Edmonds\u00a0J (1967) Systems of distinct representative and linear algebra. J\u00a0Res Nat B Stand Sect\u00a0B 71(4):241\u2013245","journal-title":"J\u00a0Res Nat B Stand Sect\u00a0B"},{"issue":"3","key":"280_CR9","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1109\/TPDS.2005.26","volume":"16","author":"B Franke","year":"2005","unstructured":"Franke\u00a0B, O\u2019Boyle MFP (2005) A complete compiler approach to auto-parallelizing C programs for multi-DSP systems. IEEE Trans Parallel Distrib Syst 16(3):234\u2013245","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"280_CR10","unstructured":"Feautrier\u00a0P (1993) Toward automatic partitioning of arrays on distributed memory computers. In: ACM international conference on supercomputing, 1993, pp 175\u2013184"},{"issue":"2","key":"280_CR11","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1109\/MM.2006.41","volume":"26","author":"M Gschwind","year":"2006","unstructured":"Gschwind\u00a0M, Hofstee HP, Flachs\u00a0B, Hopkins\u00a0M, Watanabe\u00a0Y, Yamazaki\u00a0T (2006) Synergistic processing in cell\u2019s multicore architecture. IEEE Micro 26(2):10\u201324","journal-title":"IEEE Micro"},{"issue":"4","key":"280_CR12","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1109\/MC.2007.124","volume":"40","author":"J Gebis","year":"2007","unstructured":"Gebis\u00a0J, Patterson\u00a0D (2007) Embracing and extending 20th-century instruction set architectures. Computer 40(4):68\u201375","journal-title":"Computer"},{"issue":"11","key":"280_CR13","first-page":"1195","volume":"E81-D","author":"M Guo","year":"1998","unstructured":"Guo\u00a0M, Yamashita\u00a0Y, Nakata\u00a0I (1998) Efficient implementation of multi-dimensional array redistribution. IEICE Trans Inf Syst E81-D(11):1195\u20131204","journal-title":"IEICE Trans Inf Syst"},{"issue":"8","key":"280_CR14","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1016\/S0167-8191(00)00027-2","volume":"26","author":"M Guo","year":"2000","unstructured":"Guo\u00a0M, Nakata\u00a0I, Yamashita\u00a0Y (2000) Contention-free communication scheduling for array redistribution. Parallel Comput 26(8):1325\u20131343","journal-title":"Parallel Comput"},{"issue":"3","key":"280_CR15","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1023\/A:1011602732570","volume":"20","author":"M Guo","year":"2001","unstructured":"Guo\u00a0M, Nakata\u00a0I (2001) A framework for efficient array redistribution on distributed memory multicomputers. J\u00a0Supercomput 20(3):243\u2013265","journal-title":"J\u00a0Supercomput"},{"issue":"9","key":"280_CR16","first-page":"1825","volume":"E86-D","author":"M Guo","year":"2003","unstructured":"Guo\u00a0M (2003) Efficient loop partitioning for parallel codes of irregular scientific computations. IEICE Trans Inf Syst E86-D(9):1825\u20131834","journal-title":"IEICE Trans Inf Syst"},{"issue":"3","key":"280_CR17","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1023\/A:1024262610201","volume":"25","author":"M Guo","year":"2003","unstructured":"Guo\u00a0M (2003) Communication generation for irregular codes. J\u00a0Supercomput 25(3):199\u2013214","journal-title":"J\u00a0Supercomput"},{"key":"280_CR18","unstructured":"Hoeflinger\u00a0J (1998) Interprocedural parallelization using memory classification analysis. PhD thesis, Univ of Illinois at Urbana-Champaign, Center for Supercomputing Res & Dev"},{"issue":"12","key":"280_CR19","doi-asserted-by":"crossref","first-page":"1201","DOI":"10.1109\/71.895789","volume":"11","author":"C-H Hsu","year":"2000","unstructured":"Hsu C-H, Bai S-W, Chung Y-C, Yang C-S (2000) A generalized basic-cycle calculation method for array redistribution. IEEE Trans Parallel Distrib Syst 11(12):1201\u20131216","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"280_CR20","series-title":"Lecture notes in computer science","volume-title":"EUC 2006 proceedings","author":"C-H Hsu","year":"2006","unstructured":"Hsu C-H, Lan C-Y, Chen S-C (2006) Optimizing scheduling stability for runtime data alignment. In: EUC 2006 proceedings. Lecture notes in computer science, vol\u00a04097. Springer, Berlin"},{"key":"280_CR21","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1016\/S0167-8191(99)00037-X","volume":"25","author":"G-H Hwang","year":"1999","unstructured":"Hwang G-H, Lee JK (1999) An expression-rewriting framework to generate communication sets for HPF programs with block-cyclic distribution. Parallel Comput 25:1105\u20131139","journal-title":"Parallel Comput"},{"key":"280_CR22","unstructured":"Kandemir\u00a0M, Choudhary\u00a0A, Shenoy\u00a0N, Banerjee\u00a0P, Ramanujam\u00a0J (1998) A hyperplane based approach for optimizing spatial locality in loop nests. In: Proc 12th ACM int conf supercomputing, 1998, pp 69\u201376"},{"key":"280_CR23","unstructured":"Kandemir\u00a0M, Ramanujam\u00a0J, Choudhary\u00a0A, Banerjee\u00a0P (1998) A loop transformation algorithm based on explicit data layout representation for optimizing locality. In: Proc 11th international workshop, LCPC\u201998, Chapel Hill, NC, USA, 1998, pp 34\u201350"},{"issue":"3\u20134","key":"280_CR24","first-page":"445","volume":"24","author":"AW Lam","year":"1998","unstructured":"Lam AW, Lam MS (1998) Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Comput 24(3\u20134):445\u2013475","journal-title":"Parallel Comput"},{"key":"280_CR25","doi-asserted-by":"crossref","unstructured":"Lam AW, Cheong GI, Lam MS (1999) An affine partitioning algorithm to maximize parallelism and minimize communication. In: 13th ACM international conference on supercomputing, Rhodes, Greece, 1999, pp 228\u2013237","DOI":"10.1145\/305138.305197"},{"issue":"8","key":"280_CR26","doi-asserted-by":"crossref","first-page":"825","DOI":"10.1109\/71.605769","volume":"8","author":"PZ Lee","year":"1997","unstructured":"Lee PZ (1997) Efficient algorithms for data distribution on distributed memory parallel computers. IEEE Trans Parallel Distrib Syst 8(8):825\u2013839","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"280_CR27","volume-title":"Linear and nonlinear programming","author":"DG Luenberger","year":"1984","unstructured":"Luenberger DG (1984) Linear and nonlinear programming. Addison-Wesley, Reading"},{"key":"280_CR28","doi-asserted-by":"crossref","unstructured":"Ozcan\u00a0E, Onbasioglu\u00a0E (2007) Memetic algorithms for parallel code optimization. Int J Parallel Program 35(1)","DOI":"10.1007\/s10766-006-0026-x"},{"key":"280_CR29","unstructured":"Paek\u00a0Y (1997) Compiling for distributed memory multiprocessors based on access region analysis. PhD thesis, Univ of Illinois at Urbana-Champaign, Center for Supercomputing Res & Dev"},{"key":"280_CR30","unstructured":"Pan\u00a0L, Xue\u00a0J, Lai MK (2007) Toward automatic data distribution for migrating computations. In: The proceedings of 2007 international conference on parallel processing, September 2007"},{"issue":"11","key":"280_CR31","doi-asserted-by":"crossref","first-page":"1121","DOI":"10.1109\/71.544354","volume":"7","author":"MP Petersen","year":"1996","unstructured":"Petersen MP, Padua AD (1996) Static and dynamic evaluation of data dependence analysis techniques. IEEE Trans Parallel Distrib Syst 7(11):1121\u20131132","journal-title":"IEEE Trans Parallel Distrib Syst"},{"issue":"4","key":"280_CR32","doi-asserted-by":"crossref","first-page":"472","DOI":"10.1109\/71.97903","volume":"2","author":"J Ramanujam","year":"1991","unstructured":"Ramanujam\u00a0J, Sadayappan\u00a0P (1991) Compile-time techniques for data distributed in distributed memory machines. IEEE Trans Parallel Distrib Syst 2(4):472\u2013482","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"280_CR33","unstructured":"Reilly\u00a0J (1995) SPEC95 products and benchmarks. SPEC Newsletter"},{"issue":"4","key":"280_CR34","doi-asserted-by":"crossref","first-page":"902","DOI":"10.1109\/JSSC.2008.917531","volume":"43","author":"H Shikano","year":"2008","unstructured":"Shikano\u00a0H, Ito\u00a0M (2008) Heterogeneous multi-core architecture that enables 54x AAC-LC stereo encoding. IEEE J Solid-State Circuits 43(4):902\u2013910","journal-title":"IEEE J Solid-State Circuits"},{"issue":"3","key":"280_CR35","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1023\/A:1008112110494","volume":"15","author":"K-P Shih","year":"2000","unstructured":"Shih K-P, Sheu J-P, Huang C-H (2000) Statement-level communication-free partitioning techniques for parallelizing compilers. J\u00a0Supercomput 15(3):243\u2013269","journal-title":"J\u00a0Supercomput"},{"key":"280_CR36","doi-asserted-by":"crossref","unstructured":"Wu J-H, Chu C-P (2007) An exact data dependence testing method for quadratic expressions. Inf Sci 177(23)","DOI":"10.1016\/j.ins.2007.06.006"},{"key":"280_CR37","volume-title":"High performance compilers for parallel computing","author":"M Wolfe","year":"1996","unstructured":"Wolfe\u00a0M (1996) High performance compilers for parallel computing. Addison-Wesley, Reading"},{"key":"280_CR38","volume-title":"Lecture notes in computer science","author":"Y Zhao","year":"2007","unstructured":"Zhao\u00a0Y, Kennedy\u00a0K (2007) Dependence-based code generation for a CELL processor. In: Lecture notes in computer science, vol\u00a04382. Springer, Berlin"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-009-0280-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11227-009-0280-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-009-0280-y","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,1]],"date-time":"2019-06-01T10:23:58Z","timestamp":1559384638000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11227-009-0280-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,3,25]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,4]]}},"alternative-id":["280"],"URL":"https:\/\/doi.org\/10.1007\/s11227-009-0280-y","relation":{},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"value":"0920-8542","type":"print"},{"value":"1573-0484","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,3,25]]}}}