{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,24]],"date-time":"2026-02-24T18:13:51Z","timestamp":1771956831044,"version":"3.50.1"},"reference-count":41,"publisher":"Elsevier","isbn-type":[{"value":"9780120121458","type":"print"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[1997]]},"DOI":"10.1016\/s0065-2458(08)60706-8","type":"book-chapter","created":{"date-parts":[[2008,5,30]],"date-time":"2008-05-30T04:21:49Z","timestamp":1212121309000},"page":"53-103","source":"Crossref","is-referenced-by-count":27,"title":["Parallelization of DOALL and DOACROSS Loops\u2014a Survey"],"prefix":"10.1016","author":[{"given":"A.R.","family":"Hurson","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joford T.","family":"Lim","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Krishna M.","family":"Kavi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ben","family":"Lee","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"78","reference":[{"key":"10.1016\/S0065-2458(08)60706-8_bib1","first-page":"620","article-title":"An empirical study on DOACROSS loops","author":"Chen","year":"1991","journal-title":"Proceedings Supercomputing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib2","doi-asserted-by":"crossref","unstructured":"Chen, D.K. & Yew, P.C. (1994a). Redundant synchronization elimination for DOACROSS loops. Proceedings 8th International Parallel Processing Symposium, pp. 477\u2013481","DOI":"10.1109\/IPPS.1994.288260"},{"key":"10.1016\/S0065-2458(08)60706-8_bib3","doi-asserted-by":"crossref","unstructured":"Chen, D.K. & Yew, P.C. (1994b). Statement re-ordering for DOACROSS loops. Proceedings International Conference on Parallel Processing.","DOI":"10.1109\/ICPP.1994.186"},{"key":"10.1016\/S0065-2458(08)60706-8_bib4","unstructured":"Cytron, R. (1986). DOACROSS: beyond vectorization for multiprocessors. Proceedings International Conference on Parallel Processing, pp. 836\u2013844"},{"key":"10.1016\/S0065-2458(08)60706-8_bib5","doi-asserted-by":"crossref","first-page":"1285","DOI":"10.1016\/S0167-8191(05)80039-0","article-title":"Performance of the Intel iPSC\/860 and Ncube 6400 hypercubes","volume":"17","author":"Dunigan","year":"1991","journal-title":"Parallel Computing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib6","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1016\/0167-8191(88)90037-3","article-title":"An analysis of the computational and parallel complexity of the Livermore loops","volume":"7","author":"Feo","year":"1988","journal-title":"Parallel Computing"},{"issue":"8","key":"10.1016\/S0065-2458(08)60706-8_bib7","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1145\/135226.135232","article-title":"Factoring: a method for scheduling parallel loops","volume":"35","author":"Hummel","year":"1992","journal-title":"Communications of the ACM"},{"key":"10.1016\/S0065-2458(08)60706-8_bib8","doi-asserted-by":"crossref","unstructured":"Hurson, A.R., Lim, J.T., Kavi, K. & Shirazi, B. (1994a). Loop allocation scheme for multithreaded dataflow computers. Proceedings 8th International Parallel Processing Symposium, 316\u2013322","DOI":"10.1109\/IPPS.1994.288283"},{"key":"10.1016\/S0065-2458(08)60706-8_bib9","unstructured":"Hurson, A.R., Lim, J.T. & Lee, B. (1994b). Extended staggered scheme: a loop allocation policy. Invited Paper, World IMACS Conference, pp. 1321\u20131325"},{"key":"10.1016\/S0065-2458(08)60706-8_bib10","doi-asserted-by":"crossref","unstructured":"Krothapalli, V.P. & Sadayappan, P. (1990). Dynamic scheduling of DOACROSS loops for multiprocessors. Proceedings Parbase-90: International Conference on Databases and Parallel Architectures, pp. 66\u201375","DOI":"10.1109\/PARBSE.1990.77118"},{"issue":"10","key":"10.1016\/S0065-2458(08)60706-8_bib11","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1109\/TSE.1985.231547","article-title":"Allocating independent subtasks on parallel processors","volume":"SE-11","author":"Kruskal","year":"1985","journal-title":"IEEE Transactions on Software Engineering"},{"key":"10.1016\/S0065-2458(08)60706-8_bib12","doi-asserted-by":"crossref","unstructured":"Kwan, T. TTotty, B.K. & Read, D.A. (1993). Communication and computation performance of the CM-5. Proceedings International Conference on Supercomputing, 192\u2013201","DOI":"10.1145\/169627.169691"},{"key":"10.1016\/S0065-2458(08)60706-8_bib13","first-page":"140","article-title":"Locality and loop scheduling on NUMA multiprocessors","volume":"II","author":"Li","year":"1993","journal-title":"Proceedings International Conference on Parallel Processing"},{"issue":"2","key":"10.1016\/S0065-2458(08)60706-8_bib14","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1109\/2.261915","article-title":"Exploiting the parallelism available in loops","volume":"27","author":"Lilja","year":"1994","journal-title":"IEEE Computer"},{"key":"10.1016\/S0065-2458(08)60706-8_bib15","doi-asserted-by":"crossref","unstructured":"Lim, J.T., Hurson, A.R., Lee, B. & Shirazi, B. (1992). Staggered distribution: a loop allocation scheme for dataflow multiprocessor systems. The Fourth Symposium on the Frontiers of Massively Parallel Computation, pp. 310\u2013317","DOI":"10.1109\/FMPC.1992.234944"},{"key":"10.1016\/S0065-2458(08)60706-8_bib16","doi-asserted-by":"crossref","unstructured":"Markatos, E.P. & LeBlanc, T.J. (1992). Using processor affinity in loop scheduling on shared-memory multiprocessors. Proceedings Supercomputing, pp. 104\u2013113","DOI":"10.1109\/SUPERC.1992.236705"},{"issue":"4","key":"10.1016\/S0065-2458(08)60706-8_bib17","doi-asserted-by":"crossref","first-page":"410","DOI":"10.1109\/TC.1987.1676923","article-title":"Processor allocation for horizontal and vertical parallelism and related speedup bounds","volume":"C-36","author":"Polychronopoulos","year":"1987","journal-title":"IEEE Transactions on Computers"},{"issue":"12","key":"10.1016\/S0065-2458(08)60706-8_bib18","doi-asserted-by":"crossref","first-page":"1425","DOI":"10.1109\/TC.1987.5009495","article-title":"Guided self-scheduling: a practical scheduling scheme for parallel supercomputers","volume":"C-36","author":"Polychronopoulos","year":"1987","journal-title":"IEEE Transactions on Computers"},{"key":"10.1016\/S0065-2458(08)60706-8_bib19","unstructured":"Polychronopoulos, C.D., Kuck, D.J. & Padua, D.A. (1986). Execution of Parallel Loops on Parallel Processor Systems. Proceedings International Conference on Parallel Processing, pp. 519\u2013527"},{"key":"10.1016\/S0065-2458(08)60706-8_bib20","first-page":"842","article-title":"Efficient doacross execution on distributed shared-memory multiprocessors","author":"Su","year":"1991","journal-title":"Proceedings Supercomputing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib21","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1109\/SUPERC.1994.344281","article-title":"Affinity scheduling of unbalanced workloads","author":"Subramaniam","year":"1994","journal-title":"Proceedings Supercomputing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib22","first-page":"528","article-title":"Processor self-Scheduling for multiple-nested parallel loops","author":"Tang","year":"1986","journal-title":"Proceedings International Conference on Parallel Processing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib23","first-page":"247","article-title":"Dynamic loop scheduling for shared-memory multi-processors","volume":"II","author":"Tzen","year":"1991","journal-title":"Proceedings International Conference on Parallel Processing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib24","unstructured":"Yue, K.K. & Lilja, D.J. (1994a). Parallel Loop Scheduling for High-Performance Computers. Technical Report No. HPPC-94\u201312, Department of Computer Science, University of Minnesota"},{"key":"10.1016\/S0065-2458(08)60706-8_bib25","doi-asserted-by":"crossref","first-page":"726","DOI":"10.1109\/TSE.1987.233477","article-title":"A scheme to enforce data dependence on large multi-processor systems","volume":"SE-13","author":"Zhu","year":"1987","journal-title":"IEEE Transactions on Software Engineering"},{"issue":"3","key":"10.1016\/S0065-2458(08)60706-8_bib26","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1109\/71.86107","article-title":"Compile-time partitioning of iterative parallel loops to reduce cache coherency traffic","volume":"2","author":"Abraham","year":"1991","journal-title":"IEEE Transactions on Parallel and Distributed Systems"},{"key":"10.1016\/S0065-2458(08)60706-8_bib27","first-page":"285","article-title":"A scheme for effective execution of irregular DOACROSS loops","volume":"II","author":"Chen","year":"1992","journal-title":"Proceedings International Conference on Parallel Processing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib28","unstructured":"Cytron, R. (1987). Limited processor scheduling of doacross loops. Proceedings International Conference on Parallel Processing, pp. 226\u2013234"},{"key":"10.1016\/S0065-2458(08)60706-8_bib29","first-page":"39","article-title":"Self-adjusting scheduling: an on-line optimization technique for locality management and load balancing","volume":"II","author":"Hamidzadeh","year":"1994","journal-title":"Proceedings International Conference on Parallel Processing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib30","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1016\/0743-7315(92)90050-W","article-title":"Compile-time optimization of near-neighbor communication for scalable shared-memory multiprocessors","volume":"15","author":"Hudak","year":"1992","journal-title":"Journal of Parallel and Distributed Computing"},{"issue":"3","key":"10.1016\/S0065-2458(08)60706-8_bib31","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1109\/71.86104","article-title":"Removal of redundant dependencies in DOACROSS loops with constant dependencies","volume":"2","author":"Krothapalli","year":"1991","journal-title":"IEEE Transactions on Parallel and Distributed Systems"},{"issue":"6","key":"10.1016\/S0065-2458(08)60706-8_bib32","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1109\/71.285604","article-title":"The impact of parallel loop scheduling strategies on prefetching in a shared-memory multiprocessor","volume":"5","author":"Lilja","year":"1994","journal-title":"IEEE Transactions on Parallel and Distributed Systems"},{"key":"10.1016\/S0065-2458(08)60706-8_bib33","doi-asserted-by":"crossref","unstructured":"Polychronopoulos, C.D. (1987a). Advanced loop optimizations for parallel computers. In Lecture Notes in Computer Science No. 297: Proceedings International Conference on Supercomputing, 255\u2013277","DOI":"10.1007\/3-540-18991-2_15"},{"key":"10.1016\/S0065-2458(08)60706-8_bib34","doi-asserted-by":"crossref","unstructured":"Polychronopoulos, C.D. (1987b). Automatic restructuring of Fortran programs for parallel execution. Proceedings 4th International DFVLR Seminar on Parallel Computing in Science and Engineering, pp. 107\u2013130","DOI":"10.1007\/3-540-18923-8_17"},{"key":"10.1016\/S0065-2458(08)60706-8_bib35","doi-asserted-by":"crossref","unstructured":"Rudolph, D.C. & Polychronopoulos, C.D. (1989). An efficient message-passing scheduler based on guided self scheduling. Proceedings International Conference on Supercomputing, pp. 50\u201361","DOI":"10.1145\/318789.318796"},{"key":"10.1016\/S0065-2458(08)60706-8_bib36","unstructured":"Saltz, J.H. & Mirchandaney, R. (1991). The preprocessed DOACROSS Loop. Proceedings International Conference on Parallel Processing, II, 174\u2013179"},{"key":"10.1016\/S0065-2458(08)60706-8_bib37","doi-asserted-by":"crossref","unstructured":"Saltz, J.H., Mirchandaney, R. & Crowley, K. (1989). The DoConsider loop. Proceedings, International Conference on Supercomputing, pp. 29\u201340","DOI":"10.1145\/318789.318794"},{"key":"10.1016\/S0065-2458(08)60706-8_bib38","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1016\/0743-7315(90)90129-D","article-title":"Runtime scheduling and execution of loops on message passing machines","volume":"8","author":"Saltz","year":"1990","journal-title":"Journal of Parallel and Distributed Computing"},{"issue":"5","key":"10.1016\/S0065-2458(08)60706-8_bib39","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1109\/12.88484","article-title":"Runtime parallelization and scheduling of loops","volume":"40","author":"Saltz","year":"1991","journal-title":"IEEE Transactions on Computers"},{"key":"10.1016\/S0065-2458(08)60706-8_bib40","first-page":"91","article-title":"Data dependence analysis and uniformization for doubly nested loops","volume":"II","author":"Tzen","year":"1992","journal-title":"Proceedings International Conference on Parallel Processing"},{"key":"10.1016\/S0065-2458(08)60706-8_bib41","unstructured":"Yue, K.K. & Lilja, D.J. (1994b). Parameter Estimation for a Generalized Parallel Loop Scheduling Algorithm. Technical Report No. HPPC-94\u201318, Department of Computer Science, University of Minnesota"}],"container-title":["Advances in Computers","Emphasizing Parallel Programming Techniques"],"original-title":[],"language":"en","deposited":{"date-parts":[[2019,5,11]],"date-time":"2019-05-11T14:52:34Z","timestamp":1557586354000},"score":1,"resource":{"primary":{"URL":"https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0065245808607068"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1997]]},"ISBN":["9780120121458"],"references-count":41,"URL":"https:\/\/doi.org\/10.1016\/s0065-2458(08)60706-8","relation":{},"ISSN":["0065-2458"],"issn-type":[{"value":"0065-2458","type":"print"}],"subject":[],"published":{"date-parts":[[1997]]}}}