{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,4]],"date-time":"2024-09-04T20:01:19Z","timestamp":1725480079541},"publisher-location":"Berlin, Heidelberg","reference-count":38,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"type":"print","value":"9783540404354"},{"type":"electronic","value":"9783540450092"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2003]]},"DOI":"10.1007\/3-540-45009-2_6","type":"book-chapter","created":{"date-parts":[[2007,2,28]],"date-time":"2007-02-28T07:13:46Z","timestamp":1172646826000},"page":"69-83","source":"Crossref","is-referenced-by-count":4,"title":["Evaluation of OpenMP for the Cyclops Multithreaded Architecture"],"prefix":"10.1007","author":[{"given":"George","family":"Almasi","sequence":"first","affiliation":[]},{"given":"Eduard","family":"Ayguad\u00e9","sequence":"additional","affiliation":[]},{"given":"C\u0103lin","family":"Ca\u015fcaval","sequence":"additional","affiliation":[]},{"given":"Jos\u00e9","family":"Casta\u00f1os","sequence":"additional","affiliation":[]},{"given":"Jes\u00fas","family":"Labarta","sequence":"additional","affiliation":[]},{"given":"Francisco","family":"Mart\u00ednez","sequence":"additional","affiliation":[]},{"given":"Xavier","family":"Martorell","sequence":"additional","affiliation":[]},{"given":"Jos\u00e9","family":"Moreira","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2003,5,27]]},"reference":[{"key":"6_CR1","doi-asserted-by":"crossref","unstructured":"Anant Agarwal. Raw computation. Scientific American, August 1999.","DOI":"10.1038\/scientificamerican0899-60"},{"key":"6_CR2","doi-asserted-by":"crossref","unstructured":"George Alm\u00e1si, C\u0103lin Ca\u015fcaval, Jos\u00e9 G. Casta\u00f5s, Monty Denneau, Derek Lieber, Jos\u00e9 E. Moreira, and Jr. Henry S. Warren. Dissecting Cyclops: A detailed analysis of a multithreaded architecture. In MEDEA Workshop on On-Chip Multiprocessor: Processor Architecture and Memory Hierarchy related Issues, September 2002.","DOI":"10.1145\/773365.773369"},{"key":"6_CR3","unstructured":"D. Bailey, T. Harris, W. Saphir, R. van der Wijngaart, A. Woo, and Maurice Yarrow. The NAS parallel benchmarks 2.0. Technical Report Technical Report NAS-95-020, NASA Ames Research Center, December 1995."},{"key":"6_CR4","doi-asserted-by":"crossref","unstructured":"L. Barroso, K. Gharachorloo, R. McNamara, A. Nowatzyk, S. Qadeer, B. Sano, S. Smith, R. Stets, and B. Verghese. Piranha: A scalable architecture based on single-chip multiprocessing. In 27th Annual International Symposium on Computer Architecture, pages 282\u2013293, June 2000.","DOI":"10.1145\/339647.339696"},{"key":"6_CR5","unstructured":"C\u0103lin Ca\u015fcaval, Jos\u00e9 Casta\u00f5s, Luis Ceze, Monty Denneau, Manish Gupta, Derek Lieber, Jos\u00e9 E. Moreira, Karin Strauss, and Henry S. Warren, Jr. Evaluation of a multithreaded architecture for cellular computing. In Proceedings of the 8th International Symposium of High Performance Computer Architecture, February 2002."},{"key":"6_CR6","unstructured":"Intel Corporation. Intel hyperthreading technology. http:\/\/www.intel.com\/info\/hyperthreading . 2003."},{"key":"6_CR7","unstructured":"D. Bailey, E. Barszcz, J. Barton, D. Browning, R. Carter, L. Dagum, R. Fatoohi, S. Fineberg, R. Frederickson, T. Lasinski, R. Schreiber, H. Simon, V. Venkatakrishnan, and S. Weeratunga. The NAS parallel benchmarks. Technical Report Technical Report RNR-94-007, NASA Ames Research Center, March 1994."},{"key":"6_CR8","doi-asserted-by":"crossref","unstructured":"Susan Eggers, Joel Emer, Henry Levy, Jack Lo, Rebecca Stamm, and Dean Tullsen. Simultaneous multithreading: A platform for next-generation processors. IEEE Micro, pages 12\u201318, September\/October 1997.","DOI":"10.1109\/40.621209"},{"issue":"2","key":"6_CR9","doi-asserted-by":"publisher","first-page":"310","DOI":"10.1147\/sj.402.0310","volume":"40","author":"F. Allen","year":"2001","unstructured":"Frances Allen et al. Blue gene: A vision for protein science using a petaflop supercomputer. IBM Systems Journal, 40(2):310\u2013328, 2001.","journal-title":"IBM Systems Journal"},{"key":"6_CR10","doi-asserted-by":"crossref","unstructured":"M. Gonzalez, E. Ayguad\u00e9, X. Martorell, J. Labarta, N. Navarro, and J. Oliver. NanosCompiler: Supporting flexible multilevel parallelism in OpenMP. Concurrency: Practice and Experience, 12(9), August 2000.","DOI":"10.1002\/1096-9128(200010)12:12<1205::AID-CPE524>3.3.CO;2-U"},{"key":"6_CR11","doi-asserted-by":"crossref","unstructured":"M. W. Hall, P. Kogge, J. Koller, P. Diniz, J. Chame, J. Draper, J. LaCross, J. Brockman, W. Athas, A. Srivasava, V. Freech, J. Shin, and J. Park. Mapping irregular applications to DIVA, a PIM-based data-intensive architecture. In Proceedings of SC99, November 1999.","DOI":"10.1145\/331532.331589"},{"key":"6_CR12","unstructured":"H. Jin, M. Frumkin, and J. Yan. The OpenMP implementation of the NAS parallel benchmarks and its performance. Technical Report Technical Report NAS-99-011, NASA Ames Research Center, October 1999."},{"key":"6_CR13","unstructured":"Yi Kang, Michael Huang, Seung-Moon Yoo, Zhenzho Ge, Diana Keen, Vinh Lam, Prattap Pattnaik, and Josep Torrellas. FlexRAM: Toward an advanced intelligent memory system. In International Conference on Computer Design (ICCD), October 1999."},{"key":"6_CR14","unstructured":"P. Kogge, S. Bass, J. Brockman, D. Chen, and E. Sha. Pursuing a petaflop: Point designs for 100 TF computers using PIM technologies. In Frontiers of Massively Parallel Computation Symposium, 1996."},{"key":"6_CR15","unstructured":"Peter M. Kogge. The EXECUBE approach to massively parallel processing. In Intl. Conf. on Parallel Processing, August 1994."},{"key":"6_CR16","unstructured":"Jack L. Lo, Susan J. Eggers, Henry M. Levy, Sujay S. Parekh, and Dean M. Tullsen. Tuning compiler optimizations for simultaneous multithreading. In International Symposium on Microarchitecture, pages 114\u2013124, 1997."},{"key":"6_CR17","unstructured":"H. Lu, Y. C. Hu, and W. Zwaenepoel. OpenMP on network of workstations. In Proc. of Supercomputing\u201998, 1998."},{"key":"6_CR18","doi-asserted-by":"crossref","unstructured":"X. Martorell, E. Ayguad\u00e9, J.I. Navarro, J. Corbal\u00e1n, M. Gonz\u00e1lez, and J. Labarta. Thread fork\/join techniques for multi-level parallelism exploitation in NUMA multiprocessors. In Proceedings of the 13th Int. Conference on Supercomputing ICS\u201999, June 1999.","DOI":"10.1145\/305138.305206"},{"key":"6_CR19","doi-asserted-by":"crossref","unstructured":"X. Martorell, J. Labarta, J.I. Navarro, and E. Ayguad\u00e9. A library implementation of the nano-threads programming model. In Proceedings of Euro-Par\u201996, August 1996.","DOI":"10.1007\/BFb0024761"},{"key":"6_CR20","unstructured":"OpenMP Organization. OpenMP Fortran application interface, v. 2.0. http:\/\/www.openmp.org , June 2000."},{"key":"6_CR21","doi-asserted-by":"crossref","unstructured":"Mark Oskin, Frederic T. Chong, and Timothy Sherwood. Active Pages: A computation model for intelligent memory. In International Symposium on Computer Architecture, pages 192\u2013203, 1998.","DOI":"10.1109\/ISCA.1998.694774"},{"key":"6_CR22","doi-asserted-by":"crossref","unstructured":"David Patterson, Thomas Anderson, Neal Cardwell, Richard Fromm, Kimberly Keeton, Christoforos Kozyrakis, Randi Thomas, and Katherine Yelick. A case for intelligent RAM: IRAM. In Proceedings of IEEE Micro, April 1997.","DOI":"10.1109\/40.592312"},{"key":"6_CR23","first-page":"39","volume":"II","author":"C. D. Polychronopoulos","year":"1989","unstructured":"Constantine D. Polychronopoulos, Milind B. Girkar, Mohammed Resa Haghighat, Chia Ling Lee, Bruce P. Leung, and Dale A. Schouten. Parafrase-2: An environment for parallelizing, partitioning, synchronizing and scheduling programs on multiprocessors. In 1989 International Conference on Parallel Processing, volume II, pages 39\u201348, St. Charles, Ill., 1989.","journal-title":"1989 International Conference on Parallel Processing"},{"key":"6_CR24","unstructured":"William H. Press, Saul A. Teukolsky, William T. Vetterling, and Brian P. Flannery. Numerical recipes in C. In Cambridge University Press, 1992."},{"key":"6_CR25","doi-asserted-by":"crossref","unstructured":"S. Rixner, W.J. Dally, U.J. Kapasi, B. Khailany, A. Lopez-Lagunas, P.R. Mattson, and J.D. Owens. A bandwidth-efficient architecture for media processing. In 31st International Symposium on Microarchitecture, November 1998.","DOI":"10.1109\/MICRO.1998.742118"},{"key":"6_CR26","unstructured":"M. Sato, S. Satoh, K. Kusano, and Y. Tanaka. Design of OpenMP compiler for an smp cluster, 1999."},{"key":"6_CR27","unstructured":"Scientific Computing Associates, Inc. PCGPACK user\u2019s guide."},{"key":"6_CR28","doi-asserted-by":"crossref","unstructured":"A. Snavely, L. Carter, J. Boisseau, A. Majumdar, K. S. Gatlin, N. Mitchel, J. Feo, and B. Koblenz. Multiprocessor performance on the Tera MTA. In Proceedings Supercomputing\u2019 98, Orlando, Florida, Nov. 7\u201313 1998.","DOI":"10.1109\/SC.1998.10049"},{"key":"6_CR29","unstructured":"A. Snavely, G. Johnson, and J. Genetti. Data intensive volume visualization on the Tera MTA and Cray T3E. In Proceedings of the High Performance Computing Symposium-HPC\u2019 99, pages 59\u201364, 1999."},{"key":"6_CR30","unstructured":"Silicon Graphics Computer Systems. Origin2000 and Onyx2 performance tuning and optimization guide. Technical Report Doc. num. 007-3430-002, 1998."},{"issue":"1","key":"6_CR31","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1147\/rd.461.0005","volume":"46","author":"J. M. Tendler","year":"2002","unstructured":"J. M. Tendler, J. S. Dodson, Jr. J. S. Fields, H. Le, and B. Sinharoy. POWER4 system microarchitecture. IBM Journal of Research and Development, 46(1):5\u201326, 2002.","journal-title":"IBM Journal of Research and Development"},{"key":"6_CR32","unstructured":"Josep Torrellas, Liuxi Yang, and Anthony-Trung Nguyen. Toward a cost-effective DSM organization that exploits processor-memory integration. In Sixth International Symposium on High-Performance Computer Architecture, January 2000."},{"key":"6_CR33","unstructured":"M. Tremblay. MAJC: Microprocessor architecture for Java computing. In Hot Chips, August 1999."},{"key":"6_CR34","doi-asserted-by":"crossref","unstructured":"Dean M. Tullsen, Susan J. Eggers, and Henry M. Levy. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of the 22nd Annual International Symposium on Computer Architecture, pages 392\u2013403, June 1995.","DOI":"10.1145\/225830.224449"},{"key":"6_CR35","doi-asserted-by":"crossref","unstructured":"Dean M. Tullsen, Jack L. Lo, Susan J. Eggers, and Henry M. Levy. Supporting fine-grained synchronization on a simultaneous multithreading processor. In HPCA, pages 54\u201358, 1999.","DOI":"10.1109\/HPCA.1999.744326"},{"key":"6_CR36","doi-asserted-by":"crossref","unstructured":"Elliot Waingold, Michael Taylor, Devabhaktuni Srikrishna, Vivek Sarkar, Walter Lee, Victor Lee, Jang Kim, Matthew Frank, Peter Finch, Rajeev Barua, Jonathan Babb, Saman Amarasinghe, and Anant Agarwal. Baring it all to software: Raw machines. IEEE Computer, pages 86\u201393, September 1997.","DOI":"10.1109\/2.612254"},{"key":"6_CR37","doi-asserted-by":"crossref","unstructured":"M. Yankelevsky and C. D. Polychronopoulos. \u03b1-Coral: A multigrain, multithreading processor architecture. In Proceedings of International Conference on Supercomputing\u201901, 2001.","DOI":"10.1145\/377792.377861"},{"key":"6_CR38","unstructured":"H. P. Zima and T. Sterling. The Gilgamesh processor-in-memory architecture and its execution model. In Workshop on Compilers for Parallel Computers, June 2001."}],"container-title":["Lecture Notes in Computer Science","OpenMP Shared Memory Parallel Programming"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/3-540-45009-2_6","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,4,24]],"date-time":"2019-04-24T20:15:34Z","timestamp":1556136934000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/3-540-45009-2_6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2003]]},"ISBN":["9783540404354","9783540450092"],"references-count":38,"URL":"https:\/\/doi.org\/10.1007\/3-540-45009-2_6","relation":{},"ISSN":["0302-9743"],"issn-type":[{"type":"print","value":"0302-9743"}],"subject":[],"published":{"date-parts":[[2003]]}}}