{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:14:08Z","timestamp":1750306448453,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2015,11,15]],"date-time":"2015-11-15T00:00:00Z","timestamp":1447545600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2015,11,15]]},"DOI":"10.1145\/2832087.2832090","type":"proceedings-article","created":{"date-parts":[[2015,11,11]],"date-time":"2015-11-11T13:07:06Z","timestamp":1447247226000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Examining recent many-core architectures and programming models using SHOC"],"prefix":"10.1145","author":[{"given":"M. Graham","family":"Lopez","sequence":"first","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN"}]},{"given":"Jeffrey","family":"Young","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology, Atlanta, GA"}]},{"given":"Jeremy S.","family":"Meredith","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN"}]},{"given":"Philip C.","family":"Roth","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN"}]},{"given":"Mitchel","family":"Horton","sequence":"additional","affiliation":[{"name":"University of Tennessee-Knoxville, Knoxville, TN"}]},{"given":"Jeffrey S.","family":"Vetter","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN"}]}],"member":"320","published-online":{"date-parts":[[2015,11,15]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1654059.1654078"},{"key":"e_1_3_2_1_2_1","volume-title":"Online","author":"Berkeley View Group","year":"2015","unstructured":"Berkeley View Group . Dwarf mine overview . Online , Sept. 2015 , 2015. http:\/\/view.eecs.berkeley.edu\/wiki\/Dwarf_Mine. Berkeley View Group. Dwarf mine overview. Online, Sept. 2015, 2015. http:\/\/view.eecs.berkeley.edu\/wiki\/Dwarf_Mine."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2012.6402918"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2013.6704684"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306797"},{"key":"e_1_3_2_1_6_1","volume-title":"GPU Technology Conference, 2015","author":"Choi J.","year":"2015","unstructured":"J. Choi . Evaluation of the Jetson TK1 development board for power and performance . GPU Technology Conference, 2015 . http:\/\/on-demand.gputechconf.com\/gtc\/ 2015 \/presentation\/S5407-Jee-Choi.pdf. J. Choi. Evaluation of the Jetson TK1 development board for power and performance. GPU Technology Conference, 2015. http:\/\/on-demand.gputechconf.com\/gtc\/2015\/presentation\/S5407-Jee-Choi.pdf."},{"key":"e_1_3_2_1_7_1","first-page":"2012","article-title":"Intel Xeon Phi coprocessor - the architecture (white paper)","author":"Chryos G.","year":"2012","unstructured":"G. Chryos . Intel Xeon Phi coprocessor - the architecture (white paper) . HotChips 2012 , 2012 . https:\/\/software.intel.com\/en-us\/articles\/intel-xeon-phi-coprocessor-codename-knights-corner. G. Chryos. Intel Xeon Phi coprocessor - the architecture (white paper). HotChips 2012, 2012. https:\/\/software.intel.com\/en-us\/articles\/intel-xeon-phi-coprocessor-codename-knights-corner.","journal-title":"HotChips"},{"key":"e_1_3_2_1_8_1","volume-title":"Defining software requirements for scientific computing. DARPA HPCS presentation","author":"Colella P.","year":"2004","unstructured":"P. Colella . Defining software requirements for scientific computing. DARPA HPCS presentation , 2004 . P. Colella. Defining software requirements for scientific computing. DARPA HPCS presentation, 2004."},{"key":"e_1_3_2_1_9_1","unstructured":"T. P. P. Council. TPC Benchmark H (Decision Support) Standard Specification Revision 2.17.0. 2013. http:\/\/www.tpc.org\/tpch\/spec\/tpch2.17.0.pdf.  T. P. P. Council. TPC Benchmark H (Decision Support) Standard Specification Revision 2.17.0. 2013. http:\/\/www.tpc.org\/tpch\/spec\/tpch2.17.0.pdf."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1735688.1735702"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2012.99"},{"key":"e_1_3_2_1_12_1","unstructured":"GCC likely to support both OpenACC and Intel Xeon Phi offload pragmas in 2015. 2014. http:\/\/www.techenablement.com\/gcc-likely-to-support-both-openacc-and-intel-xeon-phi-offload-pragmas-in-2015.  GCC likely to support both OpenACC and Intel Xeon Phi offload pragmas in 2015. 2014. http:\/\/www.techenablement.com\/gcc-likely-to-support-both-openacc-and-intel-xeon-phi-offload-pragmas-in-2015."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/16\/1\/009"},{"key":"e_1_3_2_1_14_1","volume-title":"Exploring expression data: Identification and analysis of coexpressed genes. Genome Research, (11):1106--1115","author":"Heyer L. J.","year":"1999","unstructured":"L. J. Heyer , S. Kruglyak , and S. Yooseph . Exploring expression data: Identification and analysis of coexpressed genes. Genome Research, (11):1106--1115 , 1999 . L. J. Heyer, S. Kruglyak, and S. Yooseph. Exploring expression data: Identification and analysis of coexpressed genes. Genome Research, (11):1106--1115, 1999."},{"key":"e_1_3_2_1_15_1","unstructured":"Intel Math Kernel Library. 2014. https:\/\/software.intel.com\/en-us\/intel-mkl.  Intel Math Kernel Library. 2014. https:\/\/software.intel.com\/en-us\/intel-mkl."},{"key":"e_1_3_2_1_16_1","volume-title":"Offload using a pragma","author":"XE","year":"2013","unstructured":"Intel C++ compiler XE 13.0 reference : Offload using a pragma . 2013 . https:\/\/software.intel.com\/sites\/products\/documentation\/doclib\/stdxe\/2013\/composerxe\/compiler\/cpp-lin\/index.htm#GUID-44F5B8E2-8EFD-4C51-ACF8-357900798834.htm. Intel C++ compiler XE 13.0 reference: Offload using a pragma. 2013. https:\/\/software.intel.com\/sites\/products\/documentation\/doclib\/stdxe\/2013\/composerxe\/compiler\/cpp-lin\/index.htm#GUID-44F5B8E2-8EFD-4C51-ACF8-357900798834.htm."},{"key":"e_1_3_2_1_17_1","volume-title":"Technical Report CNA-150","author":"PACK","year":"1979","unstructured":"IT PACK 2.0 user's guide. Technical Report CNA-150 , Center for Numerical Analysis , University of Texas, 1979 . ITPACK 2.0 user's guide. Technical Report CNA-150, Center for Numerical Analysis, University of Texas, 1979."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/2606265.2606942"},{"key":"e_1_3_2_1_19_1","volume-title":"The MNIST database of handwritten digits","author":"LeCun Y.","year":"2014","unstructured":"Y. LeCun , C. Cortes , and C. J. Burges . The MNIST database of handwritten digits . 2014 . http:\/\/yann.lecun.com\/exdb\/mnist\/. Y. LeCun, C. Cortes, and C. J. Burges. The MNIST database of handwritten digits. 2014. http:\/\/yann.lecun.com\/exdb\/mnist\/."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2600212.2600704"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/2388996.2389017"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40698-0_7"},{"key":"e_1_3_2_1_23_1","unstructured":"The MD5 message-digest algorithm. RFC1321 1992. https:\/\/www.ietf.org\/rfc\/rfc1321.txt.  The MD5 message-digest algorithm. RFC1321 1992. https:\/\/www.ietf.org\/rfc\/rfc1321.txt."},{"key":"e_1_3_2_1_24_1","unstructured":"A. Ng. Stanford online machine learning course. April 2014. http:\/\/online.stanford.edu\/course\/machine-learning.  A. Ng. Stanford online machine learning course. April 2014. http:\/\/online.stanford.edu\/course\/machine-learning."},{"key":"e_1_3_2_1_25_1","volume-title":"Neural networks and deep learning","author":"Nielsen M.","year":"2014","unstructured":"M. Nielsen . Neural networks and deep learning . October 2014 . https:\/\/github.com\/mnielsen\/neural-networks-and-deep-learning. M. Nielsen. Neural networks and deep learning. October 2014. https:\/\/github.com\/mnielsen\/neural-networks-and-deep-learning."},{"key":"e_1_3_2_1_26_1","unstructured":"NVIDIA cuSPARSE. 2014. https:\/\/developer.nvidia.com\/cuSPARSE.  NVIDIA cuSPARSE. 2014. https:\/\/developer.nvidia.com\/cuSPARSE."},{"key":"e_1_3_2_1_27_1","unstructured":"The OpenACC Application Programming Interface. 2011. http:\/\/www.openacc.org\/sites\/default\/files\/OpenACC.1.0_0.pdf.  The OpenACC Application Programming Interface. 2011. http:\/\/www.openacc.org\/sites\/default\/files\/OpenACC.1.0_0.pdf."},{"volume-title":"GPU Technology Conference, 2014","year":"2014","key":"e_1_3_2_1_28_1","unstructured":"OpenUH : open source OpenACC compiler . GPU Technology Conference, 2014 . http:\/\/on-demand. gputechconf.com\/gtc\/ 2014 \/presentations\/S4343-openuh-open-source-openacc-compiler.pdf. OpenUH: open source OpenACC compiler. GPU Technology Conference, 2014. http:\/\/on-demand. gputechconf.com\/gtc\/2014\/presentations\/S4343-openuh-open-source-openacc-compiler.pdf."},{"key":"e_1_3_2_1_29_1","volume-title":"Whitepaper","author":"Fortran PGI","year":"2009","unstructured":"PGI Fortran and C accelerator programming model . Whitepaper , 2009 . http:\/\/www.pgroup.com\/lit\/whitepapers\/pgi_accel_prog_model_1.1.pdf. PGI Fortran and C accelerator programming model. Whitepaper, 2009. http:\/\/www.pgroup.com\/lit\/whitepapers\/pgi_accel_prog_model_1.1.pdf."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2726935.2726943"},{"key":"e_1_3_2_1_31_1","volume-title":"Live: Jen-Hsuang Huang kicks off NVIDIA's 2014 GPU Technology Conference","author":"Sherman B.","year":"2014","unstructured":"B. Sherman . Live: Jen-Hsuang Huang kicks off NVIDIA's 2014 GPU Technology Conference . March 2014 . http:\/\/blogs.nvidia.com\/blog\/2014\/03\/25\/live-jen-hsun-huang-gtc-14\/. B. Sherman. Live: Jen-Hsuang Huang kicks off NVIDIA's 2014 GPU Technology Conference. March 2014. http:\/\/blogs.nvidia.com\/blog\/2014\/03\/25\/live-jen-hsun-huang-gtc-14\/."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/1884795.1884812"},{"key":"e_1_3_2_1_33_1","volume-title":"Fast collision attack on MD5. IACR Cryptology ePrint Archive","author":"Stevens M.","year":"2006","unstructured":"M. Stevens . Fast collision attack on MD5. IACR Cryptology ePrint Archive , 2006 :104, 2006. M. Stevens. Fast collision attack on MD5. IACR Cryptology ePrint Archive, 2006:104, 2006."},{"key":"e_1_3_2_1_34_1","volume-title":"Parboil: A revised benchmark suite for scientific and commercial throughput computing","author":"Stratton J. A.","year":"2012","unstructured":"J. A. Stratton , C. Rodrigues , I.-J. Sung , N. Obeid , L.-W. Chang , N. Anssari , G. D. Liu , and W.-M. W. Hwu . Parboil: A revised benchmark suite for scientific and commercial throughput computing . Center for Reliable and High-Performance Computing , 2012 . J. A. Stratton, C. Rodrigues, I.-J. Sung, N. Obeid, L.-W. Chang, N. Anssari, G. D. Liu, and W.-M. W. Hwu. Parboil: A revised benchmark suite for scientific and commercial throughput computing. Center for Reliable and High-Performance Computing, 2012."},{"key":"e_1_3_2_1_35_1","volume-title":"GPU, and CPU. CoRR, abs\/1311.0378","author":"Teodoro G.","year":"2013","unstructured":"G. Teodoro , T. M. Kur\u00e7 , J. Kong , L. A. D. Cooper , and J. H. Saltz . Comparative performance analysis of Intel Xeon Phi , GPU, and CPU. CoRR, abs\/1311.0378 , 2013 . G. Teodoro, T. M. Kur\u00e7, J. Kong, L. A. D. Cooper, and J. H. Saltz. Comparative performance analysis of Intel Xeon Phi, GPU, and CPU. CoRR, abs\/1311.0378, 2013."},{"key":"e_1_3_2_1_36_1","volume-title":"GTC15 keynote highlights 10x GPU computing growth","author":"Trader T.","year":"2015","unstructured":"T. Trader . GTC15 keynote highlights 10x GPU computing growth . March 2015 . http:\/\/www.hpcwire.com\/2015\/03\/17\/gtc15-keynote-highlights-10x-gpu-computing-growth\/. T. Trader. GTC15 keynote highlights 10x GPU computing growth. March 2015. http:\/\/www.hpcwire.com\/2015\/03\/17\/gtc15-keynote-highlights-10x-gpu-computing-growth\/."},{"key":"e_1_3_2_1_37_1","volume-title":"Online","author":"Vetter J. S.","year":"2015","unstructured":"J. S. Vetter . SHOC GitHub data repository . Online , Sept. 2015 , 2015. https:\/\/github.com\/vetter\/shoc\/tree\/master\/data. J. S. Vetter. SHOC GitHub data repository. Online, Sept. 2015, 2015. https:\/\/github.com\/vetter\/shoc\/tree\/master\/data."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2008.12.006"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2012.19"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2013.263"},{"key":"e_1_3_2_1_41_1","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1007\/978-3-319-17473-0_5","volume-title":"J. Brodman and P. Tu","author":"Xu R.","year":"2015","unstructured":"R. Xu , X. Tian , S. Chandrasekaran , Y. Yan , and B. Chapman . Nas parallel benchmarks for gpgpus using a directive-based programming model . In J. Brodman and P. Tu , editors, Languages and Compilers for Parallel Computing, volume 8967 of Lecture Notes in Computer Science , pages 67 -- 81 . Springer International Publishing , 2015 . R. Xu, X. Tian, S. Chandrasekaran, Y. Yan, and B. Chapman. Nas parallel benchmarks for gpgpus using a directive-based programming model. In J. Brodman and P. Tu, editors, Languages and Compilers for Parallel Computing, volume 8967 of Lecture Notes in Computer Science, pages 67--81. Springer International Publishing, 2015."}],"event":{"name":"SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis","sponsor":["SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing","SIGARCH ACM Special Interest Group on Computer Architecture","IEEE-CS\\DATC IEEE Computer Society"],"location":"Austin Texas","acronym":"SC15"},"container-title":["Proceedings of the 6th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computing Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2832087.2832090","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2832087.2832090","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:43:04Z","timestamp":1750225384000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2832087.2832090"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,11,15]]},"references-count":41,"alternative-id":["10.1145\/2832087.2832090","10.1145\/2832087"],"URL":"https:\/\/doi.org\/10.1145\/2832087.2832090","relation":{},"subject":[],"published":{"date-parts":[[2015,11,15]]},"assertion":[{"value":"2015-11-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}