{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,9,18]],"date-time":"2023-09-18T05:02:28Z","timestamp":1695013348897},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2013,4,5]],"date-time":"2013-04-05T00:00:00Z","timestamp":1365120000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2013,10]]},"DOI":"10.1007\/s11227-013-0915-x","type":"journal-article","created":{"date-parts":[[2013,4,4]],"date-time":"2013-04-04T09:20:43Z","timestamp":1365067243000},"page":"406-430","source":"Crossref","is-referenced-by-count":13,"title":["Analysis of scalable data-privatization threading algorithms for hybrid MPI\/OpenMP parallelization of molecular dynamics"],"prefix":"10.1007","volume":"66","author":[{"given":"Manaschai","family":"Kunaseth","sequence":"first","affiliation":[]},{"given":"David F.","family":"Richards","sequence":"additional","affiliation":[]},{"given":"James N.","family":"Glosli","sequence":"additional","affiliation":[]},{"given":"Rajiv K.","family":"Kalia","sequence":"additional","affiliation":[]},{"given":"Aiichiro","family":"Nakano","sequence":"additional","affiliation":[]},{"given":"Priya","family":"Vashishta","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2013,4,5]]},"reference":[{"key":"915_CR1","volume-title":"Supercomputing","author":"JC Phillips","year":"2002","unstructured":"Phillips JC, Zheng G, Kumar S, Kale\u2019 LV (2002) NAMD: biomolecular simulations on thousands of processors. In: Supercomputing, Los Alamitos, CA"},{"issue":"1","key":"915_CR2","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1016\/j.jcp.2006.06.014","volume":"221","author":"KJ Bowers","year":"2007","unstructured":"Bowers KJ, Dror RO, Shaw DE (2007) Zonal methods for the parallel execution of range-limited N-body simulations. J Comput Phys 221(1):303\u2013329","journal-title":"J Comput Phys"},{"issue":"3","key":"915_CR3","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1021\/ct700301q","volume":"4","author":"B Hess","year":"2008","unstructured":"Hess B, Kutzner C, van\u00a0der Spoel D, Lindahl E (2008) GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput 4(3):435\u2013447","journal-title":"J Chem Theory Comput"},{"key":"915_CR4","volume-title":"Supercomputing","author":"DE Shaw","year":"2009","unstructured":"Shaw DE, Dror RO, Salmon JK, Grossman JP, Mackenzie KM, Bank JA, Young C, Deneroff MM, Batson B, Bowers KJ, Chow E, Eastwood MP, Ierardi DJ, Klepeis JL, Kuskin JS, Larson RH, Lindorff-Larsen K, Maragakis P, Moraes MA, Piana S, Shan Y, Towles B (2009) Millisecond-scale molecular dynamics simulations on Anton. In: Supercomputing, Portland, OR"},{"key":"915_CR5","volume-title":"International parallel and distributed processing symposium","author":"K Nomura","year":"2009","unstructured":"Nomura K, Dursun H, Seymour R, Wang W, Kalia RK, Nakano A, Vashishta P, Shimojo F, Yang LH (2009) A metascalable computing framework for large spatiotemporal-scale atomistic simulations. In: International parallel and distributed processing symposium"},{"key":"915_CR6","doi-asserted-by":"crossref","DOI":"10.1063\/1.3139006","volume":"130","author":"A Kushima","year":"2009","unstructured":"Kushima A, Lin X, Li J, Eapen J, Mauro JC, Qian X, Diep P, Yip S (2009) Computing the viscosity of supercooled liquids. J Chem Phys 130:224501","journal-title":"J Chem Phys"},{"key":"915_CR7","volume-title":"Material research society symposium proceeding","author":"W Wang","year":"2009","unstructured":"Wang W, Clark R, Nakano A, Kalia RK, Vashishta P (2009) Multi-million atom molecular dynamics study of combustion mechanism of aluminum nanoparticle. In: Material research society symposium proceeding"},{"key":"915_CR8","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1088\/1742-6596\/46\/1\/037","volume":"46","author":"FH Streitz","year":"2006","unstructured":"Streitz FH, Glosli JN, Patel MV, Chan B, Yates RK, de Supinski BR, Sexton J, Gunnels JA (2006) Simulating solidification in metals at high pressure: the drive to petascale computing. J Phys Conf Ser 46:254\u2013267","journal-title":"J Phys Conf Ser"},{"issue":"3","key":"915_CR9","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1016\/j.cpc.2011.10.012","volume":"183","author":"WM Brown","year":"2012","unstructured":"Brown WM, Kohlmeyer A, Plimpton SJ, Tharrington AN (2012) Implementing molecular dynamics on hybrid high performance computers\u2014particle-particle particle-mesh. Comput Phys Commun 183(3):449\u2013459","journal-title":"Comput Phys Commun"},{"key":"915_CR10","volume-title":"International parallel and distributed processing symposium","author":"SR Alam","year":"2008","unstructured":"Alam SR, Agarwal PK, Hampton SS, Ong H, Vetter JS (2008) Impact of multicores on large-scale molecular dynamics simulations. In: International parallel and distributed processing symposium, Miami, FL"},{"issue":"1","key":"915_CR11","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1109\/MC.2011.15","volume":"44","author":"SH Fuller","year":"2011","unstructured":"Fuller SH, Millett LI (2011) Computing performance: game over or next level? Computer 44(1):31\u201338","journal-title":"Computer"},{"key":"915_CR12","volume-title":"International conference on parallel and distributed processing techniques and applications","author":"L Peng","year":"2009","unstructured":"Peng L, Kunaseth M, Dursun H, Nomura K, Wang W, Kalia RK, Nakano A, Vashishta P (2009) A\u00a0scalable hierarchical parallelization framework for molecular dynamics simulation on multicore clusters. In: International conference on parallel and distributed processing techniques and applications, Las Vegas, NV"},{"issue":"3","key":"915_CR13","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1177\/1094342009106188","volume":"23","author":"MJ Chorley","year":"2009","unstructured":"Chorley MJ, Walker DW, Guest MF (2009) Hybrid message-passing and shared-memory programming in a molecular dynamics application on multicore clusters. Int J High Perform C 23(3):196\u2013211","journal-title":"Int J High Perform C"},{"key":"915_CR14","first-page":"427","volume-title":"Euromicro workshop","author":"R Rabenseifner","year":"2009","unstructured":"Rabenseifner R, Hager G, Jost G (2009) Hybrid MPI\/OpenMP parallel programming on clusters of multi-core SMP nodes. In: Euromicro workshop, pp 427\u2013436"},{"key":"915_CR15","volume-title":"International symposium on parallel and distributed processing with applications","author":"C Osthoff","year":"2011","unstructured":"Osthoff C, Grunmann P, Boito F, Kassick R, Pilla L, Navaux P, Schepke C, Panetta J, Maillard N, Silva Dias PL, Walko R (2011) Improving performance on atmospheric models through a hybrid OpenMP\/MPI implementation. In: International symposium on parallel and distributed processing with applications"},{"key":"915_CR16","volume-title":"Supercomputing","author":"JN Glosli","year":"2007","unstructured":"Glosli JN, Richards DF, Caspersen KJ, Rudd RE, Gunnels JA, Streitz FH (2007) Extending stability beyond CPU millennium: a micron-scale atomistic simulation of Kelvin\u2013Helmholtz instability. In: Supercomputing, Reno, NV"},{"issue":"4","key":"915_CR17","doi-asserted-by":"crossref","first-page":"3298","DOI":"10.1063\/1.467576","volume":"101","author":"D York","year":"1994","unstructured":"York D, Yang W (1994) The fast Fourier Poisson method for calculating Ewald sums. J Chem Phys 101(4):3298\u20133300","journal-title":"J Chem Phys"},{"key":"915_CR18","volume-title":"Computer simulation using particles","author":"R Hockney","year":"1981","unstructured":"Hockney R, Eastwood J (1981) Computer simulation using particles. McGraw-Hill, New York"},{"issue":"12","key":"915_CR19","doi-asserted-by":"crossref","first-page":"10089","DOI":"10.1063\/1.464397","volume":"98","author":"T Darden","year":"1993","unstructured":"Darden T, York D, Pedersen L (1993) Particle mesh Ewald: an Nlog(N) method for Ewald sums in large systems. J Chem Phys 98(12):10089\u201310092","journal-title":"J Chem Phys"},{"key":"915_CR20","volume-title":"Supercomputing","author":"DF Richards","year":"2009","unstructured":"Richards DF, Glosli JN, Chan B, Dorr MR, Draeger EW, Fattebert J-L, Krauss WD, Spelce T, Streitz FH, Surh MP, Gunnels JA (2009) Beyond homogeneous decomposition: scaling long-range forces on massively parallel systems. In: Supercomputing, Portland, OR"},{"issue":"12","key":"915_CR21","doi-asserted-by":"crossref","first-page":"2608","DOI":"10.1016\/j.cpc.2012.07.013","volume":"183","author":"J-L Fattebert","year":"2012","unstructured":"Fattebert J-L, Richards DF, Glosli JN (2012) Dynamic load balancing algorithm for molecular dynamics based on Voronoi cells domain decompositions. Comput Phys Commun 183(12):2608\u20132615","journal-title":"Comput Phys Commun"},{"issue":"3","key":"915_CR22","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1023\/A:1011119519789","volume":"29","author":"J Mellor-Crummey","year":"2001","unstructured":"Mellor-Crummey J, Whalley D, Kennedy K (2001) Improving memory hierarchy performance for irregular applications using data and computation reorderings. Int J Parallel Program 29(3):217\u2013247","journal-title":"Int J Parallel Program"},{"issue":"1","key":"915_CR23","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1007\/s11227-011-0560-1","volume":"57","author":"L Peng","year":"2011","unstructured":"Peng L, Kunaseth M, Dursun H, Nomura K, Wang WQ, Kalia RK, Nakano A, Vashishta P (2011) Exploiting hierarchical parallelisms for molecular dynamics simulation on multicore clusters. J Supercomput 57(1):20\u201333","journal-title":"J Supercomput"},{"key":"915_CR24","volume-title":"International parallel and distributed processing","author":"S Penmatsa","year":"2007","unstructured":"Penmatsa S, Chronopoulos AT, Karonis NT, Toonen B (2007) Implementation of distributed loop scheduling schemes on the TeraGrid. In: International parallel and distributed processing, Long Beach, CA"},{"key":"915_CR25","volume-title":"International parallel and distributed processing","author":"FM Ciorba","year":"2006","unstructured":"Ciorba FM, Andronikos T, Riakiotakis AT, Papakonstantinou G (2006) Dynamic multi-phase scheduling for heterogeneous clusters. In: International parallel and distributed processing, Rhodes, Greece"},{"issue":"15","key":"915_CR26","doi-asserted-by":"crossref","first-page":"5486","DOI":"10.1016\/j.jcp.2010.03.047","volume":"229","author":"A Sunarso","year":"2010","unstructured":"Sunarso A, Tsuji T, Chono S (2010) GPU-accelerated molecular dynamics simulation for study of liquid crystalline flows. J Comput Phys 229(15):5486\u20135497","journal-title":"J Comput Phys"},{"issue":"2","key":"915_CR27","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1016\/j.jcp.2006.06.039","volume":"221","author":"J Yang","year":"2007","unstructured":"Yang J, Wang Y, Chen Y (2007) GPU accelerated molecular dynamics simulation of thermal conductivities. J Comput Phys 221(2):799\u2013804","journal-title":"J Comput Phys"},{"key":"915_CR28","volume-title":"International conference on parallel processing workshops","author":"C Hu","year":"2009","unstructured":"Hu C, Liu Y, Li J (2009) Efficient parallel implementation of molecular dynamics with embedded atom method on multi-core platforms. In: International conference on parallel processing workshops"},{"issue":"2","key":"915_CR29","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1016\/j.cpc.2009.10.009","volume":"181","author":"DW Holmes","year":"2010","unstructured":"Holmes DW, Williams JR, Tilke P (2010) An events based algorithm for distributing concurrent tasks on multi-core architectures. Comput Phys Commun 181(2):341\u2013354","journal-title":"Comput Phys Commun"},{"key":"915_CR30","volume-title":"Supercomputing","author":"K Madduri","year":"2009","unstructured":"Madduri K, Williams S, Ethier S, Oliker L, Shalf J, Strohmaier E, Yelicky K (2009) Memory-efficient optimization of gyrokinetic particle-to-grid interpolation for multicore processors. In: Supercomputing, Portland, OR"},{"key":"915_CR31","volume-title":"Algorithm design","author":"J Kleinberg","year":"2005","unstructured":"Kleinberg J, Tardos E (2005) Algorithm design, 2 edn. Pearson Education, Upper Saddle River","edition":"2"},{"key":"915_CR32","volume-title":"International parallel and distributed processing symposium","author":"UV Catalyurek","year":"2007","unstructured":"Catalyurek UV, Boman EG, Devine KD, Bozdag D, Heaphy R, Riesen L (2007) A hypergraph-based dynamic load balancing for adaptive scientific computations. In: International parallel and distributed processing symposium"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-013-0915-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11227-013-0915-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-013-0915-x","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,1]],"date-time":"2019-06-01T10:24:09Z","timestamp":1559384649000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11227-013-0915-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,4,5]]},"references-count":32,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,10]]}},"alternative-id":["915"],"URL":"https:\/\/doi.org\/10.1007\/s11227-013-0915-x","relation":{},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"value":"0920-8542","type":"print"},{"value":"1573-0484","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,4,5]]}}}