{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T21:53:40Z","timestamp":1768341220275,"version":"3.49.0"},"reference-count":31,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2025,5,1]],"date-time":"2025-05-01T00:00:00Z","timestamp":1746057600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computers"],"abstract":"<jats:p>The structure and memory organization of graphics processor units (GPUs) manufactured by NVIDIA and the use of CUDA programming technology to solve computational fluid dynamics (CFD) problems is reviewed and discussed. The potential of using a general-purpose GPU to solve fluid dynamics problems is examined. The code optimization with the utilization of various memory types is considered. Some CFD benchmark problems focused on simulation of viscous incompressible fluid flows are solved on GPUs. Consideration is given to the application of the finite volume method and projection method. Programming implementation of various components of the computational procedure, solution of Poisson equation for pressure and multigrid method to solve the system of algebraic equations, is provided. By using meshes of varying resolutions and different techniques for dividing up the input data into blocks, the speedup of the GPU solution is compared to the CPU approach.<\/jats:p>","DOI":"10.3390\/computers14050170","type":"journal-article","created":{"date-parts":[[2025,5,1]],"date-time":"2025-05-01T06:49:02Z","timestamp":1746082142000},"page":"170","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Application of Graphics Processor Unit Computing Resources to Solution of Incompressible Fluid Dynamics Problems"],"prefix":"10.3390","volume":"14","author":[{"given":"Redha","family":"Benhadj-Djilali","sequence":"first","affiliation":[{"name":"Department of Mechanical Engineering, Faculty of Engineering, Computing and the Environment, Kingston University, London SW15 3DW, UK"}]},{"given":"Arturas","family":"Gulevskis","sequence":"additional","affiliation":[{"name":"Department of Mechanical Engineering, Faculty of Engineering, Computing and the Environment, Kingston University, London SW15 3DW, UK"}]},{"given":"Konstantin","family":"Volkov","sequence":"additional","affiliation":[{"name":"Department of Mechanical Engineering, Faculty of Engineering, Computing and the Environment, Kingston University, London SW15 3DW, UK"}]}],"member":"1968","published-online":{"date-parts":[[2025,5,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1111\/j.1467-8659.2007.01012.x","article-title":"A survey of general-purpose computation on graphics hardware","volume":"26","author":"Owens","year":"2007","journal-title":"Comput. Graph. Forum"},{"key":"ref_2","unstructured":"Boreskov, A.V., and Kharlamov, A. (2010). Basics of Working with CUDA Technology, DMK-Press Publishing House."},{"key":"ref_3","unstructured":"Gorobets, A.V., Sukov, S.A., Zheleznyakov, A.O., Bogdanov, P.B., and Chetverushkin, B.N. (2011). Application of GPU in the Framework of Hybrid Two-Level Parallelization MPI+OpenMP on Heterogeneous Computing Systems, Publishing center of SUSU."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"102265","DOI":"10.1016\/j.jocs.2024.102265","article-title":"A GPU-accelerated computational fluid dynamics solver for assessing shear-driven indoor airflow and virus transmission by scale-resolved simulations","volume":"78","author":"Korhonen","year":"2024","journal-title":"J. Comput. Sci."},{"key":"ref_5","first-page":"36","article-title":"Using GPUs to improve multigrid solver performance on a cluster","volume":"4","author":"Goddeke","year":"2008","journal-title":"Int. J. Comput. Sci. Eng."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1016\/j.actaastro.2019.01.010","article-title":"Simulation of hypersonic flows with equilibrium chemical reactions on graphics processor units","volume":"163","author":"Emelyanov","year":"2019","journal-title":"Acta Astronaut."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1016\/j.actaastro.2022.09.044","article-title":"Simulation of high-temperature flowfield around hypersonic waverider using graphics processor units","volume":"204","author":"Dobrov","year":"2023","journal-title":"Acta Astronaut."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"112052","DOI":"10.1016\/j.jcp.2023.112052","article-title":"A unified framework for non-linear reconstruction schemes in a compact stencil. Part 1: Beyond second order","volume":"481","author":"Deng","year":"2023","journal-title":"J. Comput. Phys."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"113575","DOI":"10.1016\/j.jcp.2024.113575","article-title":"Efficient ROUND schemes on non-uniform grids applied to discontinuous Galerkin schemes with Godunov-type finite volume sub-cell limiting","volume":"522","author":"Deng","year":"2025","journal-title":"J. Comput. Phys."},{"key":"ref_10","unstructured":"Cohen, J.M., and Molemaker, M.J. (2009, January 18\u201322). A fast double precision CFD code using CUDA. Proceedings of the Parallel CFD Conference, Moffet Field, CA, USA."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"295","DOI":"10.20537\/2076-7633-2010-2-3-295-308","article-title":"Fundamentals of CUDA technology","volume":"2","author":"Kazennov","year":"2010","journal-title":"Comput. Res. Model."},{"key":"ref_12","first-page":"77","article-title":"Some technological aspects of using high-performance computing on graphics processors in applied software systems","volume":"11","author":"Galimov","year":"2010","journal-title":"Comput. Methods Program."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Jacobsen, D.A., and Senocak, I. (2011, January 4\u20137). A full-depth amalgamated parallel 3D geometric multigrid solver for GPU clusters. Proceedings of the 49th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition, Orlando, FL, USA.","DOI":"10.2514\/6.2011-946"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"2526","DOI":"10.1016\/j.cpc.2008.11.005","article-title":"Accelerating scientific computations with mixed precision algorithm","volume":"180","author":"Baboulin","year":"2009","journal-title":"Comput. Phys. Commun."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1016\/j.cma.2009.11.001","article-title":"CFD-based analysis and two-level aerodynamic optimization on graphics processing units","volume":"199","author":"Kampolis","year":"2010","journal-title":"Comput. Methods Appl. Mech. Eng."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1016\/j.jpdc.2012.04.003","article-title":"GPU programming strategies and trends in GPU computing","volume":"73","author":"Brodtkorb","year":"2012","journal-title":"J. Parallel Distrib. Comput."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1002\/fld.2254","article-title":"Running unstructured grid-based CFD solvers on modern graphics hardware","volume":"66","author":"Corrigan","year":"2011","journal-title":"Int. J. Numer. Methods Fluids"},{"key":"ref_18","first-page":"269","article-title":"An implementation of the splitting scheme on staggered grids for computing nonstationary flows of viscous incompressible fluid","volume":"6","author":"Volkov","year":"2005","journal-title":"Comput. Methods Program."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1090\/S0025-5718-1968-0242392-2","article-title":"Numerical solution of Navier\u2013Stokes equations","volume":"22","author":"Chorin","year":"1968","journal-title":"Math. Comput."},{"key":"ref_20","unstructured":"Belotserkovsky, O.M. (1994). Numerical Modeling in Continuous Media Mechanics, Fizmatlit."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Thibault, J.C., and Senocak, I. (2009, January 5\u20138). CUDA implementation of a Navier\u2013Stokes solver on multi-GPU desktop platforms for incompressible flows. Proceedings of the 47th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition, Orlando, FL, USA.","DOI":"10.2514\/6.2009-758"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1016\/0021-9991(82)90058-4","article-title":"High-Re solutions for incompressible flow using the Navier\u2013Stokes equations and a multigrid method","volume":"48","author":"Ghia","year":"1982","journal-title":"J. Comput. Phys."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1016\/0021-9991(83)90129-8","article-title":"Driven cavity flows by efficient numerical techniques","volume":"49","author":"Schreiber","year":"1983","journal-title":"J. Comput. Phys."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1016\/0021-9991(86)90008-2","article-title":"Block-implicit multigrid solution of Navier\u2013Stokes equations in primitive variables","volume":"158","author":"Vanka","year":"1986","journal-title":"J. Comput. Phys."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1006\/jcph.1995.1103","article-title":"Simulation of cavity flows by the lattice Boltzmann method","volume":"118","author":"Hou","year":"1995","journal-title":"J. Comput. Phys."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.jcp.2005.01.002","article-title":"A new paradigm for solving Navier\u2013Stokes equations: Stream function\u2013velocity formulation","volume":"207","author":"Gupta","year":"2008","journal-title":"J. Comput. Phys."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1007\/s10915-008-9186-8","article-title":"On the use of compact stream function-velocity formulation of steady Navier\u2013Stokes equations on geometries beyond rectangular","volume":"36","author":"Pandit","year":"2008","journal-title":"J. Sci. Comput."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1016\/j.compfluid.2010.12.012","article-title":"Multi relaxation time lattice Boltzmann simulations of deep lid driven cavity flows at different aspect ratios","volume":"45","author":"Lin","year":"2011","journal-title":"Comput. Fluids"},{"key":"ref_29","first-page":"554","article-title":"Exponential compact higher order scheme for steady incompressible Navier\u2013Stokes equations","volume":"4","author":"Mishra","year":"2012","journal-title":"Eng. Appl. Comput. Fluid Mech."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1016\/0021-9991(90)90149-U","article-title":"An efficient scheme for solving steady incompressible Navier\u2013Stokes equations","volume":"89","author":"Bruneau","year":"1990","journal-title":"J. Comput. Phys."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1016\/S0045-7930(98)00002-4","article-title":"Benchmark spectral results on the lid-driven cavity flow","volume":"27","author":"Botella","year":"1998","journal-title":"Comput. Fluids"}],"container-title":["Computers"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-431X\/14\/5\/170\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:25:59Z","timestamp":1760030759000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-431X\/14\/5\/170"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,1]]},"references-count":31,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2025,5]]}},"alternative-id":["computers14050170"],"URL":"https:\/\/doi.org\/10.3390\/computers14050170","relation":{},"ISSN":["2073-431X"],"issn-type":[{"value":"2073-431X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,5,1]]}}}