{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,29]],"date-time":"2025-12-29T13:49:09Z","timestamp":1767016149181,"version":"build-2065373602"},"reference-count":30,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2024,4,17]],"date-time":"2024-04-17T00:00:00Z","timestamp":1713312000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Coordena\u00e7\u00e3o de Aperfei\u00e7oamento de Pessoal de N\u00edvel Superior","award":["88887.512576\/2020\u201300","402673\/2021\u20132"],"award-info":[{"award-number":["88887.512576\/2020\u201300","402673\/2021\u20132"]}]},{"DOI":"10.13039\/501100003593","name":"Conselho Nacional de Desenvolvimento Cient\u00edfico e Tecnol\u00f3gico","doi-asserted-by":"publisher","award":["88887.512576\/2020\u201300","402673\/2021\u20132"],"award-info":[{"award-number":["88887.512576\/2020\u201300","402673\/2021\u20132"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Petr\u00f3leo Brasileiro S.A.","award":["88887.512576\/2020\u201300","402673\/2021\u20132"],"award-info":[{"award-number":["88887.512576\/2020\u201300","402673\/2021\u20132"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computation"],"abstract":"<jats:p>The maximum number of parallel threads in traditional CFD solutions is limited by the Central Processing Unit (CPU) capacity, which is lower than the capabilities of a modern Graphics Processing Unit (GPU). In this context, the GPU allows for simultaneous processing of several parallel threads with double-precision floating-point formatting. The present study was focused on evaluating the advantages and drawbacks of implementing LASER Beam Welding (LBW) simulations using the CUDA platform. The performance of the developed code was compared to that of three top-rated commercial codes executed on the CPU. The unsteady three-dimensional heat conduction Partial Differential Equation (PDE) was discretized in space and time using the Finite Volume Method (FVM). The Volumetric Thermal Capacitor (VTC) approach was employed to model the melting-solidification. The GPU solutions were computed using a CUDA-C language in-house code, running on a Gigabyte Nvidia GeForce RTX\u2122 3090 video card and an MSI 4090 video card (both made in Hsinchu, Taiwan), each with 24 GB of memory. The commercial solutions were executed on an Intel\u00ae Core\u2122 i9-12900KF CPU (made in Hillsboro, Oregon, United States of America) with a 3.6 GHz base clock and 16 cores. The results demonstrated that GPU and CPU processing achieve similar precision, but the GPU solution exhibited significantly faster speeds and greater power efficiency, resulting in speed-ups ranging from 75.6 to 1351.2 times compared to the CPU solutions. The in-house code also demonstrated optimized memory usage, with an average of 3.86 times less RAM utilization. Therefore, adopting parallelized algorithms run on GPU can lead to reduced CFD computational costs compared to traditional codes while maintaining high accuracy.<\/jats:p>","DOI":"10.3390\/computation12040083","type":"journal-article","created":{"date-parts":[[2024,4,17]],"date-time":"2024-04-17T09:15:20Z","timestamp":1713345320000},"page":"83","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["An Implementation of LASER Beam Welding Simulation on Graphics Processing Unit Using CUDA"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5624-183X","authenticated-orcid":false,"given":"Ernandes","family":"Nascimento","sequence":"first","affiliation":[{"name":"Aeronautics Institute of Technology\u2014ITA, S\u00e3o Jos\u00e9 dos Campos 12228-900, SP, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0572-4943","authenticated-orcid":false,"given":"Elisan","family":"Magalh\u00e3es","sequence":"additional","affiliation":[{"name":"Aeronautics Institute of Technology\u2014ITA, S\u00e3o Jos\u00e9 dos Campos 12228-900, SP, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9671-1493","authenticated-orcid":false,"given":"Arthur","family":"Azevedo","sequence":"additional","affiliation":[{"name":"Aeronautics Institute of Technology\u2014ITA, S\u00e3o Jos\u00e9 dos Campos 12228-900, SP, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1897-9942","authenticated-orcid":false,"given":"Luiz E. S.","family":"Paes","sequence":"additional","affiliation":[{"name":"Faculty of Mechanical Engineering, Federal University of Uberl\u00e2ndia\u2014UFU, Uberl\u00e2ndia 38410-337, MG, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-8339-2168","authenticated-orcid":false,"given":"Ariel","family":"Oliveira","sequence":"additional","affiliation":[{"name":"Aeronautics Institute of Technology\u2014ITA, S\u00e3o Jos\u00e9 dos Campos 12228-900, SP, Brazil"}]}],"member":"1968","published-online":{"date-parts":[[2024,4,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0377-0427(00)00507-0","article-title":"From Finite Differences to Finite Elements","volume":"128","year":"2001","journal-title":"J. Comput. Appl. Math."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Moukalled, F., Mangani, L., and Darwish, M. (2016). The Finite Volume Method in Computational Fluid Dynamics, Springer International Publishing. Fluid Mechanics and Its Applications.","DOI":"10.1007\/978-3-319-16874-6"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"949","DOI":"10.1109\/JLT.2022.3205712","article-title":"16-Bit (4 \u00d7 4) Optical Random Access Memory (RAM) Bank","volume":"41","author":"Pappas","year":"2023","journal-title":"J. Light. Technol."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"e6018","DOI":"10.1002\/cpe.6018","article-title":"Comparing Unified, Pinned, and Host\/Device Memory Allocations for Memory-Intensive Workloads on Tegra SoC","volume":"33","author":"Choi","year":"2021","journal-title":"Concurr. Comput."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1007\/s00162-023-00674-x","article-title":"GPU Computing of Yield Stress Fluid Flows in Narrow Gaps","volume":"37","author":"Frigaard","year":"2023","journal-title":"Theor. Comput. Fluid Dyn."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"106874","DOI":"10.1016\/j.cpc.2019.106874","article-title":"A GPU-Accelerated Package for Simulation of Flow in Nanoporous Source Rocks with Many-Body Dissipative Particle Dynamics","volume":"247","author":"Xia","year":"2020","journal-title":"Comput. Phys. Commun."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"108248","DOI":"10.1016\/j.cpc.2021.108248","article-title":"FSEI-GPU: GPU Accelerated Simulations of the Fluid\u2013Structure\u2013Electrophysiology Interaction in the Left Heart","volume":"273","author":"Viola","year":"2022","journal-title":"Comput. Phys. Commun."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"108548","DOI":"10.1016\/j.cpc.2022.108548","article-title":"Development and Performance of a HemeLB GPU Code for Human-Scale Blood Flow Simulation","volume":"282","author":"Zacharoudiou","year":"2023","journal-title":"Comput. Phys. Commun."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"103312","DOI":"10.1016\/j.jfluidstructs.2021.103312","article-title":"A Fluid\u2013Structure Interaction Model for Free-Surface Flows and Flexible Structures Using Smoothed Particle Hydrodynamics on a GPU","volume":"104","author":"Rogers","year":"2021","journal-title":"J. Fluids Struct."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2376","DOI":"10.1016\/j.cpc.2012.06.005","article-title":"Optimizations of a GPU Accelerated Heat Conduction Equation by a Programming of CUDA Fortran from an Analysis of a PTX File","volume":"183","author":"Satake","year":"2012","journal-title":"Comput. Phys. Commun."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.jmatprotec.2015.06.016","article-title":"A Rapid GPU-Based Heat Transfer and Solidification Model for Dynamic Computer Simulations of Continuous Steel Casting","volume":"226","year":"2015","journal-title":"J. Mater. Process. Technol."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"e138","DOI":"10.7717\/peerj-cs.138","article-title":"Solving the Inverse Heat Conduction Problem Using NVLink Capable Power Architecture","volume":"3","year":"2017","journal-title":"PeerJ Comput. Sci."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3846\/mla.2020.13500","article-title":"Comparison of GPU and CPU Efficiency While Solving Heat Conduction Problems","volume":"12","author":"Semenenko","year":"2020","journal-title":"Sci. Future Lith."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1007\/s10973-020-09924-0","article-title":"SADI Approach Programming on GPU: Convective Heat Transfer of Nanofluids Flow inside a Wavy Channel","volume":"146","author":"Taghavi","year":"2021","journal-title":"J. Therm. Anal. Calorim."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"107680","DOI":"10.1016\/j.jqsrt.2021.107680","article-title":"A Fast GPU Monte Carlo Implementation for Radiative Heat Transfer in Graded-Index Media","volume":"269","author":"Shao","year":"2021","journal-title":"J. Quant. Spectrosc. Radiat. Transf."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"102135","DOI":"10.1016\/j.csite.2022.102135","article-title":"A Comparison between Nonlinear and Constant Thermal Properties Approaches to Estimate the Temperature in LASER Welding Simulation","volume":"35","year":"2022","journal-title":"Case Stud. Therm. Eng."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"106672","DOI":"10.1016\/j.icheatmasstransfer.2023.106672","article-title":"The Volumetric Thermal Capacitor Method for Nonlinear Heat Transfer in Phase-Change Materials","volume":"142","year":"2023","journal-title":"Int. Commun. Heat Mass Transf."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2917","DOI":"10.1007\/s00170-023-11253-z","article-title":"A Literature Review in Heat Source Thermal Modeling Applied to Welding and Similar Processes","volume":"126","author":"Nascimento","year":"2023","journal-title":"Int. J. Adv. Manuf. Technol."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"4465","DOI":"10.1016\/j.matpr.2020.12.842","article-title":"Determination of Heat Source Model for Simulating Full Penetration Laser Welding of 316 LN Stainless Steel by Computational Fluid Dynamics","volume":"45","author":"Unni","year":"2021","journal-title":"Mater. Today Proc."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Beh\u00falov\u00e1, M., and Babalov\u00e1, E. (2023). Numerical Simulation of Temperature Fields during Laser Welding\u2013Brazing of Al\/Ti Plates. Materials, 16.","DOI":"10.3390\/ma16062258"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"2300309","DOI":"10.1002\/adem.202300309","article-title":"Influence of Heat Input on Microstructure and Mechanical Properties of Laser Welding GH4169 Bolt Assembly\u2014Numerical and Experimental Analysis","volume":"25","author":"Deng","year":"2023","journal-title":"Adv. Eng. Mater."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"121002","DOI":"10.1115\/1.4054903","article-title":"Evaluation of Phase Transformation Strain and Its Influence on Residual Stress Generation in Laser Welded Ti\u20136Al\u20134V Alloy","volume":"144","author":"Kumar","year":"2022","journal-title":"J. Manuf. Sci. Eng."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"106780","DOI":"10.1016\/j.ijthermalsci.2020.106780","article-title":"Modified Nonlinear Function Specification and Temperature Moving Sensor to Estimate the Heat Supply in a Welding Process","volume":"162","year":"2021","journal-title":"Int. J. Therm. Sci."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"107846","DOI":"10.1016\/j.ijthermalsci.2022.107846","article-title":"Estimating the Absorption Efficiency in a Laser Welding Process Using a Nonlinear Inverse Problem Method","volume":"183","author":"Pereira","year":"2023","journal-title":"Int. J. Therm. Sci."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"121857","DOI":"10.1016\/j.ijheatmasstransfer.2021.121857","article-title":"A Quadrilateral Optimization Method for Non-Linear Thermal Properties Determination in Materials at High Temperatures","volume":"181","year":"2021","journal-title":"Int. J. Heat Mass Transf."},{"key":"ref_26","unstructured":"Versteeg, H.K., and Malalasekera, W. (2007). An Introduction to Computational Fluid Dynamics, Pearson Education Limited. [2nd ed.]."},{"key":"ref_27","unstructured":"Crank, J. (1984). Free and Moving Boundary Problems, Clarendon Press."},{"key":"ref_28","unstructured":"Bergman, T.L., Lavine, A.S., Incropera, F.P., and Dewitt, D.P. (2011). Fundamentals of Heat and Mass Transfer, John Wiley & Sons. [7th ed.]."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1016\/j.icheatmasstransfer.2018.02.014","article-title":"A Thermal Analysis in Laser Welding Using Inverse Problems","volume":"92","author":"Paes","year":"2018","journal-title":"Int. Commun. Heat Mass Transf."},{"key":"ref_30","unstructured":"Codes for the Representation of Currencies and Funds. ISO: 2015 (Standard No. ISO 4217:2015). Available online: https:\/\/www.iso.org\/iso-4217-currency-codes.html."}],"container-title":["Computation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-3197\/12\/4\/83\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:29:13Z","timestamp":1760106553000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-3197\/12\/4\/83"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,17]]},"references-count":30,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2024,4]]}},"alternative-id":["computation12040083"],"URL":"https:\/\/doi.org\/10.3390\/computation12040083","relation":{},"ISSN":["2079-3197"],"issn-type":[{"type":"electronic","value":"2079-3197"}],"subject":[],"published":{"date-parts":[[2024,4,17]]}}}