{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T02:05:47Z","timestamp":1775786747630,"version":"3.50.1"},"reference-count":54,"publisher":"Springer Science and Business Media LLC","issue":"9","license":[{"start":{"date-parts":[[2024,2,22]],"date-time":"2024-02-22T00:00:00Z","timestamp":1708560000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,2,22]],"date-time":"2024-02-22T00:00:00Z","timestamp":1708560000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004837","name":"Ministerio de Ciencia e Innovaci\u00f3n","doi-asserted-by":"publisher","award":["PID2020-113656RB-C21"],"award-info":[{"award-number":["PID2020-113656RB-C21"]}],"id":[{"id":"10.13039\/501100004837","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004837","name":"Ministerio de Ciencia e Innovaci\u00f3n","doi-asserted-by":"publisher","award":["PID2022-1370480A-C43"],"award-info":[{"award-number":["PID2022-1370480A-C43"]}],"id":[{"id":"10.13039\/501100004837","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004837","name":"Ministerio de Ciencia e Innovaci\u00f3n","doi-asserted-by":"publisher","award":["PID2022-138696OB-C21"],"award-info":[{"award-number":["PID2022-138696OB-C21"]}],"id":[{"id":"10.13039\/501100004837","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100012818","name":"Comunidad de Madrid","doi-asserted-by":"publisher","award":["MIMACUHSPACE-CM-UC3M"],"award-info":[{"award-number":["MIMACUHSPACE-CM-UC3M"]}],"id":[{"id":"10.13039\/100012818","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004834","name":"Universitat Jaume I","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100004834","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2024,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Graphics processing units (GPUs) have become integral to embedded systems and supercomputing centres due to their large memory, cutting-edge technology and high performance per watt. However, their susceptibility to transient errors requires a comprehensive analysis of error sensitivity, as well as the development of error mitigation techniques and fault-tolerant algorithms. This study focuses on evaluating the soft-error sensitivity of two distinct versions of LU decomposition algorithms implemented on two very different GPUs\u2014a low-power SoC embedded GPU and a high-performance massively parallel GPU. Through extensive fault injection campaigns on both GPUs, we examine the vulnerability of the algorithms, identify error causes, and determine critical code components requiring enhanced protection. The experiments reveal that most single bit flip fault injections in the instruction results lead to erroneous outcomes or unrecoverable errors. Notably, efficient GPU resource utilisation can increase the number of masked errors, thereby enhancing error resilience. Additionally, while different parts of the code exhibit similar error occurrence types and rates, the propagation of errors to elements within the result matrix differs significantly.<\/jats:p>","DOI":"10.1007\/s11227-024-05925-0","type":"journal-article","created":{"date-parts":[[2024,2,22]],"date-time":"2024-02-22T01:03:50Z","timestamp":1708563830000},"page":"12844-12862","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Comparative analysis of soft-error sensitivity in LU decomposition algorithms on diverse GPUs"],"prefix":"10.1007","volume":"80","author":[{"given":"German","family":"Leon","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jose M.","family":"Badia","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jose A.","family":"Belloch","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Almudena","family":"Lindoso","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luis","family":"Entrena","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,2,22]]},"reference":[{"issue":"3","key":"5925_CR1","doi-asserted-by":"publisher","first-page":"5068","DOI":"10.1109\/JIOT.2019.2895742","volume":"6","author":"JA Belloch","year":"2019","unstructured":"Belloch JA, Bad\u00eda JM, Igual FD, Cobos M (2019) Practical considerations for acoustic source localization in the IoT era: platforms, energy efficiency, and performance. IEEE Internet Things J 6(3):5068\u20135079. https:\/\/doi.org\/10.1109\/JIOT.2019.2895742","journal-title":"IEEE Internet Things J"},{"key":"5925_CR2","doi-asserted-by":"crossref","unstructured":"Rech P, Pilla LL, Navaux POA, Carro L (2014) Impact of GPUs parallelism management on safety-critical and HPC applications reliability. In: 2014 44th Annual IEEE\/IFIP International Conference on Dependable Systems and Networks. IEEE, pp 455\u2013466","DOI":"10.1109\/DSN.2014.49"},{"key":"5925_CR3","doi-asserted-by":"crossref","unstructured":"Gomez LB, Cappello F, Carro L, DeBardeleben N, Fang B, Gurumurthi S, Pattabiraman K, Rech P, Reorda MS (2014) Gpgpus: how to combine high computational power with high reliability. In: 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, pp 1\u20139","DOI":"10.7873\/DATE2014.354"},{"key":"5925_CR4","doi-asserted-by":"publisher","first-page":"49562","DOI":"10.1109\/ACCESS.2020.2979489","volume":"8","author":"JA Belloch","year":"2020","unstructured":"Belloch JA, Ramos G, Badia JM, Cobos M (2020) An efficient implementation of parallel parametric HRTF models for binaural sound synthesis in mobile multimedia. IEEE Access 8:49562\u201349573","journal-title":"IEEE Access"},{"issue":"5","key":"5925_CR5","doi-asserted-by":"publisher","first-page":"1614","DOI":"10.1109\/TCSI.2017.2761909","volume":"65","author":"JA Belloch","year":"2018","unstructured":"Belloch JA, Bad\u00eda JM, Igual FD, Gonzalez A, Quintana-Ort\u00ed ES (2018) Optimized fundamental signal processing operations for energy minimization on heterogeneous mobile devices. IEEE Trans Circuits Syst I Regul Pap 65(5):1614\u20131627","journal-title":"IEEE Trans Circuits Syst I Regul Pap"},{"issue":"2","key":"5925_CR6","doi-asserted-by":"publisher","first-page":"591","DOI":"10.1007\/s11831-019-09321-3","volume":"27","author":"KV Sakhare","year":"2020","unstructured":"Sakhare KV, Tewari T, Vyas V (2020) Review of vehicle detection systems in advanced driver assistant systems. Arch Comput Methods Eng 27(2):591\u2013610","journal-title":"Arch Comput Methods Eng"},{"key":"5925_CR7","doi-asserted-by":"publisher","first-page":"24411","DOI":"10.1109\/ACCESS.2018.2830661","volume":"6","author":"WG Hatcher","year":"2018","unstructured":"Hatcher WG, Yu W (2018) A survey of deep learning: platforms, applications and emerging research trends. IEEE Access 6:24411\u201324432","journal-title":"IEEE Access"},{"key":"5925_CR8","doi-asserted-by":"crossref","unstructured":"Furano G, Menicucci A (2018) Roadmap for on-board processing and data handling systems in space. In: Dependable Multicore Architectures at Nanoscale. Springer, pp 253\u2013281","DOI":"10.1007\/978-3-319-54422-9_10"},{"issue":"6","key":"5925_CR9","doi-asserted-by":"publisher","first-page":"46","DOI":"10.1109\/MM.2018.2873870","volume":"38","author":"S Alcaide","year":"2018","unstructured":"Alcaide S, Kosmidis L, Tabani H, Hernandez C, Abella J, Cazorla FJ (2018) Safety-related challenges and opportunities for GPUs in the automotive domain. IEEE Micro 38(6):46\u201355. https:\/\/doi.org\/10.1109\/MM.2018.2873870","journal-title":"IEEE Micro"},{"key":"5925_CR10","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1007\/978-3-642-36949-0_20","volume-title":"Euro-Par 2012: parallel processing workshops","author":"I Demeshko","year":"2013","unstructured":"Demeshko I, Maruyama N, Tomita H, Matsuoka S (2013) Multi-GPU implementation of the NICAM atmospheric model. In: Caragiannis I, Alexander M, Badia RM, Cannataro M, Costan A, Danelutto M, Desprez F, Krammer B, Sahuquillo J, Scott SL, Weidendorfer J (eds) Euro-Par 2012: parallel processing workshops. Springer, pp 175\u2013184"},{"key":"5925_CR11","doi-asserted-by":"publisher","first-page":"94719","DOI":"10.1109\/ACCESS.2020.2993103","volume":"8","author":"JM Bad\u00eda","year":"2020","unstructured":"Bad\u00eda JM, Amor-Martin A, Belloch JA, Garc\u00eda-Castillo LE (2020) GPU acceleration of a non-standard finite element mesh truncation technique for electromagnetics. IEEE Access 8:94719\u201394730","journal-title":"IEEE Access"},{"key":"5925_CR12","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1016\/j.jpdc.2021.07.006","volume":"157","author":"GN Barreales","year":"2021","unstructured":"Barreales GN, Novalbos M, Otaduy MA, Sanchez A (2021) Mdscale: Scalable multi-GPU bonded and short-range molecular dynamics. J Parall Distrib Comput 157:243\u2013255. https:\/\/doi.org\/10.1016\/j.jpdc.2021.07.006","journal-title":"J Parall Distrib Comput"},{"issue":"2","key":"5925_CR13","doi-asserted-by":"publisher","first-page":"663","DOI":"10.1109\/TR.2018.2878387","volume":"68","author":"FF dos Santos","year":"2019","unstructured":"dos Santos FF, Pimenta PF, Lunardi C, Draghetti L, Carro L, Kaeli D, Rech P (2019) Analyzing and increasing the reliability of convolutional neural networks on GPUs. IEEE Trans Reliab 68(2):663\u2013677","journal-title":"IEEE Trans Reliab"},{"issue":"5","key":"5925_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3403956","volume":"53","author":"R Canal","year":"2020","unstructured":"Canal R, Hernandez C, Tornero R, Cilardo A, Massari G, Reghenzani F, Fornaciari W, Zapater M, Atienza D, Oleksiak A et al (2020) Predictive reliability and fault management in exascale systems: state of the art and perspectives. ACM Comput Surv (CSUR) 53(5):1\u201332","journal-title":"ACM Comput Surv (CSUR)"},{"key":"5925_CR15","doi-asserted-by":"crossref","unstructured":"Tiwari D, Gupta S, Rogers J, Maxwell D, Rech P, Vazhkudai S, Oliveira D, Londo D, DeBardeleben N, Navaux P, et al (2015) Understanding gpu errors on large-scale hpc systems and the implications for system design and operation. In: 2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA). IEEE, pp 331\u2013342","DOI":"10.1109\/HPCA.2015.7056044"},{"key":"5925_CR16","unstructured":"Sastry Hari SK, Rech P, Tsai T, Stephenson M, Zulfiqar A, Sullivan M, Shirvani P, Racunas P, Emer J, Keckler SW (2020) Estimating silent data corruption rates using a two-level model. arXiv e-prints, 2005"},{"key":"5925_CR17","doi-asserted-by":"crossref","unstructured":"dos Santos FF, Hari SKS, Basso PM, Carro L, Rech P (2021) Demystifying GPU reliability: comparing and combining beam experiments, fault simulation, and profiling. In: 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, pp 289\u2013298","DOI":"10.1109\/IPDPS49936.2021.00037"},{"key":"5925_CR18","doi-asserted-by":"publisher","first-page":"113856","DOI":"10.1016\/j.microrel.2020.113856","volume":"114","author":"G Le\u00f3n","year":"2020","unstructured":"Le\u00f3n G, Bad\u00eda JM, Belloch JA, Lindoso A, Entrena L (2020) Evaluating the soft error sensitivity of a GPU-based SoC for matrix multiplication. Microelectron Reliab 114:113856","journal-title":"Microelectron Reliab"},{"key":"5925_CR19","doi-asserted-by":"crossref","unstructured":"Che S, Boyer M, Meng J, Tarjan D, Sheaffer JW, Lee S-H, Skadron K (2009) Rodinia: a benchmark suite for heterogeneous computing. In: 2009 IEEE International Symposium on Workload Characterization (IISWC). IEEE, pp 44\u201354","DOI":"10.1109\/IISWC.2009.5306797"},{"issue":"1","key":"5925_CR20","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1109\/TR.2018.2793098","volume":"67","author":"M Didehban","year":"2018","unstructured":"Didehban M, Shrivastava A (2018) A compiler technique for processor-wide protection from soft errors in multithreaded environments. IEEE Trans Reliab 67(1):249\u2013263. https:\/\/doi.org\/10.1109\/TR.2018.2793098","journal-title":"IEEE Trans Reliab"},{"key":"5925_CR21","doi-asserted-by":"publisher","unstructured":"Bodmann P, Papadimitriou G, Gizopoulos D, Rech P (2021) The impact of SoC integration and OS deployment on the reliability of ARM processors. In: 2021 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp 223\u2013225. https:\/\/doi.org\/10.1109\/ISPASS51385.2021.00040. IEEE","DOI":"10.1109\/ISPASS51385.2021.00040"},{"issue":"1","key":"5925_CR22","doi-asserted-by":"publisher","first-page":"346","DOI":"10.1109\/TR.2019.2954384","volume":"70","author":"D Cotroneo","year":"2021","unstructured":"Cotroneo D, Iannillo AK, Natella R, Rosiello S (2021) Dependability assessment of the android OS through fault injection. IEEE Trans Reliab 70(1):346\u2013361. https:\/\/doi.org\/10.1109\/TR.2019.2954384","journal-title":"IEEE Trans Reliab"},{"issue":"3","key":"5925_CR23","doi-asserted-by":"publisher","first-page":"791","DOI":"10.1109\/TC.2015.2444855","volume":"65","author":"DAGG de Oliveira","year":"2016","unstructured":"de Oliveira DAGG, Pilla LL, Santini T, Rech P (2016) Evaluation and mitigation of radiation-induced soft errors in graphics processing units. IEEE Trans Comput 65(3):791\u2013804","journal-title":"IEEE Trans Comput"},{"key":"5925_CR24","doi-asserted-by":"publisher","first-page":"42674","DOI":"10.1109\/ACCESS.2020.2975832","volume":"8","author":"Y Zhu","year":"2020","unstructured":"Zhu Y, Liu Y, Zhang G (2020) FT-PBLAS: PBLAS-based fault-tolerant linear algebra computation on high-performance computing systems. IEEE Access 8:42674\u201342688. https:\/\/doi.org\/10.1109\/ACCESS.2020.2975832","journal-title":"IEEE Access"},{"issue":"4","key":"5925_CR25","doi-asserted-by":"publisher","first-page":"1874","DOI":"10.1109\/TNS.2014.2301768","volume":"61","author":"LL Pilla","year":"2014","unstructured":"Pilla LL, Rech P, Silvestri F, Frost C, Navaux POA, Reorda MS, Carro L (2014) Software-based hardening strategies for neutron sensitive FFT algorithms on GPUs. IEEE Trans Nucl Sci 61(4):1874\u20131880","journal-title":"IEEE Trans Nucl Sci"},{"key":"5925_CR26","doi-asserted-by":"publisher","unstructured":"Condia JER, dos Santos FF, Reorda MS, Rech P (2021) Combining architectural simulation and software fault injection for a fast and accurate CNNs reliability evaluation on GPUs. In: 2021 IEEE 39th VLSI Test Symposium (VTS). IEEE, pp 62\u201368. https:\/\/doi.org\/10.1109\/VTS50974.2021.9441044","DOI":"10.1109\/VTS50974.2021.9441044"},{"issue":"7","key":"5925_CR27","doi-asserted-by":"publisher","first-page":"1560","DOI":"10.1109\/TNS.2020.2977583","volume":"67","author":"PM Basso","year":"2020","unstructured":"Basso PM, Santos FFD, Rech P (2020) Impact of tensor cores and mixed precision on the reliability of matrix multiplication in GPUs. IEEE Trans Nucl Sci 67(7):1560\u20131565. https:\/\/doi.org\/10.1109\/TNS.2020.2977583","journal-title":"IEEE Trans Nucl Sci"},{"key":"5925_CR28","doi-asserted-by":"publisher","unstructured":"Condia JER, Rech P, dos Santos FF, Carrot L, Reorda MS (2021) Protecting GPU\u2019s Microarchitectural Vulnerabilities via Effective Selective Hardening. In: 2021 IEEE 27th International Symposium on On-Line Testing and Robust System Design (IOLTS), pp. 1\u20137. https:\/\/doi.org\/10.1109\/IOLTS52814.2021.9486703. IEEE","DOI":"10.1109\/IOLTS52814.2021.9486703"},{"key":"5925_CR29","doi-asserted-by":"publisher","unstructured":"Abdelfattah A, Haidar A, Tomov S, Dongarra J (2018) Optimizing GPU kernels for irregular batch workloads: a case study for cholesky factorization. In: 2018 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, pp 450\u2013456. https:\/\/doi.org\/10.1109\/HPEC.2018.8547576","DOI":"10.1109\/HPEC.2018.8547576"},{"issue":"1","key":"5925_CR30","first-page":"1051","volume":"75","author":"FJ Alventosa","year":"2018","unstructured":"Alventosa FJ, Alonso P, Vidal AM, Pi\u00f1ero G, Quintana-Orti ES (2018) Fast block QR update in digital signal processing. J Supercomput 75(1):1051\u20131064","journal-title":"J Supercomput"},{"issue":"10","key":"5925_CR31","doi-asserted-by":"publisher","first-page":"817","DOI":"10.17743\/jaes.2017.0029","volume":"65","author":"B Bank","year":"2017","unstructured":"Bank B, Belloch JA, V\u00e4lim\u00e4ki V (2017) Efficient design of a parallel graphic equalizer. J Audio Eng Soc 65(10):817\u2013825. https:\/\/doi.org\/10.17743\/jaes.2017.0029","journal-title":"J Audio Eng Soc"},{"issue":"6","key":"5925_CR32","doi-asserted-by":"publisher","first-page":"4975","DOI":"10.1109\/TPWRS.2017.2662322","volume":"32","author":"G Zhou","year":"2017","unstructured":"Zhou G, Bo R, Chien L, Zhang X, Shi F, Xu C, Feng Y (2017) GPU-based batch LU-factorization solver for concurrent analysis of massive power flows. IEEE Trans Power Syst 32(6):4975\u20134977. https:\/\/doi.org\/10.1109\/TPWRS.2017.2662322","journal-title":"IEEE Trans Power Syst"},{"key":"5925_CR33","doi-asserted-by":"crossref","unstructured":"Wu P, DeBardeleben N, Guan Q, Blanchard S, Chen J, Tao D, Liang X, Ouyang K, Chen Z (2017) Silent data corruption resilient two-sided matrix factorizations. In: Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp 415\u2013427","DOI":"10.1145\/3018743.3018750"},{"key":"5925_CR34","doi-asserted-by":"crossref","unstructured":"Wu P, Guan Q, DeBardeleben N, Blanchard S, Tao D, Liang X, Chen J, Chen Z (2016) Towards practical algorithm based fault tolerance in dense linear algebra. In: Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, pp 31\u201342","DOI":"10.1145\/2907294.2907315"},{"key":"5925_CR35","doi-asserted-by":"crossref","unstructured":"Chen J, Li H, Li S, Liang X, Wu P, Tao D, Ouyang K, Liu Y, Zhao K, Guan Q, et al (2018) Fault tolerant one-sided matrix decompositions on heterogeneous systems with GPUs. In: SC18: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, pp 854\u2013865","DOI":"10.1109\/SC.2018.00071"},{"issue":"6","key":"5925_CR36","doi-asserted-by":"publisher","first-page":"518","DOI":"10.1109\/TC.1984.1676475","volume":"100","author":"K-H Huang","year":"1984","unstructured":"Huang K-H, Abraham JA (1984) Algorithm-based fault tolerance for matrix operations. IEEE Trans Comput 100(6):518\u2013528","journal-title":"IEEE Trans Comput"},{"key":"5925_CR37","doi-asserted-by":"crossref","unstructured":"Davies T, Chen Z (2013) Correcting soft errors online in LU factorization. In: Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing. ACM, pp 167\u2013178","DOI":"10.1145\/2462902.2462920"},{"key":"5925_CR38","doi-asserted-by":"crossref","unstructured":"Wu P, Chen Z (2014) FT-ScaLAPACK: correcting soft errors on-line for ScaLAPACK Cholesky, QR, and LU factorization routines. In: Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing. ACM, pp 49\u201360","DOI":"10.1145\/2600212.2600232"},{"key":"5925_CR39","doi-asserted-by":"crossref","unstructured":"Tselonis S, Gizopoulos D (2016) GUFI: A framework for GPUs reliability assessment. In: 2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). IEEE, pp 90\u2013100","DOI":"10.1109\/ISPASS.2016.7482077"},{"key":"5925_CR40","doi-asserted-by":"crossref","unstructured":"Hari SKS, Tsai T, Stephenson M, Keckler SW, Emer J (2017) SASSIFI: an architecture-level fault injection tool for GPU application resilience evaluation. In: 2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). IEEE, pp 249\u2013258","DOI":"10.1109\/ISPASS.2017.7975296"},{"key":"5925_CR41","doi-asserted-by":"crossref","unstructured":"Previlon FG, Kalra C, Tiwari D, Kaeli DR (2019) PCFI: Program counter guided fault injection for accelerating gpu reliability assessment. In: 2019 Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, pp 308\u2013311","DOI":"10.23919\/DATE.2019.8714781"},{"key":"5925_CR42","doi-asserted-by":"crossref","unstructured":"Fickenscher J, Reinhart S, Hannig F, Teich J, Bouzouraa ME (2017) Convoy tracking for ADAS on embedded GPUs. In: 2017 IEEE Intelligent Vehicles Symposium (IV). IEEE, pp 959\u2013965","DOI":"10.1109\/IVS.2017.7995839"},{"key":"5925_CR43","doi-asserted-by":"publisher","first-page":"103143","DOI":"10.1016\/j.micpro.2020.103143","volume":"77","author":"L Kosmidis","year":"2020","unstructured":"Kosmidis L, Rodriguez I, Jover \u00c1, Alcaide S, Lachaize J, Abella J, Notebaert O, Cazorla FJ, Steenari D (2020) GPU4S: embedded GPUs in space-latest project updates. Microprocess Microsyst 77:103143","journal-title":"Microprocess Microsyst"},{"issue":"7","key":"5925_CR44","doi-asserted-by":"publisher","first-page":"1467","DOI":"10.1109\/TNS.2022.3155820","volume":"69","author":"JM Badia","year":"2022","unstructured":"Badia JM, Leon G, Belloch JA, Lindoso A, Garcia-Valderas M, Morilla Y, Entrena L (2022) Reliability evaluation of LU decomposition on GPU-accelerated system-on-chip under proton irradiation. IEEE Trans Nucl Sci 69(7):1467\u20131474","journal-title":"IEEE Trans Nucl Sci"},{"key":"5925_CR45","unstructured":"NVIDIA (2021) CUDA C++ programming guide. PG-02829-001_v11.2. Design Guide"},{"key":"5925_CR46","unstructured":"NVIDIA (2014) NVIDIA Tegra K1. A new era in mobile computing. NVIDIA Whitepaper"},{"key":"5925_CR47","unstructured":"NVIDIA (2017) NVIDIA Tesla V100 GPU Architecture. The World\u2019s Most Advanced Data Center GPU. WP-08608-001_v1.1. NVIDIA,. NVIDIA"},{"issue":"4","key":"5925_CR48","doi-asserted-by":"publisher","first-page":"1642","DOI":"10.1109\/TNS.2015.2450997","volume":"62","author":"J Noh","year":"2015","unstructured":"Noh J, Correas V, Lee S, Jeon J, Nofal I, Cerba J, Belhaddad H, Alexandrescu D, Lee Y, Kwon S (2015) Study of neutron soft error rate (SER) sensitivity: investigation of upset mechanisms by comparative simulation of FinFET and planar MOSFET SRAMs. IEEE Trans Nucl Sci 62(4):1642\u20131649","journal-title":"IEEE Trans Nucl Sci"},{"key":"5925_CR49","doi-asserted-by":"crossref","unstructured":"Li G, Pattabiraman K, Cher C-Y, Bose P (2016) Understanding error propagation in GPGPU applications. In: SC\u201916: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, pp 240\u2013251","DOI":"10.1109\/SC.2016.20"},{"key":"5925_CR50","doi-asserted-by":"crossref","unstructured":"Oliveira D, Pilla L, DeBardeleben N, Blanchard S, Quinn H, Koren I, Navaux P, Rech P (2017) Experimental and analytical study of xeon phi reliability. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp 1\u201312","DOI":"10.1145\/3126908.3126960"},{"key":"5925_CR51","doi-asserted-by":"crossref","unstructured":"Oliveira D, Frattin V, Navaux P, Koren I, Rech P (2017) Carol-fi: an efficient fault-injection tool for vulnerability evaluation of modern hpc parallel accelerators. In: Proceedings of the Computing Frontiers Conference, pp 295\u2013298","DOI":"10.1145\/3075564.3075598"},{"key":"5925_CR52","unstructured":"NVIDIA (2021) CUDA-GDB. CUDA Debugger. User Manual. DU-05227-042_v11.2"},{"key":"5925_CR53","unstructured":"NVIDIA (2023) Profiler User\u2019s Guide. https:\/\/docs.nvidia.com\/cuda\/profiler-users-guide, v12.1"},{"issue":"5\u20136","key":"5925_CR54","doi-asserted-by":"publisher","first-page":"232","DOI":"10.1016\/j.parco.2009.12.005","volume":"36","author":"S Tomov","year":"2010","unstructured":"Tomov S, Dongarra J, Baboulin M (2010) Towards dense linear algebra for hybrid GPU accelerated manycore systems. Parallel Comput 36(5\u20136):232\u2013240. https:\/\/doi.org\/10.1016\/j.parco.2009.12.005","journal-title":"Parallel Comput"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-024-05925-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11227-024-05925-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-024-05925-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,4]],"date-time":"2024-06-04T06:47:11Z","timestamp":1717483631000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11227-024-05925-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,22]]},"references-count":54,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2024,6]]}},"alternative-id":["5925"],"URL":"https:\/\/doi.org\/10.1007\/s11227-024-05925-0","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-3372503\/v1","asserted-by":"object"}]},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"value":"0920-8542","type":"print"},{"value":"1573-0484","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,22]]},"assertion":[{"value":"21 January 2024","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 February 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}