{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T22:55:22Z","timestamp":1777676122363,"version":"3.51.4"},"reference-count":80,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2025,5,15]],"date-time":"2025-05-15T00:00:00Z","timestamp":1747267200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"funder":[{"DOI":"10.13039\/100006192","name":"Advanced Scientific Computing Research","doi-asserted-by":"publisher","award":["DE-SC0024387"],"award-info":[{"award-number":["DE-SC0024387"]}],"id":[{"id":"10.13039\/100006192","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2025,9]]},"abstract":"<jats:p>Modern scientific workflows process massive amounts of data from diverse instruments and sensors, leveraging geographically distributed, heterogeneous compute and storage resources\u2014from leadership-class systems to edge devices\u2014connected by high-performance networks. The diversity of resources introduces challenges in harnessing their full potential, with resilience issues arising across applications, system software, networks, storage, and hardware. Today, workflow management systems (WMS) coordinate the execution of computation and data management tasks across target resources. However, WMS\u2019s centralized nature makes them vulnerable to faults and scalability issues that may result in failures of entire computational campaigns. This paper introduces a novel agentic framework for workflow management, fully distributing and decentralizing the WMS functions and modeling them as swarm intelligence agents infused with advanced artificial intelligence solutions and traditional distributed computing algorithms that can make coordinated decisions in the presence of failures of the underlying cyberinfrastructure.<\/jats:p>","DOI":"10.1177\/10943420251339317","type":"journal-article","created":{"date-parts":[[2025,5,16]],"date-time":"2025-05-16T01:07:32Z","timestamp":1747357652000},"page":"692-712","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":3,"title":["SWARM: Reimagining scientific workflow management systems in a distributed world"],"prefix":"10.1177","volume":"39","author":[{"given":"Prasanna","family":"Balaprakash","sequence":"first","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Krishnan","family":"Raghavan","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7890-3934","authenticated-orcid":false,"given":"Franck","family":"Cappello","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5106-503X","authenticated-orcid":false,"given":"Ewa","family":"Deelman","sequence":"additional","affiliation":[{"name":"Information Sciences Institute, University of Southern California, Marina del Rey, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anirban","family":"Mandal","sequence":"additional","affiliation":[{"name":"Renaissance Computing Institute, University of North Carolina Chapel Hill, Chapel Hill, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2851-595X","authenticated-orcid":false,"given":"Hongwei","family":"Jin","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Imtiaz","family":"Mahmud","sequence":"additional","affiliation":[{"name":"Lawrence Berkeley National Laboratory, Berkeley, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Komal","family":"Thareja","sequence":"additional","affiliation":[{"name":"Renaissance Computing Institute, University of North Carolina Chapel Hill, Chapel Hill, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shixun","family":"Wu","sequence":"additional","affiliation":[{"name":"University of California Riverside, Riverside, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pawel","family":"Zuk","sequence":"additional","affiliation":[{"name":"Information Sciences Institute, University of Southern California, Marina del Rey, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mariam","family":"Kiran","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory, Oak Ridge, TN, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zizhong","family":"Chen","sequence":"additional","affiliation":[{"name":"University of California Riverside, Riverside, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7339-5256","authenticated-orcid":false,"given":"Sheng","family":"Di","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kesheng","family":"Wu","sequence":"additional","affiliation":[{"name":"Lawrence Berkeley National Laboratory, Berkeley, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2025,5,15]]},"reference":[{"key":"e_1_3_3_2_1","doi-asserted-by":"crossref","unstructured":"Adriaens F Gionis A (2022) Diameter minimization by shortcutting with degree constraints. In: 2022 IEEE International Conference on Data Mining (ICDM) Orlando FL USA 28 November\u201301 December 2022 IEEE pp. 843\u2013848.","DOI":"10.1109\/ICDM54844.2022.00095"},{"key":"e_1_3_3_3_1","doi-asserted-by":"crossref","unstructured":"Alsaadi A Ward L Merzky A et al. (2022) RADICAL-pilot and parsl: executing heterogeneous workflows on HPC platforms. In: 2022 IEEE\/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS) Dallas TX USA 13\u201318 November 2022 IEEE pp. 27\u201334.","DOI":"10.1109\/WORKS56498.2022.00009"},{"key":"e_1_3_3_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-021-10097-x"},{"key":"e_1_3_3_5_1","doi-asserted-by":"publisher","DOI":"10.1177\/1550147720913233"},{"key":"e_1_3_3_6_1","unstructured":"AuYoung A Chun BN Snoeren AC et al. (2004) Resource allocation in federated distributed computing infrastructures. In: Proceedings of the First Workshop on Operating System and Architectural Support for the on demand IT InfraStructure San Francisco CA December 6-8 2004."},{"key":"e_1_3_3_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1317379.1317382"},{"key":"e_1_3_3_8_1","doi-asserted-by":"crossref","unstructured":"Bland W Lu H Seo S et al. (2015) Lessons learned implementing user-level failure mitigation in mpich. In: 2015 15th IEEE\/ACM international symposium on cluster cloud and grid computing Shenzhen China 04\u201307 May 2015 IEEE pp. 1123\u20131126.","DOI":"10.1109\/CCGrid.2015.51"},{"key":"e_1_3_3_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0377-2217(99)00486-5"},{"key":"e_1_3_3_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2020.101213"},{"key":"e_1_3_3_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11721-012-0075-2"},{"key":"e_1_3_3_12_1","volume-title":"Scheduling Algorithms","author":"Brucker P","year":"2007","unstructured":"Brucker P (2007) Scheduling Algorithms. Springer Berlin, Heildelberg."},{"issue":"1","key":"e_1_3_3_13_1","first-page":"5","article-title":"Toward exascale resilience: 2014 update","volume":"1","author":"Cappello F","year":"2014","unstructured":"Cappello F, Al G, Gropp W, et al. (2014) Toward exascale resilience: 2014 update. Supercomputing Frontiers and Innovations: An International Journal 1(1): 5\u201328.","journal-title":"Supercomputing Frontiers and Innovations: An International Journal"},{"key":"e_1_3_3_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/571637.571640"},{"key":"e_1_3_3_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCC.2016.2594174"},{"key":"e_1_3_3_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TII.2020.3028963"},{"key":"e_1_3_3_17_1","doi-asserted-by":"crossref","unstructured":"Clifford J Garfield K Towhidnejad M et al. (2017) Multi-layer model of swarm intelligence for resilient autonomous systems. In: 2017 IEEE\/AIAA 36th Digital Avionics Systems Conference (DASC) St. Petersburg FL USA 17\u201321 September 2017 IEEE pp. 1\u20134.","DOI":"10.1109\/DASC.2017.8102147"},{"key":"e_1_3_3_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.automatica.2007.07.022"},{"key":"e_1_3_3_19_1","first-page":"6351","article-title":"Learning combinatorial optimization algorithms over graphs","volume":"30","author":"Dai H","year":"2017","unstructured":"Dai H, Khalil E, Zhang Y, et al. (2017) Learning combinatorial optimization algorithms over graphs. Advances in Neural Information Processing Systems 30: 6351\u20136361.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_3_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2014.10.008"},{"key":"e_1_3_3_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2019.2919690"},{"key":"e_1_3_3_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2020.101200"},{"issue":"1","key":"e_1_3_3_23_1","first-page":"5","article-title":"A white paper on the benefits of chipkill-correct ecc for pc server main memory","volume":"11","author":"Dell TJ","year":"1997","unstructured":"Dell TJ (1997) A white paper on the benefits of chipkill-correct ecc for pc server main memory. IBM Microelectronics division 11(1-23): 5\u20137.","journal-title":"IBM Microelectronics division"},{"key":"e_1_3_3_24_1","unstructured":"DIS (2024) Distri. https:\/\/swarm-workflows.org\/DISTRI\/"},{"key":"e_1_3_3_25_1","doi-asserted-by":"publisher","DOI":"10.1080\/23307706.2024.2388551"},{"key":"e_1_3_3_26_1","doi-asserted-by":"crossref","unstructured":"Floyd S Henderson T Gurtov A (2004) Rfc3782: the newreno modification to tcp\u2019s fast recovery algorithm.","DOI":"10.17487\/rfc3782"},{"key":"e_1_3_3_27_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1015617019423"},{"key":"e_1_3_3_28_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkac247"},{"key":"e_1_3_3_29_1","doi-asserted-by":"publisher","DOI":"10.1287\/moor.1.2.117"},{"key":"e_1_3_3_30_1","unstructured":"Google (2024) OR-Tools. https:\/\/developers.google.com\/optimization\/"},{"key":"e_1_3_3_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3210284.3210297"},{"key":"e_1_3_3_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1400097.1400105"},{"key":"e_1_3_3_33_1","volume-title":"Data Pipelines with Apache Airflow","author":"Harenslak BP","year":"2021","unstructured":"Harenslak BP, de Ruiter J (2021) Data Pipelines with Apache Airflow. NY: Manning, Shelter Island."},{"key":"e_1_3_3_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csa.2024.100065"},{"key":"e_1_3_3_35_1","doi-asserted-by":"publisher","DOI":"10.1063\/5.0112122"},{"key":"e_1_3_3_36_1","doi-asserted-by":"crossref","unstructured":"Jin H Raghavan K Papadimitriou G et al. (2022) Workflow anomaly detection with graph neural networks. In: 2022 IEEE\/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS). Dallas TX USA 13\u201318 November 2022 pp. 35\u201342.","DOI":"10.1109\/WORKS56498.2022.00010"},{"key":"e_1_3_3_37_1","doi-asserted-by":"crossref","unstructured":"Jin H Papadimitriou G Raghavan K et al. (2024) Large language models for anomaly detection in computational workflows: from supervised fine-tuning to in-context learning. In: SC24: International Conference for High Performance Computing Networking Storage and Analysis Atlanta GA USA 17\u201322 November 2024 IEEE pp. 1\u201317.","DOI":"10.1109\/SC41406.2024.00098"},{"key":"e_1_3_3_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2023.3334396"},{"key":"e_1_3_3_39_1","doi-asserted-by":"crossref","unstructured":"Johnston J Liu XY Wu S et al. (2023b) Downlink beamforming optimization via deep learning. In: 2023 59th Annual Allerton Conference on Communication Control and Computing (Allerton) Monticello IL USA 26\u201329 September 2023 IEEE pp. 1\u20135.","DOI":"10.1109\/Allerton58177.2023.10313512"},{"key":"e_1_3_3_40_1","doi-asserted-by":"publisher","DOI":"10.3389\/frobt.2016.00036"},{"key":"e_1_3_3_41_1","first-page":"187","volume-title":"Swarm intelligence. In: Handbook of Nature-Inspired and Innovative Computing: Integrating Classical Models with Emerging Technologies","author":"Kennedy J","year":"2006","unstructured":"Kennedy J (2006) Swarm intelligence. In: Handbook of Nature-Inspired and Innovative Computing: Integrating Classical Models with Emerging Technologies. New York, NY: Springer, 187\u2013219."},{"key":"e_1_3_3_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ejc.2011.09.030"},{"key":"e_1_3_3_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1773912.1773922"},{"key":"e_1_3_3_44_1","volume-title":"Proceedings of PFLDnet, Volume 2004","author":"Leith D","year":"2004","unstructured":"Leith D, Shorten R (2004) H-TCP: TCP for high-speed and long-distance networks. Proceedings of PFLDnet, Volume 2004. Argonne, IL: Citeseer."},{"key":"e_1_3_3_45_1","doi-asserted-by":"crossref","unstructured":"Liu N Li Z Xu Z et al. (2017) A hierarchical framework of cloud resource allocation and power management using deep reinforcement learning. In: 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS) Atlanta GA USA 5\u20138 June 2017 pp. 372\u2013382.","DOI":"10.1109\/ICDCS.2017.123"},{"key":"e_1_3_3_46_1","unstructured":"Liu XY Li Z Wu S et al. (2023) Stationary deep reinforcement learning with quantum k-spin Hamiltonian regularization. In: ICLR 2023 Workshop on Physics for Machine Learning Kihali Rwanda May 1-5 2023."},{"key":"e_1_3_3_47_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2020.01.026"},{"key":"e_1_3_3_48_1","doi-asserted-by":"crossref","unstructured":"Mahmud I Zuk P Wang C et al. (2024) DISTRI: development and integration of simulation tools for resilient infrastructure. In: The 5th International Workshop on Big Data & AI Tools Models and Use Cases for Innovative Scientific Discovery (BTSD) Washington DC USA 15\u201318 December 2024.","DOI":"10.1109\/BigData62323.2024.10825783"},{"key":"e_1_3_3_49_1","doi-asserted-by":"crossref","unstructured":"Mao Y Deb S Venkatakrishnan SB et al. (2020) Perigee: efficient peer-to-peer network design for blockchains. In: Proceedings of the 39th Symposium on Principles of Distributed Computing Virtual August 3-7 2020 pp. 428\u2013437.","DOI":"10.1145\/3382734.3405704"},{"key":"e_1_3_3_50_1","doi-asserted-by":"crossref","unstructured":"Mitchell R Pottier L Jacobs S et al. (2019) Exploration of workflow management systems emerging features from users perspectives. In: 2019 IEEE International Conference on Big Data (Big Data) Los Angeles CA USA 09\u201312 December 2019 IEEE pp. 4537\u20134544.","DOI":"10.1109\/BigData47090.2019.9005494"},{"key":"e_1_3_3_51_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bty699"},{"key":"e_1_3_3_52_1","unstructured":"OpenAI (2025) Openai o3-mini. https:\/\/openai.com\/index\/openai-o3-mini\/"},{"key":"e_1_3_3_53_1","doi-asserted-by":"crossref","unstructured":"Park K Lee DH Woo Y et al. (2009) Reliability and performance enhancement technique for ssd array storage system using raid mechanism. In: 2009 9th International Symposium on Communications and Information Technology Icheon Korea (South) 28\u201330 September 2009 IEEE pp. 140\u2013145.","DOI":"10.1109\/ISCIT.2009.5341269"},{"key":"e_1_3_3_54_1","doi-asserted-by":"crossref","unstructured":"Patterson DA Gibson G Katz RH (1988) A case for redundant arrays of inexpensive disks (raid). In: Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data Chicago IL June 1109-3116 1988 pp. \u2013.","DOI":"10.1145\/50202.50214"},{"key":"e_1_3_3_55_1","doi-asserted-by":"publisher","DOI":"10.3390\/drones6110340"},{"key":"e_1_3_3_56_1","doi-asserted-by":"publisher","DOI":"10.1016\/B978-0-12-805467-3.00015-6"},{"key":"e_1_3_3_57_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cie.2020.106649"},{"key":"e_1_3_3_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2961106"},{"key":"e_1_3_3_59_1","unstructured":"SimPy (2024) SimPy. https:\/\/simpy.readthedocs.io\/en\/latest\/"},{"key":"e_1_3_3_60_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2020.05.009"},{"key":"e_1_3_3_61_1","doi-asserted-by":"crossref","unstructured":"Sridharan V Liberty D (2012) A study of dram failures in the field. In: SC\u201912: Proceedings of the International Conference on High Performance Computing Networking Storage and Analysis Salt Lake City UT USA 10\u201316 November 2012 IEEE pp. 1\u201311.","DOI":"10.1109\/SC.2012.13"},{"key":"e_1_3_3_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/964723.383071"},{"key":"e_1_3_3_63_1","unstructured":"Suresh L Malkhi D Gopalan P et al. (2018) Stable and consistent membership at scale with rapid. In: 2018 USENIX Annual Technical Conference (USENIX ATC 18) Boston MA July 11-13 2018."},{"key":"e_1_3_3_64_1","doi-asserted-by":"crossref","unstructured":"Svirin P De K Forti A et al. (2019) BigPanDA: panda workload management system and its applications beyond ATLAS. EPJ Web of Conferences Volume 214. EDP Sciences Les Ulis France. pp. 03050.","DOI":"10.1051\/epjconf\/201921403050"},{"key":"e_1_3_3_65_1","doi-asserted-by":"publisher","DOI":"10.1016\/0377-2217(93)90182-M"},{"key":"e_1_3_3_66_1","doi-asserted-by":"crossref","unstructured":"Taylor IJ Deelman E Gannon DB et al. (2007) Workflows for E-Science: Scientific Workflows for Grids Volume 1. Springer London UK.","DOI":"10.1007\/978-1-84628-757-2"},{"key":"e_1_3_3_67_1","volume-title":"A greedy consensus-based approach to distributed job selection: toward fully-decentralized workload management system","author":"Thareja K","year":"2025","unstructured":"Thareja K, Raghavan K, Mandal A, et al. (2025) A greedy consensus-based approach to distributed job selection: toward fully-decentralized workload management system. In: Proceedings of the 25th IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid\u201925), Tromso, Norway, May 19-22, 2025, IEEE."},{"key":"e_1_3_3_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/71.993206"},{"key":"e_1_3_3_69_1","doi-asserted-by":"crossref","unstructured":"Tyler N Knop R Bard D et al. (2022) Cross-facility workflows: case studies with active experiments. In: 2022 IEEE\/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS) Dallas TX USA 13\u201318 November 2022 IEEE pp. 68\u201375.","DOI":"10.1109\/WORKS56498.2022.00014"},{"key":"e_1_3_3_70_1","doi-asserted-by":"crossref","unstructured":"Van Beek V Oikonomou G Iosup A (2019) Portfolio scheduling for managing operational and disaster-recovery risks in virtualized datacenters hosting business-critical workloads. In: 2019 18th International Symposium on Parallel and Distributed Computing (ISPDC) Amsterdam Netherlands 03\u201307 June 2019 IEEE pp. 94\u2013102.","DOI":"10.1109\/ISPDC.2019.00022"},{"key":"e_1_3_3_71_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00607-020-00861-2"},{"key":"e_1_3_3_72_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41592-021-01254-9"},{"key":"e_1_3_3_73_1","doi-asserted-by":"crossref","unstructured":"Wu S Zhai Y Huang J et al. (2023a) FT-GEMM: a fault tolerant high performance GEMM implementation on x86 CPUs. In: Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing Orlando FL June 20-23 2023 pp. 323\u2013324.","DOI":"10.1145\/3588195.3595947"},{"key":"e_1_3_3_74_1","doi-asserted-by":"crossref","unstructured":"Wu S Ding Y Zhai Y et al. (2024a) Ft k-means: a high-performance k-means on gpu with fault tolerance. In: 2024 IEEE International Conference on Cluster Computing (CLUSTER) Kobe Japan 24-27 September 2024 IEEE pp. 322\u2013334.","DOI":"10.1109\/CLUSTER59578.2024.00035"},{"key":"e_1_3_3_75_1","unstructured":"Wu S Raghavan K Di S et al. (2024b) Dgro: diameter-guided ring optimization for integrated research infrastructure membership. https:\/\/arxiv.org\/abs\/2410.11142"},{"key":"e_1_3_3_76_1","doi-asserted-by":"crossref","unstructured":"Wu S Zhai Y Liu J et al. (2023b) Anatomy of high-performance gemm with online fault tolerance on gpus. In: Proceedings of the 37th International Conference on Supercomputing Orlando FL June 21-23 2023 pp. 360\u2013372.","DOI":"10.1145\/3577193.3593715"},{"key":"e_1_3_3_77_1","article-title":"Turbofft: Co-designed high-performance and fault-tolerant fast fourier transform on gpus","author":"Wu S","year":"2024","unstructured":"Wu S, Zhai Y, Liu J, et al. (2024c) Turbofft: Co-designed high-performance and fault-tolerant fast fourier transform on gpus. arXiv preprint arXiv:2412.05824.","journal-title":"arXiv preprint arXiv:2412.05824"},{"key":"e_1_3_3_78_1","article-title":"Turbofft: a high-performance fast fourier transform with fault tolerance on gpu","author":"Wu S","year":"2024","unstructured":"Wu S, Zhai Y, Liu J, et al. (2024d) Turbofft: a high-performance fast fourier transform with fault tolerance on gpu. arXiv preprint arXiv:2405.02520.","journal-title":"arXiv preprint arXiv:2405.02520"},{"key":"e_1_3_3_79_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2019.03.005"},{"key":"e_1_3_3_80_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1006856"},{"key":"e_1_3_3_81_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.hcc.2024.100202"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420251339317","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/10943420251339317","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420251339317","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T08:17:44Z","timestamp":1777450664000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/10943420251339317"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,15]]},"references-count":80,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2025,9]]}},"alternative-id":["10.1177\/10943420251339317"],"URL":"https:\/\/doi.org\/10.1177\/10943420251339317","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"value":"1094-3420","type":"print"},{"value":"1741-2846","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,5,15]]}}}