{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T07:55:12Z","timestamp":1768031712833,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":29,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,6,21]],"date-time":"2023-06-21T00:00:00Z","timestamp":1687305600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,6,21]]},"DOI":"10.1145\/3577193.3593720","type":"proceedings-article","created":{"date-parts":[[2023,6,20]],"date-time":"2023-06-20T18:47:05Z","timestamp":1687286825000},"page":"477-487","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Enabling Reconfigurable HPC through MPI-based Inter-FPGA Communication"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-1918-397X","authenticated-orcid":false,"given":"Nicholas","family":"Contini","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6430-8587","authenticated-orcid":false,"given":"Bharath","family":"Ramesh","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3705-2387","authenticated-orcid":false,"given":"Kaushik","family":"Kandadi Suresh","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0040-8404","authenticated-orcid":false,"given":"Tu","family":"Tran","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7450-5787","authenticated-orcid":false,"given":"Ben","family":"Michalowicz","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1280-130X","authenticated-orcid":false,"given":"Mustafa","family":"Abduljabbar","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1200-2754","authenticated-orcid":false,"given":"Hari","family":"Subramoni","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0356-1781","authenticated-orcid":false,"given":"Dhabaleswar","family":"Panda","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering \/ Network-Based Computing Laboratory, The Ohio State University, Columbus, Ohio, United States of America"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,6,21]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"18","volume-title":"Panda","author":"Bayatpour M.","year":"2021","unstructured":"Bayatpour , M. , Sarkauskas , N. , Subramoni , H. , Maqbool Hashmi , J. , Panda , D.K. : Bluesmpi: Efficient mpi non-blocking alltoall offloading designs on modern blue-field smart nics. In : Chamberlain, B.L., Varbanescu, A.L., Ltaief, H., Luszczek, P. (eds.) High Performance Computing. pp. 18 -- 37 . Springer International Publishing , Cham ( 2021 ) Bayatpour, M., Sarkauskas, N., Subramoni, H., Maqbool Hashmi, J., Panda, D.K.: Bluesmpi: Efficient mpi non-blocking alltoall offloading designs on modern blue-field smart nics. In: Chamberlain, B.L., Varbanescu, A.L., Ltaief, H., Luszczek, P. (eds.) High Performance Computing. pp. 18--37. Springer International Publishing, Cham (2021)"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3289602.3293929"},{"key":"e_1_3_2_1_3_1","volume-title":"IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing, H2RC@ SC 2022","author":"Christgau S.","year":"2022","unstructured":"Christgau , S. , Knaust , M. , Steinke , T. : A first step towards support for mpi partitioned communication on sycl-programmed fpgas . In: IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing, H2RC@ SC 2022 , Dallas, TX, USA , November , 2022 (2022) Christgau, S., Knaust, M., Steinke, T.: A first step towards support for mpi partitioned communication on sycl-programmed fpgas. In: IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing, H2RC@ SC 2022, Dallas, TX, USA, November, 2022 (2022)"},{"key":"e_1_3_2_1_4_1","first-page":"78","volume-title":"Ezzatti","author":"Favaro F.","year":"2022","unstructured":"Favaro , F. , Dufrechou , E. , Oliver , J.P. , Ezzatti , P. : Time-power-energy balance of blas kernels in modern fpgas. In : Navaux, P., Barrios H., C.J., Osthoff, C., Guerrero, G. (eds.) High Performance Computing. pp. 78 -- 89 . Springer International Publishing , Cham ( 2022 ) Favaro, F., Dufrechou, E., Oliver, J.P., Ezzatti, P.: Time-power-energy balance of blas kernels in modern fpgas. In: Navaux, P., Barrios H., C.J., Osthoff, C., Guerrero, G. (eds.) High Performance Computing. pp. 78--89. Springer International Publishing, Cham (2022)"},{"key":"e_1_3_2_1_5_1","volume-title":"Acceleration of an autoencoder using a fpga-soc in a high-performance node of a distributed onboard computer","author":"Freitag T.","year":"2022","unstructured":"Freitag , T. : Acceleration of an autoencoder using a fpga-soc in a high-performance node of a distributed onboard computer ( 2022 ), https:\/\/publica.fraunhofer.de\/handle\/publica\/430107 Freitag, T.: Acceleration of an autoencoder using a fpga-soc in a high-performance node of a distributed onboard computer (2022), https:\/\/publica.fraunhofer.de\/handle\/publica\/430107"},{"key":"e_1_3_2_1_6_1","unstructured":"Graph 500 green list (november 2022). https:\/\/graph500.org\/?page_id$=$1128 (2022) Graph 500 green list (november 2022). https:\/\/graph500.org\/?page_id$=$1128 (2022)"},{"issue":"6","key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","first-page":"e6769","DOI":"10.1002\/cpe.6769","volume":"34","author":"Haghi P.","year":"2022","unstructured":"Haghi , P. , Guo , A. , Xiong , Q. , Yang , C. , Geng , T. , Broaddus , J.T. , Marshall , R. , Schafer , D. , Skjellum , A. , Herbordt , M.C. : Reconfigurable switches for high performance and flexible mpi collectives. Concurrency and Computation: Practice and Experience 34 ( 6 ), e6769 ( 2022 ) Haghi, P., Guo, A., Xiong, Q., Yang, C., Geng, T., Broaddus, J.T., Marshall, R., Schafer, D., Skjellum, A., Herbordt, M.C.: Reconfigurable switches for high performance and flexible mpi collectives. Concurrency and Computation: Practice and Experience 34(6), e6769 (2022)","journal-title":"Experience"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","first-page":"828","DOI":"10.1109\/IPDPS53621.2022.00085","volume-title":"2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS).","author":"de Haro J.M.","year":"2022","unstructured":"de Haro , J.M. , Cano , R. , Alvarez , C. , Jim\u00e9nez-Gonz\u00e1lez , D. , Martorell , X. , Ayguad\u00e9 , E. , Labarta , J. , Abel , F. , Ringlein , B. , Weiss , B. : Ompss@ cloudfpga: An fpga task-based programming model with message passing . In: 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS). pp. 828 -- 838 . IEEE ( 2022 ) de Haro, J.M., Cano, R., Alvarez, C., Jim\u00e9nez-Gonz\u00e1lez, D., Martorell, X., Ayguad\u00e9, E., Labarta, J., Abel, F., Ringlein, B., Weiss, B.: Ompss@ cloudfpga: An fpga task-based programming model with message passing. In: 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS). pp. 828--838. IEEE (2022)"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1109\/H2RC54759.2021.00009","volume-title":"2021 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC).","author":"He Z.","year":"2021","unstructured":"He , Z. , Parravicini , D. , Petrica , L. , O'Brien , K. , Alonso , G. , Blott , M. : Accl: Fpgaa-ccelerated collectives over 100 gbps tcp-ip . In: 2021 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). pp. 33 -- 43 . IEEE ( 2021 ) He, Z., Parravicini, D., Petrica, L., O'Brien, K., Alonso, G., Blott, M.: Accl: Fpgaa-ccelerated collectives over 100 gbps tcp-ip. In: 2021 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). pp. 33--43. IEEE (2021)"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3079856.3080246"},{"key":"e_1_3_2_1_11_1","unstructured":"Laboratory N.B.C.: Mvapich: Mpi over infiniband 10gige\/iwarp and roce. http:\/\/mvapich.cse.ohio-state.edu\/ Laboratory N.B.C.: Mvapich: Mpi over infiniband 10gige\/iwarp and roce. http:\/\/mvapich.cse.ohio-state.edu\/"},{"key":"e_1_3_2_1_12_1","unstructured":"Laboratory N.B.C.: OSU Micro-Benchmarks. http:\/\/mvapich.cse.ohio-state.edu\/benchmarks\/ [Online; accessed May 15 2023] Laboratory N.B.C.: OSU Micro-Benchmarks. http:\/\/mvapich.cse.ohio-state.edu\/benchmarks\/ [Online; accessed May 15 2023]"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2019.2950655"},{"key":"e_1_3_2_1_14_1","first-page":"16","volume-title":"Prasanna","author":"Lin Y.C.","year":"2022","unstructured":"Lin , Y.C. , Zhang , B. , Prasanna , V. : Accelerating gnn training on cpu+multi-fpga heterogeneous platform. In : Navaux, P., Barrios H., C.J., Osthoff, C., Guerrero, G. (eds.) High Performance Computing. pp. 16 -- 30 . Springer International Publishing , Cham ( 2022 ) Lin, Y.C., Zhang, B., Prasanna, V.: Accelerating gnn training on cpu+multi-fpga heterogeneous platform. In: Navaux, P., Barrios H., C.J., Osthoff, C., Guerrero, G. (eds.) High Performance Computing. pp. 16--30. Springer International Publishing, Cham (2022)"},{"key":"e_1_3_2_1_15_1","volume-title":"MPI: A Message-Passing Interface Standard Version 4.0 (Jun","author":"Message Passing Interface Forum","year":"2021","unstructured":"Message Passing Interface Forum : MPI: A Message-Passing Interface Standard Version 4.0 (Jun 2021 ), https:\/\/www.mpi-forum.org\/docs\/mpi-4.0\/mpi40-report.pdf Message Passing Interface Forum: MPI: A Message-Passing Interface Standard Version 4.0 (Jun 2021), https:\/\/www.mpi-forum.org\/docs\/mpi-4.0\/mpi40-report.pdf"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1109\/H2RC51942.2020.00007","volume-title":"2020 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC).","author":"Meyer M.","year":"2020","unstructured":"Meyer , M. , Kenter , T. , Plessl , C. : Evaluating fpga accelerator performance with a parameterized opencl adaptation of selected benchmarks of the hpcchallenge benchmark suite . In: 2020 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). pp. 10 -- 18 . IEEE ( 2020 ) Meyer, M., Kenter, T., Plessl, C.: Evaluating fpga accelerator performance with a parameterized opencl adaptation of selected benchmarks of the hpcchallenge benchmark suite. In: 2020 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). pp. 10--18. IEEE (2020)"},{"key":"e_1_3_2_1_17_1","first-page":"8","volume-title":"Wright","author":"Nguyen T.","year":"2020","unstructured":"Nguyen , T. , Williams , S. , Siracusa , M. , MacLean , C. , Doerfler , D. , Wright , N.J. : The performance and energy efficiency potential of fpgas in scientific computing. In: 2020 IEEE\/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS). pp. 8 -- 19 . IEEE ( 2020) Nguyen, T., Williams, S., Siracusa, M., MacLean, C., Doerfler, D., Wright, N.J.: The performance and energy efficiency potential of fpgas in scientific computing. In: 2020 IEEE\/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS). pp. 8--19. IEEE (2020)"},{"key":"e_1_3_2_1_18_1","unstructured":"OpenCL Specification. https:\/\/registry.khronos.org\/OpenCL\/specs\/3.0-unified\/pdf\/OpenCL_API.pdf (2022) OpenCL Specification. https:\/\/registry.khronos.org\/OpenCL\/specs\/3.0-unified\/pdf\/OpenCL_API.pdf (2022)"},{"key":"e_1_3_2_1_19_1","unstructured":"Open MPI: Open Source High Performance Computing. http:\/\/www.openmpi.org Open MPI: Open Source High Performance Computing. http:\/\/www.openmpi.org"},{"key":"e_1_3_2_1_20_1","volume-title":"Bringing fpgas to hpc production systems and codes (11","author":"Plessl C.","year":"2018","unstructured":"Plessl , C. : Bringing fpgas to hpc production systems and codes (11 2018 ), invited talk at the R2HC'18 workshop at SC '18 Plessl, C.: Bringing fpgas to hpc production systems and codes (11 2018), invited talk at the R2HC'18 workshop at SC'18"},{"key":"e_1_3_2_1_21_1","first-page":"1","volume-title":"2020 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC).","author":"Ringlein B.","year":"2020","unstructured":"Ringlein , B. , Abel , F. , Ditter , A. , Weiss , B. , Hagleitner , C. , Fey , D. : Programming reconfigurable heterogeneous computing clusters using mpi with transpilation . In: 2020 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). pp. 1 -- 9 . IEEE ( 2020 ) Ringlein, B., Abel, F., Ditter, A., Weiss, B., Hagleitner, C., Fey, D.: Programming reconfigurable heterogeneous computing clusters using mpi with transpilation. In: 2020 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC). pp. 1--9. IEEE (2020)"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/FPL.2006.311233"},{"key":"e_1_3_2_1_23_1","volume-title":"Putnam","author":"Salda\u00f1a M.","year":"2010","unstructured":"Salda\u00f1a , M. , Patel , A. , Madill , C. , Nunes , D. , Wang , D. , Chow , P. , Wittig , R. , Styles , H. , Putnam , A. : Mpi as a programming model for high-performance reconfigurable computers. ACM Transactions on Reconfigurable Technology and Systems (TRETS) 3(4), 1--29 ( 2010 ) Salda\u00f1a, M., Patel, A., Madill, C., Nunes, D., Wang, D., Chow, P., Wittig, R., Styles, H., Putnam, A.: Mpi as a programming model for high-performance reconfigurable computers. ACM Transactions on Reconfigurable Technology and Systems (TRETS) 3(4), 1--29 (2010)"},{"key":"e_1_3_2_1_24_1","first-page":"1","volume-title":"Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies.","author":"Sanaullah A.","year":"2018","unstructured":"Sanaullah , A. , Herbordt , M.C. : Fpga hpc using opencl: Case study in 3d fft . In: Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies. pp. 1 -- 6 ( 2018 ) Sanaullah, A., Herbordt, M.C.: Fpga hpc using opencl: Case study in 3d fft. In: Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies. pp. 1--6 (2018)"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/HiPC53243.2021.00054"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2890150"},{"key":"e_1_3_2_1_28_1","volume-title":"Workshop on Exascale MPI (2018)","author":"Stern J.","unstructured":"Stern , J. , Xiong , Q. , Skjellum , A. , Herbordt , M. : A novel approach to supporting communicators for in-switch processing of mpi collectives . In: Workshop on Exascale MPI (2018) Stern, J., Xiong, Q., Skjellum, A., Herbordt, M.: A novel approach to supporting communicators for in-switch processing of mpi collectives. In: Workshop on Exascale MPI (2018)"},{"key":"e_1_3_2_1_29_1","unstructured":"Top500. https:\/\/www.top500.org\/lists\/top500\/2022\/11\/ (2022) Top500. https:\/\/www.top500.org\/lists\/top500\/2022\/11\/ (2022)"},{"issue":"3","key":"e_1_3_2_1_30_1","first-page":"257","volume":"26","author":"Wang H.","year":"2011","unstructured":"Wang , H. , Potluri , S. , Luo , M. , Singh , A.K. , Sur , S. , Panda , D.K. : Mvapich 2-gpu: optimized gpu to gpu communication for infiniband clusters. Computer Science-Research and Development 26 ( 3 ), 257 -- 266 ( 2011 ) Wang, H., Potluri, S., Luo, M., Singh, A.K., Sur, S., Panda, D.K.: Mvapich2-gpu: optimized gpu to gpu communication for infiniband clusters. Computer Science-Research and Development 26(3), 257--266 (2011)","journal-title":"Development"}],"event":{"name":"ICS '23: 37th International Conference on Supercomputing","location":"Orlando FL USA","acronym":"ICS '23","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture"]},"container-title":["Proceedings of the 37th International Conference on Supercomputing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3577193.3593720","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:47:32Z","timestamp":1750178852000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3577193.3593720"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,21]]},"references-count":29,"alternative-id":["10.1145\/3577193.3593720","10.1145\/3577193"],"URL":"https:\/\/doi.org\/10.1145\/3577193.3593720","relation":{},"subject":[],"published":{"date-parts":[[2023,6,21]]},"assertion":[{"value":"2023-06-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}