{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T04:14:14Z","timestamp":1763352854612,"version":"3.45.0"},"reference-count":37,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2025,9,1]],"date-time":"2025-09-01T00:00:00Z","timestamp":1756684800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2025,9,1]],"date-time":"2025-09-01T00:00:00Z","timestamp":1756684800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"funder":[{"DOI":"10.13039\/100010669","name":"H2020 LEIT Information and Communication Technologies","doi-asserted-by":"publisher","award":["871738"],"award-info":[{"award-number":["871738"]}],"id":[{"id":"10.13039\/100010669","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100014438","name":"Business Finland","doi-asserted-by":"publisher","award":["AI for Situational Awareness (AISA)"],"award-info":[{"award-number":["AI for Situational Awareness (AISA)"]}],"id":[{"id":"10.13039\/501100014438","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:p>\n                    We propose\n                    <jats:italic toggle=\"yes\">PoCL-R<\/jats:italic>\n                    , a novel computing runtime that makes remote compute devices available to the client via the cross-vendor heterogeneous\n                    <jats:italic toggle=\"yes\">Open Computing Language (OpenCL)<\/jats:italic>\n                    API standard. For robustness in mobile use cases, intermittent connection loss is handled gracefully even if the device\u2019s IP address changes on the way. A major benefit in comparison to the previous state-of-the-art OpenCL distribution layers is that\n                    <jats:italic toggle=\"yes\">PoCL-R<\/jats:italic>\n                    minimizes network-induced latency by transferring data and signaling command completions between remote devices in a peer-to-peer fashion, outperforming the state-of-the-art baseline by a factor of up to 50x in synthetic benchmarks.\n                    <jats:italic toggle=\"yes\">PoCL-R<\/jats:italic>\n                    \u2019s streamlined TCP-based protocol has a command latency of only 60 microseconds on top of network round-trip latency in synthetic benchmarks. The application of\n                    <jats:italic toggle=\"yes\">PoCL-R<\/jats:italic>\n                    to porting of compute-heavy applications to mobile devices is demonstrated with a smartphone-based\n                    <jats:italic toggle=\"yes\">Augmented Reality (AR)<\/jats:italic>\n                    rendering case study where we measure up to 19x improvements to frame rate and 17x improvements to local energy consumption when using the proposed runtime to remote-offload a part of the AR rendering workload. Scalability across multiple GPU server nodes in real-world applications is shown using a computational fluid dynamics simulation, which scales with the number of servers at roughly 80% efficiency which is comparable to a\n                    <jats:italic toggle=\"yes\">Message-Passing Interface (MPI)<\/jats:italic>\n                    port of the same simulation, but with the benefit of not needing a separate API for cross-server distribution.\n                  <\/jats:p>","DOI":"10.1177\/10943420251369350","type":"journal-article","created":{"date-parts":[[2025,9,1]],"date-time":"2025-09-01T14:26:11Z","timestamp":1756736771000},"page":"750-769","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":0,"title":["PoCL-R: An open standard based heterogeneous offloading layer with server side scalability"],"prefix":"10.1177","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8721-7530","authenticated-orcid":false,"given":"Jan","family":"Solanti","sequence":"first","affiliation":[{"name":"Faculty of Information Technology and Communication Sciences"}]},{"given":"Michal","family":"Babej","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology and Communication Sciences"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5373-3190","authenticated-orcid":false,"given":"Julius","family":"Ikkala","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology and Communication Sciences"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5707-8544","authenticated-orcid":false,"given":"Pekka","family":"J\u00e4\u00e4skel\u00e4inen","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology and Communication Sciences"}]}],"member":"179","published-online":{"date-parts":[[2025,9]]},"reference":[{"key":"e_1_3_4_2_1","doi-asserted-by":"publisher","DOI":"10.1177\/10943420241277839"},{"key":"e_1_3_4_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/SBAC-PAD49847.2020.00049"},{"key":"e_1_3_4_4_1","doi-asserted-by":"crossref","unstructured":"Alves A Rufino J Pina A et al. (2012) clOpenCL-supporting distributed heterogeneous computing in HPC clusters. In: European Conference on Parallel Processing Dresden Germany 25 \u2013 29 August 2025 112\u2013122.","DOI":"10.1007\/978-3-642-36949-0_14"},{"key":"e_1_3_4_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3388333.3388641"},{"key":"e_1_3_4_6_1","first-page":"196","article-title":"The MOSIX Virtual OpenCL (VCL) Cluster Platform","volume":"8","author":"Barak A","year":"2011","unstructured":"Barak A, Shiloh A (2011) The MOSIX Virtual OpenCL (VCL) Cluster Platform. Proceedings of the Intel European Research & Innovation Conference 8: 196.","journal-title":"Proceedings of the Intel European Research & Innovation Conference"},{"key":"e_1_3_4_7_1","unstructured":"CHIP-SPV (2022) chipStar. https:\/\/github.com\/CHIP-SPV\/chipStar (Accessed 26 06 2023)."},{"key":"e_1_3_4_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/MASCOTS.2013.77"},{"key":"e_1_3_4_9_1","unstructured":"Ferreira PO (2023) RemoteCL. https:\/\/github.com\/silverclaw\/RemoteCL (Accessed 27 06 2023)."},{"key":"e_1_3_4_10_1","article-title":"MPI-based multi-GPU extension of the Lattice Boltzmann Method","author":"H\u00e4usl F","year":"2019","unstructured":"H\u00e4usl F (2019) MPI-based multi-GPU extension of the Lattice Boltzmann Method. Master\u2019s Thesis, Universit\u00e4t Bayreuth. https:\/\/epub.uni-bayreuth.de\/id\/eprint\/5689\/","journal-title":"Master\u2019s Thesis, Universit\u00e4t Bayreuth"},{"key":"e_1_3_4_11_1","article-title":"H.265 and ISO\/IEC 23008-2: high efficiency video coding (HEVC)","author":"ITU-T Study Group 21","year":"2013","unstructured":"ITU-T Study Group 21 (2013) H.265 and ISO\/IEC 23008-2: high efficiency video coding (HEVC). ITU-T Recommendations. Available at: https:\/\/handle.itu.int\/11.1002\/1000\/15936","journal-title":"ITU-T Recommendations"},{"key":"e_1_3_4_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-014-0320-y"},{"key":"e_1_3_4_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-018-1416-1"},{"key":"e_1_3_4_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2012.16"},{"key":"e_1_3_4_15_1","unstructured":"Kessenich J Ouriel B Krisch R (2023) SPIR-V specification. https:\/\/registry.khronos.org\/SPIR-V\/specs\/unified1\/SPIRV.html (Version 1.6 accessed 17 06 2023)."},{"key":"e_1_3_4_16_1","unstructured":"Khronos OpenCL Working Group (2020) The OpenCLTM specification. https:\/\/www.khronos.org\/registry\/OpenCL\/specs\/3.0-unified\/pdf\/OpenCL_API.pdf (Accessed 16 10 2020)."},{"key":"e_1_3_4_17_1","unstructured":"Khronos OpenVX Working Group (2022) The OpenVXTM specification 1.3.1. https:\/\/www.khronos.org\/registry\/OpenVX\/specs\/1.3.1\/html\/OpenVX_Specification_1_3_1.html (Accessed 27 06 2023)."},{"key":"e_1_3_4_18_1","unstructured":"Khronos SYCL Working Group (2020) SYCLTM specification. https:\/\/www.khronos.org\/registry\/SYCL\/specs\/sycl-1.2.1.pdf (Accessed 16 10 2020)."},{"key":"e_1_3_4_19_1","unstructured":"Khronos Vulkan Working Group (2022) VulkanTM 1.3.214 - a Specification (with all registered Vulkan extensions). https:\/\/www.khronos.org\/registry\/vulkan\/specs\/1.3-extensions\/pdf\/vkspec.pdf (Accessed 20 05 2022)."},{"key":"e_1_3_4_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2304576.2304623"},{"key":"e_1_3_4_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980983.2908094"},{"key":"e_1_3_4_22_1","unstructured":"Lehmann M (2022) FluidX3D. https:\/\/github.com\/ProjectPhysX\/FluidX3D (Accessed 26 06 2023)."},{"key":"e_1_3_4_23_1","volume-title":"Computational Study of Microplastic Transport at the Water-Air Interface with a Memory-Optimized Lattice Boltzmann Method","author":"Lehmann M","year":"2023","unstructured":"Lehmann M (2023) Computational Study of Microplastic Transport at the Water-Air Interface with a Memory-Optimized Lattice Boltzmann Method. Bayreuth: Universit\u00e4t Bayreuth. https:\/\/epub.uni-bayreuth.de\/id\/eprint\/6977\/"},{"key":"e_1_3_4_24_1","first-page":"015308","article-title":"Accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit, and customized 16-bit number formats","volume":"106","author":"Lehmann M","year":"2022","unstructured":"Lehmann M, Krause M, Amati G, et al. (2022) Accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit, and customized 16-bit number formats. Physical Review 106: 015308.","journal-title":"Physical Review"},{"key":"e_1_3_4_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2019.2928289"},{"key":"e_1_3_4_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-38027-3_7"},{"key":"e_1_3_4_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3452296.3472923"},{"key":"e_1_3_4_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2014.09.011"},{"key":"e_1_3_4_29_1","first-page":"7146","article-title":"A remote direct memory access protocol specification","volume":"5040","author":"Recio R","year":"2007","unstructured":"Recio R, Metzler B, Culley P, et al. (2007) A remote direct memory access protocol specification. Rfc 5040: 7146.","journal-title":"Rfc"},{"key":"e_1_3_4_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPDC.2011.25"},{"key":"e_1_3_4_31_1","first-page":"2024","article-title":"Real-time decoding and AR playback of the emerging MPEG video-based point cloud compression standard","volume":"2019","author":"Schwarz S","year":"2019","unstructured":"Schwarz S, Pesonen M (2019) Real-time decoding and AR playback of the emerging MPEG video-based point cloud compression standard. Indian Botanical Contactor 2019: 2024.","journal-title":"Indian Botanical Contactor"},{"key":"e_1_3_4_32_1","first-page":"17487","article-title":"Direct data placement over reliable transports","volume":"5041","author":"Shah H","year":"2007","unstructured":"Shah H, Pinkerton J, Recio R, et al. (2007) Direct data placement over reliable transports. Rfc 5041: 17487.","journal-title":"Rfc"},{"key":"e_1_3_4_33_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.4072"},{"key":"e_1_3_4_34_1","unstructured":"Simpson RJ Baldwin D Rost R (2023) OpenGL ES\u00ae shading language version 3.20.6. https:\/\/www.khronos.org\/registry\/OpenGL\/specs\/es\/3.2\/GLSL_ES_Specification_3.20.pdf (accessed 27 06 2023)."},{"key":"e_1_3_4_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-04580-6_6"},{"key":"e_1_3_4_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2012.2221191"},{"key":"e_1_3_4_37_1","unstructured":"The Khronos Group Inc (2023) OpenGL\u00ae ES version 3.2 (may 5 2022). https:\/\/www.khronos.org\/registry\/OpenGL\/specs\/es\/3.2\/es_spec_3.2.pdf. (Accessed: 27 06 2023)."},{"key":"e_1_3_4_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/InPar.2012.6339609"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420251369350","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/10943420251369350","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420251369350","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T04:10:46Z","timestamp":1763352646000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/10943420251369350"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9]]},"references-count":37,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["10.1177\/10943420251369350"],"URL":"https:\/\/doi.org\/10.1177\/10943420251369350","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"type":"print","value":"1094-3420"},{"type":"electronic","value":"1741-2846"}],"subject":[],"published":{"date-parts":[[2025,9]]}}}