PoCL-R: An open standard based heterogeneous offloading layer with server side scalability.
Saved in:
| Title: | PoCL-R: An open standard based heterogeneous offloading layer with server side scalability. |
|---|---|
| Authors: | Solanti, Jan1 (AUTHOR) jan.solanti@tuni.fi, Babej, Michal1 (AUTHOR), Ikkala, Julius1 (AUTHOR), Jääskeläinen, Pekka1 (AUTHOR) pekka.jaaskelainen@tuni.fi |
| Source: | International Journal of High Performance Computing Applications. Nov2025, Vol. 39 Issue 6, p750-769. 20p. |
| Subjects: | OpenCL (Computer program language), Heterogeneous computing, Scalability, Distributed computing, Augmented reality, Mobile apps |
| Abstract: | We propose PoCL-R, a novel computing runtime that makes remote compute devices available to the client via the cross-vendor heterogeneous Open Computing Language (OpenCL) API standard. For robustness in mobile use cases, intermittent connection loss is handled gracefully even if the device's IP address changes on the way. A major benefit in comparison to the previous state-of-the-art OpenCL distribution layers is that PoCL-R minimizes network-induced latency by transferring data and signaling command completions between remote devices in a peer-to-peer fashion, outperforming the state-of-the-art baseline by a factor of up to 50x in synthetic benchmarks. PoCL-R 's streamlined TCP-based protocol has a command latency of only 60 microseconds on top of network round-trip latency in synthetic benchmarks. The application of PoCL-R to porting of compute-heavy applications to mobile devices is demonstrated with a smartphone-based Augmented Reality (AR) rendering case study where we measure up to 19x improvements to frame rate and 17x improvements to local energy consumption when using the proposed runtime to remote-offload a part of the AR rendering workload. Scalability across multiple GPU server nodes in real-world applications is shown using a computational fluid dynamics simulation, which scales with the number of servers at roughly 80% efficiency which is comparable to a Message-Passing Interface (MPI) port of the same simulation, but with the benefit of not needing a separate API for cross-server distribution. [ABSTRACT FROM AUTHOR] |
| Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: egs DbLabel: Engineering Source An: 188761336 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: PoCL-R: An open standard based heterogeneous offloading layer with server side scalability. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Solanti%2C+Jan%22">Solanti, Jan</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> jan.solanti@tuni.fi</i><br /><searchLink fieldCode="AR" term="%22Babej%2C+Michal%22">Babej, Michal</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Ikkala%2C+Julius%22">Ikkala, Julius</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Jääskeläinen%2C+Pekka%22">Jääskeläinen, Pekka</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> pekka.jaaskelainen@tuni.fi</i> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22International+Journal+of+High+Performance+Computing+Applications%22">International Journal of High Performance Computing Applications</searchLink>. Nov2025, Vol. 39 Issue 6, p750-769. 20p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22OpenCL+%28Computer+program+language%29%22">OpenCL (Computer program language)</searchLink><br /><searchLink fieldCode="DE" term="%22Heterogeneous+computing%22">Heterogeneous computing</searchLink><br /><searchLink fieldCode="DE" term="%22Scalability%22">Scalability</searchLink><br /><searchLink fieldCode="DE" term="%22Distributed+computing%22">Distributed computing</searchLink><br /><searchLink fieldCode="DE" term="%22Augmented+reality%22">Augmented reality</searchLink><br /><searchLink fieldCode="DE" term="%22Mobile+apps%22">Mobile apps</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: We propose PoCL-R, a novel computing runtime that makes remote compute devices available to the client via the cross-vendor heterogeneous Open Computing Language (OpenCL) API standard. For robustness in mobile use cases, intermittent connection loss is handled gracefully even if the device's IP address changes on the way. A major benefit in comparison to the previous state-of-the-art OpenCL distribution layers is that PoCL-R minimizes network-induced latency by transferring data and signaling command completions between remote devices in a peer-to-peer fashion, outperforming the state-of-the-art baseline by a factor of up to 50x in synthetic benchmarks. PoCL-R 's streamlined TCP-based protocol has a command latency of only 60 microseconds on top of network round-trip latency in synthetic benchmarks. The application of PoCL-R to porting of compute-heavy applications to mobile devices is demonstrated with a smartphone-based Augmented Reality (AR) rendering case study where we measure up to 19x improvements to frame rate and 17x improvements to local energy consumption when using the proposed runtime to remote-offload a part of the AR rendering workload. Scalability across multiple GPU server nodes in real-world applications is shown using a computational fluid dynamics simulation, which scales with the number of servers at roughly 80% efficiency which is comparable to a Message-Passing Interface (MPI) port of the same simulation, but with the benefit of not needing a separate API for cross-server distribution. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=188761336 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1177/10943420251369350 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 20 StartPage: 750 Subjects: – SubjectFull: OpenCL (Computer program language) Type: general – SubjectFull: Heterogeneous computing Type: general – SubjectFull: Scalability Type: general – SubjectFull: Distributed computing Type: general – SubjectFull: Augmented reality Type: general – SubjectFull: Mobile apps Type: general Titles: – TitleFull: PoCL-R: An open standard based heterogeneous offloading layer with server side scalability. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Solanti, Jan – PersonEntity: Name: NameFull: Babej, Michal – PersonEntity: Name: NameFull: Ikkala, Julius – PersonEntity: Name: NameFull: Jääskeläinen, Pekka IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 11 Text: Nov2025 Type: published Y: 2025 Identifiers: – Type: issn-print Value: 10943420 Numbering: – Type: volume Value: 39 – Type: issue Value: 6 Titles: – TitleFull: International Journal of High Performance Computing Applications Type: main |
| ResultId | 1 |