PoCL-R: An open standard based heterogeneous offloading layer with server side scalability.

Saved in:
Bibliographic Details
Title: PoCL-R: An open standard based heterogeneous offloading layer with server side scalability.
Authors: Solanti, Jan1 (AUTHOR) jan.solanti@tuni.fi, Babej, Michal1 (AUTHOR), Ikkala, Julius1 (AUTHOR), Jääskeläinen, Pekka1 (AUTHOR) pekka.jaaskelainen@tuni.fi
Source: International Journal of High Performance Computing Applications. Nov2025, Vol. 39 Issue 6, p750-769. 20p.
Subjects: OpenCL (Computer program language), Heterogeneous computing, Scalability, Distributed computing, Augmented reality, Mobile apps
Abstract: We propose PoCL-R, a novel computing runtime that makes remote compute devices available to the client via the cross-vendor heterogeneous Open Computing Language (OpenCL) API standard. For robustness in mobile use cases, intermittent connection loss is handled gracefully even if the device's IP address changes on the way. A major benefit in comparison to the previous state-of-the-art OpenCL distribution layers is that PoCL-R minimizes network-induced latency by transferring data and signaling command completions between remote devices in a peer-to-peer fashion, outperforming the state-of-the-art baseline by a factor of up to 50x in synthetic benchmarks. PoCL-R 's streamlined TCP-based protocol has a command latency of only 60 microseconds on top of network round-trip latency in synthetic benchmarks. The application of PoCL-R to porting of compute-heavy applications to mobile devices is demonstrated with a smartphone-based Augmented Reality (AR) rendering case study where we measure up to 19x improvements to frame rate and 17x improvements to local energy consumption when using the proposed runtime to remote-offload a part of the AR rendering workload. Scalability across multiple GPU server nodes in real-world applications is shown using a computational fluid dynamics simulation, which scales with the number of servers at roughly 80% efficiency which is comparable to a Message-Passing Interface (MPI) port of the same simulation, but with the benefit of not needing a separate API for cross-server distribution. [ABSTRACT FROM AUTHOR]
Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
FullText Text:
  Availability: 0
Header DbId: egs
DbLabel: Engineering Source
An: 188761336
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: PoCL-R: An open standard based heterogeneous offloading layer with server side scalability.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Solanti%2C+Jan%22">Solanti, Jan</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> jan.solanti@tuni.fi</i><br /><searchLink fieldCode="AR" term="%22Babej%2C+Michal%22">Babej, Michal</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Ikkala%2C+Julius%22">Ikkala, Julius</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Jääskeläinen%2C+Pekka%22">Jääskeläinen, Pekka</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> pekka.jaaskelainen@tuni.fi</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22International+Journal+of+High+Performance+Computing+Applications%22">International Journal of High Performance Computing Applications</searchLink>. Nov2025, Vol. 39 Issue 6, p750-769. 20p.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22OpenCL+%28Computer+program+language%29%22">OpenCL (Computer program language)</searchLink><br /><searchLink fieldCode="DE" term="%22Heterogeneous+computing%22">Heterogeneous computing</searchLink><br /><searchLink fieldCode="DE" term="%22Scalability%22">Scalability</searchLink><br /><searchLink fieldCode="DE" term="%22Distributed+computing%22">Distributed computing</searchLink><br /><searchLink fieldCode="DE" term="%22Augmented+reality%22">Augmented reality</searchLink><br /><searchLink fieldCode="DE" term="%22Mobile+apps%22">Mobile apps</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: We propose PoCL-R, a novel computing runtime that makes remote compute devices available to the client via the cross-vendor heterogeneous Open Computing Language (OpenCL) API standard. For robustness in mobile use cases, intermittent connection loss is handled gracefully even if the device's IP address changes on the way. A major benefit in comparison to the previous state-of-the-art OpenCL distribution layers is that PoCL-R minimizes network-induced latency by transferring data and signaling command completions between remote devices in a peer-to-peer fashion, outperforming the state-of-the-art baseline by a factor of up to 50x in synthetic benchmarks. PoCL-R 's streamlined TCP-based protocol has a command latency of only 60 microseconds on top of network round-trip latency in synthetic benchmarks. The application of PoCL-R to porting of compute-heavy applications to mobile devices is demonstrated with a smartphone-based Augmented Reality (AR) rendering case study where we measure up to 19x improvements to frame rate and 17x improvements to local energy consumption when using the proposed runtime to remote-offload a part of the AR rendering workload. Scalability across multiple GPU server nodes in real-world applications is shown using a computational fluid dynamics simulation, which scales with the number of servers at roughly 80% efficiency which is comparable to a Message-Passing Interface (MPI) port of the same simulation, but with the benefit of not needing a separate API for cross-server distribution. [ABSTRACT FROM AUTHOR]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=188761336
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1177/10943420251369350
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 20
        StartPage: 750
    Subjects:
      – SubjectFull: OpenCL (Computer program language)
        Type: general
      – SubjectFull: Heterogeneous computing
        Type: general
      – SubjectFull: Scalability
        Type: general
      – SubjectFull: Distributed computing
        Type: general
      – SubjectFull: Augmented reality
        Type: general
      – SubjectFull: Mobile apps
        Type: general
    Titles:
      – TitleFull: PoCL-R: An open standard based heterogeneous offloading layer with server side scalability.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Solanti, Jan
      – PersonEntity:
          Name:
            NameFull: Babej, Michal
      – PersonEntity:
          Name:
            NameFull: Ikkala, Julius
      – PersonEntity:
          Name:
            NameFull: Jääskeläinen, Pekka
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 11
              Text: Nov2025
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-print
              Value: 10943420
          Numbering:
            – Type: volume
              Value: 39
            – Type: issue
              Value: 6
          Titles:
            – TitleFull: International Journal of High Performance Computing Applications
              Type: main
ResultId 1