Benchmarking of communication techniques for GPUs
Saved in:
| Title: | Benchmarking of communication techniques for GPUs |
|---|---|
| Authors: | Bernaschi, M.1 m.bernaschi@iac.cnr.it, Bisson, M.1, Rossetti, D.2 |
| Source: | Journal of Parallel & Distributed Computing. Feb2013, Vol. 73 Issue 2, p250-255. 6p. |
| Subjects: | Computer graphics, Performance evaluation, Application software, Message passing (Computer science), Computer storage devices, Computer programming |
| Abstract: | Abstract: We report about the performances obtained, at the application level, by two MPI implementations for Infiniband that allow direct exchange of data stored in the global memory of Graphic Processing Units (GPU) based on the Nvidia CUDA. For the same purpose, we tested also the Application Programming Interface of APEnet, which is a custom, high performance interconnect technology. As a benchmark we consider the time required to update a single spin of the 3D Heisenberg spin glass model by using the over-relaxation algorithm. The results show that CUDA streams are instrumental in achieving the best possible performances. [Copyright &y& Elsevier] |
| Copyright of Journal of Parallel & Distributed Computing is the property of Academic Press Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: egs DbLabel: Engineering Source An: 84361961 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Benchmarking of communication techniques for GPUs – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Bernaschi%2C+M%2E%22">Bernaschi, M.</searchLink><relatesTo>1</relatesTo><i> m.bernaschi@iac.cnr.it</i><br /><searchLink fieldCode="AR" term="%22Bisson%2C+M%2E%22">Bisson, M.</searchLink><relatesTo>1</relatesTo><br /><searchLink fieldCode="AR" term="%22Rossetti%2C+D%2E%22">Rossetti, D.</searchLink><relatesTo>2</relatesTo> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22Journal+of+Parallel+%26+Distributed+Computing%22">Journal of Parallel & Distributed Computing</searchLink>. Feb2013, Vol. 73 Issue 2, p250-255. 6p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Computer+graphics%22">Computer graphics</searchLink><br /><searchLink fieldCode="DE" term="%22Performance+evaluation%22">Performance evaluation</searchLink><br /><searchLink fieldCode="DE" term="%22Application+software%22">Application software</searchLink><br /><searchLink fieldCode="DE" term="%22Message+passing+%28Computer+science%29%22">Message passing (Computer science)</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+storage+devices%22">Computer storage devices</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+programming%22">Computer programming</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Abstract: We report about the performances obtained, at the application level, by two MPI implementations for Infiniband that allow direct exchange of data stored in the global memory of Graphic Processing Units (GPU) based on the Nvidia CUDA. For the same purpose, we tested also the Application Programming Interface of APEnet, which is a custom, high performance interconnect technology. As a benchmark we consider the time required to update a single spin of the 3D Heisenberg spin glass model by using the over-relaxation algorithm. The results show that CUDA streams are instrumental in achieving the best possible performances. [Copyright &y& Elsevier] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of Journal of Parallel & Distributed Computing is the property of Academic Press Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=84361961 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1016/j.jpdc.2012.09.006 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 6 StartPage: 250 Subjects: – SubjectFull: Computer graphics Type: general – SubjectFull: Performance evaluation Type: general – SubjectFull: Application software Type: general – SubjectFull: Message passing (Computer science) Type: general – SubjectFull: Computer storage devices Type: general – SubjectFull: Computer programming Type: general Titles: – TitleFull: Benchmarking of communication techniques for GPUs Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Bernaschi, M. – PersonEntity: Name: NameFull: Bisson, M. – PersonEntity: Name: NameFull: Rossetti, D. IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 02 Text: Feb2013 Type: published Y: 2013 Identifiers: – Type: issn-print Value: 07437315 Numbering: – Type: volume Value: 73 – Type: issue Value: 2 Titles: – TitleFull: Journal of Parallel & Distributed Computing Type: main |
| ResultId | 1 |