Fast development of dense linear algebra codes on graphics processors

M. Jesús Zafont, Alberto Martín, Francisco Igual, Enrique S. Quintana-Ortí

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

1 Citation (Scopus)

Abstract

We present an application programming interface (API) for the C programming language that facilitates the development of dense linear algebra algorithms on graphics processors applying the FLAME methodology. The interface, built on top of the NVIDIA CUBLAS library, implements all the computational functionality of the FLAME/C interface. In addition, the API includes data transference routines to explicitly handle communication between the CPU and GPU memory spaces. The flexibility and simplicity-of-use of this tool are illustrated using a complex operation of dense linear algebra: the Cholesky factorization. For this operation, we implement and evaluate all existing variants on an NVIDIA G80 processor, attaining speed-ups 7× compared with the CPU implementations.

Original languageEnglish
Title of host publicationIPDPS 2009 - Proceedings of the 2009 IEEE International Parallel and Distributed Processing Symposium
DOIs
Publication statusPublished - 25 Nov 2009
Externally publishedYes
EventIEEE International Parallel and Distributed Processing Symposium, 2009 - Rome, Italy
Duration: 23 May 200929 May 2009
Conference number: 23rd

Publication series

NameIPDPS 2009 - Proceedings of the 2009 IEEE International Parallel and Distributed Processing Symposium

Conference

ConferenceIEEE International Parallel and Distributed Processing Symposium, 2009
Abbreviated titleIPDPS 2009
CountryItaly
CityRome
Period23/05/0929/05/09

Keywords

  • FLAME
  • Graphics processors
  • High performance
  • High-level APIs
  • Linear algebra

Cite this

Zafont, M. J., Martín, A., Igual, F., & Quintana-Ortí, E. S. (2009). Fast development of dense linear algebra codes on graphics processors. In IPDPS 2009 - Proceedings of the 2009 IEEE International Parallel and Distributed Processing Symposium [5160940] (IPDPS 2009 - Proceedings of the 2009 IEEE International Parallel and Distributed Processing Symposium). https://doi.org/10.1109/IPDPS.2009.5160940