Artículo
OpenCL overview, implementation, and performance comparison
Fecha de publicación:
04/2013
Editorial:
Institute of Electrical and Electronics Engineers
Revista:
IEEE Latin America Transactions
ISSN:
1548-0992
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
High performance parallel computing was something exclusive for expensive specialized hardware some years ago. But now we can find powerful parallel processors in many home graphics card whose interface has been recently opened by many manufacturers for general purpose computing. OpenCL, created by the world most important processors manufacturers, went a little further, aiming for a platform and manufacturer independent parallel language. However, understanding this new processing paradigm is challenging and critical for future computation demanding applications. The first approach of this document is to provide a deep technical background of OpenCL architecture. Second, we propose an implementation of a matrix product calculation OpenCL kernel directly implemented in C++ without wrappers so as to describe in detail the OpenCL programming flow. Thirdly, different platforms and algebraic scenarios are created for this program concluding that the improvement of calculation performance can reach up to 3 orders of magnitude over the same algorithm in plain C++.
Palabras clave:
Heterogeneus
,
Systems
,
Paralelism
,
Computing
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(IFEG)
Articulos de INST.DE FISICA ENRIQUE GAVIOLA
Articulos de INST.DE FISICA ENRIQUE GAVIOLA
Citación
Fraire, Juan Andres; Ferreyra, Pablo Alejandro; Marques, Carlos Alberto; OpenCL overview, implementation, and performance comparison; Institute of Electrical and Electronics Engineers; IEEE Latin America Transactions; 11; 1; 4-2013; 274-280
Compartir
Altmétricas