Artículo
Performance analysis and comparison of cellular automata GPU implementations
Millán, Emmanuel Nicolás
; Wolovick, Nicolás; Piccoli, María Fabiana; Garcia Garino, Carlos Gabriel
; Bringa, Eduardo Marcial
Fecha de publicación:
09/2017
Editorial:
Springer
Revista:
Cluster Computing-the Journal Of Networks Software Tools And Applications
ISSN:
1386-7857
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
Cellular automata (CA) models are of interest to several scientific areas, and there is a growing interest in exploring large systems which would need high performance computing. In this work a CA implementation is presented which performs well in five different NVIDIA GPU architectures, from Tesla to Maxwell, simulating systems with up to a billion cells. Using the game of life (GoL) and a more complex variation of GoL as examples, a performance of 5.58e6 evaluated cells/s is achieved. The two optimizations most often used in previous studies are the use of shared memory and Multicell algorithms. Here, these optimizations do not improve performance in Fermi or newer architectures. The GoL CA code running in an NVIDIA Titan X obtained a speedup of up to ∼ 85 x and up to ∼ 230 x for a more complex CA, compared to an optimized serial CPU implementation. Finally, the efficiency of each GPU is analyzed in terms of cell performance/transistors and cell performance/bandwidth showing how the architectures improved for this particular problem.
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(CCT - MENDOZA)
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - MENDOZA
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - MENDOZA
Citación
Millán, Emmanuel Nicolás; Wolovick, Nicolás; Piccoli, María Fabiana; Garcia Garino, Carlos Gabriel; Bringa, Eduardo Marcial; Performance analysis and comparison of cellular automata GPU implementations; Springer; Cluster Computing-the Journal Of Networks Software Tools And Applications; 20; 3; 9-2017; 2763-2777
Compartir
Altmétricas