Improvements in Hardware Transactional Memory for GPU Architectures

Villegas Fernández, Alejandro; Asenjo-Plaza, Rafael; González-Navarro, María Ángeles; Plata-González, Óscar Guillermo

Improvements in Hardware Transactional Memory for GPU Architectures

dc.centro	E.T.S.I. Informática	es_ES
dc.contributor.author	Villegas Fernández, Alejandro
dc.contributor.author	Asenjo-Plaza, Rafael
dc.contributor.author	González-Navarro, María Ángeles
dc.contributor.author	Plata-González, Óscar Guillermo
dc.date.accessioned	2016-07-20T09:39:11Z
dc.date.available	2016-07-20T09:39:11Z
dc.date.created	2016
dc.date.issued	2016-07-20
dc.departamento	Arquitectura de Computadores
dc.description.abstract	In the multi-core CPU world, transactional memory (TM)has emerged as an alternative to lock-based programming for thread synchronization. Recent research proposes the use of TM in GPU architectures, where a high number of computing threads, organized in SIMT fashion, requires an effective synchronization method. In contrast to CPUs, GPUs offer two memory spaces: global memory and local memory. The local memory space serves as a shared scratch-pad for a subset of the computing threads, and it is used by programmers to speed-up their applications thanks to its low latency. Prior work from the authors proposed a lightweight hardware TM (HTM) support based in the local memory, modifying the SIMT execution model and adding a conflict detection mechanism. An efficient implementation of these features is key in order to provide an effective synchronization mechanism at the local memory level. After a quick description of the main features of our HTM design for GPU local memory, in this work we gather together a number of proposals designed with the aim of improving those mechanisms with high impact on performance. Firstly, the SIMT execution model is modified to increase the parallelism of the application when transactions must be serialized in order to make forward progress. Secondly, the conflict detection mechanism is optimized depending on application characteristics, such us the read/write sets, the probability of conflict between transactions and the existence of read-only transactions. As these features can be present in hardware simultaneously, it is a task of the compiler and runtime to determine which ones are more important for a given application. This work includes a discussion on the analysis to be done in order to choose the best configuration solution.	es_ES
dc.description.sponsorship	Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech.	es_ES
dc.identifier.uri	http://hdl.handle.net/10630/11858
dc.language.iso	eng	es_ES
dc.relation.eventdate	6 de julio de 2016	es_ES
dc.relation.eventplace	Valladolid, España	es_ES
dc.relation.eventtitle	18th International Workshop on Compilers for Parallel Computing (CPC’15)	es_ES
dc.rights	by-nc-nd
dc.rights.accessRights	open access	es_ES
dc.subject	Ordenadores - Equipo de entrada y salida	es_ES
dc.subject.other	Hardware Transactional Memory	es_ES
dc.subject.other	GPU	es_ES
dc.title	Improvements in Hardware Transactional Memory for GPU Architectures	es_ES
dc.type	conference output	es_ES
dspace.entity.type	Publication
relation.isAuthorOfPublication	6ea008bf-69ee-4104-a942-2033b5b07ab8
relation.isAuthorOfPublication	0857b903-5728-47c9-b298-a203bf081d23
relation.isAuthorOfPublication	34b85e22-88ce-4035-a53e-2bafb0c3310b
relation.isAuthorOfPublication.latestForDiscovery	6ea008bf-69ee-4104-a942-2033b5b07ab8

Files

Original bundle

Now showing 1 - 1 of 1

Name:: paper-19.pdf
Size:: 297.91 KB
Format:: Adobe Portable Document Format

Download

Collections

Ponencias, Comunicaciones a congresos y Pósteres