CAVLCU: an efcient GPU‑based implementation of CAVLC

Fuentes-Alventosa, Antonio; Gómez-Luna, Juan; González-Linares, José María; Guil-Mata, Nicolás; Medina-Carnicer, Rafael

doi:10.1007/s11227-021-04183-8

CAVLCU: an efcient GPU‑based implementation of CAVLC

dc.centro	E.T.S.I. Informática	es_ES
dc.contributor.author	Fuentes-Alventosa, Antonio
dc.contributor.author	Gómez-Luna, Juan
dc.contributor.author	González-Linares, José María
dc.contributor.author	Guil-Mata, Nicolás
dc.contributor.author	Medina-Carnicer, Rafael
dc.date.accessioned	2025-02-13T09:43:31Z
dc.date.available	2025-02-13T09:43:31Z
dc.date.created	2025
dc.date.issued	2022
dc.departamento	Arquitectura de Computadores
dc.description.abstract	In this paper, we present CAVLCU, an efficient implementation of CAVLC on GPU, which is based on four key ideas. First, we use only one kernel to avoid the long latency global memory accesses required to transmit intermediate results among different kernels, and the costly launches and terminations of additional kernels. Second, we apply an efficient synchronization mechanism for thread-blocks (In this paper, to prevent confusion, a block of pixels of a frame will be referred to as simply block and a GPU thread block as thread-block.) that process adjacent frame regions (in horizontal and vertical dimensions) to share results in global memory space. Third, we exploit fully the available global memory bandwidth by using vectorized loads to move directly the quantized transform coefficients to registers. Fourth, we use register tiling to implement the zigzag sorting, thus obtaining high instruction-level parallelism. An exhaustive experimental evaluation showed that our approach is between 2.5× and 5.4× faster than the only state-of-the-art GPUbased implementation of CAVLC.	es_ES
dc.description.sponsorship	Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.	es_ES
dc.identifier.citation	Fuentes-Alventosa, A., Gómez-Luna, J., González-Linares, J.M. et al. CAVLCU: an efficient GPU-based implementation of CAVLC. J Supercomput 78, 7556–7590 (2022). https://doi.org/10.1007/s11227-021-04183-8	es_ES
dc.identifier.doi	10.1007/s11227-021-04183-8
dc.identifier.uri	https://hdl.handle.net/10630/37825
dc.language.iso	eng	es_ES
dc.publisher	Springer Nature	es_ES
dc.rights	Atribución 4.0 Internacional	*
dc.rights.accessRights	open access	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	Compresión de datos (Informática)	es_ES
dc.subject	Imágenes - Compresión	es_ES
dc.subject	Procesado de imágenes - Técnicas digitales	es_ES
dc.subject	Compresión de vídeo	es_ES
dc.subject.other	CAVLC	es_ES
dc.subject.other	GPU	es_ES
dc.subject.other	CUDA	es_ES
dc.subject.other	H.264	es_ES
dc.subject.other	Parallel implementations	es_ES
dc.subject.other	Data compression	es_ES
dc.subject.other	Variable-length encoding	es_ES
dc.title	CAVLCU: an efcient GPU‑based implementation of CAVLC	es_ES
dc.type	journal article	es_ES
dc.type.hasVersion	VoR	es_ES
dspace.entity.type	Publication
relation.isAuthorOfPublication	3388700c-0831-457c-9cf8-ca14cec33a15
relation.isAuthorOfPublication	bed8ca48-652e-4212-8c3c-05bfdc85a378
relation.isAuthorOfPublication.latestForDiscovery	3388700c-0831-457c-9cf8-ca14cec33a15

Files

Original bundle

Now showing 1 - 1 of 1

Name:: s11227-021-04183-8.pdf
Size:: 2.75 MB
Format:: Adobe Portable Document Format
Description:: Artículo principal

Download

Description: Artículo principal

Collections

Artículos