CAVLCU: an efcient GPU‑based implementation of CAVLC
| dc.centro | E.T.S.I. Informática | es_ES |
| dc.contributor.author | Fuentes-Alventosa, Antonio | |
| dc.contributor.author | Gómez-Luna, Juan | |
| dc.contributor.author | González-Linares, José María | |
| dc.contributor.author | Guil-Mata, Nicolás | |
| dc.contributor.author | Medina-Carnicer, Rafael | |
| dc.date.accessioned | 2025-02-13T09:43:31Z | |
| dc.date.available | 2025-02-13T09:43:31Z | |
| dc.date.created | 2025 | |
| dc.date.issued | 2022 | |
| dc.departamento | Arquitectura de Computadores | |
| dc.description.abstract | In this paper, we present CAVLCU, an efficient implementation of CAVLC on GPU, which is based on four key ideas. First, we use only one kernel to avoid the long latency global memory accesses required to transmit intermediate results among different kernels, and the costly launches and terminations of additional kernels. Second, we apply an efficient synchronization mechanism for thread-blocks (In this paper, to prevent confusion, a block of pixels of a frame will be referred to as simply block and a GPU thread block as thread-block.) that process adjacent frame regions (in horizontal and vertical dimensions) to share results in global memory space. Third, we exploit fully the available global memory bandwidth by using vectorized loads to move directly the quantized transform coefficients to registers. Fourth, we use register tiling to implement the zigzag sorting, thus obtaining high instruction-level parallelism. An exhaustive experimental evaluation showed that our approach is between 2.5× and 5.4× faster than the only state-of-the-art GPUbased implementation of CAVLC. | es_ES |
| dc.description.sponsorship | Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. | es_ES |
| dc.identifier.citation | Fuentes-Alventosa, A., Gómez-Luna, J., González-Linares, J.M. et al. CAVLCU: an efficient GPU-based implementation of CAVLC. J Supercomput 78, 7556–7590 (2022). https://doi.org/10.1007/s11227-021-04183-8 | es_ES |
| dc.identifier.doi | 10.1007/s11227-021-04183-8 | |
| dc.identifier.uri | https://hdl.handle.net/10630/37825 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Springer Nature | es_ES |
| dc.rights | Atribución 4.0 Internacional | * |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
| dc.subject | Compresión de datos (Informática) | es_ES |
| dc.subject | Imágenes - Compresión | es_ES |
| dc.subject | Procesado de imágenes - Técnicas digitales | es_ES |
| dc.subject | Compresión de vídeo | es_ES |
| dc.subject.other | CAVLC | es_ES |
| dc.subject.other | GPU | es_ES |
| dc.subject.other | CUDA | es_ES |
| dc.subject.other | H.264 | es_ES |
| dc.subject.other | Parallel implementations | es_ES |
| dc.subject.other | Data compression | es_ES |
| dc.subject.other | Variable-length encoding | es_ES |
| dc.title | CAVLCU: an efcient GPU‑based implementation of CAVLC | es_ES |
| dc.type | journal article | es_ES |
| dc.type.hasVersion | VoR | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 3388700c-0831-457c-9cf8-ca14cec33a15 | |
| relation.isAuthorOfPublication | bed8ca48-652e-4212-8c3c-05bfdc85a378 | |
| relation.isAuthorOfPublication.latestForDiscovery | 3388700c-0831-457c-9cf8-ca14cec33a15 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- s11227-021-04183-8.pdf
- Size:
- 2.75 MB
- Format:
- Adobe Portable Document Format
- Description:
- Artículo principal
Description: Artículo principal

