RT Conference Proceedings
T1 A Hybrid Piece-Wise Slowdown Model for Concurrent Kernel Execution on GPU
A1 López Albelda, Bernabé
A1 Castro, Francisco M.
A1 González-Linares, José María
A1 Guil-Mata, Nicolás
K1 GKS (Sistema de ordenador)
K1 Kernel, Funciones de
AB Current execution of kernels on GPUs allows improving the use of hardware resources and reducing the execution time of co-executed kernels. In addition, efficient kernel-oriented scheduling policies pursuing criteria based on fairness or Quality of Service can be implemented. However, achieved co-executing performance strongly depends on how GPU resources are partitioned between kernels. Thus, precise slowdown models that predict accurate co-execution performance must be used to fulfill scheduling policy requirements. Most recent slowdown models work with Spatial Multitask (SMT) partitioning, where Stream Multiprocessors (SMs) are distributed among tasks. In this work, we show that Simultaneous Multikernel (SMK) partitioning, where kernels share the SMs, obtains better performance. However, kernel interference in SMK occurs not only in global memory, as in the SMT case, but also within the SM, leading to high prediction errors. Here, we propose a modification of a previous state-of-the-art slowdown model to reduce median prediction error from 27.92% to 9.50%. Moreover, this new slowdown model is used to implement a scheduling policy that improves fairness by 1.41x on average compared to even partitioning, whereas previous models reach only 1.21x on average.
PB José Cano, Phil Trinder
YR 2022
FD 2022-08
LK https://hdl.handle.net/10630/24921
UL https://hdl.handle.net/10630/24921
LA eng
NO Universidad de Málaga. Campus de Excelencia Internacional Andalucía TechP18-FR-3130UMA20-FEDERJA-059PID2019-105396RB-I00
DS RIUMA. Repositorio Institucional de la Universidad de Málaga
RD 13 abr 2026