<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-06-03T01:31:21Z</responseDate><request verb="GetRecord" identifier="oai:riuma.uma.es:10630/16279" metadataPrefix="mods">https://riuma.uma.es/rest/oai/request</request><GetRecord><record><header><identifier>oai:riuma.uma.es:10630/16279</identifier><datestamp>2026-02-03T12:25:29Z</datestamp><setSpec>com_10630_2254</setSpec><setSpec>col_10630_37959</setSpec></header><metadata><mods:mods xmlns:doc="http://www.lyncode.com/xoai" xmlns:mods="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
   <mods:name>
      <mods:namePart>Lázaro Muñoz, Antonio José</mods:namePart>
   </mods:name>
   <mods:name>
      <mods:namePart>López Albelda, Bernabé</mods:namePart>
   </mods:name>
   <mods:name>
      <mods:namePart>González-Linares, José María</mods:namePart>
   </mods:name>
   <mods:name>
      <mods:namePart>Guil-Mata, Nicolás</mods:namePart>
   </mods:name>
   <mods:extension>
      <mods:dateAvailable encoding="iso8601">2018-07-16T11:34:20Z</mods:dateAvailable>
   </mods:extension>
   <mods:extension>
      <mods:dateAccessioned encoding="iso8601">2018-07-16T11:34:20Z</mods:dateAccessioned>
   </mods:extension>
   <mods:originInfo>
      <mods:dateIssued encoding="iso8601">2018-07-16</mods:dateIssued>
   </mods:originInfo>
   <mods:identifier type="uri">https://hdl.handle.net/10630/16279</mods:identifier>
   <mods:abstract>Concurrent  execution  of  tasks  in  GPUs can  reduce  the  computation  time  of  a  workload  by&#xd;
overlapping  data  transfer  and  execution  commands.&#xd;
However it is diﬃcult to implement an eﬃcient run-&#xd;
time scheduler that minimizes the workload makespan&#xd;
as many execution orderings should be evaluated.  In&#xd;
this  paper,  we  employ  scheduling  theory  to  build  a&#xd;
model  that  takes  into  account  the  device  capabili-&#xd;
ties, workload characteristics, constraints and objec-&#xd;
tive  functions.    In  our  model,  GPU  tasks  schedul-&#xd;
ing  is  reformulated  as  a  ﬂow  shop  scheduling  prob-&#xd;
lem, which allow us to apply and compare well known&#xd;
methods already developed in the operations research&#xd;
ﬁeld.  In addition we develop a new heuristic, specif-&#xd;
ically  focused  on  executing  GPU  commands,   that&#xd;
achieves better scheduling results than previous tech-&#xd;
niques.  Finally, a comprehensive evaluation, showing&#xd;
the  suitability  and  robustness  of  this  new  approach,&#xd;
is conducted in three diﬀerent NVIDIA architectures&#xd;
(Kepler, Maxwell and Pascal).</mods:abstract>
   <mods:language>
      <mods:languageTerm>eng</mods:languageTerm>
   </mods:language>
   <mods:accessCondition type="useAndReproduction">http://creativecommons.org/licenses/by-nc-nd/4.0/</mods:accessCondition>
   <mods:accessCondition type="useAndReproduction">open access</mods:accessCondition>
   <mods:accessCondition type="useAndReproduction">Attribution-NonCommercial-NoDerivatives 4.0 Internacional</mods:accessCondition>
   <mods:subject>
      <mods:topic>Arquitectura de redes informáticas - Congresos</mods:topic>
   </mods:subject>
   <mods:titleInfo>
      <mods:title>A scheduling theory framework for GPU tasks eﬃcient execution</mods:title>
   </mods:titleInfo>
   <mods:genre>conference output</mods:genre>
</mods:mods>
</metadata></record></GetRecord></OAI-PMH>