<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-05-27T21:55:27Z</responseDate><request verb="GetRecord" identifier="oai:riuma.uma.es:10630/7956" metadataPrefix="mods">https://riuma.uma.es/rest/oai/request</request><GetRecord><record><header><identifier>oai:riuma.uma.es:10630/7956</identifier><datestamp>2026-02-03T12:12:36Z</datestamp><setSpec>com_10630_2254</setSpec><setSpec>col_10630_37959</setSpec></header><metadata><mods:mods xmlns:doc="http://www.lyncode.com/xoai" xmlns:mods="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
   <mods:name>
      <mods:namePart>Vilches Reina, Antonio</mods:namePart>
   </mods:name>
   <mods:name>
      <mods:namePart>Asenjo-Plaza, Rafael</mods:namePart>
   </mods:name>
   <mods:name>
      <mods:namePart>Corbera-Peña, Francisco Javier</mods:namePart>
   </mods:name>
   <mods:name>
      <mods:namePart>González-Navarro, María Ángeles</mods:namePart>
   </mods:name>
   <mods:extension>
      <mods:dateAvailable encoding="iso8601">2014-07-30T10:55:31Z</mods:dateAvailable>
   </mods:extension>
   <mods:extension>
      <mods:dateAccessioned encoding="iso8601">2014-07-30T10:55:31Z</mods:dateAccessioned>
   </mods:extension>
   <mods:originInfo>
      <mods:dateIssued encoding="iso8601">2014-07-30</mods:dateIssued>
   </mods:originInfo>
   <mods:identifier type="uri">http://hdl.handle.net/10630/7956</mods:identifier>
   <mods:abstract>This paper explores the possibility of efficiently using multicores&#xd;
in conjunction with multiple GPU accelerators under a parallel task&#xd;
programming paradigm. In particular, we address the challenge of&#xd;
extending a parallel_for template to allow its&#xd;
exploitation on heterogeneous systems. The extension is based on a&#xd;
two-stages pipeline engine which is responsible for partitioning and&#xd;
scheduling the chunks into the computational resources. Under this&#xd;
engine, we propose a dynamic scheduling strategy coupled with an&#xd;
adaptive partitioning heuristic that resizes chunks to prevent&#xd;
underutilization and load unbalance of CPUs and GPUs. In this paper&#xd;
we introduce the adaptive&#xd;
partitioning heuristic which is derived from an analytical model that&#xd;
minimizes the load unbalance while maximizes the throughput in the&#xd;
system. Using two benchmarks we evaluate the&#xd;
overhead introduced by our template extensions finding that it is&#xd;
negligible. We also evaluate the efficiency of our adaptive&#xd;
partitioning strategies and compared them with related work.</mods:abstract>
   <mods:language>
      <mods:languageTerm>eng</mods:languageTerm>
   </mods:language>
   <mods:accessCondition type="useAndReproduction">open access</mods:accessCondition>
   <mods:subject>
      <mods:topic>Computación heterogénea</mods:topic>
   </mods:subject>
   <mods:subject>
      <mods:topic>Procesos en paralelo (Informática)</mods:topic>
   </mods:subject>
   <mods:titleInfo>
      <mods:title>Adaptive Partition Strategies for Loop Parallelism in Heterogeneous Architectures</mods:title>
   </mods:titleInfo>
   <mods:genre>conference output</mods:genre>
</mods:mods>
</metadata></record></GetRecord></OAI-PMH>