Mostrar el registro sencillo del ítem

dc.contributor.authorRuiz-Montiel, Manuela
dc.contributor.authorMandow-Andaluz, Lorenzo 
dc.contributor.authorPérez-de-la-Cruz-Molina, José Luis 
dc.date.accessioned2019-10-17T11:54:09Z
dc.date.available2019-10-17T11:54:09Z
dc.date.created2017
dc.date.issued2019-10-17
dc.identifier.urihttps://hdl.handle.net/10630/18596
dc.description.abstractThis work describes MPQ-learning, an temporal-difference method that approximates the set of all non-dominated policies in multi-objective Markov decision problems, where rewards are vectors and each component stands for an objective to maximize. Unlike other approximations to Multi-objective Reinforcement Learning, MPQ-learning does not require additional parameters or preference information, and can be applied to non-convex Pareto frontiers. We also present the results of the application of MPQ-learning to some benchmark problems and compare it to a linearization procedure.en_US
dc.description.sponsorshipThis work is partially funded by grants TIN2009-14179 (Spanish Government, Plan Nacional de I+D+i) and TIN2016-80774-R (AEI/FEDER, UE) (Spanish Government, Agencia Estatal de Investigación; and European Union, Fondo Europeo de Desarrollo Regional). Manuela Ruiz-Montiel is funded by the Spanish Ministry of Education through the National F.P.U. Program.en_US
dc.language.isoengen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectAprendizaje automático (Inteligencia artificial)en_US
dc.subject.otherReinforcement learningen_US
dc.subject.otherMulti-objective optimizationen_US
dc.subject.otherMOMDPsen_US
dc.subject.otherQ-learningen_US
dc.titleA temporal difference method for multi-objective reinforcement learningen_US
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.centroE.T.S.I. Informáticaen_US
dc.identifier.doihttps://doi.org/10.1016/j.neucom.2016.10.100
dc.type.hasVersioninfo:eu-repo/semantics/submittedVersiones_ES


Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem