Novel Distributional Reinforcement and Ensemble Learning Algorithms.

Aziz, Vanya

Novel Distributional Reinforcement and Ensemble Learning Algorithms.

dc.centro	Escuela de Ingenierías Industriales	es_ES
dc.contributor.advisor	Hendrix, Eligius María Theodorus
dc.contributor.advisor	Nowak, Ivo
dc.contributor.author	Aziz, Vanya
dc.date.accessioned	2025-07-10T11:15:01Z
dc.date.available	2025-07-10T11:15:01Z
dc.date.created	2025
dc.date.issued	2025
dc.date.submitted	2025-06-11
dc.departamento	Ingeniería Mecánica, Térmica y de Fluidos	es_ES
dc.description.abstract	This dissertation focuses on Deep Reinforcement Learning (DRL), a neural network-based approach for solving Markov Decision Processes in high-dimensional spaces with unknown transition dynamics. The main contribution of this thesis is the development of a novel state-of-the-art distributional reinforcement learning algorithm within the maximum-entropy Actor-Critic framework. This algorithm, termed ”Cram´er-based Soft Distributional Soft Actor-critic” (C-DSAC), demonstrates superior performance to other RL algorithms, especially in environments with high-dimensional spaces and complex dynamics. Its performance is shown to be partly rooted in a phenomenon arising in Cram´er-metric-based Distributional Reinforcement Learning, referred to as confidence-driven model updates. This mechanism ensures that the value function approximator is updated more conservatively when confidence in its estimates is low. Theoretical justifications for the algorithm are provided, demonstrating its convergence in the policy evaluation setting and, under widely accepted mild assumptions, in the control setting as well. Beyond foundational algorithmic research, this thesis contributes to the practical application of RL in robotics. Given the crucial role of multi-joint robotic systems in modern production technology, a RL meta-algorithm called ”Reinforcement Learning - Inverse Kinematics” (RL-IK) is devised. This approach enhances the applicability of reinforcement learning to robotic control tasks by significantly accelerating convergence to near-optimal policies compared to standard RL methods. An essential prerequisite for real-world RL applications in control systems is machine perception for state identification. To address challenges in this field, this thesis explores novel Supervised Learning (SL) approaches, validated on image classification tasks, with a focus on ensemble learning strategies.	es_ES
dc.identifier.uri	https://hdl.handle.net/10630/39287
dc.language.iso	eng	es_ES
dc.publisher	UMA Editorial	es_ES
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.accessRights	open access	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Robótica - Tesis doctorales	es_ES
dc.subject	Programación lineal	es_ES
dc.subject	Aprendizaje automático (Inteligencia artificial)	es_ES
dc.subject	Redes neuronales (Informática)	es_ES
dc.subject.other	Distributional Reinforcement Learning	es_ES
dc.subject.other	Soft Actor-Critic	es_ES
dc.subject.other	Robotics	es_ES
dc.subject.other	Linear Programming	es_ES
dc.subject.other	Ensemble	es_ES
dc.title	Novel Distributional Reinforcement and Ensemble Learning Algorithms.	es_ES
dc.type	doctoral thesis	es_ES
dspace.entity.type	Publication
relation.isAdvisorOfPublication	0c3992b1-f2f1-4f53-a186-1dbf6d6cef5a
relation.isAdvisorOfPublication.latestForDiscovery	0c3992b1-f2f1-4f53-a186-1dbf6d6cef5a

Files

Original bundle

Now showing 1 - 1 of 1

Name:: TD_AZIZ_Vanya.pdf
Size:: 3.47 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Tesis doctorales