Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

dc.centroEscuela de Ingenierías Industrialeses_ES
dc.contributor.authorIsrailov, Sardor
dc.contributor.authorFu, Li
dc.contributor.authorSánchez-Rodríguez, Jesús
dc.contributor.authorFusco, Franco
dc.contributor.authorAllibert, Guillaume
dc.contributor.authorRaufaste, Christophe
dc.contributor.authorArgentina, Médéric
dc.date.accessioned2025-11-07T10:04:33Z
dc.date.available2025-11-07T10:04:33Z
dc.date.issued2023-02-13
dc.departamentoFísica Aplicada IIes_ES
dc.description.abstractMachine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approaches_ES
dc.description.sponsorshipFrench National Research Agencyes_ES
dc.identifier.citationIsrailov, S., Fu, L., Sánchez-Rodríguez, J., Fusco, F., Allibert, G., Raufaste, C., & Argentina, M. (2023). Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes. Plos one, 18(2), e0280071.es_ES
dc.identifier.doi10.1371/journal.pone.0280071
dc.identifier.urihttps://hdl.handle.net/10630/40639
dc.language.isoenges_ES
dc.publisherPlos Onees_ES
dc.rightsAtribución 4.0 Internacional*
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectRedes neuronales artificialeses_ES
dc.subjectAprendizaje automáticoes_ES
dc.subjectPénduloes_ES
dc.subject.otherPendulumses_ES
dc.subject.otherLearning curveses_ES
dc.subject.otherArtificial neural networkses_ES
dc.subject.otherMachine learninges_ES
dc.subject.otherAlgorithmses_ES
dc.subject.otherGaussian noisees_ES
dc.titleReinforcement learning approach to control an inverted pendulum: A general framework for educational purposeses_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
journal.pone.0280071.pdf
Size:
2.26 MB
Format:
Adobe Portable Document Format
Description:

Collections