An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System

DAI Xiaoqing; ZHAO Xu

doi:10.3969/j.issn.1671-637x.2022.02.012

Journals >Electronics Optics & Control >Volume 29 >Issue 2 >Page 53 > Article

Electronics Optics & Control
Vol. 29, Issue 2, 53 (2022)

An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System

DAI Xiaoqing¹ and ZHAO Xu²

Author Affiliations

¹[in Chinese]

²[in Chinese]

show less

DOI: 10.3969/j.issn.1671-637x.2022.02.012 Cite this Article

DAI Xiaoqing, ZHAO Xu. An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System[J]. Electronics Optics & Control, 2022, 29(2): 53 Copy Citation Text

show less

Abstract

For the online implementation of infinite horizon optimal control for continuous linear systems, an online Q-learning algorithm is designed under the condition that the system dynamics are completely unknown.Based on the Hamiltonian function and the optimal cost function in the infinite horizon optimal control theory, the Q function of the continuous linear system is constructed.An Actor/Critic approximator structure is designed by using the integral reinforcement learning method.With asymptotic stability of the closed-loop system and convergence to the optimal solution, the parameters of the Q function are estimated online.The 6th-order linear system model of the turbocharged engine is numerically simulated, and the results show that both the Critic weight and the Actor weight asymptotically converge to the optimal value, and the model-free optimal control is realized.

Keywords

Actor/Critic approximator Hamiltonian function optimal control Q-learning

DAI Xiaoqing, ZHAO Xu. An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System[J]. Electronics Optics & Control, 2022, 29(2): 53

Download Citation

Tools

Save the article for my favorites

Paper Information