• Electronics Optics & Control
  • Vol. 29, Issue 2, 53 (2022)
DAI Xiaoqing1 and ZHAO Xu2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less
    DOI: 10.3969/j.issn.1671-637x.2022.02.012 Cite this Article
    DAI Xiaoqing, ZHAO Xu. An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System[J]. Electronics Optics & Control, 2022, 29(2): 53 Copy Citation Text show less

    Abstract

    For the online implementation of infinite horizon optimal control for continuous linear systems, an online Q-learning algorithm is designed under the condition that the system dynamics are completely unknown.Based on the Hamiltonian function and the optimal cost function in the infinite horizon optimal control theory, the Q function of the continuous linear system is constructed.An Actor/Critic approximator structure is designed by using the integral reinforcement learning method.With asymptotic stability of the closed-loop system and convergence to the optimal solution, the parameters of the Q function are estimated online.The 6th-order linear system model of the turbocharged engine is numerically simulated, and the results show that both the Critic weight and the Actor weight asymptotically converge to the optimal value, and the model-free optimal control is realized.
    DAI Xiaoqing, ZHAO Xu. An Online Q-Learning Algorithm for a Model-Free Infinite Horizon System[J]. Electronics Optics & Control, 2022, 29(2): 53
    Download Citation