Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid

CAO Jun; SUN Yingying; ZHAO Hang

doi:10.11805/tkyda2020178

Journals >Journal of Terahertz Science and Electronic Information Technology >Volume 21 >Issue 1 >Page 112 > Article

Journal of Terahertz Science and Electronic Information Technology
Vol. 21, Issue 1, 112 (2023)

Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid

CAO Jun^*, SUN Yingying, and ZHAO Hang

Author Affiliations

[in Chinese]

show less

DOI: 10.11805/tkyda2020178 Cite this Article

CAO Jun, SUN Yingying, ZHAO Hang. Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid[J]. Journal of Terahertz Science and Electronic Information Technology , 2023, 21(1): 112 Copy Citation Text

show less

Abstract

Dynamic pricing is one of the most effective ways to encourage customers to change their consumption pattern. Therefore, Reinforcement Learning-based Optimizing Dynamic Pricing(RLODP) algorithm is proposed for energy management in a hierarchical electricity market by considering both service provider's profit and customers' costs. Using Reinforcement Learning, the SP can adaptively determine the retail electricity price. Dynamic pricing problem is formulated as a discrete finite Markov Decision Process(MDP), and Q-learning is adopted to solve this decision-making problem. Simulation results show that the RLODP algorithm can reduce energy costs for customers, balance the energy supply and the demands in the electricity market.

Keywords

demand response discrete finite Markov Decision Process electricity price Reinforcement Learning smart grid

Download Citation

Save the article for my favorites

Paper Information