Jian-Dong Yao, Wen-Bin Hao, Zhi-Gao Meng, Bo Xie, Jian-Hua Chen, Jia-Qi Wei. Adaptive multi-agent reinforcement learning for dynamic pricing and distributed energy management in virtual power plant networks[J]. Journal of Electronic Science and Technology, 2025, 23(1): 100290

Search by keywords or author
- Journal of Electronic Science and Technology
- Vol. 23, Issue 1, 100290 (2025)

Fig. 1. Interaction dynamics between the DSO and VPPs within the MAMDP framework.

Fig. 2. Average cumulative reward for all agents across training episodes: MARL learning curve (above) and zoomed-in view of final 5000 episodes (below).

Fig. 3. DSO’s pricing and net demand over a week.

Fig. 4. Temporal dynamics of key state variables in the VPP network over a representative week.

Fig. 5. Computational time and solution quality as the number of VPPs increases from 10 to 200.

Fig. 6. System’s performance over a 30-day period following a permanent 15% reduction in average renewable generation capacity.
|
Table 1. MARL and system parameters.
|
Table 2. Economic efficiency comparison.
|
Table 3. Computational performance comparison.
|
Table 4. Adaptability score (0–100).
|
Table 5. Sensitivity analysis results (percentage change in system performance).
|
Table 6. VPP resource utilization (percentage of capacity).
|
Table 7. System performance under unexpected events (percentage deviation from normal operations).
|
Table 8. System response to renewable generation forecast errors.

Set citation alerts for the article
Please enter your email address