Reinforcement Learning for Free Electron Laser Online Optimization

Jiacheng Wu; Meng Cai; Yujie Lu; Nanshun Huang; Chao Feng; Zhentang Zhao

doi:10.3788/AOS230893

Journals >Acta Optica Sinica >Volume 43 >Issue 21 >Page 2114002 > Article

Acta Optica Sinica
Vol. 43, Issue 21, 2114002 (2023)

Reinforcement Learning for Free Electron Laser Online Optimization

Jiacheng Wu^1,2, Meng Cai³, Yujie Lu^1,3, Nanshun Huang^4,*..., Chao Feng^2,3 and Zhentang Zhao^1,2,3|Show fewer author(s)

Author Affiliations

¹School of Physical Science and Technology, ShanghaiTech University, Shanghai 201210, China

²Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai 201210, China

³Shanghai Institute of Applied Physics, Chinese Academy of Sciences, Shanghai 201800, China

⁴Zhangjiang Laboratory, Shanghai 201210, China

show less

DOI: 10.3788/AOS230893 Cite this Article Set citation alerts

Jiacheng Wu, Meng Cai, Yujie Lu, Nanshun Huang, Chao Feng, Zhentang Zhao. Reinforcement Learning for Free Electron Laser Online Optimization[J]. Acta Optica Sinica, 2023, 43(21): 2114002 Copy Citation Text

show less

Fig. 1. Layout of the undulator system

Download full size

Fig. 2. Comparison of power curves among SAC, TD3, and DDPG algorithms for FEL tuning tasks

Download full size

Fig. 3. Comparison of gain curves and optimized orbits among SAC, TD3, and DDPG algorithms in the FEL optimization. (a) Gain curves; optimized orbits along (b) x and (c) y directions

Download full size

Fig. 4. Comparison between the optimized spots of DDPG, TD3, and SAC algorithms and the initial spot. (a) Initial spot; (b) DDPG optimized spot; (c) TD3 optimized spot; (d) SAC optimized spot

Download full size

Parameter	Value
Beam average energy /GeV	1.5
Peak current /A	800
Energy spread /%	0.014
FEL wavelength /nm	3.72
Average beam radius / $μ$ m	50
Undulator length /m	3
Period length /cm	2.35

Table 1. Main parameters of the simulation

Parameter	Value
Actor learning rate	0.0003
Critic learning rate	0.0003
Neural network size	256×512
Batch size	64
Optimizer	Adam
Discount factor	0.99
Alpha learning rate	0.0003
Police noise	0.2

Table 2. Network parameter settings of SAC, DDPG, and TD3 algorithms

Jiacheng Wu, Meng Cai, Yujie Lu, Nanshun Huang, Chao Feng, Zhentang Zhao. Reinforcement Learning for Free Electron Laser Online Optimization[J]. Acta Optica Sinica, 2023, 43(21): 2114002

Download Citation

Set citation alerts for the article

Tools

Set citation alerts for the article

Save the article for my favorites

Paper Information

微信扫一扫：分享

微信扫一扫：分享