Abstract: This study proposes a low-level radio frequency (LLRF) feedback control algorithm based on reinforcement learning (RL) using the soft actor–critic (SAC) and proximal policy optimization (PPO ...