Abstract: In recent years, reinforcement learning has made significant progress in a wide range of domains, including autonomous driving, behavior analysis in electricity markets, and robotic control.