Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Published in Arxiv, 2020
Recommended citation: Jiajun Fan, He Ba, Xian Guo, Jianye Hao, "Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning." Arxiv, 2020. https://arxiv.org/abs/2011.06752