H7-Unit_4-Policy_Gradient_with_PyTorch-I8-Additional_Readings
中英文对照学习,效果更佳!
原课程链接:https://huggingface.co/deep-rl-course/unit8/introduction-sf?fw=pt
Additional Readings
附加读数
These are optional readings if you want to go deeper.
如果你想更深入,这些都是可选的读物。
Introduction to Policy Optimization
策略优化简介
Policy Gradient
第3部分:策略优化简介-快速编写文档策略梯度
- https://johnwlambert.github.io/policy-gradients/
- RL - Policy Gradient Explained
- Chapter 13, Policy Gradient Methods; Reinforcement Learning, an introduction by Richard Sutton and Andrew G. Barto