中英文对照学习，效果更佳！
原课程链接：https://huggingface.co/deep-rl-course/unit8/introduction-sf?fw=pt

Additional Readings

附加读数

These are optional readings if you want to go deeper.

如果你想更深入，这些都是可选的读物。

Introduction to Policy Optimization

策略优化简介

Part 3: Intro to Policy Optimization - Spinning Up documentation

Policy Gradient

第3部分：策略优化简介-快速编写文档策略梯度

Implementation

Https://johnwlambert.github.io/policy-gradients/RL-政策梯度解释第13章，政策梯度方法；强化学习，理查德·萨顿和安德鲁·G·巴托实施简介

Reinforcement

#Reinforcement

I8-Unit_5-Introduction_to_Unity_ML_Agents-A0-Introduction 上一篇

I8-Unit_5-Introduction_to_Unity_ML_Agents-B1-How_ML_Agents_works 下一篇