J9-Unit_6-Actor_Critic_methods_with_Robotics_environments-F5-Additional_Readings

中英文对照学习,效果更佳!
原课程链接:https://huggingface.co/deep-rl-course/communication/certification?fw=pt

Additional Readings

附加读数

Bias-variance tradeoff in Reinforcement Learning

强化学习中的偏差-方差权衡

If you want to dive deeper into the question of variance and bias tradeoff in Deep Reinforcement Learning, you can check these two articles:

如果您想更深入地研究深度强化学习中的方差和偏差权衡问题,您可以查看这两篇文章:

Advantage Functions

理解(深度)强化学习中的偏差/方差权衡强化学习优势函数中的偏差/方差权衡

Actor Critic

Advantage Functions,SpinningUp RLActor批评家