M12-Unit_8-Part_2_Proximal_Policy_Optimization_(PPO)_with_Doom-C2-Conclusion

中英文对照学习,效果更佳!
原课程链接:https://huggingface.co/deep-rl-course/unit2/glossary?fw=pt

Conclusion

结论

That’s all for today. Congrats on finishing this Unit and the tutorial! ⭐️

今天就到这里吧。祝贺完成本单元和教程!⭐️

Now that you’ve successfully trained your Doom agent, why not try deathmatch? Remember, that’s a much more complex level than the one you’ve just trained, but it’s a nice experiment and I advise you to try it.

既然你已经成功地训练了你的末日特工,为什么不试试DeathMatch呢?记住,这是一个比你刚刚训练的要复杂得多的水平,但这是一个很好的实验,我建议你尝试一下。

If you do it, don’t hesitate to share your model in the #rl-i-made-this channel in our discord server.

如果你这样做了,请毫不犹豫地在我们的Discord服务器的#rl-i-make-thisis频道分享你的模型。

This concludes the last unit, but we are not finished yet! 🤗 The following bonus unit includes some of the most interesting, advanced and cutting edge work in Deep Reinforcement Learning.

最后一单元结束了,但我们还没有完成!🤗下面的奖励单元包括一些深度强化学习中最有趣、最高级和最前沿的工作。

See you next time 🔥,

下次见,🔥,

Keep Learning, Stay awesome 🤗

继续学习,保持卓越🤗