M12-Unit_8-Part_2_Proximal_Policy_Optimization_(PPO)_with_Doom-C2-Conclusion
中英文对照学习,效果更佳!
原课程链接:https://huggingface.co/deep-rl-course/unit2/glossary?fw=pt
Conclusion
结论
That’s all for today. Congrats on finishing this Unit and the tutorial! ⭐️
今天就到这里吧。祝贺完成本单元和教程!⭐️
Now that you’ve successfully trained your Doom agent, why not try deathmatch? Remember, that’s a much more complex level than the one you’ve just trained, but it’s a nice experiment and I advise you to try it.
既然你已经成功地训练了你的末日特工,为什么不试试DeathMatch呢?记住,这是一个比你刚刚训练的要复杂得多的水平,但这是一个很好的实验,我建议你尝试一下。
If you do it, don’t hesitate to share your model in the #rl-i-made-this channel in our discord server.
如果你这样做了,请毫不犹豫地在我们的Discord服务器的#rl-i-make-thisis频道分享你的模型。
This concludes the last unit, but we are not finished yet! 🤗 The following bonus unit includes some of the most interesting, advanced and cutting edge work in Deep Reinforcement Learning.
最后一单元结束了,但我们还没有完成!🤗下面的奖励单元包括一些深度强化学习中最有趣、最高级和最前沿的工作。
See you next time 🔥,
下次见,🔥,