[1706.03762 Attention Is All You Need.pdf](file:///E:/%E6%90%9C%E7%8B%97%E9%AB%98%E9%80%9F%E4%B8%8B%E8%BD%BD2%202017-10-24/1706.03762%20Attention%20Is%20All%20You%20Need.pdf)
[1803.07055 Simple random search provides a competitive approach to reinforcement learning.pdf](file:///E:/%E6%90%9C%E7%8B%97%E9%AB%98%E9%80%9F%E4%B8%8B%E8%BD%BD2%202017-10-24/1803.07055%20Simple%20random%20search%20provides%20a%20competitive%20approach%20%20to%20reinforcement%20learning.pdf)
[Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments 2018 ICLR.pdf](file:///E:/%E6%90%9C%E7%8B%97%E9%AB%98%E9%80%9F%E4%B8%8B%E8%BD%BD2%202017-10-24/Continuous%20Adaptation%20via%20Meta-Learning%20in%20Nonstationary%20and%20Competitive%20Environments%202018%20ICLR.pdf)
[Reinforcement Learning with Deep Energy-Based Policies](file:///E:/%E6%90%9C%E7%8B%97%E9%AB%98%E9%80%9F%E4%B8%8B%E8%BD%BD2%202017-10-24/1702.08165%20[[1702.08165]%20Reinforcement%20Learning%20with%20Deep%20Energy-Based%20Policies].pdf)
[1804.03782 CoT Cooperative Training for Generative Modeling (2).pdf](file:///E:/%E6%90%9C%E7%8B%97%E9%AB%98%E9%80%9F%E4%B8%8B%E8%BD%BD2%202017-10-24/1804.03782%20CoT%20Cooperative%20Training%20for%20Generative%20Modeling%20(2).pdf)
Sequence generative adversarial nets with policy gradient的相关微信公众号文章 – 搜狗微信搜索
[1609.05473] SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
How to Train your Generative Models? And why does Adversarial Training work so well?
[SeqGAN Sequence Generative Adversarial Nets with Policy Gradient 1609.05473v5.pdf](file:///D:/[%E6%B7%B1%E5%BA%A6%E5%AD%A6%E4%B9%A0%E8%B5%84%E6%96%99][2016-2017]/SeqGAN%20Sequence%20Generative%20Adversarial%20Nets%20with%20Policy%20Gradient%201609.05473v5.pdf)
2018-04-21
©著作权归作者所有,转载或内容合作请联系作者
【社区内容提示】社区部分内容疑似由AI辅助生成,浏览时请结合常识与多方信息审慎甄别。
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。
【社区内容提示】社区部分内容疑似由AI辅助生成,浏览时请结合常识与多方信息审慎甄别。
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。