{"published":"2018-07-08 16:00:59","blog_url":"https://paper.hatenadiary.jp/","version":"1.0","url":"https://paper.hatenadiary.jp/entry/2018/07/08/160059","type":"rich","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fpaper.hatenadiary.jp%2Fentry%2F2018%2F07%2F08%2F160059\" title=\"\u5f37\u5316\u5b66\u7fd2\u306e\u52c9\u5f37\u3092\u59cb\u3081\u308b\u3068\u304d\u5f79\u306b\u7acb\u3063\u305f\u8cc7\u6599\u306a\u3069 - \u3081\u3082\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","author_url":"https://blog.hatena.ne.jp/misos/","categories":["\u5f37\u5316\u5b66\u7fd2"],"image_url":null,"width":"100%","blog_title":"\u3081\u3082","description":"\u8b1b\u7fa9\u8cc7\u6599 CS 294: Deep Reinforcement Learning, Fall 2018 @ UC Berkeley CS234: Reinforcement Learning @ Stanford University MS&E338 Reinforcement Learning @ Stanford University \u5b9f\u88c5 Gym RL-Adventure RL-Adventure-2: Policy Gradients \u30bf\u30a4\u30c8\u30eb\u901a\u308a\u3067\u3059\u3002 sutton\u306e\u672c\u306e\u306f\u3058\u3081\u306e\u3053\u308d\u3092\u8aad\u307f\u59cb\u3081\u305f\u6642\u306b\u53c2\u8003\u306b\u3057\u305f\u30e1\u30e2\u3067\u3059\u3002 \u3060\u3044\u3076\u524d\u306e\u4e0b\u66f8\u304d\u306a\u306e\u3067\u4eca(2018/7/7\u73fe\u5728)\u306f\u3082\u3063\u3068\u3044\u3044\u3082\u306e\u2026","provider_name":"Hatena Blog","title":"\u5f37\u5316\u5b66\u7fd2\u306e\u52c9\u5f37\u3092\u59cb\u3081\u308b\u3068\u304d\u5f79\u306b\u7acb\u3063\u305f\u8cc7\u6599\u306a\u3069","height":"190","author_name":"misos","provider_url":"https://hatena.blog"}