{"version":"1.0","url":"https://paper.hatenadiary.jp/entry/2016/12/02/051114","author_url":"https://blog.hatena.ne.jp/misos/","blog_title":"\u3081\u3082","author_name":"misos","blog_url":"https://paper.hatenadiary.jp/","provider_name":"Hatena Blog","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fpaper.hatenadiary.jp%2Fentry%2F2016%2F12%2F02%2F051114\" title=\"\u5f37\u5316\u5b66\u7fd2\u306e\u8cc7\u6599\u30e1\u30e2\uff13\uff1a\u30de\u30eb\u30b3\u30d5\u6c7a\u5b9a\u904e\u7a0b - \u3081\u3082\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","description":"Agent\u2013Environment Interface \u5f37\u5316\u5b66\u7fd2\u306b\u304a\u3051\u308bagent-environment\u306e\u76f8\u4e92\u4f5c\u7528 Markov Decision Process \u5b9a\u7fa9 \u8b1b\u7fa9\u52d5\u753b Markov Decision Processes I Markov Decision Process II RL Course by David Silver(Deepmind) \u6709\u9650\u30de\u30eb\u30b3\u30d5\u6c7a\u5b9a\u904e\u7a0b\uff08Finite Markov Decision Processes\uff09\u5468\u8fba\u306b\u95a2\u3057\u3066\u3002 \u3044\u308d\u3044\u308d\u30e1\u30e2\u3057\u3088\u3046\u3068\u601d\u3063\u305f\u3051\u3069\u3001\u56f3\u304c\u591a\u304f\u3066\u9762\u5012\u304f\u3055\u304f\u306a\u3063\u305f\u306e\u3067\u8b1b\u7fa9\u52d5\u753b\u3060\u3051\u30e1\u30e2\u3002 Sutton\u6c0f\u306e\u672c\u3067\u306f\u3053\u306e\u7ae0\u3067\u521d\u3081\u3066\u3053\u308c\u4ee5\u964d\u306e\u30da\u30fc\u2026","width":"100%","type":"rich","title":"\u5f37\u5316\u5b66\u7fd2\u306e\u8cc7\u6599\u30e1\u30e2\uff13\uff1a\u30de\u30eb\u30b3\u30d5\u6c7a\u5b9a\u904e\u7a0b","height":"190","image_url":"https://cdn-ak.f.st-hatena.com/images/fotolife/m/misos/20161202/20161202045441.png","published":"2016-12-02 05:11:14","provider_url":"https://hatena.blog","categories":["\u8ad6\u6587\u30fb\u8cc7\u6599\u30fb\u30b9\u30e9\u30a4\u30c9\u96c6","\u6a5f\u68b0\u5b66\u7fd2","\u5f37\u5316\u5b66\u7fd2"]}