{"author_url":"https://blog.hatena.ne.jp/thr3a/","published":"2015-03-08 10:57:37","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fblog.turai.work%2Fentry%2F20150308%2F1425779857\" title=\"NLTK\u3092\u4f7f\u3063\u3066TF-IDF\u6cd5\u3092\u8a66\u3057\u3066\u307f\u308b - \u52d5\u304b\u3056\u308b\u3053\u3068\u30d0\u30b0\u306e\u5982\u3057\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","blog_title":"\u52d5\u304b\u3056\u308b\u3053\u3068\u30d0\u30b0\u306e\u5982\u3057","author_name":"thr3a","url":"https://blog.turai.work/entry/20150308/1425779857","blog_url":"https://blog.turai.work/","height":"190","provider_url":"https://hatena.blog","type":"rich","provider_name":"Hatena Blog","version":"1.0","image_url":null,"title":"NLTK\u3092\u4f7f\u3063\u3066TF-IDF\u6cd5\u3092\u8a66\u3057\u3066\u307f\u308b","description":"NLTK\u3068\u306f NLTK\u3068\u306fPython\u3067\u52d5\u304f\u81ea\u7136\u8a00\u8a9e\u51e6\u7406\u7528\u30c4\u30fc\u30eb\u30ad\u30c3\u30c8\u3068\u306e\u3053\u3068\u3002\u305d\u306e\u30c4\u30fc\u30eb\u306e\uff11\u3064\u3067\u3042\u308bTF-IDF\u3092\u4f7f\u3063\u3066\u307f\u308b\u3002 \u30a4\u30f3\u30b9\u30c8\u30fc\u30eb \u516c\u5f0f\u30b5\u30a4\u30c8\u3092\u53c2\u8003\u306b\u30b3\u30de\u30f3\u30c9\u3092\u53e9\u304f\u3060\u3051 sudo easy_install pip sudo pip install -U numpy sudo pip install -U nltk python >>import nltk \u3053\u3093\u306a\u611f\u3058 #coding: utf-8 import nltk docs = [ ['\u4eca\u65e5', '\u30ab\u30ec\u30fc', '\u96e8'], ['\u4eca\u65e5', '\u30c6\u30cb\u30b9', '\u6674\u308c'], ['\u4eca\u65e5', '\u96e8']] collection = nltk.T\u2026","width":"100%","categories":[]}