{"url":"https://odz.hatenablog.com/entry/20080220/1203520572","description":"\u306a\u3093\u304b Wikipedia \u306e TF-IDF \u306e\u9805\u76ee\u304c\u3061\u3087\u3063\u3068\u3072\u3069\u3044\u306a\u3001\u3053\u308c\u306f\u3002 \u666e\u901a\u3001tf \u306f\u300c\u3042\u308b\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8\u4e2d\u306b\u304a\u3051\u308b\u300d\u3042\u308b\u5358\u8a9e\u306e\u51fa\u73fe\u983b\u5ea6\u3068\u3044\u3046\u610f\u5473\u3067\u4f7f\u3046\u3093\u3058\u3083\u306a\u3044\u304b\u306a\u3041\u3002 \u3042\u3068\u307e\u3041\u3001\u4e00\u53e3\u306b TF-IDF \u3068\u3044\u3063\u3066\u3082\u3001idf \u304c 1 + log(N/df) \u3060\u3063\u305f\u308a\u3001tf \u306e square root \u3092\u53d6\u3063\u305f\u308a idf \u306e\u4e8c\u4e57\u3092\u53d6\u3063\u305f\u308a\u3068\u304b\u7d50\u69cb\u30d0\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u304c\u3042\u3063\u305f\u308a\u3059\u308b\u3082\u3093\u3067\u3059\u3002","author_name":"odz","type":"rich","image_url":null,"provider_name":"Hatena Blog","version":"1.0","blog_url":"https://odz.hatenablog.com/","author_url":"https://blog.hatena.ne.jp/odz/","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fodz.hatenablog.com%2Fentry%2F20080220%2F1203520572\" title=\"TF-IDF - odz buffer\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","width":"100%","title":"TF-IDF","height":"190","blog_title":"odz buffer","categories":["NLP"],"provider_url":"https://hatena.blog","published":"2008-02-20 00:16:12"}