{"provider_url":"https://hatena.blog","categories":["NLP(\u81ea\u7136\u8a00\u8a9e\u51e6\u7406)"],"author_url":"https://blog.hatena.ne.jp/nikkie-ftnext/","height":"190","blog_title":"nikkie-ftnext\u306e\u65e5\u8a18","title":"\u300eTransformer\u306b\u3088\u308b\u81ea\u7136\u8a00\u8a9e\u51e6\u7406\u300f\u306eRoBERTa\u4e8b\u524d\u8a13\u7df4\u306e\u30b3\u30fc\u30c9\u3092\u3001\u30c7\u30fc\u30bf\u3092huggingface/datasets\u3067\u8aad\u307f\u8fbc\u3080\u3088\u3046\u306b\u66f8\u304d\u76f4\u3059","width":"100%","blog_url":"https://nikkie-ftnext.hatenablog.com/","url":"https://nikkie-ftnext.hatenablog.com/entry/replace-linebylinetextdataset-datasets-library","image_url":"https://m.media-amazon.com/images/I/41SmuREMw8L._SL500_.jpg","version":"1.0","author_name":"nikkie-ftnext","published":"2022-05-06 15:33:37","provider_name":"Hatena Blog","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fnikkie-ftnext.hatenablog.com%2Fentry%2Freplace-linebylinetextdataset-datasets-library\" title=\"\u300eTransformer\u306b\u3088\u308b\u81ea\u7136\u8a00\u8a9e\u51e6\u7406\u300f\u306eRoBERTa\u4e8b\u524d\u8a13\u7df4\u306e\u30b3\u30fc\u30c9\u3092\u3001\u30c7\u30fc\u30bf\u3092huggingface/datasets\u3067\u8aad\u307f\u8fbc\u3080\u3088\u3046\u306b\u66f8\u304d\u76f4\u3059 - nikkie-ftnext\u306e\u65e5\u8a18\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","description":"\u306f\u3058\u3081\u306b \u4eca\u65e5\u3082\u7d20\u632f\u308a\u306b\u3068\u308a\u304f\u30fc\u307f\u3053\uff011 nikkie\u3067\u3059\uff01 \u5148\u65e5\u3001\u300eTransformer\u306b\u3088\u308b\u81ea\u7136\u8a00\u8a9e\u51e6\u7406\u300f\u306e\u4e2d\u306eRoBERTa\u306e\u4e8b\u524d\u8a13\u7df4\u3092\u5199\u7d4c\u3057\u305f\u3068\u3044\u3046\u8a18\u4e8b\u3092\u66f8\u304d\u307e\u3057\u305f\uff1a \"\u8003\u3048\u306a\u304c\u3089\u5199\u7d4c\"\u3057\u3066\u3044\u3066\u3001\u3044\u304f\u3064\u304b\u6398\u308a\u4e0b\u3052\u305f\u3044\u4e8b\u9805\u304c\u51fa\u3066\u304d\u3066\u3044\u307e\u3059\u3002 \u4eca\u56de\u306f\u30c7\u30fc\u30bf\u306e\u8aad\u307f\u8fbc\u307f\u306b\u30d5\u30a9\u30fc\u30ab\u30b9\u3057\u307e\u3059\u3002 \u76ee\u6b21 \u306f\u3058\u3081\u306b \u76ee\u6b21 \u4eca\u56de\u89e3\u6d88\u3059\u308b\u7a4d\u307f\u6b8b\u3057 \u53c2\u8003\u4f8b\uff1aexamples\u306elanguage-modeling/run_mlm.py \u52d5\u4f5c\u74b0\u5883 datasets\u30e9\u30a4\u30d6\u30e9\u30ea\u3067\u66f8\u304d\u63db\u3048 \u66f8\u304d\u63db\u3048\u89e3\u8aac \u66f8\u304d\u63db\u3048\u30663\u7ae0 \u66f8\u304d\u63db\u3048\u305f\u3053\u3068\u306e\u691c\u8a3c \u7d42\u308f\u308a\u306b \u4eca\u56de\u89e3\u6d88\u3059\u308b\u7a4d\u307f\u6b8b\u3057 dataset\u306f\u3001\ud83e\udd17\u7684\u306b\u306fdatasets\u3092\u4f7f\u2026","type":"rich"}