{"published":"2021-02-15 01:11:36","description":"\u76ee\u6b21 \u6982\u8981 \u63d0\u6848\u624b\u6cd5\u306e\u5185\u5bb9 Digit Tokenization(DT) Random Shift(RS) pre-training\u7528\u306eNumerical Data (ND)\u306e\u751f\u6210 pre-training\u7528\u306eTextual Data (TD)\u306e\u751f\u6210 \u8ffd\u52a0pre-training\u306e\u65b9\u6cd5 \u6570\u5024\u5b9f\u9a13 ND\u30fbTD\u306b\u3088\u308b\u8ffd\u52a0pre-train\u306e\u52b9\u679c Digit Tokenization\u306e\u52b9\u679c \u8a00\u8a9e\u7406\u89e3\u80fd\u529b\u3092\u5931\u3063\u3066\u3044\u306a\u3044\u304b\u306e\u78ba\u8a8d GENBERT\u306e\u91cd\u307f\u306f\u3001GENBERT\u4ee5\u5916\u306e\u30a2\u30fc\u30ad\u30c6\u30af\u30c1\u30e3\u3067\u3082\u5229\u7528\u3067\u304d\u308b\u304b\uff1f \u88dc\u8db3 \u30ea\u30f3\u30af \u8ad6\u6587\uff1a[2004.04487] Injecting Numerical Reason\u2026","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fwwacky.hateblo.jp%2Fentry%2Finjecting-numerical-reasoning-skills-into-language-models\" title=\"Injecting Numerical Reasoning Skills into Language Models \u3092\u8aad\u3093\u3060 - wwacky\u306e\u5099\u5fd8\u9332\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","version":"1.0","author_url":"https://blog.hatena.ne.jp/wwacky/","width":"100%","url":"https://wwacky.hateblo.jp/entry/injecting-numerical-reasoning-skills-into-language-models","provider_url":"https://hatena.blog","author_name":"wwacky","height":"190","blog_url":"https://wwacky.hateblo.jp/","blog_title":"wwacky\u306e\u5099\u5fd8\u9332","image_url":"https://cdn-ak.f.st-hatena.com/images/fotolife/w/wwacky/20210214/20210214220713.png","title":"Injecting Numerical Reasoning Skills into Language Models \u3092\u8aad\u3093\u3060","provider_name":"Hatena Blog","type":"rich","categories":["\u8ad6\u6587\u8aad\u307f"]}