{"html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fen.bioerrorlog.work%2Fentry%2Fattention-is-all-you-need-paper\" title=\"Reading the Transformer Paper: Attention Is All You Need - BioErrorLog Tech Blog\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","provider_name":"Hatena Blog","description":"This is a summary of the seminal paper \"Attention Is All You Need,\" which introduced the Transformer architecture. Introduction Attention Is All You Need Overview Method Model Architecture Training Method Results Translation Tasks Transformer Model Variations English Constituency Parsing Conclusion/\u2026","image_url":"https://cdn-ak.f.st-hatena.com/images/fotolife/B/BioErrorLog/20240417/20240417112508.png","author_url":"https://blog.hatena.ne.jp/BioErrorLog/","blog_url":"https://en.bioerrorlog.work/","blog_title":"BioErrorLog Tech Blog","type":"rich","url":"https://en.bioerrorlog.work/entry/attention-is-all-you-need-paper","height":"190","title":"Reading the Transformer Paper: Attention Is All You Need","version":"1.0","author_name":"BioErrorLog","categories":["AI","LLM","Papers"],"width":"100%","provider_url":"https://hatena.blog","published":"2025-12-23 22:43:12"}