{"blog_title":"V\u30dd\u30a4\u30f3\u30c8\u30de\u30fc\u30b1\u30c6\u30a3\u30f3\u30b0\uff5cTECH LAB\u306e Tech Blog","author_url":"https://blog.hatena.ne.jp/miu4930/","provider_url":"https://hatena.blog","version":"1.0","blog_url":"https://techblog.vpoint.co.jp/","image_url":"https://cdn-ak.f.st-hatena.com/images/fotolife/m/miu4930/20250304/20250304110847.jpg","provider_name":"Hatena Blog","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Ftechblog.vpoint.co.jp%2Fentry%2F2025%2F03%2F04%2F143548\" title=\"2\u3064\u306e\u753b\u50cf\u3092\u878d\u5408\u3059\u308b&quot;Image Fusion via Vision-Language Model&quot;\u3068\u3044\u3046\u8ad6\u6587\u3092\u8aad\u3093\u3060\u306e\u3067\u5185\u5bb9\u3092\u307e\u3068\u3081\u3066\u307f\u307e\u3057\u305f\u3002 - V\u30dd\u30a4\u30f3\u30c8\u30de\u30fc\u30b1\u30c6\u30a3\u30f3\u30b0\uff5cTECH LAB\u306e Tech Blog\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","height":"190","author_name":"miu4930","url":"https://techblog.vpoint.co.jp/entry/2025/03/04/143548","published":"2025-03-04 14:35:48","width":"100%","categories":["AI","\u751f\u6210AI","\u753b\u50cf\u89e3\u6790"],"description":"\u306f\u3058\u3081\u306b \u8a72\u5f53\u3059\u308b\u30bf\u30b9\u30af FILM\u3068\u306f Text Feature Fusion Text-Guided Vision Feature Fusion Vision Feature Decoding Fine-Tuning\u306f\u3069\u3046\u3059\u308b\u306e\u304b\uff1f \u751f\u6210\u3055\u308c\u308b\u878d\u5408\u753b\u50cf \u8d64\u5916\u7dda-\u53ef\u8996\u5149\u753b\u50cf\u878d\u5408 \u30de\u30eb\u30c1\u9732\u5149\u753b\u50cf\u878d\u5408 \u307e\u3068\u3081 \u306f\u3058\u3081\u306b \u3053\u3093\u306b\u3061\u306f\u3001CCCMK\u30db\u30fc\u30eb\u30c7\u30a3\u30f3\u30b0\u30b9\u4e09\u6d66\u3067\u3059\u3002 \u6628\u5e74\u672b\u306b\u53c2\u52a0\u3057\u305fAI\u30fb\u6a5f\u68b0\u5b66\u7fd2\u306e\u5b66\u4f1a\"NeurIPS 2024\"\u30672\u3064\u306e\u753b\u50cf\u3092\u5408\u6210\u3057\u30012\u3064\u306e\u753b\u50cf\u306e\u7279\u5fb4\u3092\u6301\u3063\u305f1\u3064\u306e\u753b\u50cf\u3092\u751f\u6210\u3059\u308b\u3001\u3068\u3044\u3046\u6280\u8853\u306e\u5b58\u5728\u3092\u77e5\u308a\u3001\u753b\u50cf\u5408\u6210\u306b\u3064\u3044\u3066\u8abf\u3079\u3066\u307f\u305f\u3044\u306a\u3001\u3068\u8003\u3048\u3066\u3044\u307e\u3057\u305f\u3002 \u8abf\u3079\u3066\u307f\u308b\u3068\u30012\u3064\u306e\u753b\u50cf\u2026","type":"rich","title":"2\u3064\u306e\u753b\u50cf\u3092\u878d\u5408\u3059\u308b\"Image Fusion via Vision-Language Model\"\u3068\u3044\u3046\u8ad6\u6587\u3092\u8aad\u3093\u3060\u306e\u3067\u5185\u5bb9\u3092\u307e\u3068\u3081\u3066\u307f\u307e\u3057\u305f\u3002"}