{"blog_title":"fltech - Technology Blog of Fujitsu Research","type":"rich","title":"Fujitsu's Corporate Benchmarking Proposal: To Unlock the True Value of AI Agent Models #1 When AI 'Sees' What Isn't There: Introducing a Benchmark for Diagnosing Hallucinations in Multimodal Large Language Models (MLLMs)","image_url":"https://cdn.blog.st-hatena.com/images/theme/og-image-1500.png","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fblog-en.fltech.dev%2Fentry%2F2026%2F03%2F11%2Ffujitsu-hallucination-benchmark-en\" title=\"Fujitsu&#39;s Corporate Benchmarking Proposal: To Unlock the True Value of AI Agent Models #1 When AI &#39;Sees&#39; What Isn&#39;t There: Introducing a Benchmark for Diagnosing Hallucinations in Multimodal Large Language Models (MLLMs) - fltech - Technology Blog of Fujitsu Research\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","width":"100%","author_name":"shiziqiang","provider_name":"Hatena Blog","published":"2026-03-11 01:00:00","height":"190","provider_url":"https://hatena.blog","categories":["AI"],"blog_url":"https://blog-en.fltech.dev/","author_url":"https://blog.hatena.ne.jp/shiziqiang/","version":"1.0","url":"https://blog-en.fltech.dev/entry/2026/03/11/fujitsu-hallucination-benchmark-en","description":"This article marks the beginning of a TechBlog series entitled 'Fujitsu's Corporate Benchmarking Proposal: To Unlock the True Value of AI Agent Models.' It covers three blogs to the following schedule: Part 1: When AI 'Sees' What Isn't There: Introducing a Benchmark for Diagnosing Hallucinations in \u2026"}