Apr 22, 2025 4 min read AI Stories

OpenAI’s o3 and o4-mini: Hallucination Rates Surge 2–3x

In April 2025, OpenAI’s release of its latest reasoning models, o3 and o4-mini, was met with both excitement and concern. While these models promised state-of-the-art performance in reasoning, coding, and multimodal tasks, internal and third-party evaluations revealed a startling issue: hallucination rates have surged to two or even three times those of previous models. OpenAI officials have openly admitted their inability to fully explain this phenomenon, raising critical questions about the reliability, safety, and future direction of advanced AI reasoning systems.

This post is for paying subscribers only

You might also like...

Trust Is the New Intelligence: Inside OpenEvidence’s Rise in Medicine

Spotify’s AI Music Lab: The Quietest Power Grab in Sound

DeepMind Enters the Heart of Fusion: When AI Learns to Steady a Star

Inside Nscale’s 18-Month Revolution: How a Former Mining Firm Became the Infrastructure of Intelligence

The Fitting Room Goes Online: How Google’s AI Is Rebuilding the Interface Between Desire and Data