reason we're benchmaxxing is because there's a huge monetary incentive now to ha...

mycall · 2025-10-25T04:30:24 1761366624

benchmaxxing has also been identified as one of the causes of hallucination.

svnt · 2025-10-25T05:16:43 1761369403

hallucination is just built in, what am I missing?

ACCount37 · 2025-10-25T12:21:32 1761394892

That LLMs have some basic metaknowledge and metacognitive skills that they can use to reduce the hallucination rate.

Which is what humans do too - it's not magic. Humans just get more metacognitive juice for free. Resulting in a hallucination rate significantly lower than that of LLMs, but significantly higher than zero.

Now, having the skills you need to avoid hallucinations is good, even if they're weak and basic skills. But is an LLM willing to actually put them to use?

OpenAI cooked o3 with reckless RL using hallucination-unaware reward calculation - which punished reluctance to answer and rewarded overconfident guesses. And their benchmark suite didn't catch it, because the benchmarks were hallucination-unaware too.