CoinFeed
There's a Benchmark Test That Measures AI 'Bullshit'—Most Models Fail - CoinFeed
Time 19:26

There's a Benchmark Test That Measures AI 'Bullshit'—Most Models Fail

March 10, 2026
CoinFeed News

BullshitBench tests whether AI models can detect nonsensical questions—or if they'll confidently answer them anyway. The results are dire.

Back to News Feed