Skip to main content

Loading AI Digest

Bite-sized AI for curious minds...

ImpossibleBench: New Benchmark Tests LLMs' Tendency to Cheat on Test Cases | AI Digest | AI Digest