← Blog

The reward hacking we keep seeing.

Patterns observed across hundreds of LLM trajectory annotations on Terminal-Bench.


Draft. Patterns observed across hundreds of LLM trajectory annotations on Terminal-Bench.