The reward hacking we keep seeing.
Patterns observed across hundreds of LLM trajectory annotations on Terminal-Bench.
Draft. Patterns observed across hundreds of LLM trajectory annotations on Terminal-Bench.
Patterns observed across hundreds of LLM trajectory annotations on Terminal-Bench.
Draft. Patterns observed across hundreds of LLM trajectory annotations on Terminal-Bench.