Super-hard

Build a toy agent environment where the model can solve a task, call tools, fail safely, o

Agents, Tool Use & Product Post-Training · Problem 7 of 7

Chapter 14Agents, Tool Use & Product Post-Training

Build a toy agent environment where the model can solve a task, call tools, fail safely, o

Super-hardProblem 7 / 7

Build a toy agent environment where the model can solve a task, call tools, fail safely, or reward-hack the evaluator; then add mitigations and show they work.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints

solution.pypython

local draft

import types

def evaluate_naive(submission):
    raise NotImplementedError

def evaluate_hardened(submission):
    raise NotImplementedError

def _safe_run(t):
    raise NotImplementedError

⌘/Ctrl + ↵ to submit

AI review

Ready when you are

Submit your solution and a structured review appears here — verdict, score, and concrete feedback. Any correct approach passes.

Chapter 14Agents, Tool Use & Product Post-Training

Build a toy agent environment where the model can solve a task, call tools, fail safely, o

Super-hardProblem 7 / 7

Build a toy agent environment where the model can solve a task, call tools, fail safely, or reward-hack the evaluator; then add mitigations and show they work.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints