Agents, Tool Use & Product Post-Training · Problem 4 of 7
Implement a code-evaluation harness that runs unit tests against generated solutions and returns pass/fail.
Implement the function/class skeleton in the editor. Any correct approach is accepted.
import subprocess, tempfile, os, json
def evaluate_solution(solution_code, test_code, timeout_s=5.0):
raise NotImplementedErrorReady when you are
Submit your solution and a structured review appears here — verdict, score, and concrete feedback. Any correct approach passes.
Implement a code-evaluation harness that runs unit tests against generated solutions and returns pass/fail.
Implement the function/class skeleton in the editor. Any correct approach is accepted.