Medium

Implement a pairwise LLM-as-judge harness with position-swap debiasing (run both orders, a

Evaluation, Reward Hacking & Alignment Methodology · Problem 2 of 4

Chapter 12Evaluation, Reward Hacking & Alignment Methodology

Implement a pairwise LLM-as-judge harness with position-swap debiasing (run both orders, a

MediumProblem 2 / 4

Implement a pairwise LLM-as-judge harness with position-swap debiasing (run both orders, average).

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints