Hard

Implement a bootstrap confidence interval for win-rate from paired preference judgments

Evaluation, Reward Hacking & Alignment Methodology · Problem 3 of 4

Chapter 12Evaluation, Reward Hacking & Alignment Methodology

Implement a bootstrap confidence interval for win-rate from paired preference judgments

HardProblem 3 / 4

Implement a bootstrap confidence interval for win-rate from paired preference judgments.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints