Super-hard

Implement an n-gram/embedding contamination detector that flags eval items overlapping a t

Evaluation, Reward Hacking & Alignment Methodology · Problem 4 of 4

Chapter 12Evaluation, Reward Hacking & Alignment Methodology

Implement an n-gram/embedding contamination detector that flags eval items overlapping a t

Super-hardProblem 4 / 4

Implement an n-gram/embedding contamination detector that flags eval items overlapping a training corpus and reports a contamination-adjusted score.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints