Hard

Implement a preference-dataset builder from accepted/rejected suggestions, including a pos

Agents, Tool Use & Product Post-Training · Problem 5 of 7

Chapter 14Agents, Tool Use & Product Post-Training

Implement a preference-dataset builder from accepted/rejected suggestions, including a pos

HardProblem 5 / 7

Implement a preference-dataset builder from accepted/rejected suggestions, including a position-bias correction.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints