Hard

Implement beam-search-over-reasoning-steps that expands/prunes partial CoTs using a PRM sc

Reasoning & Test-Time Compute · Problem 3 of 4

Chapter 11Reasoning & Test-Time Compute

Implement beam-search-over-reasoning-steps that expands/prunes partial CoTs using a PRM sc

HardProblem 3 / 4

Implement beam-search-over-reasoning-steps that expands/prunes partial CoTs using a PRM score.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints