Research Engineering & Debugging · Problem 5 of 8
Debug a broken GPT training notebook containing NaNs, a mask bug, and shifted labels.
Implement the function/class skeleton in the editor. Any correct approach is accepted.
import torch, torch.nn.functional as F
def training_step(model, input_ids, attn_mask_bias):
raise NotImplementedErrorReady when you are
Submit your solution and a structured review appears here — verdict, score, and concrete feedback. Any correct approach passes.
Debug a broken GPT training notebook containing NaNs, a mask bug, and shifted labels.
Implement the function/class skeleton in the editor. Any correct approach is accepted.