Hard

Implement a synthetic-data pipeline: generate candidate examples, validate them with a rul

SFT, Instruction Tuning, Data & PEFT · Problem 4 of 6

Chapter 08SFT, Instruction Tuning, Data & PEFT

Implement a synthetic-data pipeline: generate candidate examples, validate them with a rul

HardProblem 4 / 6

Implement a synthetic-data pipeline: generate candidate examples, validate them with a rule-based checker, deduplicate, and report the resulting source mixture.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints