Super-hard

Implement expert-parallel dispatch/combine with a simulated all-to-all and verify outputs

Mixture-Of-Experts · Problem 4 of 4

Chapter 07Mixture-Of-Experts

Implement expert-parallel dispatch/combine with a simulated all-to-all and verify outputs

Super-hardProblem 4 / 4

Implement expert-parallel dispatch/combine with a simulated all-to-all and verify outputs match a dense reference for the same routing.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints