Easy

Implement top-kk routing (softmax \to top-kk \to renormalized gates) in PyTorch

Mixture-Of-Experts · Problem 1 of 4

Chapter 07Mixture-Of-Experts

Implement top-kk routing (softmax \to top-kk \to renormalized gates) in PyTorch

EasyProblem 1 / 4

Implement top-kk routing (softmax \to top-kk \to renormalized gates) in PyTorch.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints