Easy

Implement scaled dot-product attention with a causal mask in numpy

Transformer Architecture Internals · Problem 1 of 7

Chapter 01Transformer Architecture Internals

EasyProblem 1 / 7

Implement scaled dot-product attention with a causal mask in numpy.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints

solution.pypython

local draft

import numpy as np

def scaled_dot_product_attention(Q, K, V, causal=True):
    raise NotImplementedError

⌘/Ctrl + ↵ to submit

AI review

Ready when you are

Submit your solution and a structured review appears here — verdict, score, and concrete feedback. Any correct approach passes.

Chapter 01Transformer Architecture Internals

EasyProblem 1 / 7

Implement scaled dot-product attention with a causal mask in numpy.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints