Super-hard

Implement a continuous-batching scheduler simulator with a paged KV cache that admits/evic

Inference & Serving · Problem 5 of 5

Chapter 13Inference & Serving

Implement a continuous-batching scheduler simulator with a paged KV cache that admits/evic

Super-hardProblem 5 / 5

Implement a continuous-batching scheduler simulator with a paged KV cache that admits/evicts requests and reports throughput and P99.

Implement the function/class skeleton in the editor. Any correct approach is accepted.

Hints