Skip to content

[Feature] Memory Cache System Refactoring Road Map (Mem Cache V2) #12587

@hnyls2002

Description

@hnyls2002

Here is the roadmap for refactoring SGLang's memory caching system (mem cache v2).

Design Goals

Support arbitrary feature combinations regarding the KV cache management sub-system.

  • Prefix caching
  • Hierarchical caching
  • PD disaggregation
  • Request retraction/abortion
  • Hybrid and Sparse attention
  • Spec Decode

Road Map

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions