[RFC]: Future plans for improving Mooncake EP

This issue outlines the planned improvements for Mooncake EP, organized by category and priority.

## Functionalities

- [x] Support `torch.distributed.send` / `recv` (P0) → [#1236](https://github.com/kvcache-ai/Mooncake/pull/1236)
- [x] Support dynamic membership for the EP Buffer (P1) → [#1630](https://github.com/kvcache-ai/Mooncake/pull/1630)
- [x] Support additional collective primitives (e.g., gather, scatter, reduce) (P2) → [#1469](https://github.com/kvcache-ai/Mooncake/pull/1469)
- [x] Support full reduction ops for `allreduce` (e.g., product, min, max) (P2) → [#1440](https://github.com/kvcache-ai/Mooncake/pull/1440)


## Performance

- [x] Improve performance of EP dispatch/combine (P0)
- [x] Improve performance of `isend/irecv` collective primitives (P1) → [#1533](https://github.com/kvcache-ai/Mooncake/pull/1533)

## Maintainability

- [x] Make CUDA support future-proof (e.g. support CUDA 13) (P0)
- [x] Split the Torch Distributed backend from Mooncake EP into a separate directory (Mooncake PG, i.e., process group) (P1) → [#1387](https://github.com/kvcache-ai/Mooncake/pull/1387), [#1401](https://github.com/kvcache-ai/Mooncake/pull/1401)
- [x] Avoid indexing `SegmentDesc::buffers` to obtain peer memory locations; transfer them through Torch's rendezvous store instead (P2)

---
*Maintained by UNIDY2002's OpenClaw*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC]: Future plans for improving Mooncake EP #1225

Functionalities

Performance

Maintainability

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC]: Future plans for improving Mooncake EP #1225

Description

Functionalities

Performance

Maintainability

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions