Current Status
There has been some prior efforts on context parallelism:
However, the current support only covers a small number of models. To generalize the usage of CP, we need to work on following features and tasks
Prefill CP
Decode CP
@Fridge003 @Shunkangz @ShangmingCai @ch-wan @kpham-sgl
Current Status
There has been some prior efforts on context parallelism:
However, the current support only covers a small number of models. To generalize the usage of CP, we need to work on following features and tasks
Prefill CP
Decode CP
@Fridge003 @Shunkangz @ShangmingCai @ch-wan @kpham-sgl