You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
🚀 The feature, motivation and pitch
We are going to migrate from model runner v1 to model runner v2 gradually, here is the roadmap:
Tasks:
logprob_token_idssupport #40559num_gpu_runner_capture_triggersandnum_cudagraph_captured#41285[Model Runner V2] Support stock torch compile for v2 #41667We temporally block this as there would be a big refactor for torch compile stuff recentlypre_forwardorder #42676Sizes of tensors must matcherror #42778Triton Error [CUDA]: device-side assert triggered#43139ElasticEPScalingExecutorfor MRv2 #43915AttributeError: 'CohereASRDecoder' object has no attribute 'embed_input_ids'#44568openai.InternalServerError: Error code: 500 - 'list index out of range'#45467Alternatives
No response
Additional context
No response
Before submitting a new issue...