Add more tests for vllm and clean out the old vllm test#162292
Add more tests for vllm and clean out the old vllm test#162292
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162292
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit b63be49 with merge base d8b6622 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
to myself, |
|
@pytorchbot merge -f 'All vLLM tests are ok, lint failures are from trunk' |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Test failure coverage from pytorch 2.8 release issues [internal access only](https://docs.google.com/document/d/1zvK1eUAHubHGGHg9jKxd-QlP89fzgfqOBvE2m9mUs90/edit?tab=t.0 ) See coverage mapping | Given test / pattern | Suite ID (from config) | |---|---| | pytest -v -s basic_correctness/test_cumem.py | vllm_basic_correctness_test | | pytest -v -s entrypoints/openai/test_sleep.py | vllm_entrypoints_test | | pytest -v -s entrypoints/openai/test_translation_validation.py::test_long_audio_request | vllm_entrypoints_test | | pytest -v -s lora/test_quant_model.py | vllm_lora_28_failure_test | | pytest -v -s -x tests/lora/test_llama_tp.py | vllm_lora_tp_test_distributed | | pytest -v -s distributed/test_sequence_parallel.py -k test_tp_sp_generation |vllm_distributed_test_28_failure_test | | pytest -v -s distributed/test_sequence_parallel.py::test_tp_sp_generation[...] | vllm_distributed_test_28_failure_test | | pytest models/language/generation/test_mistral.py::test_models[...] | vllm_languagde_model_test_extended_generation_28_failure_test | | pytest models/multimodal/pooling/test_jinavl_reranker.py::test_model_text_image[...] | vllm_multi_model_test_28_failure_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen25vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora_beam_search | vllm_lora_test | | tests/lora/test_phi.py::test_phi2_lora | DIDN'T FIND IT IT IN VLLM | | models/multimodal/generation/test_voxtral.py::test_models_with_multiple_audios[5-128-half] | vllm_multi_model_test_28_failure_test | | models/test_initialization.py::test_can_initialize[VoxtralForConditionalGeneration] | vllm_basic_models_test | | pytest -v -s -x lora/test_chatglm3_tp.py -k test_chatglm3_lora_tp4_fully_sharded_loras | vllm_lora_tp_test_distributed | Pull Request resolved: pytorch#162292 Approved by: https://github.com/atalman, https://github.com/huydhn
Test failure coverage from pytorch 2.8 release issues [internal access only](https://docs.google.com/document/d/1zvK1eUAHubHGGHg9jKxd-QlP89fzgfqOBvE2m9mUs90/edit?tab=t.0 ) See coverage mapping | Given test / pattern | Suite ID (from config) | |---|---| | pytest -v -s basic_correctness/test_cumem.py | vllm_basic_correctness_test | | pytest -v -s entrypoints/openai/test_sleep.py | vllm_entrypoints_test | | pytest -v -s entrypoints/openai/test_translation_validation.py::test_long_audio_request | vllm_entrypoints_test | | pytest -v -s lora/test_quant_model.py | vllm_lora_28_failure_test | | pytest -v -s -x tests/lora/test_llama_tp.py | vllm_lora_tp_test_distributed | | pytest -v -s distributed/test_sequence_parallel.py -k test_tp_sp_generation |vllm_distributed_test_28_failure_test | | pytest -v -s distributed/test_sequence_parallel.py::test_tp_sp_generation[...] | vllm_distributed_test_28_failure_test | | pytest models/language/generation/test_mistral.py::test_models[...] | vllm_languagde_model_test_extended_generation_28_failure_test | | pytest models/multimodal/pooling/test_jinavl_reranker.py::test_model_text_image[...] | vllm_multi_model_test_28_failure_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen25vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora_beam_search | vllm_lora_test | | tests/lora/test_phi.py::test_phi2_lora | DIDN'T FIND IT IT IN VLLM | | models/multimodal/generation/test_voxtral.py::test_models_with_multiple_audios[5-128-half] | vllm_multi_model_test_28_failure_test | | models/test_initialization.py::test_can_initialize[VoxtralForConditionalGeneration] | vllm_basic_models_test | | pytest -v -s -x lora/test_chatglm3_tp.py -k test_chatglm3_lora_tp4_fully_sharded_loras | vllm_lora_tp_test_distributed | Pull Request resolved: pytorch#162292 Approved by: https://github.com/atalman, https://github.com/huydhn
Test failure coverage from pytorch 2.8 release issues [internal access only](https://docs.google.com/document/d/1zvK1eUAHubHGGHg9jKxd-QlP89fzgfqOBvE2m9mUs90/edit?tab=t.0 ) See coverage mapping | Given test / pattern | Suite ID (from config) | |---|---| | pytest -v -s basic_correctness/test_cumem.py | vllm_basic_correctness_test | | pytest -v -s entrypoints/openai/test_sleep.py | vllm_entrypoints_test | | pytest -v -s entrypoints/openai/test_translation_validation.py::test_long_audio_request | vllm_entrypoints_test | | pytest -v -s lora/test_quant_model.py | vllm_lora_28_failure_test | | pytest -v -s -x tests/lora/test_llama_tp.py | vllm_lora_tp_test_distributed | | pytest -v -s distributed/test_sequence_parallel.py -k test_tp_sp_generation |vllm_distributed_test_28_failure_test | | pytest -v -s distributed/test_sequence_parallel.py::test_tp_sp_generation[...] | vllm_distributed_test_28_failure_test | | pytest models/language/generation/test_mistral.py::test_models[...] | vllm_languagde_model_test_extended_generation_28_failure_test | | pytest models/multimodal/pooling/test_jinavl_reranker.py::test_model_text_image[...] | vllm_multi_model_test_28_failure_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen25vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora_beam_search | vllm_lora_test | | tests/lora/test_phi.py::test_phi2_lora | DIDN'T FIND IT IT IN VLLM | | models/multimodal/generation/test_voxtral.py::test_models_with_multiple_audios[5-128-half] | vllm_multi_model_test_28_failure_test | | models/test_initialization.py::test_can_initialize[VoxtralForConditionalGeneration] | vllm_basic_models_test | | pytest -v -s -x lora/test_chatglm3_tp.py -k test_chatglm3_lora_tp4_fully_sharded_loras | vllm_lora_tp_test_distributed | Pull Request resolved: pytorch#162292 Approved by: https://github.com/atalman, https://github.com/huydhn
Test failure coverage from pytorch 2.8 release issues [internal access only](https://docs.google.com/document/d/1zvK1eUAHubHGGHg9jKxd-QlP89fzgfqOBvE2m9mUs90/edit?tab=t.0 ) See coverage mapping | Given test / pattern | Suite ID (from config) | |---|---| | pytest -v -s basic_correctness/test_cumem.py | vllm_basic_correctness_test | | pytest -v -s entrypoints/openai/test_sleep.py | vllm_entrypoints_test | | pytest -v -s entrypoints/openai/test_translation_validation.py::test_long_audio_request | vllm_entrypoints_test | | pytest -v -s lora/test_quant_model.py | vllm_lora_28_failure_test | | pytest -v -s -x tests/lora/test_llama_tp.py | vllm_lora_tp_test_distributed | | pytest -v -s distributed/test_sequence_parallel.py -k test_tp_sp_generation |vllm_distributed_test_28_failure_test | | pytest -v -s distributed/test_sequence_parallel.py::test_tp_sp_generation[...] | vllm_distributed_test_28_failure_test | | pytest models/language/generation/test_mistral.py::test_models[...] | vllm_languagde_model_test_extended_generation_28_failure_test | | pytest models/multimodal/pooling/test_jinavl_reranker.py::test_model_text_image[...] | vllm_multi_model_test_28_failure_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen25vl_lora | vllm_lora_test | | tests/lora/test_qwen2vl.py::test_qwen2vl_lora_beam_search | vllm_lora_test | | tests/lora/test_phi.py::test_phi2_lora | DIDN'T FIND IT IT IN VLLM | | models/multimodal/generation/test_voxtral.py::test_models_with_multiple_audios[5-128-half] | vllm_multi_model_test_28_failure_test | | models/test_initialization.py::test_can_initialize[VoxtralForConditionalGeneration] | vllm_basic_models_test | | pytest -v -s -x lora/test_chatglm3_tp.py -k test_chatglm3_lora_tp4_fully_sharded_loras | vllm_lora_tp_test_distributed | Pull Request resolved: pytorch#162292 Approved by: https://github.com/atalman, https://github.com/huydhn
Test failure coverage from pytorch 2.8 release issues
internal access only
See coverage mapping