Conversation
Summary of ChangesHello @lifuhuang, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request updates the system's default LoRA backend to 'csgmv', moving away from 'triton'. This change is reflected in the server's configuration and command-line arguments. Additionally, the test infrastructure has been adjusted to accommodate this new default, ensuring that tests either implicitly use 'csgmv' or allow for flexible backend selection. Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
WalkthroughDefault LoRA backend changed from "triton" to "csgmv" in server defaults and CLI. Runner and test utilities now accept an optional lora_backend (None). Tests stop passing explicit backends and adjust helpers; backend list expanded to include "csgmv". One test retains a print referencing a removed backend variable. Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Poem
Pre-merge checks and finishing touches❌ Failed checks (2 warnings)
✅ Passed checks (1 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: CodeRabbit UI Review profile: CHILL Plan: Pro 📥 CommitsReviewing files that changed from the base of the PR and between 4ac8e09 and 353e5534e5e8dc6e5b12bb01f91c6d41d880490e. 📒 Files selected for processing (11)
💤 Files with no reviewable changes (5)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (21)
Comment |
There was a problem hiding this comment.
Code Review
This pull request updates the default LoRA backend from 'triton' to 'csgmv'. The changes are applied to server arguments, benchmarks, and test runners. Most tests are updated to remove the hardcoded 'triton' backend and rely on the new default.
I've found a critical issue in test/srt/lora/test_lora_qwen3.py where removing a variable without removing its usages will lead to a NameError, breaking the test.
Additionally, I've pointed out that with these changes, the 'triton' backend is no longer being tested. I've asked for clarification on whether this is intended and suggested either removing unused code or updating tests to cover both backends if needed.
Overall, the changes are aligned with the PR's goal, but the identified issues should be addressed.
| @@ -50,7 +50,7 @@ def __post_init__(self): | |||
|
|
|||
|
|
|||
| TORCH_DTYPES = [torch.float16] | |||
| BACKENDS = ["triton"] | |||
| BACKENDS = ["triton", "csgmv"] | |||
There was a problem hiding this comment.
This BACKENDS constant seems to be unused across the test files. With the removal of the hardcoded backend variable in the tests, they will now only run with the new default LoRA backend (csgmv). This means testing for the 'triton' backend is effectively removed.
If this is intended, this constant could be removed to avoid confusion. If both backends should be tested, the tests should be updated to iterate over this BACKENDS list.
353e553 to
269174b
Compare
b0c6c74 to
91fb881
Compare
|
@lifuhuang @Fridge003 this pr breaks lora unit tests on AMD May you fix this in the follow up? Thanks! |
Motivation
Modifications
Accuracy Tests
Benchmarking and Profiling
Checklist
Summary by CodeRabbit
New Features
Refactor
Tests
Chores