feat(auto): replace kilo-auto/small backing with Gemma 4#2502
Merged
Conversation
Switch kilo-auto/small from GPT-5 Nano / gpt-oss-20b:free to google/gemma-4-31b-it (paid) and google/gemma-4-26b-a4b-it (free). - Context window increased from 131K to 262K tokens - Image support enabled - Preferred provider set to Novita for both models - gpt-oss-20b:free model status set to disabled
Contributor
Author
Code Review SummaryStatus: 1 Issue Found | Recommendation: Address before merge Overview
Issue Details (click to expand)WARNING
Other Observations (not in diff)Issues found in unchanged code that cannot receive inline comments:
Files Reviewed (4 files)
Reviewed by gpt-5.4-20260305 · 913,754 tokens |
Novita has no Vercel inference provider mapping, so Gemma requests routed to Vercel would fail. Exclude Gemma from the Vercel randomization to keep all requests on OpenRouter.
…date Gemma description - Revert Gemma exclusion from Vercel routing (unnecessary) - Remove gpt_oss_20b_free_model definition and all imports - Update Gemma 4 26B description from OpenRouter API metadata
lambertjosh
approved these changes
Apr 17, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Replace the backing models for
kilo-auto/small:openai/gpt-5-nano→google/gemma-4-31b-itopenai/gpt-oss-20b:free→google/gemma-4-26b-a4b-it:freeKey changes:
gpt-oss-20b:freestatus set todisabled(retained in kiloExclusiveModels per deprecation policy)This PR was authored by
anthropic/claude-opus-4.6, a premium model.Verification
pnpm typecheckpassespnpm formatappliedVisual Changes
N/A
Reviewer Notes
gpt_oss_20b_free_modelis set todisabledrather than removed, following the existing deprecation policy inmodels.tsKiloExclusiveModelwithstatus: 'hidden'(same pattern as the old gpt-oss-20b:free)isGemmaModel()check added togetPreferredProviderOrder()to route all Gemma models through Novita