Refactor the 'getModel' callbacks into their own file by chrsmith · Pull Request #63359 · sourcegraph/sourcegraph-public-snapshot

chrsmith · 2024-06-20T00:20:25Z

Minor refactoring to the cmd/frontend/internal/completions package.

When we call newCompletionsHandler one of the parameters is a function named getModel. This is called to determine which LLM model should be use to respond to the incoming completion request. And we have two implementations of this function, one for code completions and another for chats.

This PR just moves the logic for those two implementations into their own file (get_model.go), along with a couple of functions for which they were the only caller.

There were two minor functionality changes I made, which I'll call out in comments on this PR.

Why?

As we rework how LLM models and associated configuration flows throughout the backend, updating these functions will be a bit easier if they are pulled out like this rather than being defined inline like they are today.

Also, pretty much any place in the codebase where we have hard-coded an LLM model is "on notice" and should instead be driven entirely by some sort of global configuration file. (Since this is one of the key problems server-side LLM config is trying to solve.)

Test plan

NA, no functional changes. Relying on CI/CD and linter.

Changelog

NA

chrsmith · 2024-06-20T00:26:28Z

-		func(_ context.Context, requestParams types.CodyCompletionRequestParameters, c *conftypes.CompletionsConfig) (string, error) {
-			customModel := allowedCustomModel(requestParams.Model)
-			if customModel != "" {
-				return customModel, nil
-			}
-			if requestParams.Model != "" {
-				return "", errors.Newf("Unsupported code completion model %q", requestParams.Model)
-			}
-			return c.CompletionModel, nil
-		},


⚠️ When I teased this out into a separate function, I rewrote the logic and the signature for allowedCustomModel(string) string because as it is today, it's crazy-confusing.

allowedCustomModel(string) string was refactored into isAllowedCodeCompletionModel(string) bool. So rather than returning the input string or "", it just returns a boolean.

So previously, the (confusing) behavior was:

If it is an allowed custom model, return the requestParams.Model as-is.

If it is not an allowed custom model (or not found) return "".

But that could be distilled down into:

if requestParams.Model != "" { if isAllowedCodeCompletionModel(requestParams.Model) { return requestParams.Model, nil } return "", errors.Newf("unsupported code completion model %q", requestParams.Model) }

So please double check my thinking here. But it should be identical and, IMHO, much easier to read.

Confirmed this matches my understanding as well, the behavior is identical.

…ore models) (#63797) This PR if what the past dozen or so [cleanup](https://github.com/sourcegraph/sourcegraph/pull/63359), [refactoring](https://github.com/sourcegraph/sourcegraph/pull/63731), and [test](https://github.com/sourcegraph/sourcegraph/pull/63761) PRs were all about: using the new `modelconfig` system for the completion APIs. This will enable users to: - Use the new site config schema for specifying LLM configuration, added in https://github.com/sourcegraph/sourcegraph/pull/63654. Sourcegraph admins who use these new site config options will be able to support many more LLM models and providers than is possible using the older "completions" site config. - For Cody Enterprise users, we no longer ignore the `CodyCompletionRequest.Model` field. And now support users specifying any LLM model (provided it is "supported" by the Sourcegraph instance). Beyond those two things, everything should continue to work like before. With any existing "completions" configuration data being converted into the `modelconfig` system (see https://github.com/sourcegraph/sourcegraph/pull/63533). ## Overview In order to understand how this all fits together, I'd suggest reviewing this PR commit-by-commit. ### [Update internal/completions to use modelconfig](https://github.com/sourcegraph/sourcegraph/commit/e6b7eb171eea6bd6a512f0e61457170a86128eae) The first change was to update the code we use to serve LLM completions. (Various implementations of the `types.CompletionsProvider` interface.) The key changes here were as follows: 1. Update the `CompletionRequest` type to include the `ModelConfigInfo` field (to make the new Provider and Model-specific configuration data available.) 2. Rename the `CompletionRequest.Model` field to `CompletionRequest.RequestedModel`. (But with a JSON annotation to maintain compatibility with existing callers.) This is to catch any bugs related to using the field directly, since that is now almost guaranteed to be a mistake. (See below.) With these changes, all of the `CompletionProvider`s were updated to reflect these changes. - Any situation where we used the `CompletionRequest.Parameters.RequestedModel` should now refer to `CompletionRequest.ModelConfigInfo.Model.ModelName`. The "model name" being the thing that should be passed to the API provider, e.g. `gpt-3.5-turbo`. - In some situations (`azureopenai`) we needed to rely on the Model ID as a more human-friendly identifier. This isn't 100% accurate, but will match the behavior we have today. A long doc comment calls out the details of what is wrong with that. - In other situations (`awsbedrock`, `azureopenai`) we read the new `modelconfig` data to configure the API provider (e.g. `Azure.UseDeprecatedAPI`), or surface model-specific metadata (e.g. AWS Provisioned Throughput ARNs). While the code is a little clunky to avoid larger refactoring, this is the heart and soul of how we will be writing new completion providers in the future. That is, taking specific configuration bags with whatever data that is required. ### [Fix bugs in modelconfig](https://github.com/sourcegraph/sourcegraph/commit/75a51d8cb520e35918bd3a67a090a36d456b1797) While we had lots of tests for converting the existing "completions" site config data into the `modelconfig.ModelConfiguration` structure, there were a couple of subtle bugs that I found while testing the larger change. The updated unit tests and comments should make that clear. ### [Update frontend/internal/httpapi/completions to use modelconfig](https://github.com/sourcegraph/sourcegraph/commit/084793e08fca51a5ab84a7d73421d575caeebaa1) The final step was to update the HTTP endpoints that serve the completion requests. There weren't any logic changes here, just refactoring how we lookup the required data. (e.g. converting the user's requested model into an actual model found in the site configuration.) We support Cody clients sending either "legacy mrefs" of the form `provider/model` like before, or the newer mref `provider::apiversion::model`. Although it will likely be a while before Cody clients are updated to only use the newer-style model references. The existing unit tests for the competitions APIs just worked, which was the plan. But for the few changes that were required I've added comments to explain the situation. ### [Fix: Support requesting models just by their ID](https://github.com/sourcegraph/sourcegraph/pull/63797/commits/99715feba614230aa84cf94aae571adb96768035) > ... We support Cody clients sending either "legacy mrefs" of the form `provider/model` like before ... Yeah, so apparently I lied 😅 . After doing more testing, the extension _also_ sends requests where the requested model is just `"model"`. (Without the provider prefix.) So that now works too. And we just blindly match "gtp-3.5-turbo" to the first mref with the matching model ID, such as "anthropic::unknown::gtp-3.5-turbo". ## Test plan Existing unit tests pass, added a few tests. And manually tested my Sg instance configured to act as both "dotcom" mode and a prototypical Cody Enterprise instance. ## Changelog Update the Cody APIs for chat or code completions to use the "new style" model configuration. This allows for great flexibility in configuring LLM providers and exposing new models, but also allows Cody Enterprise users to select different models for chats. This will warrant a longer, more detailed changelog entry for the patch release next week. As this unlocks many other exciting features.

chrsmith added 5 commits June 18, 2024 19:05

Refactor interface CompletionsClient

caf0483

Update missed callsites

f9174a5

More callsites, regenerate BUILD files

eefdf68

Clearly, I need to run 'go test ./cmd/...'

a46e4e0

Refactor the 'getModel' callbacks into their own file

68c2ee6

chrsmith requested review from a team and emidoots June 20, 2024 00:20

cla-bot Bot added the cla-signed label Jun 20, 2024

chrsmith commented Jun 20, 2024

View reviewed changes

Surface getModel error to the caller

5298f58

Base automatically changed from chrsmith/refactor-completionsclient to main June 20, 2024 02:17

emidoots approved these changes Jun 20, 2024

View reviewed changes

emidoots merged commit 4984274 into main Jun 20, 2024

emidoots deleted the chrsmith/refactor-completions-api branch June 20, 2024 02:27

chrsmith mentioned this pull request Jul 12, 2024

feat/cody: Refactor completions API to use new modelconfig (support more models) #63797

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the 'getModel' callbacks into their own file#63359

Refactor the 'getModel' callbacks into their own file#63359
emidoots merged 6 commits into
mainfrom
chrsmith/refactor-completions-api

chrsmith commented Jun 20, 2024 •

edited

Loading

Uh oh!

chrsmith Jun 20, 2024

Uh oh!

emidoots Jun 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chrsmith commented Jun 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why?

Test plan

Changelog

Uh oh!

chrsmith Jun 20, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots Jun 20, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chrsmith commented Jun 20, 2024 •

edited

Loading