feat(cody): Expose Sg modelconfig data via HTTP REST API by chrsmith · Pull Request #63604 · sourcegraph/sourcegraph-public-snapshot

chrsmith · 2024-07-02T21:23:57Z

This PR exposes a new HTTP endpoint from the frontend that will return the LLM models currently supported by the Sourcegraph instance. (Or a 500 if Cody isn't enabled or not configured correctly.)

This endpoint will be used by Cody clients in order to fetch LLM models dynamically, so that new LLM models can automatically be picked up as they are available from Cody Gateway. The underlying data structures will also make it easier for the Sourcegraph instance to configure arbitrary LLM providers, customize model settings, etc.

curl \
    -H "Authorization: token ${SG_LOCALHOST_TOKEN}" \
    https://sourcegraph.test:3443/.api/modelconfig/supported-models.json

Overview

This PR is the latest (long) series of steps for enabling arbitrary LLM model configuration in the Sourcegraph backend. This PR introduces three new components into the cmd/frontend/internal/modelconfig package:

`builder.go`, `builder_test.go`

The builder struct is the logic for how we generate the LLM model configuration data based on the current site configuration, what is embedded in the Sourcegraph binary, and later, what we obtain from Cody Gateway.

This is essentially how we apply things like the "model allow/deny list" or "provider/model overrides" that will later be available in the Sourcegraph site configuration.

For now, some of this functionality is inert. As there is no way for you to actually specify the type of configuration data that the builder supports. (e.g. you cannot specify the "LLM model allow or deny list", or fetch models from Cody Gateway.)

So realistically, with this PR the only thing it is doing is just converting a (internal/modelconfig/types) SiteModelConfiguration into a ModelConfiguration object. (Which was added in https://github.com/sourcegraph/sourcegraph/pull/63533.)

`init.go`, `service.go`

Next is the Service interface, which is a new global service that can be used anywhere within the frontend binary to fetch the current LLM configuration data.

This will be used later to allow the Completions API endpoints to look up various provider and LLM configuration data.

init.go supplies an initialization function to the Sourcegraph machinery, and then it goes about loading the LLM model configuration data and exposing it in a singleton object. Part of this is also registering a "site configuration watcher", so that we will automatically pick up on any changes to the site configuration that may impact the LLM model configuration.

`httpapi.go`

Finally, we expose a HTTPHandlers struct that contains a single http.HandlerFunc that will serve the HTTP response.

My intention is to later add more endpoints, such as a way for Site admins to easily curl the full configuration information. Or surface diagnostic-type information about available models.

Next Steps

With this piece, we now expose a view of the Sourcegraph instance's currently supported LLM models. However, the existing completions API stilly reads the site configuration directly. (And does not use the newer, and more flexible modelconfig package.)

So the next step to continue wiring this up is to refactor the frontend/internal/completions and internal/completions packages to rely on the "newer style" configuration data. (Which 🤞 is faithfully reproducing the current site configuration data.)

The end result should be a no-op, but will allow us to no longer rely on hard-coded LLM model names anywhere in the codebase. And instead we can just rely on the model configuration data.

Test plan

Added new unit tests. Verified things worked manually. And looked through existing unit tests for other components to sanity check some of the assumptions made in newer code.

Changelog

Sourcegraph instances how expose an HTTP endpoint that authenticated users can call to get a list of LLM models supported by the Sourcegraph instance. In the future this will be used to allow Cody users to select the LLM model dynamically, based on what is currently available.

+	// SECURITY: It's critical that we do not serve any server-side configuration data
+	// to the client, as it contains sensitive data like access tokens.
+	modelconfigSDK.RedactServerSideConfig(currentConfig)
+
+	rawJSON, err := json.MarshalIndent(currentConfig, "", "    ")
+	if err != nil {
+		h.logger.Error("marshalling configuration", log.Error(err))
+		http.Error(w, http.StatusText(http.StatusInternalServerError), http.StatusInternalServerError)
+		return
+	}
+	http.Error(w, string(rawJSON), http.StatusOK)


emidoots · 2024-07-03T00:22:58Z

+		var providerToOverride *types.Provider
+		for i := range mergedConfig.Providers {
+			if mergedConfig.Providers[i].ID == providerOverride.ID {
+				providerToOverride = &mergedConfig.Providers[i]


Just to disambiguate, can we use a different variable for i in this inner for loop?

🫨 OMFG yes, sorry. I could have sworn this was using j when I first wrote it. I'm kinda shocked that you can alias i like this and have it not break things.

I'll just rename this inner loop to mergedProvIdx since even j is too similar to be honest.

emidoots · 2024-07-03T00:38:35Z

+			mergedConfig.DefaultModels.Chat = *validModel
+		}
+		if mergedConfig.DefaultModels.FastChat == "" {
+			validModel := getModelMatchingCategory(types.ModelCategorySpeed, types.ModelCategoryBalanced)


Can you remind me: did we end up getting a way to distinguish 'models that can ONLY be used for code completion and literally cannot produce chat results'? Is that handled appropriately here?

Good catch! I'll fix this.

We do. The Model type has a Capabilities []ModelCapability json:"capabilities"`` field, with the values of "chat" or "autocomplete".

So it totally makes sense to add that sort of check here, since it's something we would want to rely on in the future. (e.g. a site admin specifying a ModelOverride that gpt-4o can only be used for chat, and therefore requests to use it for autocompletions will fail.)

Updated to:

if mergedConfig.DefaultModels.Chat == "" { validModel := getModelWithRequirements(types.ModelCapabilityAutocomplete, accuracy, balanced) if validModel == nil { return nil, errors.New("no suitable model found for Chat") } mergedConfig.DefaultModels.Chat = *validModel }

emidoots · 2024-07-03T00:40:25Z

+	// The base model configuration data is what is embedded in this binary.
+	staticModelConfig, err := embedded.GetCodyGatewayModelConfig()
+	if err != nil {
+		return errors.New("embedded model data is missing or corrupt")


Are we embedding this correctly today? i.e. if we merge this will it prevent servers from starting or do we already embed model data

We are embedding the data today, by virtue of simply importing the Go package and building it. So I do not believe it is possible -- outside of maybe build flag shenanigans? Or using CGO and messing with the linker? -- to reference the embedded package and somehow have it built without the go:embed.

So while I'm not confident enough to have this function just "return or panic", I am unaware of any way to actually trigger this error condition.

Sounds good!

emidoots

looks good

emidoots

chrsmith added 2 commits July 2, 2024 12:37

Expose Sg modelconfig data via HTTP REST API

7b43be0

Fix lint warning

f151ba4

chrsmith requested review from a team and emidoots July 2, 2024 21:23

github-advanced-security AI found potential problems Jul 2, 2024

View reviewed changes

cla-bot Bot added the cla-signed label Jul 2, 2024

emidoots reviewed Jul 2, 2024

View reviewed changes

Comment thread cmd/frontend/internal/modelconfig/builder.go Outdated

emidoots reviewed Jul 3, 2024

View reviewed changes

Comment thread cmd/frontend/internal/modelconfig/builder.go

emidoots reviewed Jul 3, 2024

View reviewed changes

emidoots approved these changes Jul 3, 2024

View reviewed changes

chrsmith added 2 commits July 2, 2024 21:18

Address PR feedback

e32d0f5

Fix problem

ea126f4

emidoots approved these changes Jul 3, 2024

View reviewed changes

chrsmith merged commit 188424d into main Jul 3, 2024

chrsmith deleted the chrsmith/modelconfig-svc_ii branch July 3, 2024 17:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cody): Expose Sg modelconfig data via HTTP REST API#63604

feat(cody): Expose Sg modelconfig data via HTTP REST API#63604
chrsmith merged 4 commits into
mainfrom
chrsmith/modelconfig-svc_ii

chrsmith commented Jul 2, 2024

Uh oh!

Check notice

Uh oh!

emidoots Jul 3, 2024

Uh oh!

chrsmith Jul 3, 2024 •

edited

Loading

Uh oh!

Uh oh!

emidoots Jul 3, 2024

Uh oh!

chrsmith Jul 3, 2024 •

edited

Loading

Uh oh!

chrsmith Jul 3, 2024 •

edited

Loading

Uh oh!

emidoots Jul 3, 2024

Uh oh!

emidoots Jul 3, 2024

Uh oh!

chrsmith Jul 3, 2024

Uh oh!

emidoots Jul 3, 2024

Uh oh!

emidoots left a comment

Uh oh!

emidoots left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chrsmith commented Jul 2, 2024

Overview

builder.go, builder_test.go

init.go, service.go

httpapi.go

Next Steps

Test plan

Changelog

Uh oh!

Check notice

Uh oh!

emidoots Jul 3, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Jul 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

emidoots Jul 3, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Jul 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chrsmith Jul 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

emidoots Jul 3, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots Jul 3, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Jul 3, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots Jul 3, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots left a comment

Choose a reason for hiding this comment

Uh oh!

emidoots left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

`builder.go`, `builder_test.go`

`init.go`, `service.go`

`httpapi.go`

chrsmith Jul 3, 2024 •

edited

Loading

chrsmith Jul 3, 2024 •

edited

Loading

chrsmith Jul 3, 2024 •

edited

Loading