wip(cody): Introduce `cody-gateway-config` tool for generating model configuration by chrsmith · Pull Request #63324 · sourcegraph/sourcegraph-public-snapshot

chrsmith · 2024-06-18T22:05:48Z

This PR adds a couple of foundational components of the server-side LLM configuration project. (Server-side Cody Model Selection)

A small command-line tool named cody-gateway-config, which will write a JSON file to disk.
The internal/modelconfig package for defining LLM Model Configuration data.

What this enables

The cody-gateway-config contains logic for building a ModelConfiguration object in-memory, and saving it to disk. The resulting JSON file is intended to be persisted in the repository with the name cody-gateway-model-configuration.json.

The purpose of this tool is to track the current list of LLM models that Cody Gateway supports in-code, so it can be versioned as part of the regular code-review and PR process. (Generally, this is just a fancier way to generate the same data that is in the sourcegraph/cody repo's dotcom.ts file.)

With the file being checked into the repo, we can then statically embed it into any Golang binaries that reference the internal/modelconfig/embedded package. This is to make it super-easy to distribute the latest Cody Gateway configuration data, since we do not need to copy the file and/or figure out how to load it from the local disk. And instead, just have the read-only contents available in-memory. (If size ever becomes a concern, we could embed a compressed version, etc.)

Why that is needed

After this PR gets merged, both Cody Gateway and the Sourcegraph instance codebases will be able to take a dependency on this modelconfig/embedded package and use this data as part of the larger "server-side LLM model configuration" story.

Cody Gateway will simply add a new HTTP endpoint like https://cody-gateway.sourcegraph.com/llm-models.json and just serve the raw data.
Sourcegraph instances will have static knowledge about available LLM models (based on when they were built/released), and be able to poll Cody Gateway to pick up on any updates.
Sourcegraph instances will then be able to customize/overwrite/extend this configuration data in arbitrary ways. (e.g. to support BYOK or BYOLLM.)
Sourcegraph instances will also expose their own "view" of this data to Cody clients. So that Cody clients can fetch a list of "known models", to allow users to select between various options, etc.

There's a lot that I'm glossing over in that list. But the big picture is that we will use these data types, and static .json file to power how this information will flow between various Sourcegraph components.

Test plan

Added unit tests for validating the generated configuration data is valid.

Changelog

NA, this is an initial step. A later PR will elaborate on the exciting new feature this is enabling.

chrsmith · 2024-06-18T22:08:16Z

+The output of this tool is a `.json` file which we check into this repo. It will
+be embedded in Go binaries as a simple way of deploying the configuration blob.
+
+## Usage


Yes, this is kinda clunky and can probably be replaced by a sufficiently awesome bezel command or sg plugin. If there's a good example for how to do something like that, I'd be happy to update it in this PR.

But otherwise I'd move that, and adding a CI/CD check to verify that there is "no diff" if rerunning the tool, to a follow-up PR.

can probably be replaced by a sufficiently awesome bezel command or sg plugin.

Fly-by comment: IMO no need for sg plugin, just putting it directly in sg is the way to go, per the sg mission of being "the one tool you need to install and run"

chrsmith · 2024-06-18T22:13:18Z

+		anthropic_06_2023 = "anthropic::2023-06-01"
+	)
+
+	return []types.Model{


This is unfortunately clunky, and the initial version of this tool (written in TypeScript) was much more elegant.

However, I rewrote it in Go because:

While it would be nice to have the TypeScript and Golang projections of the data types exist side-by-side, having a dependency (in either direction) between the sourcegraph/sourcegraph and sourcegraph/cody repos would be kind of a pain.

Also, we would never be in a situation where we would update the TypeScript types (used by Cody clients to read the configuration data fetched from the backend) before the Golang types (which will persist/read/write the configuration data on the backend).

Similarly, there would be no need for the actual logic (such as validating the document) in the TypeScript side because for Cody clients it would only need to read the data. (And could trust that it was well-formed when sent by the server.)

So it seemed like the net best approach would be to put the Golang-types in the monorepo, along with the cody-gateway-config tool to simplify "deployment" of its output. And then put the TypeScript types directly in the sourcegraph/cody repo. (Similar to how it defines types for things it receives from the Sourcegraph GraphQL API, etc.)

chrsmith · 2024-06-18T22:16:40Z

+	ModelRef ModelRef `json:"modelRef"`
+
+	DisplayName string `json:"displayName"`
+	ModelName   string `json:"string"`


All of these fields should have a big doc comment explaining what they mean and are used for. (e.g. the distinction between Model.ID and Model.ModelName.)

But it didn't seem like we'd want to have the super-verbose names in both the Golang and TypeScript types. And they would be much more useful on the Cody Client-side. (Since that's where we would actually use/care about the values.)

So I could easily be convinced to add all of the missing comments in this PR. But that was generally the thinking to keeping it spartan. (Except for the configuration data, which will only ever be used within the Sourcegraph instance. So definitely we need to have a descriptive comment on top of AWSBedrockProviderConfig .

emidoots · 2024-06-18T22:17:21Z

The resulting JSON file is intended to be persisted in the repository with the name cody-gateway-model-configuration.json.

Since it's not specific to gateway (also served embedded in the server binary); it might make sense to remove gateway from the name.

related.. https://cody-gateway.sourcegraph.com/llm-models.json

I would suggest just calling this models.json for simplicity sake. I suspect people will need to talk about this file internally and to customers, so calling it just 'models.json' might be nice/simple

emidoots · 2024-06-18T22:23:52Z

+      "modelRef": "anthropic::2023-06-01::claude-3-sonnet",
+      "displayName": "Claude 3 Sonnet",
+      "string": "claude-3-sonnet-20240229",
+      "capabilities": ["chat", "edit"],


It'd be really nice if we can move away from 'edit' in favor of 'codeCompletion' or similar

Sure thing, done.

emidoots

Looks great to me

chrsmith added 3 commits June 18, 2024 14:26

Add ModelConfiguration data types

98a547c

Add 'cody-gateway-config' tool for generating config

3028b7d

Output JSON that is lint-free

8bf5874

chrsmith assigned emidoots Jun 18, 2024

cla-bot Bot added the cla-signed label Jun 18, 2024

chrsmith commented Jun 18, 2024

View reviewed changes

emidoots reviewed Jun 18, 2024

View reviewed changes

Lint

43f9738

emidoots approved these changes Jun 18, 2024

View reviewed changes

chrsmith added 3 commits June 18, 2024 15:34

Rerun 'bazel run //:configure'

61a29c3

Rename JSON file to just 'models.json'

b50224b

Rename capability 'edit' to 'autocomplete'

7be972d

chrsmith enabled auto-merge (squash) June 18, 2024 22:47

chrsmith merged commit 34a1f17 into main Jun 18, 2024

chrsmith deleted the chrsmith/cody-gateway-config-tool branch June 18, 2024 22:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wip(cody): Introduce `cody-gateway-config` tool for generating model configuration#63324

wip(cody): Introduce `cody-gateway-config` tool for generating model configuration#63324
chrsmith merged 7 commits into
mainfrom
chrsmith/cody-gateway-config-tool

chrsmith commented Jun 18, 2024

Uh oh!

chrsmith Jun 18, 2024

Uh oh!

bobheadxi Jun 19, 2024

Uh oh!

chrsmith Jun 18, 2024

Uh oh!

chrsmith Jun 18, 2024

Uh oh!

emidoots commented Jun 18, 2024

Uh oh!

emidoots Jun 18, 2024 •

edited

Loading

Uh oh!

chrsmith Jun 18, 2024

Uh oh!

emidoots left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chrsmith commented Jun 18, 2024

What this enables

Why that is needed

Test plan

Changelog

Uh oh!

chrsmith Jun 18, 2024

Choose a reason for hiding this comment

Uh oh!

bobheadxi Jun 19, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Jun 18, 2024

Choose a reason for hiding this comment

Uh oh!

chrsmith Jun 18, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots commented Jun 18, 2024

Uh oh!

emidoots Jun 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chrsmith Jun 18, 2024

Choose a reason for hiding this comment

Uh oh!

emidoots left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

emidoots Jun 18, 2024 •

edited

Loading