OpenAI compat API adapter by lofcz · Pull Request #466 · LostRuins/koboldcpp

lofcz · 2023-10-08T18:08:41Z

The current OpenAI-like API uses hardcoded chat templates. This PR implements a non-breaking adapter users can exploit to use models requiring various chat templates. Testing request against Mistral7B Dolphin:

{
    "temperature": 0.5,
    "max_tokens": 1024,
    "messages": [
        {
            "role": "system",
            "content": "You roleplay as a dungeon master engaged in a session of Dungeons and Dragons with the user. Write in an immersive way to avoid spoiling the user's experience."
        },
        {
            "role": "user",
            "content": "I am a kobold named Nico, what should I do?"
        }
    ],
    "adapter": {
        "templates": {
          "system": {
              "start": "<|im_start|>system\n",
              "end": "<|im_end|>\n"
          },
          "user": {
              "start": "<|im_start|>user\n",
              "end": "<|im_end|>\n"
          },
          "assistent": {
              "start": "",
              "end": ""
          },
          "after_last_message": ""
        }
    }
}

This PR proposes the following non-breaking addition to /v1/chat/completions endpoint:

+"adapter": {
+        "templates": {
+        "system": {
+            "start": " String | None,
+            "end": String | None
+        },
+        "user": {
+            "start": String | None,
+            "end": String | None
+        },
+        "assistent": {
+            "start": String | None,
+            "end": String | None
+        },
+        "after_last_message": String | None
+        }
+    }

If users omit the adapter object in the request, we fall back to the default Vicuna-style template.

Response with this patch:

Response with stock 1.46.1 build:

LostRuins · 2023-10-09T04:05:52Z

This is a great idea although I would probably simplify the syntax a bit into a single object. Is there any other project that does this currently? If theres an establish spec I could follow it.

LostRuins · 2023-10-09T04:11:29Z

Without any other spec, what about something like:

"adapter": {
"system_start":"str",
"system_end":"str",
"user_start":"str",
"user_end":"str",
"assistant_start":"str",
"assistant_end":"str"
}

With any missing or null field replaced with the default value for it.

What's a good use case for after_last_mes? Seems like it would break most bot responses

lofcz · 2023-10-09T13:05:16Z

@LostRuins thanks, I've implemented preliminary support in my lib OpenAiNg, however the format can be changed, I'm open to the one you've proposed.

As for after_last_mes it's used to support the old behaviour:
https://github.com/LostRuins/koboldcpp/pull/466/files#diff-885e6237f0dc0cc77c7b4a47ef801248f4d2e6a7743b37b85a451c3ac446cbd2L424

LostRuins · 2023-10-09T13:23:45Z

I see. I think the after_last_mes should not really be needed as the tag is intended to be the AI's assistant_start response tag, keeping in consistency with the earlier user/AI dialog. Most instruct formats will only use 2 (user/assistant) or 3 (user/system/assistant) tags, so this should align with them.

LostRuins · 2023-10-09T13:42:56Z

Hi @lofcz please take a look, I have simplified the API as mentioned, let me know if it works adequately with your frontend (try both with and without the adapter to see if everything is ok)

lofcz · 2023-10-09T14:36:57Z

@LostRuins thanks for the edit, I've tried it both with and without and it works great. I'd share a video but my frontend is not in English so it wouldn't be legible for most.

LostRuins · 2023-10-09T15:14:29Z

okay then looks good to me. will merge this :) cheers

lofcz · 2023-10-09T15:23:21Z

thanks!

teddybear082 · 2023-10-09T21:32:22Z

Nice addition lofcz and LostRuins!!! Very cool.

aseichter2007 · 2024-02-09T14:38:08Z

What is the final format expected?

LostRuins · 2024-02-10T12:20:57Z

As shown above. Just add the adapter to the regular json request body.

feat: oai-adapter

3993133

lofcz mentioned this pull request Oct 8, 2023

Implement basic chat/completions openai endpoint #461

Merged

LostRuins added the enhancement New feature or request label Oct 9, 2023

simplify optional adapter for instruct start and end tags

fb6b9a8

LostRuins approved these changes Oct 9, 2023

View reviewed changes

LostRuins added the completed completed label Oct 9, 2023

LostRuins changed the base branch from concedo to concedo_experimental October 9, 2023 15:24

LostRuins merged commit 96e9539 into LostRuins:concedo_experimental Oct 9, 2023

AMDBartek mentioned this pull request Oct 21, 2023

[Feature Request] Additions to the OpenAI-compatible API #486

Closed

kadogo mentioned this pull request Nov 14, 2023

Fix openai module version to 0.28 atisharma/chasm_engine#3

Merged

LostRuins mentioned this pull request Dec 13, 2023

Can I specify a prompt template for the openai drop in api? [to the devs] #552

Closed

ewired mentioned this pull request Feb 1, 2024

Changing the template for OpenAI-compatible chat completion #654

Closed

This was referenced Jun 15, 2024

Add support to use custom chat template for usage with koboldcpp open-webui/open-webui#3183

Closed

Auto apply correct chat template when using OpenAI compatible API #925

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI compat API adapter#466

OpenAI compat API adapter#466
LostRuins merged 2 commits intoLostRuins:concedo_experimentalfrom
lofcz:patch-1

lofcz commented Oct 8, 2023 •

edited

Loading

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023 •

edited

Loading

Uh oh!

lofcz commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

lofcz commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

lofcz commented Oct 9, 2023

Uh oh!

teddybear082 commented Oct 9, 2023

Uh oh!

aseichter2007 commented Feb 9, 2024

Uh oh!

LostRuins commented Feb 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

lofcz commented Oct 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lofcz commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

lofcz commented Oct 9, 2023

Uh oh!

LostRuins commented Oct 9, 2023

Uh oh!

lofcz commented Oct 9, 2023

Uh oh!

teddybear082 commented Oct 9, 2023

Uh oh!

aseichter2007 commented Feb 9, 2024

Uh oh!

LostRuins commented Feb 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lofcz commented Oct 8, 2023 •

edited

Loading

LostRuins commented Oct 9, 2023 •

edited

Loading