server : (embeddings) using same format for "input" and "content" by ngxson · Pull Request #10872 · ggml-org/llama.cpp

ngxson · 2024-12-17T20:37:09Z

Supersede #10866

"input" and "content" now using the same format. Also added test cases.

ngxson · 2024-12-17T20:52:57Z

examples/server/server.cpp

            return;
        }

+        std::vector<llama_tokens> tokenized_prompts = tokenize_input_prompts(ctx_server.ctx, prompt, true, true);


@ggerganov I changed the add_special to true here, because I remember that embedding models need BOS token. Not sure why it's removed at some point, maybe due to human error (my error 👀 ?) during recent refactoring.

…ml-org#10872) * server : (embeddings) using same format for "input" and "content" * fix test case * handle empty input case * fix test

ngxson added 2 commits December 17, 2024 21:33

server : (embeddings) using same format for "input" and "content"

d4e0bad

fix test case

9a56680

ngxson requested a review from ggerganov December 17, 2024 20:37

ggerganov approved these changes Dec 17, 2024

View reviewed changes

handle empty input case

d4b9ec0

ngxson commented Dec 17, 2024

View reviewed changes

ggerganov mentioned this pull request Dec 17, 2024

tts : add OuteTTS support #10784

Merged

9 tasks

github-actions bot added examples python python script changes server labels Dec 17, 2024

fix test

101e772

ggerganov merged commit 4682887 into ggml-org:master Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server : (embeddings) using same format for "input" and "content"#10872

server : (embeddings) using same format for "input" and "content"#10872
ggerganov merged 4 commits intoggml-org:masterfrom
ngxson:xsn/embedding_input

ngxson commented Dec 17, 2024

Uh oh!

ngxson Dec 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ngxson commented Dec 17, 2024

Uh oh!

ngxson Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants