fix(llama-cpp): correctly calculate embeddings #6259

mudler · 2025-09-12T13:08:16Z

Description

This PR is to add a test to validate #6257 specifically for the llama.cpp backend. We have this test for sentencetransformers just a couple of lines below, but somehow seems we missed applying it to llama.cpp

Seems the bug is confirmed. It looks like we did had a regression while we migrated to the new mtmd/context server. We were incorrectly returning always a static embedding vector in the embedding calls with the llama.cpp backend.

Notes for Reviewers

Fixes #6257

Signed commits

Yes, I signed my commits.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

netlify · 2025-09-12T13:08:22Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`efd98bc`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/68c5c14a92baf200079c2685
😎 Deploy Preview	https://deploy-preview-6259--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

chore(tests): check embeddings differs in llama.cpp

17b51e1

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

fix(llama.cpp): use the correct field for embedding

f70b520

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

mudler mentioned this pull request Sep 12, 2025

llama.cpp: Embeddings always identical #6257

Closed

github-actions bot added the dependencies label Sep 12, 2025

mudler changed the title ~~[TEST]~~ fix(llama-cpp): correctly calculate embeddings Sep 12, 2025

mudler force-pushed the fix/llama-cpp-embeddings branch 4 times, most recently from c1da49d to a22cf73 Compare September 12, 2025 19:29

mudler added bug Something isn't working and removed dependencies labels Sep 12, 2025

fix(llama.cpp): use embedding type none

a7c7757

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

mudler force-pushed the fix/llama-cpp-embeddings branch from a22cf73 to a7c7757 Compare September 13, 2025 16:33

github-actions bot added the dependencies label Sep 13, 2025

mudler force-pushed the fix/llama-cpp-embeddings branch from 2a7d709 to 4d15aa3 Compare September 13, 2025 19:08

chore(tests): add test-cases in aio-e2e suite

efd98bc

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

mudler force-pushed the fix/llama-cpp-embeddings branch from 4d15aa3 to efd98bc Compare September 13, 2025 19:08

mudler merged commit 6410c99 into master Sep 13, 2025
37 checks passed

mudler deleted the fix/llama-cpp-embeddings branch September 13, 2025 21:11

BrewTestBot mentioned this pull request Sep 17, 2025

localai 3.5.1 Homebrew/homebrew-core#244480

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(llama-cpp): correctly calculate embeddings #6259

fix(llama-cpp): correctly calculate embeddings #6259

Uh oh!

mudler commented Sep 12, 2025 •

edited

Loading

Uh oh!

netlify bot commented Sep 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

fix(llama-cpp): correctly calculate embeddings #6259

fix(llama-cpp): correctly calculate embeddings #6259

Uh oh!

Conversation

mudler commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mudler commented Sep 12, 2025 •

edited

Loading

netlify bot commented Sep 12, 2025 •

edited

Loading