Skip to content

Conversation

@sredman
Copy link
Contributor

@sredman sredman commented Dec 14, 2025

Description

This PR fixes #7355

Notes for Reviewers

Tested locally with two CPU-based workers. I haven't yet figured out how to deploy this version to my lab to test vulkan. Merge this soon so I don't need to figure it out :)

Tested with the current version of llama.cpp gRPC which as of this writing is v3.6.0 (llama.cpp commit a81a569577cc38b32558958b048228150be63eae)

Signed commits

  • Yes, I signed my commits.

@netlify
Copy link

netlify bot commented Dec 14, 2025

Deploy Preview for localai ready!

Name Link
🔨 Latest commit e06d13a
🔍 Latest deploy log https://app.netlify.com/projects/localai/deploys/693e372dd089960008986391
😎 Deploy Preview https://deploy-preview-7566--localai.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@sredman sredman changed the title Work/sredman/grpc server v3 fix(7355): Update llama-cpp grpc for v3 interface Dec 14, 2025
Signed-off-by: Simon Redman <simon@ergotech.com>
Signed-off-by: Simon Redman <simon@ergotech.com>
Signed-off-by: Simon Redman <simon@ergotech.com>
@sredman sredman force-pushed the work/sredman/grpc-server-v3 branch from 2afc0d5 to e06d13a Compare December 14, 2025 04:03
Copy link
Owner

@mudler mudler left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch, thank you!

@mudler mudler merged commit 5de539a into mudler:master Dec 14, 2025
38 of 39 checks passed
@sredman sredman deleted the work/sredman/grpc-server-v3 branch December 14, 2025 16:14
@mudler mudler added the bug Something isn't working label Dec 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

RPC Distributed Inference: Environment Variable Not Being Passed to Backend

2 participants