Releases: defilantech/LLMKube
Releases · defilantech/LLMKube
llmkube-0.5.0
Helm chart for LLMKube v0.5.0 — fixes appVersion to match published controller image
v0.5.0
0.5.0 (2026-03-04)
Features
- add pre-flight memory validation for Metal agent (#204) (ba252ef)
- add health checks, metrics, and continuous monitoring to Metal agent (#205) (a113fd1)
- add per-model memoryBudget and memoryFraction CRD fields (#206) (e632369)
Bug Fixes
- agent: unregister service endpoints on metal process delete (#168) (147b9bc)
- enable controller metrics endpoint in Helm chart (#195) (70940af)
- prevent model re-download of cached models after helm upgrade (#203) (a8f9a88)
- use Recreate strategy for GPU workloads to prevent rolling update deadlock (#196) (2e45181)
Documentation
v0.4.20
0.4.20 (2026-02-28)
Features
- add license compliance scanning for GGUF models (#188) (c26400a)
- add Prometheus metrics, OpenTelemetry tracing, and inference observability (#189) (c653ff1)
- add PVC inspection to cache list for orphaned entry detection (#183) (2723d92)
- agent: add structured zap logging to metal agent (#164) (e9d143c)
- deps: upgrade to Kubernetes 1.35 and controller-runtime v0.23.1 (#175) (3c323f4)
Bug Fixes
- correct Metal quickstart docs for selectorless services (#173) (89471ec)
- prevent command injection in init container shell commands (#172) (3aa9cc3)
- remove mutable latest tags and pin container images (#174) (3c4569a)
Documentation
llmkube-0.4.20
A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference
v0.4.19
llmkube-0.4.19
A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference
v0.4.18
v0.4.17
v0.4.16
llmkube-0.4.18
A Helm chart for LLMKube - Kubernetes operator for GPU-accelerated LLM inference