Clarify Kubernetes version requirement and fallback plan in Key Features #380

SanjanShiv · 2025-04-25T11:09:50Z

…res section

What this PR does / why we need it

This PR updates the README file to improve clarity on multiple host support and installation guidelines by specifying Kubernetes version requirements, helping to prevent setup issues.

Which issue(s) this PR fixes

Fixes #
#379

Special notes for your reviewer

Does this PR introduce a user-facing change?

Kubernetes version >= 1.26

kerthcet · 2025-04-25T15:44:24Z

README.md

 - **Accelerator Fungibility**: llmaz supports serving the same LLM with various accelerators to optimize cost and performance.
 - **Various Model Providers**: llmaz supports a wide range of model providers, such as [HuggingFace](https://huggingface.co/), [ModelScope](https://www.modelscope.cn), ObjectStores. llmaz will automatically handle the model loading, requiring no effort from users.
- **Multi-Host Support**: llmaz supports both single-host and multi-host scenarios with [LWS](https://github.com/kubernetes-sigs/lws) from day 0.
+- **Multi-Host Support**: llmaz supports both single-host and multi-host scenarios with [LWS](https://github.com/kubernetes-sigs/lws) from day 0. **Important**: LWS requires Kubernetes version **v1.26 or higher**. If you are using a lower Kubernetes version and most of your workloads rely on single-node inference, we may consider replacing LWS with a deployment-based approach. This fallback plan would involve using Kubernetes Deployments to manage single-node inference workloads efficiently. See [#32](https://github.com/InftyAI/llmaz/issues/32) for more details and updates.


Could you update this in https://github.com/InftyAI/llmaz/blob/main/docs/installation.md#prerequisites? Thanks

Sure. I will complete it and make a commit.

Please revert this change, the note within installation.md is enough. Thanks

kerthcet · 2025-04-25T16:05:31Z

/kind documentation

SanjanShiv · 2025-04-25T19:45:58Z

I've updated the Installation file. Can you please review it?

kerthcet · 2025-04-26T02:09:49Z

README.md

 - **Accelerator Fungibility**: llmaz supports serving the same LLM with various accelerators to optimize cost and performance.
 - **Various Model Providers**: llmaz supports a wide range of model providers, such as [HuggingFace](https://huggingface.co/), [ModelScope](https://www.modelscope.cn), ObjectStores. llmaz will automatically handle the model loading, requiring no effort from users.
- **Multi-Host Support**: llmaz supports both single-host and multi-host scenarios with [LWS](https://github.com/kubernetes-sigs/lws) from day 0.
+- **Multi-Host Support**: llmaz supports both single-host and multi-host scenarios with [LWS](https://github.com/kubernetes-sigs/lws) from day 0. **Important**: LWS requires Kubernetes version **v1.26 or higher**. If you are using a lower Kubernetes version and most of your workloads rely on single-node inference, we may consider replacing LWS with a deployment-based approach. This fallback plan would involve using Kubernetes Deployments to manage single-node inference workloads efficiently. See [#32](https://github.com/InftyAI/llmaz/issues/32) for more details and updates.


Please revert this change, the note within installation.md is enough. Thanks

docs/installation.md

kerthcet · 2025-04-26T04:40:27Z

/lgtm
/approve

Thanks for your patience. Welcome onboard!

SanjanShiv · 2025-04-26T04:55:26Z

Thank you! I appreciate the review and approval. Excited to contribute further!

kerthcet · 2025-04-26T05:29:12Z

/lgtm

kerthcet · 2025-04-26T05:56:30Z

seems we need to rebase.

…res section

Co-authored-by: Kante Yin <kerthcet@gmail.com>

SanjanShiv · 2025-04-26T06:29:15Z

Are there any final changes required? Please clarify, and I will make the necessary updates.

kerthcet · 2025-04-26T06:56:30Z

/lgtm

InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Apr 25, 2025

InftyAI-Agent requested a review from kerthcet April 25, 2025 11:10

kerthcet reviewed Apr 25, 2025

View reviewed changes

InftyAI-Agent added documentation Categorizes issue or PR as related to documentation. and removed do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Apr 25, 2025

kerthcet reviewed Apr 26, 2025

View reviewed changes

InftyAI-Agent added lgtm Looks good to me, indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Apr 26, 2025

InftyAI-Agent assigned kerthcet Apr 26, 2025

InftyAI-Agent removed the lgtm Looks good to me, indicates that a PR is ready to be merged. label Apr 26, 2025

InftyAI-Agent added the lgtm Looks good to me, indicates that a PR is ready to be merged. label Apr 26, 2025

SanjanShiv and others added 4 commits April 26, 2025 11:55

Clarify Kubernetes version requirement and fallback plan in Key Featu…

3155d28

…res section

Update installation guide

203d132

Reverting change

01ecabe

Update docs/installation.md

15fee24

Co-authored-by: Kante Yin <kerthcet@gmail.com>

SanjanShiv force-pushed the new-feature-branch branch from 971cf5e to 15fee24 Compare April 26, 2025 06:26

InftyAI-Agent removed the lgtm Looks good to me, indicates that a PR is ready to be merged. label Apr 26, 2025

InftyAI-Agent added the lgtm Looks good to me, indicates that a PR is ready to be merged. label Apr 26, 2025

InftyAI-Agent merged commit fc5734f into InftyAI:main Apr 26, 2025
19 checks passed

kerthcet mentioned this pull request Apr 26, 2025

Declare the required K8s version as prerequisites in installation.md #379

Closed

Uh oh!

Clarify Kubernetes version requirement and fallback plan in Key Features #380

Clarify Kubernetes version requirement and fallback plan in Key Features #380

Uh oh!

Conversation

SanjanShiv commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it

Which issue(s) this PR fixes

Special notes for your reviewer

Does this PR introduce a user-facing change?

Uh oh!

kerthcet Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

SanjanShiv Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

kerthcet Apr 26, 2025

Choose a reason for hiding this comment

Uh oh!

kerthcet commented Apr 25, 2025

Uh oh!

SanjanShiv commented Apr 25, 2025

Uh oh!

kerthcet Apr 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kerthcet commented Apr 26, 2025

Uh oh!

SanjanShiv commented Apr 26, 2025

Uh oh!

kerthcet commented Apr 26, 2025

Uh oh!

kerthcet commented Apr 26, 2025

Uh oh!

SanjanShiv commented Apr 26, 2025

Uh oh!

kerthcet commented Apr 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SanjanShiv commented Apr 25, 2025 •

edited

Loading