Skip to content

Add initial support for intel Gaudi accelerators#2121

Merged
merrymercy merged 3 commits intosgl-project:mainfrom
ankurneog:intel_gaudi_support_1
Nov 23, 2024
Merged

Add initial support for intel Gaudi accelerators#2121
merrymercy merged 3 commits intosgl-project:mainfrom
ankurneog:intel_gaudi_support_1

Conversation

@ankurneog
Copy link
Copy Markdown

Motivation

Intel gaudi ( device name : hpu) has PyTorch support as an Out-of-tree device. In addition support for Triton is also available . With this initial PR we introduce intel gaudi ( device : hpu) to SGLang
More details on Intel Gaudi SW installation can be found here :
https://docs.habana.ai/en/latest/PyTorch/Getting_Started_with_PyTorch_and_Gaudi/Getting_Started_with_PyTorch.html#getting-started-pyt-model
Triton support
https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Triton_Inference.html

@ankurneog
Copy link
Copy Markdown
Author

@liangan1 : can you have a look at the initial changes. thanks

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Refine other device path with "torch.get_device_module(self.device).set_device(self.gpu_id)" may be a better choice to make the code simplifier.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done , this will be further simplified once pytorch/pytorch#140536 is available, that way we dont have any device name references in the file.

@ankurneog ankurneog force-pushed the intel_gaudi_support_1 branch from ba8495f to 91c7594 Compare November 22, 2024 09:05
@ankurneog ankurneog changed the title Add intial support for intel gaudi accelerators Add initial support for intel Gaudi accelerators Nov 22, 2024
@merrymercy merrymercy merged commit 865233e into sgl-project:main Nov 23, 2024
@merrymercy merrymercy mentioned this pull request Nov 24, 2024
37 tasks
@zhaochenyang20 zhaochenyang20 mentioned this pull request Mar 3, 2025
22 tasks
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants