Skip to content

Sync main into release/2.6 branch#1117

Merged
toyxu merged 5 commits into
release/2.6from
main
Nov 22, 2024
Merged

Sync main into release/2.6 branch#1117
toyxu merged 5 commits into
release/2.6from
main

Conversation

@toyxu

@toyxu toyxu commented Nov 22, 2024

Copy link
Copy Markdown
Contributor

Reset to bfdbaf4

mengfei25 and others added 5 commits November 21, 2024 10:45
Fix the bug where shared memory initialization is missing in the foreach
backbone

---------

Co-authored-by: Yutao Xu <yutao.xu@intel.com>
Primarily adopt better tuning for `scatter-gather` kernel launch
configurations.
torch_xpu_ops_sycl_kernels leads to around 1.83GB in size on windows,
splitting it to reduce the lib size.

New libs introduced in this PR:

torch_xpu_ops_sycl_tensor_srcs
torch_xpu_ops_sycl_norm_loss_srcs
torch_xpu_ops_sycl_poly_srcs
torch_xpu_ops_sycl_dist_srcs

---------

Co-authored-by: Feng Yuan <feng1.yuan@intel.com>
@toyxu toyxu requested a review from chuanqi129 November 22, 2024 06:14
@toyxu toyxu merged commit 1e32bbc into release/2.6 Nov 22, 2024
cfgfung pushed a commit that referenced this pull request Jan 23, 2025
Reset to
bfdbaf4

---------

Co-authored-by: mengfei25 <mengfei.li@Intel.com>
Co-authored-by: LuFengqing <fengqing.lu@intel.com>
Co-authored-by: Ratnam Parikh <114774508+ratnampa@users.noreply.github.com>
Co-authored-by: Feng Yuan <feng1.yuan@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants