Skip to content

Conversation

@jeffra
Copy link
Collaborator

@jeffra jeffra commented Feb 6, 2020

No description provided.

@jeffra jeffra merged commit 5494027 into master Feb 6, 2020
@jeffra jeffra deleted the jeffra/license_badge branch February 6, 2020 06:39
kouml pushed a commit to kouml/DeepSpeed that referenced this pull request Apr 3, 2020
rraminen pushed a commit to rraminen/DeepSpeed that referenced this pull request Apr 28, 2021
* adding example for ZeRO-2 CPU offload  (deepspeedai#29)

Co-authored-by: Jie <37380896+jren73@users.noreply.github.com>
Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Jie <37380896+jren73@users.noreply.github.com>
Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
baodii added a commit to baodii/DeepSpeed that referenced this pull request Nov 14, 2023
* add sylomatic code into upstream

enable jit_load for sycl kernels

* find Python.h using general code

* * add SYCLAutoOpBuilder to support InferenceOpBuilder
* move scripts path to op_builder/xpu

* only change cuda files extension

* change third-party relative path to enabel python install

* extracty smaller functions from sycl_extension

* change from_blob in source code to avoid big part post processing

* run pre-commit

* add BF16 support

* add other OPBuilder. fused_adam done

* cpu_adam done

* all xpu OpBuilder done, need more test

* delete csrc/xpu

* delete useless files
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants