Skip to content

Conversation

@jeffra
Copy link
Collaborator

@jeffra jeffra commented May 29, 2020

No description provided.

jeffra and others added 30 commits May 27, 2020 16:00
* Transformer kernels

Co-authored-by: Shaden Smith <ShadenTSmith@gmail.com>
Co-authored-by: Elton Zheng <eltonz@microsoft.com>
Co-authored-by: Reza Yazdani <reyazda@microsoft.com>
Co-authored-by: RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
Co-authored-by: Tunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Shaden Smith <ShadenTSmith@gmail.com>
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
* add transformer kernel api

* moving docs to deepspeed_cuda classes from __init__

* moving docs to deepspeed_cuda classes from __init__

* make transformer config and layer two sections

* added Mock imports for kernel libs.

* some minor changes to class comments

* update for code review feedback

* update for review feedback

Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
Co-authored-by: Shaden Smith <ShadenTSmith@gmail.com>
* Bert Tutorial update

* Bert Tutorial update

* fix formatting and minor tweaks to recipe

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
RezaYazdaniAminabadi and others added 5 commits May 29, 2020 11:50
* update the tutorials for fine-tuning

* update the finetuning tutorial

* Changing the structure a bit

* Typo

* Expected Results for DS pretrained model

* Update bert-finetuning.md

* :remove and after yield

Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
@jeffra jeffra merged commit 734d899 into master May 29, 2020
@jeffra jeffra deleted the kernel-staging branch September 15, 2020 20:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants