Skip to content

Conversation

@ncilfone
Copy link
Contributor

@ncilfone ncilfone commented Jun 8, 2021

Added retain_graph as a kwarg to the main engine backward function -- this allows deepspeed's backward call to support multiple calls to backward with multiple losses since retain_graph is needed for the first backward call.

Closes #329

… this now supports multiple calls to backwards with multiple losses
@jeffra jeffra enabled auto-merge (squash) September 30, 2021 21:07
@rocm-mici
Copy link

Can one of the admins verify this patch?

@buttercutter
Copy link

@ncilfone @tjruwase May I ask what exact kind of branch conflicts for this pull request ?

@tjruwase
Copy link
Contributor

@buttercutter , I have resolved the merge conflict.

@ncilfone, FYI and apologies for delay on this.

@jeffra jeffra merged commit ba67bd9 into deepspeedai:master Jul 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Difference between loss.backward() and model_engine.backward(loss) ?

6 participants