[FSDP] Fix param name prefixes for ignored modules by awgu · Pull Request #79955 · pytorch/pytorch

awgu · 2022-06-21T18:20:34Z

For ignored modules' parameters, we should also clean their parameter names since they will have the FSDP-specific prefixes.

This change only affects the prefixed parameter name keys in full_optim_state_dict() (i.e. optim state dict saving). Not having this change does not actually violate the correctness of the optim state dict save-load flow because it only requires that the keys are unique and internally consistent.

Either way, this PR explicitly adds the specification now that the parameter keys in the optim state dict should match the keys of full model state dict.

facebook-github-bot · 2022-06-21T18:20:41Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/79955
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit ec35a3b (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

facebook-github-bot · 2022-06-21T18:22:35Z

@awgu has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

rohan-varma

great catch, LGTM

awgu · 2022-06-21T22:09:04Z

@pytorchbot merge

pytorchmergebot · 2022-06-21T22:10:28Z

@pytorchbot successfully started a merge job. Check the current status here

pytorchmergebot · 2022-06-21T22:10:35Z

@awgu your PR has been successfully merged.

github-actions · 2022-06-21T22:11:14Z

Hey @awgu.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: For ignored modules' parameters, we should also clean their parameter names since they will have the FSDP-specific prefixes. This change only affects the prefixed parameter name keys in `full_optim_state_dict()` (i.e. optim state dict saving). Not having this change does not actually violate the correctness of the optim state dict save-load flow because it only requires that the keys are unique and internally consistent. Either way, this PR explicitly adds the specification now that the parameter keys in the optim state dict should match the keys of full model state dict. Pull Request resolved: #79955 Approved by: https://github.com/rohan-varma Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/0b0e65516d88b6ea5ac06598dc8247afe3c21d20 Reviewed By: rohan-varma Differential Revision: D37320671 Pulled By: awgu fbshipit-source-id: 576cf28f7f744fcb75f79af1fae7a3fd2c567d89

For ignored modules' parameters, we should also clean their parameter names since they will have the FSDP-specific prefixes. This change only affects the prefixed parameter name keys in `full_optim_state_dict()` (i.e. optim state dict saving). Not having this change does not actually violate the correctness of the optim state dict save-load flow because it only requires that the keys are unique and internally consistent. Either way, this PR explicitly adds the specification now that the parameter keys in the optim state dict should match the keys of full model state dict. Pull Request resolved: pytorch#79955 Approved by: https://github.com/rohan-varma

[FSDP] Fix param names for ignored modules

ec35a3b

facebook-github-bot added the cla signed label Jun 21, 2022

awgu marked this pull request as ready for review June 21, 2022 18:21

awgu requested review from H-Huang, mingzhe09088, mrshenli, pritamdamania87, rohan-varma and zhaojuanmao as code owners June 21, 2022 18:21

facebook-github-bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Jun 21, 2022

awgu changed the title ~~[FSDP] Fix param names for ignored modules~~ [FSDP] Fix param name prefixes for ignored modules Jun 21, 2022

rohan-varma approved these changes Jun 21, 2022

View reviewed changes

pytorchmergebot added the Merged label Jun 21, 2022

pytorchmergebot closed this in 0b0e655 Jun 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FSDP] Fix param name prefixes for ignored modules#79955

[FSDP] Fix param name prefixes for ignored modules#79955
awgu wants to merge 1 commit intopytorch:masterfrom
awgu:fsdp_prefix

awgu commented Jun 21, 2022

Uh oh!

facebook-github-bot commented Jun 21, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 21, 2022

Uh oh!

rohan-varma left a comment

Uh oh!

awgu commented Jun 21, 2022

Uh oh!

pytorchmergebot commented Jun 21, 2022

Uh oh!

pytorchmergebot commented Jun 21, 2022

Uh oh!

github-actions Bot commented Jun 21, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

awgu commented Jun 21, 2022

Uh oh!

facebook-github-bot commented Jun 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

✅ No Failures (0 Pending)

Uh oh!

facebook-github-bot commented Jun 21, 2022

Uh oh!

rohan-varma left a comment

Choose a reason for hiding this comment

Uh oh!

awgu commented Jun 21, 2022

Uh oh!

pytorchmergebot commented Jun 21, 2022

Uh oh!

pytorchmergebot commented Jun 21, 2022

Uh oh!

github-actions Bot commented Jun 21, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

facebook-github-bot commented Jun 21, 2022 •

edited

Loading