Allow save /load control of Optimizer state dict by aria1th · Pull Request #3894 · AUTOMATIC1111/stable-diffusion-webui

aria1th · 2022-10-29T07:53:42Z

Optimizers, especially Adam and its variants are recommended to save and load its state.

This patch offers way to save / load optimizer state, also supports for user-selected optimizer types, such as "SGD", "Adam", etc.

If Selecting optimizer type is enabled, this line has to be changed for safety:
if hypernetwork.optimizer_state_dict:
to, whatever like
if hypernetwork.optimizer_name == hypernetwork_optimizer_type and hypernetwork.optimizer_state_dict

to prevent loading wrong state dict for mismatching optimizer types.

Users will see new option in Training section:

This option should be only enabled when they plan to continue training in future.

Training can continue without saving optimizer state, but some user reported that it was blowing up sometimes when its continued from checkpoint... must by bad luck of optimizer...

For releasing HN, it is recommended to turn off the option (with Apply button) before saving / interrupting training.

Standard (1, 2, 1) network file size comparision is here, it is roughly 3x size difference.

Current Task

Save and load optimizer state dict
People complained about optimizer not resuming properly, it was because we don't save optimizer state dict.
Generalized way to save / load optimizers
This is for generalizing optimizer resuming process. It does not necessarily mean it will offer more optimizer options immediately.

Future jobs:

Fix Hypernetwork multiplier value while training
As far as I read the code, hyperparameter multiplier can be changed while training
Add an option for specify standard deviation + scale multiplier for initialization + nonzero bias initialization
related - Improving Hypernetwork initialization #2740
Analyzed data : Colab
Shortly, Xavier and Kaiming have too big standard deviation in weight initialization compared to normal.
But rather than using magic numbers, the std should be parameterized, and we can use xavier normally, if we scale it. (its called gain in pytorch parameter)
Add an option to fix weight initialization seeds.
This is for reproducing results.
Add an option to specify dropout structure.
Few examples have shown that 1, 2, 2[Dropout], 1 structure is promising. This is actually bug-generated networks, which won't be able to struct same structure with fix.
Instead of totally removing the functionality, we need to offer detailed way to specify dropouts.
Example : [0, 0.1, 0.15, 0] -> applies dropout at second, third layer. The sequence should follow the layer structure, First and last value should never use value other than 0.

Optional

Quick-start in page / Offering references of previously trained HNs
Emphasize the importance of dataset quality
Grouping activations by type
Generalized ways to evaluate HNs properly
Hyperparameter tuning pipeline
Add ways to use multiple hypernetworks sequentially or in parallel

This uses shared options, which can be changed async. HN Release should be done with this option OFF, unless they're planning to allow others to continue training from it.

This reverts commit 17849f0.

This reverts commit 46ee9fb.

This reverts commit 1cb0425.

aria1th · 2022-10-30T11:14:01Z

-Resolving conflicts

aria1th added 2 commits October 29, 2022 16:36

add optimizer save option to shared.opts

2db513f

Save and load optimizer state dict.

17849f0

This uses shared options, which can be changed async. HN Release should be done with this option OFF, unless they're planning to allow others to continue training from it.

aria1th requested a review from AUTOMATIC1111 as a code owner October 29, 2022 07:53

aria1th added 6 commits October 29, 2022 16:56

We have duplicate linear now

ae4e70f

Update .gitignore

1cb0425

Revert "Save and load optimizer state dict."

3055bd6

This reverts commit 17849f0.

sync master of .gitignore

46ee9fb

Revert "sync master of .gitignore"

0588123

This reverts commit 46ee9fb.

Revert "Update .gitignore"

031a144

This reverts commit 1cb0425.

aria1th added 2 commits October 30, 2022 20:14

Merge branch 'AUTOMATIC1111:master' into patch-13

8c284e1

resolve conflicts

4713d22

aria1th mentioned this pull request Oct 30, 2022

Save/loading AdamW optimizer (for hypernetworks) #3975

Merged

2 tasks

aria1th closed this Oct 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow save /load control of Optimizer state dict#3894

Allow save /load control of Optimizer state dict#3894
aria1th wants to merge 10 commits intoAUTOMATIC1111:masterfrom
aria1th:patch-13

aria1th commented Oct 29, 2022

Uh oh!

aria1th commented Oct 30, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aria1th commented Oct 29, 2022

Uh oh!

aria1th commented Oct 30, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant