Skip to content

[REQUEST] Extend offload_states to support models with cpu-based optimizer #6596

@kfertakis

Description

@kfertakis

Is your feature request related to a problem? Please describe.
The issue is related to #5620 and #6011. The new offload_states API works only with FusedAdam GPU optimizer. Currently there is no way to offload a trainable model that is using a CPU-based optimizer like DeepSpeedCPUAdam.

Describe the solution you'd like
Extend #6011 to support offloading of a model configured with CPU-based DeepSpeedCPUAdam optimizer.

Thanks,

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions