NotImplementedError: offload_to_cpu=True and NO_SHARD is not supported yet

### System Info

I was using fsdp with settings "full_shard auto_wrap" on a A100 GPU. The training went well but was interupted when saving the checkpoints. The error stated `NotImplementedError: offload_to_cpu=True and NO_SHARD is not supported yet`. I understand that I am using a single GPU so fsdp defaluts to NO_SHAPR. However, I dont understand why offload_to_cpu was set to True. Or anywhere I can reset it to false?

### Who can help?

_No response_

### Information

- [ ] The official example scripts
- [ ] My own modified scripts

### Tasks

- [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...)
- [ ] My own task or dataset (give details below)

### Reproduction

following https://github.com/lm-sys/FastChat to fine-tune an LLM

### Expected behavior

the error as stated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NotImplementedError: offload_to_cpu=True and NO_SHARD is not supported yet #24874

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

NotImplementedError: offload_to_cpu=True and NO_SHARD is not supported yet #24874

Description

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions