[`BigModeling`] Add missing check for quantized models by younesbelkada · Pull Request #1652 · huggingface/accelerate

younesbelkada · 2023-06-28T06:37:11Z

What does this PR do?

Fixes huggingface/transformers#24540 and the failing test: https://github.com/huggingface/accelerate/actions/runs/5396594202/jobs/9800396637

Currently on the main branch loading a quantized model on a single GPU fails:

from transformers import AutoModelForCausalLM, AutoConfig, AutoTokenizer
import torch

model_path="facebook/opt-350m"

config = AutoConfig.from_pretrained(model_path, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True, load_in_8bit=True, device_map="auto")

tokenizer = AutoTokenizer.from_pretrained(model_path)

input_text = "Describe the solar system."
input_ids = tokenizer(input_text, return_tensors="pt").input_ids.to("cuda")

outputs = model.generate(input_ids, max_length=100)
print(tokenizer.decode(outputs[0]))

In #1648 it seems that a check was missing before calling .to to the model in case of single GPU model dispatching. Adding a small check circunvemts this issue

cc @sgugger @SunMarc

younesbelkada · 2023-06-28T06:37:52Z

src/accelerate/big_modeling.py

    else:
        device = list(device_map.values())[0]
-        if device != "disk":
+        if device != "disk" and not getattr(model, "is_quantized", False):


we should maybe check with the attributes is_loaded_in_8bit or is_loaded_in_4bit as is_quantized has been only recently introduced. WDYT?

HuggingFaceDocBuilderDev · 2023-06-28T06:41:39Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada · 2023-06-28T06:48:08Z

src/accelerate/big_modeling.py

        device = list(device_map.values())[0]
-        if device != "disk":
+        # for backward compatibility
+        is_quantized = getattr(model, "is_quantized", False) or getattr(model, "is_loaded_in_8bit", False)


getattr(model, "is_loaded_in_8bit", False) --> for backward compatilibty for users that have an old version of transformers (before 4bit integration)

sgugger

Thanks for your PR, added a small comment.

sgugger · 2023-06-28T12:07:28Z

src/accelerate/big_modeling.py

+        elif is_quantized:
+            pass
        else:


Suggested change

elif is_quantized:

pass

else:

elif not is_quantized:

Ah added it already b93d93c before your review :D

SunMarc

Thanks for the work!

add missing check

a4b1981

younesbelkada commented Jun 28, 2023

View reviewed changes

younesbelkada requested review from SunMarc and sgugger June 28, 2023 06:46

younesbelkada added 2 commits June 28, 2023 06:46

better check

cd664f2

better check

d1628ee

younesbelkada commented Jun 28, 2023

View reviewed changes

sgugger approved these changes Jun 28, 2023

View reviewed changes

much better check

b93d93c

SunMarc approved these changes Jun 28, 2023

View reviewed changes

younesbelkada merged commit a9d43cd into main Jun 28, 2023

younesbelkada deleted the fix-to-int8 branch June 28, 2023 14:07

younesbelkada mentioned this pull request Jun 28, 2023

[BigModeling] Final fix for dispatch int8 and fp4 models #1660

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`BigModeling`] Add missing check for quantized models#1652

[`BigModeling`] Add missing check for quantized models#1652
younesbelkada merged 4 commits intomainfrom
fix-to-int8

younesbelkada commented Jun 28, 2023

Uh oh!

younesbelkada Jun 28, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jun 28, 2023 •

edited

Loading

Uh oh!

younesbelkada Jun 28, 2023

Uh oh!

sgugger left a comment

Uh oh!

sgugger Jun 28, 2023

Uh oh!

younesbelkada Jun 28, 2023

Uh oh!

SunMarc left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

younesbelkada commented Jun 28, 2023

What does this PR do?

Uh oh!

younesbelkada Jun 28, 2023

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada Jun 28, 2023

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Jun 28, 2023

Choose a reason for hiding this comment

Uh oh!

younesbelkada Jun 28, 2023

Choose a reason for hiding this comment

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Jun 28, 2023 •

edited

Loading