Fix bfloat16/float16/float32 options by jerryzh168 · Pull Request #1369 · pytorch/ao

jerryzh168 · 2024-12-02T22:51:33Z

Summary:
There was some problems with previous implementation of bfloat16/float16/float32 since it does not convert activation to the correct dtype after quantization, this PR fixes it

Test Plan:
llama:

python generate.py --checkpoint_path $CHECKPOINT_PATH/$MODEL_REPO/model.pth --compile --compile_prefill --quantization autoquant-fp

same2:

server:
python server.py ~/checkpoints/sam2 large --port 8000 --host localhost --fast --use_autoquant

client:
time xargs -I {} curl -s -w "\n" -X POST http://localhost:8000/upload_rle -F 'image=@{}' < sav_val_image_paths_shuf_1000 > results/sav_val_masks_baseline_shuf_1000

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: There was some problems with previous implementation of bfloat16/float16/float32 since it does not convert activation to the correct dtype after quantization, this PR fixes it Test Plan: llama: ``` python generate.py --checkpoint_path $CHECKPOINT_PATH/$MODEL_REPO/model.pth --compile --compile_prefill --quantization autoquant-fp ``` same2: ``` server: python server.py ~/checkpoints/sam2 large --port 8000 --host localhost --fast --use_autoquant client: time xargs -I {} curl -s -w "\n" -X POST http://localhost:8000/upload_rle -F 'image=@{}' < sav_val_image_paths_shuf_1000 > results/sav_val_masks_baseline_shuf_1000 ``` Reviewers: Subscribers: Tasks: Tags:

pytorch-bot · 2024-12-02T22:51:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1369

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 7463228 with merge base ed76e9c ():

NEW FAILURE - The following job has failed:

Run Regression Tests / test-nightly (CPU Nightly, linux.4xlarge, --pre torch --index-url https://download.pytorch.org/wh... / linux-job (gh)
test/prototype/test_parametrization.py::TestFakeSparsity::test_jit_trace

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…ytorch#1369) * Only set up during the first sample * Cleaner

* Fix bfloat16/float16/float32 options Summary: There was some problems with previous implementation of bfloat16/float16/float32 since it does not convert activation to the correct dtype after quantization, this PR fixes it Test Plan: llama: ``` python generate.py --checkpoint_path $CHECKPOINT_PATH/$MODEL_REPO/model.pth --compile --compile_prefill --quantization autoquant-fp ``` same2: ``` server: python server.py ~/checkpoints/sam2 large --port 8000 --host localhost --fast --use_autoquant client: time xargs -I {} curl -s -w "\n" -X POST http://localhost:8000/upload_rle -F 'image=@{}' < sav_val_image_paths_shuf_1000 > results/sav_val_masks_baseline_shuf_1000 ``` Reviewers: Subscribers: Tasks: Tags: * ruff

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 2, 2024

cpuhrsch approved these changes Dec 2, 2024

View reviewed changes

jerryzh168 added topic: new feature Use this tag if this PR adds a new feature topic: bug fix Use this tag for PRs that fix bugs and removed topic: new feature Use this tag if this PR adds a new feature labels Dec 2, 2024

ruff

7463228

jerryzh168 merged commit 8a51e1a into pytorch:main Dec 3, 2024

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

Update Caching logic to only trigger on the first inference sample (p…

6eae887

…ytorch#1369) * Only set up during the first sample * Cleaner

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bfloat16/float16/float32 options#1369

Fix bfloat16/float16/float32 options#1369
jerryzh168 merged 2 commits into
pytorch:mainfrom
jerryzh168:autoquant-all

jerryzh168 commented Dec 2, 2024

Uh oh!

pytorch-bot Bot commented Dec 2, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jerryzh168 commented Dec 2, 2024

Uh oh!

pytorch-bot Bot commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1369

❌ 1 New Failure

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot Bot commented Dec 2, 2024 •

edited

Loading