Gemlite fixes by HDCharles · Pull Request #1432 · pytorch/ao

HDCharles · 2024-12-18T14:38:56Z

Summary:

shapes need to be divisible by 128 or they will not work with gemlite need fp32 accumulation for groupsize None on int4

Test Plan:

python test_integration.py -k "test_gemlite" (new test for non divisible shape)a

python generate.py --checkpoint_path $CHECKPOINT_PATH/$MODEL_REPO/model.pth --compile --precision float16 --quantization gemlite-8-4-None --write_result benchmark_results.txt python generate.py --checkpoint_path
$CHECKPOINT_PATH/$MODEL_REPO/model.pth --compile --precision float16 --quantization gemlite-32-4-None --write_result benchmark_results.txta

(previously these gave nonsense responses)

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: shapes need to be divisible by 128 or they will not work with gemlite need fp32 accumulation for groupsize None on int4 Test Plan: python test_integration.py -k "test_gemlite" (new test for non divisible shape)a python generate.py --checkpoint_path $CHECKPOINT_PATH/$MODEL_REPO/model.pth --compile --precision float16 --quantization gemlite-8-4-None --write_result benchmark_results.txt python generate.py --checkpoint_path $CHECKPOINT_PATH/$MODEL_REPO/model.pth --compile --precision float16 --quantization gemlite-32-4-None --write_result benchmark_results.txta (previously these gave nonsense responses) Reviewers: Subscribers: Tasks: Tags:

pytorch-bot · 2024-12-18T14:39:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1432

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 1797c75 with merge base 33d57af ():

NEW FAILURES - The following jobs have failed:

Code Analysis with Ruff / build (3.9) (gh)
Process completed with exit code 1.
PR Label Check / Check PR Labels (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2024-12-18T16:41:12Z

        return abs(other_event.event_time - self.event_time) * 1000


+def get_arch_name() -> str:


why these changes? is this some rebase issue

@HDCharles

Summary: Resubmitting fixes from @HDCharles in pytorch#1432 since that seems to have issues with rebase Test Plan: see pytorch#1432 Reviewers: Subscribers: Tasks: Tags:

mobicham · 2024-12-18T17:11:27Z

+        if _layout.group_size == None and _layout.bit_width == 4:
+                from gemlite.core import GEMLITE_ACC_DTYPE
+                from gemlite.dtypes import DType
+                GEMLITE_ACC_DTYPE[DType.FP16] = DType.FP32


This will only work when all the layers use the same group_size, which is ok for now.
The other option will be using this https://github.com/mobiusml/gemlite/blob/master/gemlite/core.py#L87 but for now let's keep it like this

I tested this manually, it works in all cases even when there are different group sizes.

I mean when different layers use different settings within the same model, but let's not worry about that !

Summary: Resubmitting pytorch#1432 since it has some rebase issues and we want to merge the fix asap Test Plan: see pytorch#1432 Reviewers: Subscribers: Tasks: Tags:

* [resubmit] Gemlite fix Summary: Resubmitting #1432 since it has some rebase issues and we want to merge the fix asap Test Plan: see #1432 Reviewers: Subscribers: Tasks: Tags: * ruff

jerryzh168 · 2024-12-19T00:42:51Z

landed in #1435, please feel free to submit any follow up fixes

* [resubmit] Gemlite fix Summary: Resubmitting #1432 since it has some rebase issues and we want to merge the fix asap Test Plan: see #1432 Reviewers: Subscribers: Tasks: Tags: * ruff

HDCharles requested a review from jerryzh168 December 18, 2024 14:39

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 18, 2024

HDCharles requested a review from mobicham December 18, 2024 14:39

jerryzh168 reviewed Dec 18, 2024

View reviewed changes

jerryzh168 added a commit to jerryzh168/ao that referenced this pull request Dec 18, 2024

Fix gemlite shape issues

20b816f

Summary: Resubmitting fixes from @HDCharles in pytorch#1432 since that seems to have issues with rebase Test Plan: see pytorch#1432 Reviewers: Subscribers: Tasks: Tags:

mobicham reviewed Dec 18, 2024

View reviewed changes

jerryzh168 added a commit to jerryzh168/ao that referenced this pull request Dec 18, 2024

[resubmit] Gemlite fix

967e35a

Summary: Resubmitting pytorch#1432 since it has some rebase issues and we want to merge the fix asap Test Plan: see pytorch#1432 Reviewers: Subscribers: Tasks: Tags:

jerryzh168 mentioned this pull request Dec 18, 2024

[resubmit] Gemlite fix #1435

Merged

amdfaa pushed a commit that referenced this pull request Jan 10, 2025

[resubmit] Gemlite fix (#1435)

a2fd476

* [resubmit] Gemlite fix Summary: Resubmitting #1432 since it has some rebase issues and we want to merge the fix asap Test Plan: see #1432 Reviewers: Subscribers: Tasks: Tags: * ruff

HDCharles closed this Mar 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemlite fixes#1432

Gemlite fixes#1432
HDCharles wants to merge 1 commit into
mainfrom
049_gemlite_fix

HDCharles commented Dec 18, 2024

Uh oh!

pytorch-bot Bot commented Dec 18, 2024 •

edited

Loading

Uh oh!

jerryzh168 Dec 18, 2024

Uh oh!

mobicham Dec 18, 2024

Uh oh!

HDCharles Dec 19, 2024

Uh oh!

mobicham Dec 21, 2024

Uh oh!

jerryzh168 commented Dec 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		return abs(other_event.event_time - self.event_time) * 1000


		def get_arch_name() -> str:

Conversation

HDCharles commented Dec 18, 2024

Uh oh!

pytorch-bot Bot commented Dec 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1432

❌ 2 New Failures

Uh oh!

jerryzh168 Dec 18, 2024

Choose a reason for hiding this comment

Uh oh!

mobicham Dec 18, 2024

Choose a reason for hiding this comment

Uh oh!

HDCharles Dec 19, 2024

Choose a reason for hiding this comment

Uh oh!

mobicham Dec 21, 2024

Choose a reason for hiding this comment

Uh oh!

jerryzh168 commented Dec 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot Bot commented Dec 18, 2024 •

edited

Loading