Skip to content
This repository was archived by the owner on Feb 25, 2025. It is now read-only.

Conversation

@dnfield
Copy link
Contributor

@dnfield dnfield commented Mar 22, 2023

Fixes flutter/flutter#122828

On Intel hardware, the subgroup size is coming out as 16 for the quad shader in particular. There are, however, 26 quads to compute. The test works fine on M1 hardware where the subgroup size stays at 32 for this shader, but fails on Intel.

This change makes both the test shaders safe against smaller threadgroup sizes.

Copy link
Contributor

@jonahwilliams jonahwilliams left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

RSLGTM

@dnfield
Copy link
Contributor Author

dnfield commented Mar 22, 2023

A fun note to add: I was scratching my head for a bit debugging this problem because if I simplified the quad shader even a little, it jumped back up to a subgroup size of 32 on the intel mac and everything looked good. I finally thought to check the size specifically when I noticed that the count value was consistently smaller than expected once the shader got more complex.

@dnfield dnfield added the autosubmit Merge PR when tree becomes green via auto submit App label Mar 22, 2023
@auto-submit auto-submit bot removed the autosubmit Merge PR when tree becomes green via auto submit App label Mar 22, 2023
@auto-submit
Copy link
Contributor

auto-submit bot commented Mar 22, 2023

auto label is removed for flutter/engine, pr: 40509, due to - The status or check suite Mac Host clang-tidy has failed. Please fix the issues identified (or deflake) before re-applying this label.

  • The status or check suite Linux Host clang-tidy has failed. Please fix the issues identified (or deflake) before re-applying this label.
  • The status or check suite Linux Benchmarks (no-upload) has failed. Please fix the issues identified (or deflake) before re-applying this label.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

autosubmit Merge PR when tree becomes green via auto submit App e: impeller

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Impeller] Failing GPU compute tests on Intel macOS

2 participants