[diffusion] Diffusion norm fusion for z-image#18762
[diffusion] Diffusion norm fusion for z-image#18762mickqian merged 9 commits intosgl-project:mainfrom
Conversation
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
|
could you also share the images generated before and after the fuse so we can check the precision? |
|
Are we still looking to get this merged? @yingluosanqian I believe it should be ready for review |
hi, i left some comments earlier. could you please take a look and fix them first? |
if you mean this comment, i attached the images in the pr description! |
Head branch was pushed to by a user without write access
5280ba6 to
17bdca1
Compare
|
/rerun-failed-ci |
|
/tag-and-rerun-ci |
|
/rerun-failed-ci |
Signed-off-by: Chi McIsaac <chixie.mcisaac@gmail.com> Co-authored-by: yihanc <yingluosanqian@gmail.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Signed-off-by: Chi McIsaac <chixie.mcisaac@gmail.com> Co-authored-by: yihanc <yingluosanqian@gmail.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Signed-off-by: Chi McIsaac <chixie.mcisaac@gmail.com> Co-authored-by: yihanc <yingluosanqian@gmail.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Signed-off-by: Chi McIsaac <chixie.mcisaac@gmail.com> Co-authored-by: yihanc <yingluosanqian@gmail.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Signed-off-by: Chi McIsaac <chixie.mcisaac@gmail.com> Co-authored-by: yihanc <yingluosanqian@gmail.com> Co-authored-by: Mick <mickjagger19@icloud.com>
Motivation
Speed up Z-Image DiT modulation by using the fused residual form path
residual + tanh(gate) * rmsnorm(x)Initial kernel was authored by: yihan chen @yingluosanqian
Modifications
fused_norm_tanh_mul_addCuTeDSL kernel (norm(x) * tanh(scale) + shift, used here in residual form asresidual + tanh(gate) * rmsnorm(x)).Benchmarking and Profiling
1x NVIDIA H100 80GB, Tongyi-MAI/Z-Image-Turbo
Baseline:
main@4c6afbeeaSummary (Mean of 10 Runs)
Run-to-Run E2E Consistency
Precision OK

Baseline:
After:

Checklist