[Graph] Add to_cuda() for Module class by hjjq · Pull Request #57 · hidet-org/hidet

hjjq · 2023-01-04T22:17:37Z

No description provided.

Add bias to Conv2d Module. Defaults to false for back compatibility, **this is different from torch default**. Towards #57

Add some necessary module components used frequently in Stable Diffusion's UNet. Includes fixes to module attribute access from LLM branch and work arounds for torch weight copying. Towards #57.

Add graph module for using flash attention and clarify some differences in flash attention vs torch sdpa. **Attention: (pun intended)** Softmax has temperature scaling option. Divides inputs by scalar, good explanation of numerical effects [here](https://medium.com/@harshit158/softmax-temperature-5492e4007f71). Used when softmax inputs QK are too big for float 16 (abs value > 65504). This usually means the numbers are so large that dividing by small (< 4) scalar has little effect. Stable diffusion does not use this, as torch spda supports float 32 (or somehow avoids NaNs from large values). No visual or significant numeric differences in this output layer noticed. Towards #57.

Define complete UNet, with forward pass broken into down, mid, and up sections. Useful diagrams [here](http://jalammar.github.io/illustrated-stable-diffusion/) Uses blocks defined in hidet-org#97. Heavily reduced version from diffusers containing only necessary features for stable diffusion v2-1. Towards hidet-org#57. --------- Co-authored-by: vadiklyutiy <156319763+vadiklyutiy@users.noreply.github.com>

Stable diffusion uses fundamentally the same positional embeddings, but since timesteps change, a cache is not possible. There's also small changes in tensor layouts and calculation parameters between the diffusers version and the one from Llama, so I've recreated it here for now. An abstract version that combines both version is TODO. Towards hidet-org#57.

Add UNet Down, Up, and Mid block definitions and attention transformer utility layer. Modules are designed so that kwargs passed to constructors are all the same config from huggingface with minimal changes - lots of shared values and too many parameters to list individually. Same kwargs are passed to nested objects. Open to other suggestions, although this is a single use case problem. Towards hidet-org#57.

Infrastructure for compiled stable diffusion app. Towards hidet-org#57

Define complete UNet, with forward pass broken into down, mid, and up sections. Useful diagrams [here](http://jalammar.github.io/illustrated-stable-diffusion/) Uses blocks defined in #97. Heavily reduced version from diffusers containing only necessary features for stable diffusion v2-1. Towards #57. --------- Co-authored-by: vadiklyutiy <156319763+vadiklyutiy@users.noreply.github.com>

Stable diffusion uses fundamentally the same positional embeddings, but since timesteps change, a cache is not possible. There's also small changes in tensor layouts and calculation parameters between the diffusers version and the one from Llama, so I've recreated it here for now. An abstract version that combines both version is TODO. Towards #57.

Add UNet Down, Up, and Mid block definitions and attention transformer utility layer. Modules are designed so that kwargs passed to constructors are all the same config from huggingface with minimal changes - lots of shared values and too many parameters to list individually. Same kwargs are passed to nested objects. Open to other suggestions, although this is a single use case problem. Towards #57.

Infrastructure for compiled stable diffusion app. Towards #57

Define complete UNet, with forward pass broken into down, mid, and up sections. Useful diagrams [here](http://jalammar.github.io/illustrated-stable-diffusion/) Uses blocks defined in #97. Heavily reduced version from diffusers containing only necessary features for stable diffusion v2-1. Towards #57. --------- Co-authored-by: vadiklyutiy <156319763+vadiklyutiy@users.noreply.github.com>

Stable diffusion uses fundamentally the same positional embeddings, but since timesteps change, a cache is not possible. There's also small changes in tensor layouts and calculation parameters between the diffusers version and the one from Llama, so I've recreated it here for now. An abstract version that combines both version is TODO. Towards #57.

Add UNet Down, Up, and Mid block definitions and attention transformer utility layer. Modules are designed so that kwargs passed to constructors are all the same config from huggingface with minimal changes - lots of shared values and too many parameters to list individually. Same kwargs are passed to nested objects. Open to other suggestions, although this is a single use case problem. Towards #57.

Infrastructure for compiled stable diffusion app. Towards #57

hjjq added 2 commits January 4, 2023 17:14

Add to_cuda() for Module class

e141e62

fix format

8e8283c

hjjq merged commit 4cb20c1 into hidet-org:main Jan 4, 2023

hjjq deleted the module_cuda branch January 4, 2023 22:33

yaoyaoding pushed a commit that referenced this pull request Apr 3, 2024

[Graph] Conv2d Bias (#92)

487aada

Add bias to Conv2d Module. Defaults to false for back compatibility, **this is different from torch default**. Towards #57

KTong821 added a commit to KTong821/hidet that referenced this pull request Apr 24, 2024

Stable Diffusion App Infra (hidet-org#103)

d53acab

Infrastructure for compiled stable diffusion app. Towards hidet-org#57

vadiklyutiy pushed a commit that referenced this pull request Jul 22, 2024

Stable Diffusion App Infra (#103)

b63caf9

Infrastructure for compiled stable diffusion app. Towards #57

vadiklyutiy pushed a commit that referenced this pull request Jul 23, 2024

Stable Diffusion App Infra (#103)

8f03f9e

Infrastructure for compiled stable diffusion app. Towards #57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Graph] Add to_cuda() for Module class#57

[Graph] Add to_cuda() for Module class#57
hjjq merged 2 commits intohidet-org:mainfrom
hjjq:module_cuda

hjjq commented Jan 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hjjq commented Jan 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant