Skip to content

[Fixbug] Fix a bug in the minimum and maximum operator#102

Merged
yaoyaoding merged 2 commits intohidet-org:mainfrom
yaoyaoding:min-max
Feb 13, 2023
Merged

[Fixbug] Fix a bug in the minimum and maximum operator#102
yaoyaoding merged 2 commits intohidet-org:mainfrom
yaoyaoding:min-max

Conversation

@yaoyaoding
Copy link
Copy Markdown
Member

Fix #90.

@yaoyaoding yaoyaoding merged commit 33cfcbb into hidet-org:main Feb 13, 2023
@yaoyaoding yaoyaoding deleted the min-max branch February 13, 2023 23:59
KTong821 added a commit to KTong821/hidet that referenced this pull request Apr 24, 2024
Allow access to cluster attributes inside Hidet kernels. Launch kernels
with distributed shared memory.

See docs:


https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#distributed-shared-memory

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#thread-block-clusters

API:
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#cluster-group-cg

Towards supporting hidet-org#102 by adding cluster rank primitive in Hidet.

See `test_cluster.py` for example usage. To run test on Hopper machines
use `pytest --hopper`
vadiklyutiy pushed a commit that referenced this pull request Jul 22, 2024
Allow access to cluster attributes inside Hidet kernels. Launch kernels
with distributed shared memory.

See docs:


https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#distributed-shared-memory

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#thread-block-clusters

API:
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#cluster-group-cg

Towards supporting #102 by adding cluster rank primitive in Hidet.

See `test_cluster.py` for example usage. To run test on Hopper machines
use `pytest --hopper`
vadiklyutiy pushed a commit that referenced this pull request Jul 23, 2024
Allow access to cluster attributes inside Hidet kernels. Launch kernels
with distributed shared memory.

See docs:


https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#distributed-shared-memory

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#thread-block-clusters

API:
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#cluster-group-cg

Towards supporting #102 by adding cluster rank primitive in Hidet.

See `test_cluster.py` for example usage. To run test on Hopper machines
use `pytest --hopper`
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug] MinOp generates max code

1 participant