Skip to content

[Fix] NCCL API mismatch and NCCL primitive fix#301

Merged
soodoshll merged 3 commits intohidet-org:mainfrom
soodoshll:fix-distributed
Jul 1, 2023
Merged

[Fix] NCCL API mismatch and NCCL primitive fix#301
soodoshll merged 3 commits intohidet-org:mainfrom
soodoshll:fix-distributed

Conversation

@soodoshll
Copy link
Copy Markdown
Collaborator

  • count in NCCL APIs refer to number of elements, not bytes;
  • Fix nccl primitives. It now can handle dynamic shapes correctly.

@soodoshll soodoshll merged commit 664f9f0 into hidet-org:main Jul 1, 2023
@soodoshll soodoshll deleted the fix-distributed branch August 3, 2023 16:17
vadiklyutiy pushed a commit that referenced this pull request Dec 19, 2024
…. ) (#294)

[Ir][Primitives] add vectorized conversion instructions
[Ir][CuTe] add reduce primitives in cute (#295)
[Ir][CuTe] add mma primitives (#296)
[Ir][CuTe] add other primitives in cute (#297)
[Transforms][CuTe] add instruction selection pass (#298)
[Transforms][CuTe] add resolve bank conflict pass (#299)
[Transforms][CuTe] add resolve auto keywords pass (#300)
[Transforms][CuTe] add shared memory allocation pass (#301)
[Transforms][CuTe] add vectorize elementwise operation pass (#302)
[Transforms][CuTe] add analysis pass (#303)
[Transforms][CuTe] add canonicalization pass (#304)
[Transforms][CuTe] add deadcode elimination pass (#305)
[Transforms][CuTe] refactor cute lowering pass (#306)
[Graph][Ops] matmul cute (#307)
[Ir] cute miscs (#308)
[Tests] cute tests (#309)
[Chore] fix ci (#313)
---------

Co-authored-by: xiaocenxiaocen <xiao.zhang@centml.ai>
vadiklyutiy pushed a commit that referenced this pull request Dec 20, 2024
…. ) (#294)

[Ir][Primitives] add vectorized conversion instructions
[Ir][CuTe] add reduce primitives in cute (#295)
[Ir][CuTe] add mma primitives (#296)
[Ir][CuTe] add other primitives in cute (#297)
[Transforms][CuTe] add instruction selection pass (#298)
[Transforms][CuTe] add resolve bank conflict pass (#299)
[Transforms][CuTe] add resolve auto keywords pass (#300)
[Transforms][CuTe] add shared memory allocation pass (#301)
[Transforms][CuTe] add vectorize elementwise operation pass (#302)
[Transforms][CuTe] add analysis pass (#303)
[Transforms][CuTe] add canonicalization pass (#304)
[Transforms][CuTe] add deadcode elimination pass (#305)
[Transforms][CuTe] refactor cute lowering pass (#306)
[Graph][Ops] matmul cute (#307)
[Ir] cute miscs (#308)
[Tests] cute tests (#309)
[Chore] fix ci (#313)
---------

Co-authored-by: xiaocenxiaocen <xiao.zhang@centml.ai>
vadiklyutiy pushed a commit that referenced this pull request Dec 26, 2024
…. ) (#294)

[Ir][Primitives] add vectorized conversion instructions
[Ir][CuTe] add reduce primitives in cute (#295)
[Ir][CuTe] add mma primitives (#296)
[Ir][CuTe] add other primitives in cute (#297)
[Transforms][CuTe] add instruction selection pass (#298)
[Transforms][CuTe] add resolve bank conflict pass (#299)
[Transforms][CuTe] add resolve auto keywords pass (#300)
[Transforms][CuTe] add shared memory allocation pass (#301)
[Transforms][CuTe] add vectorize elementwise operation pass (#302)
[Transforms][CuTe] add analysis pass (#303)
[Transforms][CuTe] add canonicalization pass (#304)
[Transforms][CuTe] add deadcode elimination pass (#305)
[Transforms][CuTe] refactor cute lowering pass (#306)
[Graph][Ops] matmul cute (#307)
[Ir] cute miscs (#308)
[Tests] cute tests (#309)
[Chore] fix ci (#313)
---------

Co-authored-by: xiaocenxiaocen <xiao.zhang@centml.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant