Skip to content

CANN: support gated linear attn#18653

Merged
hipudding merged 3 commits intoggml-org:masterfrom
hipudding:gla
Jan 16, 2026
Merged

CANN: support gated linear attn#18653
hipudding merged 3 commits intoggml-org:masterfrom
hipudding:gla

Conversation

@hipudding
Copy link
Collaborator

This change adds support for the GGML_OP_GATED_LINEAR_ATTN operator. The feature was implemented by YushengZhao(#17814). Because the previous submission was based on an outdated codebase, this PR was rebased to merge.

Make sure to read the contributing guidelines before submitting a PR

@hipudding hipudding marked this pull request as draft January 7, 2026 02:56
@github-actions github-actions bot added ggml changes relating to the ggml tensor library for machine learning Ascend NPU issues specific to Ascend NPUs labels Jan 7, 2026
@hipudding
Copy link
Collaborator Author

image

@hipudding hipudding marked this pull request as ready for review January 7, 2026 08:14
@hipudding hipudding requested a review from noemotiovon January 7, 2026 08:14
Copy link
Collaborator

@noemotiovon noemotiovon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

赵禹昇 and others added 3 commits January 16, 2026 06:10
This change adds support for the GGML_OP_GATED_LINEAR_ATTN operator.
The feature was implemented by YushengZhao. Because the previous
submission was based on an outdated codebase, this PR was rebased to
merge.

Co-authored-by: YushengZhao <yusheng.chao@outlook.com>
Co-authored-by: hipudding <huafengchun@gmail.com>
Optimize gla for high preformance
@hipudding
Copy link
Collaborator Author

The failing test is not relate to this PR.

@hipudding hipudding merged commit baa4ba0 into ggml-org:master Jan 16, 2026
139 of 145 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants