Skip to content

allow head_dim for llama like gemma or mistral#32847

Closed
bzantium wants to merge 4 commits intohuggingface:mainfrom
bzantium:feature/#32846
Closed

allow head_dim for llama like gemma or mistral#32847
bzantium wants to merge 4 commits intohuggingface:mainfrom
bzantium:feature/#32846

Conversation

@bzantium
Copy link
Contributor

@bzantium bzantium commented Aug 16, 2024

What does this PR do?

Fixes #32846

Who can review?

@ArthurZucker

@bzantium bzantium mentioned this pull request Aug 17, 2024
@bzantium bzantium changed the title split head_dim from hidden_size for llama like gemma or mistral allow head_dim for llama like gemma or mistral Aug 17, 2024
@bzantium bzantium closed this Nov 5, 2024
@bzantium bzantium deleted the feature/#32846 branch November 5, 2024 06:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

split head_dim from hidden_size for llama like gemma or mistral

1 participant