Add RMS norm and use it by hoangmit · Pull Request #187 · ggml-org/llama.cpp

hoangmit · 2023-03-15T22:34:34Z

To resolve issue #173

ggerganov

🦙

* add ggml_rms_norm * update op num

This reverts commit b9098b0.

* iq1_m_r4: basics (quantize/dequantize) * iq1_m_r4: Zen4 gemm * iq1_m_r4: neon gemm * iq1_m_r4: switch to q8_0_x4 also on AVX2/Zen4 With the deltas being per group of 8, we cannot make use of the q8 sums stored in q8_1, so we get a tiny gain by using q8_0_x4. * iq1_m_r4: rename mul_mat_iq1_m_r4_q8_1 to mul_mat_iq1_m_r4_q8_0 --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

hoangmit added 2 commits March 15, 2023 18:26

add ggml_rms_norm

a8f75ec

update op num

11e5948

hoangmit changed the title ~~Add Rms norm~~ Add RMS norm and use it Mar 15, 2023

ggerganov approved these changes Mar 15, 2023

View reviewed changes

ggerganov merged commit 6eac39b into master Mar 15, 2023

ggerganov deleted the rms_norm branch March 15, 2023 22:42

nebulatgs mentioned this pull request Mar 15, 2023

new RMS norm PR bricks stuff #190

Closed

blackhole89 pushed a commit that referenced this pull request Mar 15, 2023

Add RMS norm and use it (#187)

9cd27fa

* add ggml_rms_norm * update op num

Deadsg pushed a commit to Deadsg/llama.cpp that referenced this pull request Dec 19, 2023

Revert "llama_cpp server: prompt is a string". Closes ggml-org#187

8895b90

This reverts commit b9098b0.

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RMS norm and use it#187

Add RMS norm and use it#187
ggerganov merged 2 commits intomasterfrom
rms_norm

hoangmit commented Mar 15, 2023

Uh oh!

ggerganov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hoangmit commented Mar 15, 2023

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants