Skip to content

added ctx_size parameter#148

Merged
ggerganov merged 3 commits intoggml-org:masterfrom
RazeLighter777:fix_infinite
Mar 15, 2023
Merged

added ctx_size parameter#148
ggerganov merged 3 commits intoggml-org:masterfrom
RazeLighter777:fix_infinite

Conversation

@RazeLighter777
Copy link
Contributor

Adds a parameter called context size (-c for short) that allows taking the context size from the user's input.

Defaults to the same hardcoded 512.

@RazeLighter777
Copy link
Contributor Author

Usage -c N, --ctx_size N

@eous
Copy link

eous commented Mar 15, 2023

#71 (comment) feel free to take a look at this diff, theres probably two other lines you want to change. Probably ignore the EPS stuff though that should be changed too at some point.

@RazeLighter777
Copy link
Contributor Author

Not 100% sure if this is right but I added it.

@ggerganov ggerganov merged commit 2d64715 into ggml-org:master Mar 15, 2023
blackhole89 pushed a commit that referenced this pull request Mar 15, 2023
* added ctx_size parameter

* added it in more places

* Apply suggestions from code review

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
SamuelOliveirads pushed a commit to SamuelOliveirads/llama.cpp that referenced this pull request Dec 29, 2025
…iq4_k_r4 (ggml-org#148)

* Slightly better matrix x vector on Zen4/AVX2 for iq2_k_r4, iq3_k_r4, iq4_k_r4

More importantly: simplify.

* Minor

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants