llama : add benchmark example by slaren · Pull Request #2626 · ggml-org/llama.cpp

slaren · 2023-08-15T19:54:47Z

Adds an example for running performance benchmarks. Multiple values can be specified for each option, and it will run the matrix of all of them. Supports output to csv, json or markdown.

Example markdown output:

model	backend	n_gpu_layers	test	t/s
LLaMA 7B mostly Q4_0	CUDA	99	pp 512	2242.06 ± 24.26
LLaMA 7B mostly Q4_0	CUDA	99	tg 128	43.09 ± 0.41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add benchmark example#2626

llama : add benchmark example#2626
slaren merged 19 commits intomasterfrom
llama-benchmark

slaren commented Aug 15, 2023 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

slaren commented Aug 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

slaren commented Aug 15, 2023 •

edited

Loading