inst : preview generation + improve prompt by ggerganov · Pull Request #98 · ggml-org/llama.vim

ggerganov · 2026-01-20T20:09:22Z

cont #96

Streaming response
Better system prompt
Continue instructions (<leader>llc)
Rerun instructions (<leader>llr)
Update default keymaps - use <leader>ll as common keymap

Next PRs:

Multiple suggestions using parallel n_cmpl
Auto-trigger instructions based on git diff?

llama.vim-inst-1-lq.mp4

alopatindev · 2026-01-22T21:13:54Z

let l:system_prompt .= "... Respond ONLY with the result of applying INSTRUCTION to SELECTION given the CONTEXT. .... Do not output any extra separators.\n"

let l:extra = s:ring_get_extra()

let l:system_prompt .= "\n"
let l:system_prompt .= "--- CONTEXT   --------------------------------------------------\n"
...

Looks kinda sad. Maybe at least serialize inputs as JSONs?

This for instance works okay with ollama most of the time:

curl -s http://localhost:11434/api/generate \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen2.5-coder:7b",
    "prompt": "You are a code-editing assistant that receives JSONs with \"selection\" value as input. Apply \"instruction\" value to the input, answer ONLY with single VALID minified JSON object with the ONLY value \"replacement\" that contains ONLY the output code. No markdown wrapping. {\"selection\":\"// TODO\",\"instruction\":\"implement quicksort in C\"}",
    "stream": false
  }' | jq -rM '.response' | jq -rM '.replacement'

ggerganov · 2026-01-22T21:29:15Z

Yeah, I was planning to change to json. Btw, I'm using gpt-oss-120b and it's solid.

ggerganov · 2026-01-24T15:36:11Z

@alopatindev Should be improved now - I find this feature quite useful.

If you give it a try would appreciate feedback.

alopatindev · 2026-01-24T16:25:52Z

Works for me (nvim 0.11.5), thanks!

Tested with --fim-qwen-3b-default and --fim-qwen-7b-default. I don't have enough VRAM to test with larger models.

ggerganov force-pushed the gg/inst-cont branch from 5d7f90d to 365b253 Compare January 24, 2026 14:27

ggerganov added 8 commits January 24, 2026 17:03

inst : preview generation + improve prompt

b0bd011

cont : add continuation

f8a9b24

cont : fix request state indexing

37aec5f

cont : fix stream parsing

243b8b0

cont : rerun instruction + remove callbacks

32e0700

cont : system prompt to json

4e9b005

cont : rework instruction state to be a map

bd15e43

docs : update

1d7e64d

ggerganov force-pushed the gg/inst-cont branch from db68abf to 1d7e64d Compare January 24, 2026 15:03

ggerganov added 2 commits January 24, 2026 17:07

cont : add recommended models

a8ef125

cont : display selected inst model

98cba39

ggerganov added 5 commits January 24, 2026 19:14

cont : fixes + add PREFIX and SUFFIX

0aa4a21

cont : minor

110f3cd

readme : cleanup

1682184

cont : use config n_prefix and n_suffix

8ca738d

readme : add example

16c636c

ggerganov merged commit 4cdf1dd into master Jan 24, 2026

ggerganov deleted the gg/inst-cont branch January 24, 2026 19:47

This was referenced Jan 29, 2026

inst : do not use json for system prompt #109

Merged

spec : add ngram-mod ggml-org/llama.cpp#19164

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inst : preview generation + improve prompt#98

inst : preview generation + improve prompt#98
ggerganov merged 15 commits into
masterfrom
gg/inst-cont

ggerganov commented Jan 20, 2026 •

edited

Loading

Uh oh!

alopatindev commented Jan 22, 2026 •

edited

Loading

Uh oh!

ggerganov commented Jan 22, 2026

Uh oh!

ggerganov commented Jan 24, 2026

Uh oh!

alopatindev commented Jan 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ggerganov commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alopatindev commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Jan 22, 2026

Uh oh!

ggerganov commented Jan 24, 2026

Uh oh!

alopatindev commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ggerganov commented Jan 20, 2026 •

edited

Loading

alopatindev commented Jan 22, 2026 •

edited

Loading

alopatindev commented Jan 24, 2026 •

edited

Loading