feat: completion and infill #164

giladgd · 2024-02-17T22:54:25Z

Description of change

feat: add LlamaCompletion that provides the ability to complete or infill text
feat: support configuring more options for getLlama when using "lastBuild"
fix: various bug fixes

Infill, also known as fill-in-middle, is used to generate a completion for an input that should connect to a given continuation.
For example, for a prefix input 123 and suffix input 789, the model is expected to generate 456 to make the final text be 123456789.

Not every model supports infill, so only those that do can be used for generating an infill.

How to generate a completion

import {fileURLToPath} from "url";
import path from "path";
import {getLlama, LlamaModel, LlamaContext, LlamaCompletion} from "node-llama-cpp";

const __dirname = path.dirname(fileURLToPath(import.meta.url));

const llama = await getLlama();
const model = new LlamaModel({
    llama,
    modelPath: path.join(__dirname, "models", "stable-code-3b.Q5_K_M.gguf")
});
const context = new LlamaContext({
    model,
    contextSize: Math.min(4096, model.trainContextSize)
});
const completion = new LlamaCompletion({
    contextSequence: context.getSequence()
});

const input = "const arrayFromOneToTwenty = [1, 2, 3,";
console.log("Input: " + input);

const res = await completion.generateCompletion(input);
console.log("Completion: " + res);

In this example I used this model

How to generate an infill

import {fileURLToPath} from "url";
import path from "path";
import {getLlama, LlamaModel, LlamaContext, LlamaCompletion, UnsupportedError} from "node-llama-cpp";

const __dirname = path.dirname(fileURLToPath(import.meta.url));

const llama = await getLlama();
const model = new LlamaModel({
    llama,
    modelPath: path.join(__dirname, "models", "stable-code-3b.Q5_K_M.gguf")
});
const context = new LlamaContext({
    model,
    contextSize: Math.min(4096, model.trainContextSize)
});
const completion = new LlamaCompletion({
    contextSequence: context.getSequence()
});

if (!completion.infillSupported)
    throw new UnsupportedError("Infill completions are not supported by this model");

const prefix = "const arrayFromOneToFourteen = [1, 2, 3, ";
const suffix = "10, 11, 12, 13, 14];";
console.log("prefix: " + prefix);
console.log("suffix: " + suffix);

const res = await completion.generateInfillCompletion(prefix, suffix);
console.log("Infill: " + res);

In this example I used this model

Pull-Request Checklist

Code is up-to-date with the master branch
npm run format to apply eslint formatting
npm run test passes with this change
This pull request links relevant issues as Fixes #0000
There are new or updated unit tests validating the change
Documentation has been updated to reflect this change
The new commits and pull request title follow conventions explained in pull request guidelines (PRs that do not follow this convention will not be merged)

…stBuild"`

ido-pluto

LGTM

github-actions · 2024-02-18T20:52:55Z

🎉 This PR is included in version 3.0.0-beta.11 🎉

The release is available on:

Your semantic-release bot 📦🚀

github-actions · 2024-09-24T18:12:57Z

🎉 This PR is included in version 3.0.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

giladgd added 4 commits February 18, 2024 00:01

fix: vitest.config.ts

2b359e9

feat: add LlamaCompletion

0113f6e

feat: support configuring more options for getLlama when using `"la…

bdc250b

…stBuild"`

fix: bugs

dc0c80d

giladgd requested a review from ido-pluto February 17, 2024 22:54

giladgd self-assigned this Feb 17, 2024

giladgd added 3 commits February 18, 2024 01:12

chore: adapt to llama.cpp breaking interface change

5c6f654

test: add more tests

4202f00

fix: vitest.config.ts

b875e1a

ido-pluto approved these changes Feb 18, 2024

View reviewed changes

giladgd merged commit ede69c1 into beta Feb 18, 2024

giladgd deleted the gilad/completion branch February 18, 2024 18:38

giladgd mentioned this pull request Feb 18, 2024

feat: version 3.0 #105

Merged

17 tasks

github-actions bot added the released on @beta label Feb 18, 2024

giladgd mentioned this pull request Mar 16, 2024

feat: async operations #178

Merged

7 tasks

giladgd mentioned this pull request Jul 28, 2024

feat: Llama 3.1 support, Phi-3 support #273

Merged

7 tasks

github-actions bot added the released label Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: completion and infill #164

feat: completion and infill #164

Uh oh!

giladgd commented Feb 17, 2024

Uh oh!

ido-pluto left a comment

Uh oh!

github-actions bot commented Feb 18, 2024

Uh oh!

github-actions bot commented Sep 24, 2024 •

edited by giladgd

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

feat: completion and infill #164

feat: completion and infill #164

Uh oh!

Conversation

giladgd commented Feb 17, 2024

Description of change

How to generate a completion

How to generate an infill

Pull-Request Checklist

Uh oh!

ido-pluto left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 18, 2024

Uh oh!

github-actions bot commented Sep 24, 2024 • edited by giladgd Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Sep 24, 2024 •

edited by giladgd

Loading