llama.vscode

Local LLM-assisted text completion extension for VS Code

Features

Auto-suggest on input
Accept a suggestion with Tab
Accept the first line of a suggestion with Shift + Tab
Accept the next word with Ctrl/Cmd + Right
Toggle the suggestion manually by pressing Ctrl + L
Control max text generation time
Configure scope of context around the cursor
Ring context with chunks from open and edited files and yanked text
Supports very large contexts even on low-end hardware via smart context reuse
Display performance stats

Installation

VS Code extension setup

Install the llama-vscode extension from the VS Code extension marketplace:

llama.cpp setup

The plugin requires a llama.cpp server instance to be running at the configured endpoint:

Mac OS

brew install llama.cpp

Any other OS

Either use the latest binaries or build llama.cpp from source. For more information how to run the llama.cpp server, please refer to the Wiki.

llama.cpp settings

Here are recommended settings, depending on the amount of VRAM that you have:

More than 16GB VRAM:

llama-server \
    -hf ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF \
    --port 8012 -ngl 99 -fa -ub 1024 -b 1024 \
    --ctx-size 0 --cache-reuse 256

Less than 16GB VRAM:

llama-server \
    -hf ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF \
    --port 8012 -ngl 99 -fa -ub 1024 -b 1024 \
    --ctx-size 0 --cache-reuse 256

Less than 8GB VRAM:

llama-server \
    -hf ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF \
    --port 8012 -ngl 99 -fa -ub 1024 -b 1024 \
    --ctx-size 0 --cache-reuse 256

You can use any other FIM-compatible model that your system can handle. By default, the models downloaded with the -hf flag are stored in:

Mac OS: ~/Library/Caches/llama.cpp/
Linux: ~/.cache/llama.cpp
Windows: LOCALAPPDATA

Recommended LLMs

The plugin requires FIM-compatible models: HF collection

Examples

TODO: add examples

Implementation details

The extension aims to be very simple and lightweight and at the same time to provide high-quality and performant local FIM completions, even on consumer-grade hardware.

The initial implementation was done by Ivaylo Gardev @igardev using the llama.vim plugin as a reference
Techincal description: ggml-org/llama.cpp#9787

Other IDEs

Vim/Neovim: https://github.com/ggml-org/llama.vim

Name	Name	Last commit message	Last commit date
Latest commit ggerganov readme : minor simplifications of the server commands Jan 27, 2025 9e4503e · Jan 27, 2025 History 24 Commits
.vscode	.vscode	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
src	src	feat: add status bar controls and completion toggles (ggml-org#6 )	Jan 22, 2025
.gitignore	.gitignore	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
.vscode-test.mjs	.vscode-test.mjs	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
.vscodeignore	.vscodeignore	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
CHANGELOG.md	CHANGELOG.md	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
LICENSE	LICENSE	license : update copyright notice	Jan 25, 2025
README.md	README.md	readme : minor simplifications of the server commands	Jan 27, 2025
eslint.config.mjs	eslint.config.mjs	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
llama.png	llama.png	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
package-lock.json	package-lock.json	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025
package.json	package.json	release : v0.0.5	Jan 23, 2025
tsconfig.json	tsconfig.json	First version of llama.vscode extension (ggml-org#2 )	Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llama.vscode

Features

Installation

VS Code extension setup

llama.cpp setup

Mac OS

Any other OS

llama.cpp settings

Recommended LLMs

Examples

Implementation details

Other IDEs

About

Releases

Packages

Languages

License

emcodem/llama.vscode

Folders and files

Latest commit

History

Repository files navigation

llama.vscode

Features

Installation

VS Code extension setup

llama.cpp setup

Mac OS

Any other OS

llama.cpp settings

Recommended LLMs

Examples

Implementation details

Other IDEs

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages