Home

Jump to bottom

igardev edited this page May 4, 2026 · 35 revisions

Version 0.0.46 is released (04.05.2026)

What is new

Multiline field for Edit with AI
Qwen3.5 models added as predefined (2B, 4B, 9B) - good for tools and chat
API Key is used (if needed and provided) on getting list of models on adding OpenAI Compatible model

Version 0.0.46 is released (29.04.2026)

What is new

llama.vscode could provide models for VS Code Copilot now:

Start tools model from llama-vscode (local or external)
In VS Code Copilot show the models list -> Other Models -> Manage Models
Make the models (all models available by the application serving the tools model are shown) you want to use visible (click on the left of the model name)
Select the desired model from Copilot and start using it

Not needed tools from Copilot could be unchecked to reduce contex size if local model is used.

Version 0.0.45 is released (04.03.2026)

What is new

Configurable debounce for inline completion requests - setting debounce_ms. llama-vscode will wait debounce_ms after a keystroke before sending a request to the LLM for inline code completion. If in the meantime there is another keystroke, the request for the previous keystroke is cancelled. Useful on low end hardware to avoid triggering code completion on every keystroke.
Notification "Extension is updated" is shown only on version change, not on every setting change (as was before)

Version 0.0.44 is released (03.03.2026)

What is new

Subagents implemented (with tool delegate_task) - now each agent, which has "Available as Subagent" checked could be used as a subagent
new agent - Unit Test Writer
new tool create_agent
new agent "Agent creator"
Files SOUL.md and USER.md (if available in the project root) will be added to the context

Setup instructions for llama.cpp server

Linux

Mac

Windows

More details about llama.cpp server

Features

Code completion

Edit with AI

Llama agent

Local ai runner

Chat with AI about llama vscode

Chat with AI

Generate commit message

Statusbar

Menu

Env

Manage completion models

Manage chat models

Manage embeddings models

Manage tools models

Manage envs

Model selection