-
Notifications
You must be signed in to change notification settings - Fork 123
Home
- Multiline field for Edit with AI
- Qwen3.5 models added as predefined (2B, 4B, 9B) - good for tools and chat
- API Key is used (if needed and provided) on getting list of models on adding OpenAI Compatible model
llama.vscode could provide models for VS Code Copilot now:
- Start tools model from llama-vscode (local or external)
- In VS Code Copilot show the models list -> Other Models -> Manage Models
- Make the models (all models available by the application serving the tools model are shown) you want to use visible (click on the left of the model name)
- Select the desired model from Copilot and start using it
Not needed tools from Copilot could be unchecked to reduce contex size if local model is used.
-
Configurable debounce for inline completion requests - setting debounce_ms. llama-vscode will wait debounce_ms after a keystroke before sending a request to the LLM for inline code completion. If in the meantime there is another keystroke, the request for the previous keystroke is cancelled. Useful on low end hardware to avoid triggering code completion on every keystroke.
-
Notification "Extension is updated" is shown only on version change, not on every setting change (as was before)
-
Subagents implemented (with tool delegate_task) - now each agent, which has "Available as Subagent" checked could be used as a subagent
-
new agent - Unit Test Writer
-
new tool create_agent
-
new agent "Agent creator"
-
Files SOUL.md and USER.md (if available in the project root) will be added to the context