Enhance VS Code Chat Provider Bridge to Fix Create-File (and other) Tool Calls by danielrobbins · Pull Request #190 · ggml-org/llama.vscode

danielrobbins · 2026-05-17T16:35:03Z

Summary

This PR fixes a core serialization bug in the VS Code chat provider bridge.

Before this change, llama-vscode flattened VS Code chat history into plain
text when building OpenAI-compatible chat requests. That discarded
structured tool metadata from earlier turns, including prior tool calls
and tool results.

In practice, this made tool-use sessions much less reliable. The model
could lose track of which tools had already been called, what arguments
had been passed, and what results had been returned. One visible failure
mode was tool-call hallucination: the model could claim that an action
had been taken, particularly file creation, without emitting the
corresponding tool call and receiving a tool result first.

What changed

Preserve structured VS Code tool metadata when translating chat history
into OpenAI-compatible messages
Emit assistant tool_calls instead of flattening prior tool calls into
plain text
Emit tool results as role: "tool" messages with tool_call_id
Normalize tool result payloads into stable string content
Forward VS Code tool-mode intent as tool_choice
Add a stronger tool-calling instruction when tools are available

Why this is needed

The previous bridge lost tool state between turns. That left the model
working from a text-only approximation of the conversation instead of the
actual tool-call history. As a result, the model could respond as if a
tool action had already happened without ever issuing the tool call.

This PR keeps the model grounded in the actual VS Code tool state and
makes the native chat/tooling path much more robust.

Scope

This PR is focused on the VS Code native chat provider bridge in
src/llama-chat-model-provider.ts.

Note: this branch currently also includes the prerequisite shim cleanup
from #187 so it compiles cleanly against current VS Code typings. That
overlap is not the main behavior change to review here; the main review
surface is the tool metadata preservation and request/response
serialization logic in the chat provider bridge.

Validation

Verified the branch compiles cleanly in the local build environment
Confirmed the chat provider now preserves tool calls and tool results
when building OpenAI-compatible messages (specifically resolving a file creation bug I repeatedly hit when using gemma4, and that other tool calls are still working.)
Confirmed tool-mode intent is forwarded through tool_choice

Before this change, llama-vscode flattened VS Code chat messages down to plain text when building OpenAI-compatible chat requests. That discarded structured tool metadata from earlier turns, including prior tool calls and tool results. In practice, this made tool-use sessions much less reliable. The model could lose track of which tools had already been called, what arguments had been passed, and what results had been returned. One visible failure mode was tool-call hallucination: the model could claim that an action had been taken, particularly new file creation, without emitting the corresponding tool call and receiving a tool result first. This change preserves VS Code tool metadata when translating chat history into OpenAI-compatible messages. Assistant turns now carry tool_calls, tool results are emitted as tool messages with tool_call_id, and tool result payloads are normalized into stable string content instead of being silently dropped. The request bridge now also forwards VS Code tool-mode intent as tool_choice and adds a stronger tool-calling instruction when tools are available. That tuning is defensive rather than foundational, but it helps reduce cases where the model responds with prose about tool usage instead of actually calling the tool. Together, these changes make the VS Code native chat/tooling path much more robust and keep the model grounded in the actual tool state instead of a flattened text-only approximation of the conversation. This branch still requires the separate chat-provider shim fix to build cleanly against current @types/vscode. That compile cleanup is handled independently so the behavioral serialization change stays reviewable on its own.

conflicts with current @types/vscode declarations. The original shim was added when llama-vscode needed to supply its own temporary LM chat-provider typings. That is no longer necessary because current @types/vscode releases already include the relevant language model chat APIs. Leaving the old shim in place causes duplicate declaration errors during TypeScript compilation. Properties such as modelOptions, tools, maxInputTokens, maxOutputTokens, and LanguageModelResponsePart are now defined both by @types/vscode and by the local shim, which breaks the build even though the runtime code itself is otherwise valid. This change removes the dead shim file entirely and keeps one small provider typing cleanup in llama-chat-model-provider.ts so the current tool metadata mapping aligns with the upstream VS Code types. After this cleanup, the repo compiles cleanly against the installed @types/vscode package without relying on duplicate local declarations. This cleanup is also required for the separate VS Code tool serialization fix to build cleanly. Without removing the obsolete shim, that branch still fails compilation on current @types/vscode before its behavioral changes can be validated in a normal TypeScript build.

danielrobbins added 2 commits May 17, 2026 11:54

danielrobbins changed the title ~~Fix/vscode tool serialization~~ Improve VS Code Chat Provider Bridge May 17, 2026

danielrobbins changed the title ~~Improve VS Code Chat Provider Bridge~~ Enhance VS Code Chat Provider Bridge to Fix Create-File (and other) Tool Calls May 17, 2026

danielrobbins mentioned this pull request May 17, 2026

VS Code Copilot Chat Improvements from Initial Testing #192

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance VS Code Chat Provider Bridge to Fix Create-File (and other) Tool Calls#190

Enhance VS Code Chat Provider Bridge to Fix Create-File (and other) Tool Calls#190
danielrobbins wants to merge 2 commits into
ggml-org:masterfrom
danielrobbins:fix/vscode-tool-serialization

danielrobbins commented May 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

danielrobbins commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Why this is needed

Scope

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

danielrobbins commented May 17, 2026 •

edited

Loading