Skip to content

Parallel completions

igardev edited this page Jan 5, 2026 · 3 revisions

Parallel Completions

Overview

Llama-vscode generates parallel code completions (default 3) if a version of llama.cpp after December, 6, 2025 (commit c42712b) is used. The next completion is shown by pressing Ctrl+], previous completion is shown by pressing Ctrl+[.
The setting max_parallel_completions determines how many completions are generated.

How to use it

  1. Run the completion model and start coding
  2. When a code completion is shown, press Ctrl+] to show the next completion, Ctrl+[ to show the previous completion
  3. Alternatively - you could hover over the shown completion and when the toolbar is shown click the arrows to show the other completions.

Settings:

  • max_parallel_completions: The max number of parallel completions to generate. Default is 3.
Screencast.from.2026-01-05.15-05-00.webm

Clone this wiki locally