feat: multiple display mode and json flag for list command by sammwyy · Pull Request #149 · AlexsJones/llmfit

sammwyy · 2026-03-02T18:41:39Z

I moved the display.rs file from the TUI (CLI mode) to its own module, src/display/mod.rs, as a generic trait.

I created two implementations: json_mode.rs and table_mode.rs, with the possibility of adding more in the future.

This way, each file handles its own implementation. In the CLI, I simply initialize the display mode based on the global flag "--json" instead of using an if statement in each subcommand.

I also added support for the "--json" flag to the "list" command.

Signed-off-by: Alex <alexsimonjones@gmail.com>

- release.yml now excludes v*-mac tags (CLI + crate + homebrew only) - New release-desktop.yml triggers on v*-mac tags - Uses --bundles app to produce .app bundle without code signing - Searches both target/ and llmfit-desktop/target/ for bundle - Desktop releases no longer slow down normal CLI releases Signed-off-by: Three Foxes (in a Trenchcoat) <threefoxesyes3inatrenchcoat@gmail.com>

Problem: Multi-GPU systems had their VRAM summed into a single pool, leading to overly optimistic model fit recommendations since most inference runtimes (llama.cpp, Ollama, etc.) don't support tensor parallelism by default. Changes: - NVIDIA detection: group by model, keep max per-card VRAM (never sum) - AMD ROCm detection: collect per-card VRAM, use max per-card - Refactor nvidia-smi parsing into separate testable function - Update display text from "GB VRAM total" → "GB VRAM each" - Add unit tests for multi-GPU parsing behavior This gives more realistic recommendations by assuming models must fit on a single GPU unless explicitly configured for tensor parallelism.

fix: use per-card VRAM instead of summed for multi-GPU systems

fix: typo in CHANGELOG.md (suppor -> support)

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Fix compile warnings in providers and TUI

…AlexsJones#49) - For dense models: use choose_quant before deciding GPU path - For MoE models: try quantization hierarchy in moe_offload_path - Add moe_memory_for_quant helper to compute MoE memory at specific quant - Add test_moe_offload_tries_lower_quantization test

- Add Remote Ollama instances section to README - Documents OLLAMA_HOST env var for custom endpoints - Addresses issue AlexsJones#40 - feature already exists but was undocumented - Includes examples for remote servers, custom ports, Docker, etc.

docs: document OLLAMA_HOST environment variable for remote connections

…ysfs Improve GPU identification fallback on Linux containers

- Rename llmfit-tui package to llmfit for crates.io continuity - Add homepage and keywords to llmfit-core for publishing - Update authors field to proper format - Add version requirement for llmfit-core dependency Fixes AlexsJones#58 Signed-off-by: Three Foxes (in a Trenchcoat) <threefoxesyes3inatrenchcoat@gmail.com>

- Publish llmfit-core first (dependency) - Wait for crates.io index to update - Then publish llmfit (depends on llmfit-core) Signed-off-by: Three Foxes (in a Trenchcoat) <threefoxesyes3inatrenchcoat@gmail.com>

…/crates-io-metadata fix: correct crates.io metadata and prepare for publishing

ci: enable windows build targets

Signed-off-by: Alex <alexsimonjones@gmail.com>

- Add RX 9060 XT (16GB) and RX 9060 (8GB) to estimate_vram_from_name() - Fixes incorrect VRAM detection on Windows due to WMI UINT32 limitation - Update comment to clarify this is RDNA 4 series Fixes AlexsJones#55 Signed-off-by: Three Foxes (in a Trenchcoat) <threefoxesyes3inatrenchcoat@gmail.com>

…/amd-rx-9060-vram fix: add AMD RX 9060 series to VRAM estimation database

…ndencies chore: Update dependencies

- test_gguf_source_deserialization — GgufSource JSON round-trips correctly - test_gguf_sources_default_to_empty — models without gguf_sources in JSON default to [] - test_catalog_popular_models_have_gguf_sources — 5 well-known models (Llama-3.3-70B, Qwen2.5-7B, etc.) have non-empty gguf_sources in the catalog - test_catalog_gguf_sources_have_valid_repos — every gguf_source in the catalog has owner/repo format, non-empty provider, and contains GGUF - test_catalog_has_significant_gguf_coverage — at least 25% of catalog models have GGUF sources (currently 30%) providers.rs (7 tests): - test_hf_name_to_gguf_candidates_generates_common_patterns — heuristic generates bartowski, ggml-org, TheBloke candidates - test_hf_name_to_gguf_candidates_strips_owner — strips the Org/ prefix correctly - test_lookup_gguf_repo_known_mappings — hardcoded mappings resolve for known models - test_lookup_gguf_repo_unknown_returns_none — unknown models return None - test_has_gguf_mapping_matches_known_models — boolean check works - test_gguf_candidates_fallback_covers_major_providers — fallback covers all 3 providers and all end in -GGUF - test_gguf_candidates_known_mapping_returns_single — hardcoded mapping returns exactly 1 result Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

The JSON output (--json flag and API) was missing `moe_offloaded_gb`, so MoE models showed only active-expert VRAM as `memory_required_gb` without indicating the additional RAM needed for inactive experts. Add `moe_offloaded_gb` and `total_memory_gb` (VRAM + offloaded RAM) to both display and API JSON serializers so consumers can see the full memory footprint. Closes AlexsJones#230 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…-fields fix: surface MoE offloaded RAM in JSON output

Signed-off-by: Alex <alexsimonjones@gmail.com>

Add support for Docker Desktop's built-in Model Runner as a fourth runtime provider alongside Ollama, llama.cpp, and MLX. Detection probes the OpenAI-compatible /v1/models endpoint on localhost:12434 (configurable via DOCKER_MODEL_RUNNER_HOST). Downloads use `docker model pull`. A new scraper (scripts/scrape_docker_models.py) queries Docker Hub's ai/ namespace and cross-references against the HF model database to produce an embedded catalog (docker_models.json) of confirmed available models. Only models verified in the catalog appear as downloadable via Docker. - Provider: detect, list installed, pull via docker CLI - TUI: status bar shows Docker availability, 'D' in Inst column, provider picker includes Docker Model Runner - Inst column refactored from enum to bitfield for extensibility - Makefile: `make update-catalogs` refreshes all scrapers and rebuilds Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Signed-off-by: Alex <alexsimonjones@gmail.com>

Bumps [docker/metadata-action](https://github.com/docker/metadata-action) from 5.10.0 to 6.0.0. - [Release notes](https://github.com/docker/metadata-action/releases) - [Commits](docker/metadata-action@c299e40...030e881) --- updated-dependencies: - dependency-name: docker/metadata-action dependency-version: 6.0.0 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>

Bumps [docker/setup-buildx-action](https://github.com/docker/setup-buildx-action) from 3 to 4. - [Release notes](https://github.com/docker/setup-buildx-action/releases) - [Commits](docker/setup-buildx-action@v3...v4) --- updated-dependencies: - dependency-name: docker/setup-buildx-action dependency-version: '4' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>

Bumps [tauri-build](https://github.com/tauri-apps/tauri) from 2.5.5 to 2.5.6. - [Release notes](https://github.com/tauri-apps/tauri/releases) - [Commits](tauri-apps/tauri@tauri-build-v2.5.5...tauri-build-v2.5.6) --- updated-dependencies: - dependency-name: tauri-build dependency-version: 2.5.6 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>

Replaces the abbreviated Chinese README with a full translation covering all sections: install, usage (TUI/CLI/REST API), how it works, model database, project structure, runtime providers, platform support, contributing, OpenClaw integration, and alternatives. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ation Feat: Docs/chinese translation

…ctions/docker/metadata-action-6.0.0 chore(deps): bump docker/metadata-action from 5.10.0 to 6.0.0

…match fix: prefer exact matches in info selection

…uri-build-2.5.6 chore(deps): bump tauri-build from 2.5.5 to 2.5.6

…ctions/docker/setup-buildx-action-4 chore(deps): bump docker/setup-buildx-action from 3 to 4

sammwyy · 2026-03-16T23:49:52Z

already

AlexsJones and others added 30 commits February 21, 2026 21:06

chore: v0.4.0 release

b29b6fa

Signed-off-by: Alex <alexsimonjones@gmail.com>

fix: typo in CHANGELOG.md (suppor -> support)

ca0f328

Fix unused-code warnings in core and TUI

4100334

Merge pull request AlexsJones#46 from luojiyin1987/fix/multi-gpu-vram

d420f47

fix: use per-card VRAM instead of summed for multi-GPU systems

Merge pull request AlexsJones#44 from luojiyin1987/fix/typos

161db41

fix: typo in CHANGELOG.md (suppor -> support)

Improve GPU identification without vendor utilities

dfa2040

Update llmfit-core/src/hardware.rs

cbabdf4

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Merge pull request AlexsJones#47 from kvkvkv01/fix/remove-build-warnings

5f4768a

Fix compile warnings in providers and TUI

Merge pull request AlexsJones#51 from AlexsJones/fix/desktop-release

03ab0b6

docs: document OLLAMA_HOST environment variable for remote connections

ci: enable windows build targets

1dffa87

Merge pull request AlexsJones#48 from c3c4d4/fix/gpu-identification-s…

97e1f30

…ysfs Improve GPU identification fallback on Linux containers

ci: update release workflow to publish workspace crates in order

f9d7abf

- Publish llmfit-core first (dependency) - Wait for crates.io index to update - Then publish llmfit (depends on llmfit-core) Signed-off-by: Three Foxes (in a Trenchcoat) <threefoxesyes3inatrenchcoat@gmail.com>

Merge pull request AlexsJones#59 from three-foxes-in-a-trenchcoat/fix…

111dc10

…/crates-io-metadata fix: correct crates.io metadata and prepare for publishing

Merge pull request AlexsJones#53 from akarsh16reddy/main

232e212

ci: enable windows build targets

chore: updated merge

c25e853

Signed-off-by: Alex <alexsimonjones@gmail.com>

chore: version bump

c40a5f4

Signed-off-by: Alex <alexsimonjones@gmail.com>

chore: updated merge

ec35061

Signed-off-by: Alex <alexsimonjones@gmail.com>

chore: updated version v0.4.2

764e98d

Signed-off-by: Alex <alexsimonjones@gmail.com>

Merge pull request AlexsJones#61 from three-foxes-in-a-trenchcoat/fix…

def29e8

…/amd-rx-9060-vram fix: add AMD RX 9060 series to VRAM estimation database

chore: Update dependencies

a67756c

chore: Upgrade ureq 2 to 3

f88d95a

chore: Run cargo clippy --fix

6d236eb

chore: Avoid unnecessary CI runs

4a0309f

Merge pull request AlexsJones#62 from reneleonhardt/chore/update-depe…

29998ae

…ndencies chore: Update dependencies

AlexsJones and others added 27 commits March 12, 2026 22:08

chore: version bump

d0fb061

Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

chore: cargo fmt

27fee48

Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

fix: regression in json only mode

8d5a312

Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

chore: version bump

6897af1

Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

feat: added in vim like bindings

4bc444b

Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

chore: version bump

a8b07cf

Signed-off-by: AlexsJones <alexsimonjones@gmail.com>

Merge pull request AlexsJones#235 from AlexsJones/fix/moe-json-memory…

926aa7e

…-fields fix: surface MoE offloaded RAM in JSON output

chore: increasing test coverage

4c6ec63

Signed-off-by: Alex <alexsimonjones@gmail.com>

chore: formatted cargo

8b51f37

Signed-off-by: Alex <alexsimonjones@gmail.com>

chore: version bump

3a6b774

Signed-off-by: Alex <alexsimonjones@gmail.com>

feat: updated demo

de39efe

Signed-off-by: Alex <alexsimonjones@gmail.com>

chore: version bump

f011612

Signed-off-by: Alex <alexsimonjones@gmail.com>

chore: version bump

b142d38

Signed-off-by: Alex <alexsimonjones@gmail.com>

docs: add Chinese translation (README.zh.md)

7f66213

Merge pull request AlexsJones#253 from AlexsJones/docs/chinese-transl…

487806c

…ation Feat: Docs/chinese translation

Merge pull request AlexsJones#243 from AlexsJones/dependabot/github_a…

d3fae58

…ctions/docker/metadata-action-6.0.0 chore(deps): bump docker/metadata-action from 5.10.0 to 6.0.0

Merge pull request AlexsJones#206 from haosenwang1018/fix/info-exact-…

c935b81

…match fix: prefer exact matches in info selection

Merge pull request AlexsJones#250 from AlexsJones/dependabot/cargo/ta…

f62d292

…uri-build-2.5.6 chore(deps): bump tauri-build from 2.5.5 to 2.5.6

Merge pull request AlexsJones#249 from AlexsJones/dependabot/github_a…

41c3eb4

…ctions/docker/setup-buildx-action-4 chore(deps): bump docker/setup-buildx-action from 3 to 4

merge: fix

5b4370a

sammwyy force-pushed the feat/display-modes branch from 905cc92 to 5b4370a Compare March 16, 2026 23:49

AlexsJones force-pushed the main branch from 063a4e4 to 0a7f3cf Compare March 18, 2026 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: multiple display mode and json flag for list command#149

feat: multiple display mode and json flag for list command#149
sammwyy wants to merge 358 commits intoAlexsJones:mainfrom
sammwyy:feat/display-modes

sammwyy commented Mar 2, 2026

Uh oh!

sammwyy commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

sammwyy commented Mar 2, 2026

Uh oh!

sammwyy commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants