feat(embedder): add Cohere dense embedder by Dicoangelo · Pull Request #941 · volcengine/OpenViking

Dicoangelo · 2026-03-24T18:09:05Z

Summary

Adds CohereDenseEmbedder using Cohere's Embed API v2 (/v2/embed)
Supports models: embed-v4.0, embed-english-v3.0, embed-multilingual-v3.0, embed-*-light-v3.0
Server-side dimension reduction for embed-v4.0 (256/512/1024/1536) via output_dimension
Client-side truncation + L2 renormalization fallback for v3 models
Asymmetric retrieval via input_type (search_query / search_document)
Batch embedding with 96-item chunking (Cohere API limit)
Full factory integration: provider validation, dimension auto-resolution, _create_embedder registry

Changes

New: openviking/models/embedder/cohere_embedders.py — CohereDenseEmbedder class
Modified: openviking/models/embedder/__init__.py — export + __all__ entry
Modified: openviking_cli/utils/config/embedding_config.py — "cohere" in provider validation, dimension resolution, factory registry

Config example

{
  "embedding": {
    "dense": {
      "provider": "cohere",
      "model": "embed-v4.0",
      "api_key": "your-cohere-api-key",
      "dimension": 1024
    }
  }
}

Test plan

Validated config parsing with provider: "cohere" passes Pydantic validation
Tested with live Cohere API: 528+ vectors indexed, 237 embeddings, 0 errors
Semantic search quality verified: 0.43-0.55 relevance scores on targeted queries
Batch processing tested with 96-item chunking on 12K+ token document
Dimension auto-resolution tested: embed-v4.0 → 1536 default, configurable to 1024

none

CLAassistant · 2026-03-24T18:09:12Z

All committers have signed the CLA.

github-actions · 2026-03-24T18:09:52Z

Failed to generate code suggestions for PR

MaojiaSheng · 2026-03-26T05:17:07Z

@Dicoangelo Thanks, but there are some conflicts that need to be resovled

Adds CohereDenseEmbedder using Cohere's Embed API v2. - Supports embed-v4.0, embed-english-v3.0, embed-multilingual-v3.0 - Server-side dimension reduction for embed-v4.0 (256/512/1024/1536) - Client-side truncation + renormalization fallback for v3 models - Asymmetric search via input_type (search_query/search_document) - Batch embedding with 96-item chunking (Cohere API limit) - Full factory integration: provider validation, dimension resolution none

16 tests covering: - Init validation (api_key required, defaults, model dimensions) - Dimension handling (v4 server-side, v3 client-side truncation, invalid dims) - Embedding calls (single, batch, query vs document input_type) - output_dimension sent for embed-v4.0 - Error handling (API errors → RuntimeError) - Resource cleanup (close) none

Extends RerankConfig with provider field and api_key for Cohere. Adds CohereRerankClient with same interface as VikingDB RerankClient. HierarchicalRetriever auto-selects rerank backend based on provider. Config example: "rerank": {"provider": "cohere", "api_key": "...", "threshold": 0.15} Quality improvement: META tokenomics query 0.55 → 0.77 relevance score. none

9 tests covering: - Rerank batch scoring with index-to-order mapping - Empty input handling - API error graceful fallback (returns None) - Original order preservation from Cohere's sorted response - Resource cleanup - RerankConfig provider auto-detection (cohere/vikingdb/empty) none

More vector candidates for reranker to evaluate = better precision. With Cohere rerank-v3.5, 10 candidates gives the cross-encoder enough material to find the best match without excessive latency. none

…lient.from_config() Cohere was special-cased in hierarchical_retriever.py while openai/litellm went through the centralized RerankClient.from_config() dispatch. This commit adds CohereRerankClient.from_config() and routes it through the same path. Also fixes a bug where from_config() used config.provider directly instead of _effective_provider(), which meant auto-detected providers (e.g. api_key without explicit provider="cohere") would not dispatch correctly. none

github-project-automation bot added this to OpenViking project Mar 24, 2026

github-project-automation bot moved this to Backlog in OpenViking project Mar 24, 2026

MaojiaSheng approved these changes Mar 26, 2026

View reviewed changes

Dicoangelo added 5 commits March 29, 2026 00:32

perf(retrieve): increase GLOBAL_SEARCH_TOPK from 5 to 10

5fd06bb

More vector candidates for reranker to evaluate = better precision. With Cohere rerank-v3.5, 10 candidates gives the cross-encoder enough material to find the best match without excessive latency. none

Dicoangelo force-pushed the feat/cohere-embedder branch from ac0552f to 5fd06bb Compare March 29, 2026 04:34

MaojiaSheng approved these changes Mar 30, 2026

View reviewed changes

MaojiaSheng merged commit 9bcdaf4 into volcengine:main Mar 30, 2026
2 checks passed

github-project-automation bot moved this from Backlog to Done in OpenViking project Mar 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(embedder): add Cohere dense embedder#941

feat(embedder): add Cohere dense embedder#941
MaojiaSheng merged 6 commits intovolcengine:mainfrom
Dicoangelo:feat/cohere-embedder

Dicoangelo commented Mar 24, 2026

Uh oh!

CLAassistant commented Mar 24, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 24, 2026

Uh oh!

MaojiaSheng commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Dicoangelo commented Mar 24, 2026

Summary

Changes

Config example

Test plan

Uh oh!

CLAassistant commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 24, 2026

Uh oh!

MaojiaSheng commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Mar 24, 2026 •

edited

Loading