fix: text and character limits on Cohere embedding API #376

jimfingal · 2025-02-27T19:30:50Z

In a previous PR, I added support for retrieving more than one embedding at once from the Cohere embedding API.

@romenlee pointed out that this causes problems with a large number of texts. It looks like the Cohere embedding API has a limit of either 96 texts, or 2048 characters.

This PR implements more intelligent batching that respects these limits -- it batches texts in chunks that are at most 96 texts or 2048 characters, submits them to the embedding API, and combines the results.

There is a complexity vs. efficiency tradeoff call to make here for the maintainers. This code is much more complex than the original naive code, pre #350 -- but it is much more efficient with network calls so will execute faster. An alternative if we don't want to take on this complexity would be to just revert #350, which would restore functionality for embedding large texts at the cost of giving up performance improvements.

Make cohere batching more intelligent

9db5419

jimfingal changed the title ~~bugfix: text and character limits on Cohere embedding API~~ fix: text and character limits on Cohere embedding API Feb 27, 2025

jimfingal mentioned this pull request Feb 27, 2025

Enhance embed_documents to make use of Cohere's ability to request multiple embeddings at once #350

Merged

michaelnchin added this to the 2025 March Release 1 milestone Mar 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: text and character limits on Cohere embedding API #376

fix: text and character limits on Cohere embedding API #376

jimfingal commented Feb 27, 2025

fix: text and character limits on Cohere embedding API #376

Are you sure you want to change the base?

fix: text and character limits on Cohere embedding API #376

Conversation

jimfingal commented Feb 27, 2025