Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change Fast Embedder adapter using langchain class #571

Merged
merged 3 commits into from
Nov 23, 2023

Conversation

valentimarco
Copy link
Member

Description

Change Fast Embedder adapter with langchain class already with cache folder (in core/local_cache, created at runtime if don't exist)

Type of change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update

Checklist:

  • My code follows the style guidelines of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • Tested Embedder class before this PR

@pieroit
Copy link
Member

pieroit commented Nov 16, 2023

So no external container?
How much does the core image grow?

@valentimarco
Copy link
Member Author

depends on the size of the model:

model dim description size_in_GB
BAAI/bge-small-en 384 Fast English model 0.20
BAAI/bge-small-en-v1.5 384 Fast and Default English model 0.13
BAAI/bge-base-en 768 Base English model 0.50
BAAI/bge-base-en-v1.5 768 Base English model, v1.5 0.44
sentence-transformers/all-MiniLM-L6-v2 384 Sentence Transformer model, MiniLM-L6-v2 0.09
intfloat/multilingual-e5-large 1024 Multilingual model, e5-large. Recommend using this model for non-English languages 2.24

by default is intfloat/multilingual-e5-large but all models are downloaded and cached in the volume folder. Idk the size of cat + embedder but i don't think we have problem of space...

@valentimarco
Copy link
Member Author

Fresh install with only intfloat/multilingual-e5-large installed
firefox_EwzNbdRmH4

Switch to BAAI/bge-base-en with 0.50Gb of disk size!
@pieroit pieroit merged commit 7d07401 into cheshire-cat-ai:develop Nov 23, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants