You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## Description
Changes:
1. Add text embedding models benchmarks
2. Add new text embedding models to docs
3. Add text embedding models descriptions
### Type of change
- [ ] Bug fix (non-breaking change which fixes an issue)
- [ ] New feature (non-breaking change which adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to not work as expected)
- [x] Documentation update (improves or adds clarity to existing
documentation)
### Checklist
- [x] I have performed a self-review of my code
- [x] I have commented my code, particularly in hard-to-understand areas
- [x] I have updated the documentation accordingly
- [x] My changes generate no new warnings
|[all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)| English | 256 | 384 | All-round model tuned for many use-cases. Trained on a large and diverse dataset of over 1 billion training pairs. |
116
+
|[all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2)| English | 384 | 768 | All-round model tuned for many use-cases. Trained on a large and diverse dataset of over 1 billion training pairs. |
117
+
|[multi-qa-MiniLM-L6-cos-v1](https://huggingface.co/sentence-transformers/multi-qa-MiniLM-L6-cos-v1)| English | 511 | 384 | This model was tuned for semantic search: Given a query/question, it can find relevant passages. It was trained on a large and diverse set of (question, answer) pairs. |
118
+
|[multi-qa-mpnet-base-dot-v1](https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1)| English | 512 | 768 | This model was tuned for semantic search: Given a query/question, it can find relevant passages. It was trained on a large and diverse set of (question, answer) pairs. |
116
119
117
120
**`Max Tokens`** - the maximum number of tokens that can be processed by the model. If the input text exceeds this limit, it will be truncated.
118
121
@@ -122,22 +125,31 @@ function App() {
122
125
123
126
### Model size
124
127
125
-
| Model | XNNPACK [MB]|
126
-
| ---------------- | :----------: |
127
-
| ALL_MINILM_L6_V2 | 91 |
128
+
| Model | XNNPACK [MB]|
129
+
| -------------------------- | :----------: |
130
+
| ALL_MINILM_L6_V2 | 91 |
131
+
| ALL_MPNET_BASE_V2 | 438 |
132
+
| MULTI_QA_MINILM_L6_COS_V1 | 91 |
133
+
| MULTI_QA_MPNET_BASE_DOT_V1 | 438 |
128
134
129
135
### Memory usage
130
136
131
-
| Model | Android (XNNPACK) [MB]| iOS (XNNPACK) [MB]|
Times presented in the tables are measured as consecutive runs of the model. Initial run times may be up to 2x longer due to model loading and initialization.
139
148
:::
140
149
141
-
| Model | iPhone 16 Pro (XNNPACK) [ms]| iPhone 14 Pro Max (XNNPACK) [ms]| iPhone SE 3 (XNNPACK) [ms]| Samsung Galaxy S24 (XNNPACK) | OnePlus 12 (XNNPACK) [ms]|
0 commit comments